Neural Networks: Computing Partial Derivatives

We will compute the partial derivative of the Cross-Entropy cost function

with respect to one weight in each connection with a sigmoid activation function for each layer.

Given

Therefore

Take aways:

  • We Can compute the partial derivative of a weight by multiplying the value of the node in layer โ„“ and the ๐›ฟ of the connected node in layer โ„“+1
  • ๐›ฟโ€™s of the nodes in layer โ„“ can be computed from the ๐›ฟโ€™s of the nodes in the layer โ„“+1

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store