- Reinforcement Learning with TensorFlow
- Sayon Dutta
- 277字
- 2021-08-27 18:51:53
The gradient descent algorithm
The gradient descent algorithm is an optimization algorithm to find the minimum of the function using first order derivatives, that is, we differentiate functions with respect to their parameters to first order only. Here, the objective of the gradient descent algorithm would be to minimize the cost function with regards to
and
.
This approach includes following steps for numerous iterations to minimize :
used in the above equations refers to the learning rate. The learning rate is the speed at which the learning agent adapts to new knowledge. Thus,
, that is, the learning rate is a hyperparameter that needs to be assigned as a scalar value or as a function of time. In this way, in every iteration, the values of
and
are updated as mentioned in the preceding formula until the value of the cost function reaches an acceptable minimum value.
The gradient descent algorithm means moving down the slope. The slope of the the curve is represented by the cost function with regards to the parameters. The gradient, that is, the slope, gives the direction of increasing slope if it's positive, and decreasing if it's negative. Thus, we use a negative sign to multiply with our slope since we have to go opposite to the direction of the increasing slope and toward the direction of the decreasing.
Using the optimum learning rate, , the descent is controlled and we don't overshoot the local minimum. If the learning rate,
, is very small, then convergence will take more time, while if it's very high then it might overshoot and miss the minimum and diverge owing to the large number of iterations:

- 構(gòu)建高質(zhì)量的C#代碼
- Canvas LMS Course Design
- 機(jī)器學(xué)習(xí)及應(yīng)用(在線實(shí)驗(yàn)+在線自測(cè))
- 影視后期制作(Avid Media Composer 5.0)
- CSS全程指南
- 城市道路交通主動(dòng)控制技術(shù)
- Maya 2012從入門到精通
- 傳感器技術(shù)應(yīng)用
- 可編程控制器技術(shù)應(yīng)用(西門子S7系列)
- AutoCAD 2012中文版繪圖設(shè)計(jì)高手速成
- INSTANT Autodesk Revit 2013 Customization with .NET How-to
- 運(yùn)動(dòng)控制器與交流伺服系統(tǒng)的調(diào)試和應(yīng)用
- 走近大數(shù)據(jù)
- 基于企業(yè)網(wǎng)站的顧客感知服務(wù)質(zhì)量評(píng)價(jià)理論模型與實(shí)證研究
- 基于ARM9的小型機(jī)器人制作