The Backpropagation Algorithm

Deep Convolutional Neural Networks

Nonlinear Regression and Gradient Descent

The Most DISRESPECTFUL Way To End a Game I've Seen

BABYMONSTER - 'Love In My Heart' M/V

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

The Stochastic Gradient Descent Algorithm

Nathan Kutz

Просмотров 12 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 29 янв 2025

Комментарии •

@TKR911 3 года назад ⁺¹
Beautiful explanation, simply beautiful !
@schmijo 3 года назад ⁺¹
Very well explained,
And the production quality is out of this world
Thank you sir 🙏🏼
@LarryPanozzo 3 года назад ⁺¹
GREAT channel! Subscribing!
Currently struggling through global multivariate nonlinear optimization of a fortunately-differentiable nonconvex, high-dimensional scalar function.
@JI77469 Месяц назад
Love the video, but I think it can be a bit confusing going from 5:30 where you just have "f(x_k, A_2, ..., A_M)" to 9:40 where you have "f_k(x_k, A_2, ..., A_M)." Looking at this, I initially thought that you might have a new fitting function for each data point, rather than the fitting function with the kth data point plugged in.
@ChristopherLum Год назад
Hi Professor Kutz, do you still have a bounty on errata? I think your going rate used to be around $0.25 per instance 😊. I believe there might be a minor errata around timestamp 7:23 . At this time you state that f is the map from input to output (effectively the forward propagation through the network) but in the context of gradient descent, you actually want to use the gradient of the error function that you mentioned in the previous slide (around timestamp 6:55).
@vladimir7533 3 года назад
Appreciate your very clear explanation!
@notgabby604 Год назад
Stochastic gradient descent is magical thinking that actually works. Sometimes you get lucky.
@scienceknight5122 8 месяцев назад
ty
@SphereofTime Год назад
19:00
@posa5225 4 года назад ⁺¹
Can you explain what the "weights" are to a non-computer science person? (There's so much jargon, my brain)
@Nissearne12 4 года назад ⁺¹
weights are all the connections between all the neurons (neurons are some sort of non linear functions). So the weights is the memory in the network, how have to be trained. The weights are initialized with random values before start training the weights. It's important that the weights is not set symmetric before start training.
@BruinChang 4 года назад ⁺⁷
"weights" can be considered as coefficients "a" and "b" in the following equation:
y = a*x + b.
Weights have their such specific "positions" that determine the mathematical structure of a computational model they construct.
For example, in the above equation, "a" represents the linear term (slope), and "b" indicates the constant (intersection of y axis): "a" occupies the position of linear term, and "b" occupies the position of constant. The tuple (a, b) does not indicate a linear model until we set the specific positions to "a" and "b" mentioned above.
"Weights" in a neural network model are something like this, but in a much higher dimensional space. Hope this helps.
@jordanzhen7174 4 года назад ⁺²
Weights are a set of attributes. For example, a house has # Bedrooms, color of door, # washrooms, #floors, each of these is a single weight, (Whatever else you want to add in there, # of windows). Then you want to see how much each of these weights affect the house price, so # of bedrooms may have a higher weighting to the price, thus higher coefficient value, and color of door may have a lower weighting. These weights then make up a formula such that you can input your own attributes of the house and try to estimate the price.
@lit22006 3 года назад ⁺¹
are you related to Matthew McConaughey!
@pnachtwey 8 месяцев назад
What is a lot of weights? The problem I have had is that it is hard to find the gradient where the derivative of all the parameters are minimized at once. Usually, there is a parameter or two that will significantly increase the sum of squared errors or mean squared error so the step can't be big and one of the many parameters will keep the parameter set from moving towards a minimum.. In other words, the valley of the data set of parameter is trying to walk down is very narrow. I have lots of real data. GD always has problems with optimizing his data. There are many algorithms that a much better if the number of parameters is less than 25. The professor's visual example is OK for teaching but is too simple.easy,

Следующие

Автовоспроизведение

The Backpropagation Algorithm

The Backpropagation Algorithm

Deep Convolutional Neural Networks

Deep Convolutional Neural Networks

Nonlinear Regression and Gradient Descent

Nonlinear Regression and Gradient Descent

The Most DISRESPECTFUL Way To End a Game I've Seen

The Most DISRESPECTFUL Way To End a Game I've Seen

BABYMONSTER - 'Love In My Heart' M/V

BABYMONSTER - 'Love In My Heart' M/V

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

The Most Illegal Baseball Bat Ever Created

The Most Illegal Baseball Bat Ever Created

25. Stochastic Gradient Descent

25. Stochastic Gradient Descent

Building the Gradient Descent Algorithm in 15 Minutes | Coding Challenge

Building the Gradient Descent Algorithm in 15 Minutes | Coding Challenge

Optimization for Deep Learning (Momentum, RMSprop, AdaGrad, Adam)

Optimization for Deep Learning (Momentum, RMSprop, AdaGrad, Adam)

Intro to Gradient Descent || Optimizing High-Dimensional Equations

Intro to Gradient Descent || Optimizing High-Dimensional Equations

Neural Networks: 1-Layer Networks

Neural Networks: 1-Layer Networks

22. Gradient Descent: Downhill to a Minimum

22. Gradient Descent: Downhill to a Minimum

The Most Important Algorithm in Machine Learning

The Most Important Algorithm in Machine Learning

Multi-Layer Networks and Activation Functions

Multi-Layer Networks and Activation Functions

Optimization as the cornerstone of regression

Optimization as the cornerstone of regression

ВЫЖИВАЮ В ИГРЕ В КАЛЬМАРА 2 ► Роблокс / Вики Шоу Плей

ВЫЖИВАЮ В ИГРЕ В КАЛЬМАРА 2 ► Роблокс / Вики Шоу Плей

КТО-ТО УКРАЛ😡 РОБУКСЫ У МОЕГО ПУПСА😭! ПОМОГИ НАЙТИ ВИНОВНОГО! #robloxshorts #roblox #brookhaven

КТО-ТО УКРАЛ😡 РОБУКСЫ У МОЕГО ПУПСА😭! ПОМОГИ НАЙТИ ВИНОВНОГО! #robloxshorts #roblox #brookhaven

ДОБАВИЛ АНИМЕ СКИНЫ ИЗ STANDOFF 2 В НЕЙРОСЕТЬ

ДОБАВИЛ АНИМЕ СКИНЫ ИЗ STANDOFF 2 В НЕЙРОСЕТЬ

Он рукой ударил сильнее, чем самбист ногой

Он рукой ударил сильнее, чем самбист ногой

ГИМН КАЧКОВ! СКОЛЬКО ПОДНИМУТ АВТОРЫ ПЕСНИ ДЛЯ КАЧКОВ!?

ГИМН КАЧКОВ! СКОЛЬКО ПОДНИМУТ АВТОРЫ ПЕСНИ ДЛЯ КАЧКОВ!?

Решил оплатить шаурму переводом

Решил оплатить шаурму переводом

Почему нельзя слушать Radiohead на первом свидании?

Почему нельзя слушать Radiohead на первом свидании?

Кладовка за 2600$.

Кладовка за 2600$.