Logistic Regression Indepth Maths Intuition In Hindi

Поделиться
HTML-код
  • Опубликовано: 1 дек 2024

Комментарии • 69

  • @parultiwari8734
    @parultiwari8734 10 месяцев назад +4

    wish i could add more thousands of likes from my side. such a great explanation!!
    Thank you sir!

  • @sairabano7968
    @sairabano7968 2 года назад +6

    Thanks, Krish for making videos in Hindi. You always make things easy to understand.

  • @SachinSharma-hv3wm
    @SachinSharma-hv3wm 2 года назад +1

    Thank u so much krish sir for making videos in hindi.....aapka way of explanation bhut easy hota hai...aap complex chizo ko bhi easy bna dete ho😊😊

  • @kmishy
    @kmishy Год назад +2

    after 1 year, today I understood why do we have log term in cost function of logistic 22:00

  • @SachinKumar-zl6ku
    @SachinKumar-zl6ku 2 года назад +2

    You are doing amazing work man

  • @hades840
    @hades840 11 месяцев назад +2

    23:22 need to keep in mind ? because i am very bad with logs

  • @UmerFarooq-zv1ky
    @UmerFarooq-zv1ky 2 месяца назад

    explanation is good.
    But the Explanation of Nitish sir Campusx is another level.

  • @himanshugamer1888
    @himanshugamer1888 11 месяцев назад

    Superb Explanation Sir ❤❤

  • @osamaosama-vh6vu
    @osamaosama-vh6vu 2 года назад

    Great explantion thank u dear sir be happy😍

  • @chaotic_singer13
    @chaotic_singer13 6 месяцев назад +1

    The intuition is good but if you can help us with a proper derivation and also about the thought process i.e. how do we thought the way we though. It will be deep!!!

  • @sankhadipbera9271
    @sankhadipbera9271 3 месяца назад

    quality content ❤‍🔥❤‍🔥

  • @kiddo7094
    @kiddo7094 2 года назад

    maja aa gya quick and understable

  • @poizn5851
    @poizn5851 2 года назад

    Thank you so much this one clear my whole droughts

  • @aradhyakanth8409
    @aradhyakanth8409 2 года назад +1

    Thank you sir.

  • @programmingcodewithmukesh2138
    @programmingcodewithmukesh2138 Год назад

    thank you sir ..so helpful for me

  • @singhramniwassinghsinghram7676
    @singhramniwassinghsinghram7676 2 года назад

    Very Helpful Video

  • @krishj8011
    @krishj8011 6 месяцев назад

    great tutorial

  • @Deeprfc12346
    @Deeprfc12346 6 месяцев назад

    bhai aap ak video. Text Mining and Sentiment Analysis pe bna dijiye.

  • @arshad1781
    @arshad1781 2 года назад +1

    Nice 👍

  • @sohildoshi2655
    @sohildoshi2655 2 года назад

    You are legend!!

  • @sabbiruddinakash7181
    @sabbiruddinakash7181 6 месяцев назад

    Thank you sir

  • @prakharagarwal9448
    @prakharagarwal9448 2 года назад +1

    Krish, when will next community session start

  • @jannatunferdous103
    @jannatunferdous103 3 месяца назад

    Pass=1 and Fail=0 till okay, but what is higher than 1? and how study hour can be less than 0? time can not be less than 0.

  • @AmmarAhmedSiddiqui
    @AmmarAhmedSiddiqui 10 месяцев назад

    local minima se global nikalne time ap ne dundi mar di !..

  • @rohitjagtap5228
    @rohitjagtap5228 2 года назад

    thanks a lot

  • @pkumar0212
    @pkumar0212 4 месяца назад

    👌

  • @sureshsingam7291
    @sureshsingam7291 2 года назад

    The maths for logistic regression you upload in ml playlist is completely different from hindi playlist which is correct🙆‍♂️😰😰

    • @tammy4994
      @tammy4994 Год назад

      Even I had the same confusion, @krishnaik could you please clarify?

  • @praneetnayak6757
    @praneetnayak6757 2 года назад +2

    IN that case what does "Maximum Likelihood" mean?

    • @rehmanahmadchaudhry2548
      @rehmanahmadchaudhry2548 Год назад

      maximum likelihood is used to simply estimate the parameters i.e. coffcients, these cofficients are further used in odds, log odds

    • @anamitrasingha6362
      @anamitrasingha6362 Год назад

      Likelihood of parameters means what's the probability of having observed the particular distribution of the dataset that you have with your right now given that I choose a particular set of parameters. What maximum likelihood estimation says is that you want to find that set of parameters that maximises the probability of having observed that distribution of the dataset that you have. You do that by taking the gradient of the likelihood/log-likelihood function with respect to the parameters and equating to 0, then solving for those parameters

  • @HAVINOSH
    @HAVINOSH Год назад +1

    do we need not need to square the last equation ?

    • @parul15137
      @parul15137 Год назад +1

      no

    • @uroojmalik8454
      @uroojmalik8454 Год назад

      @@parul15137 why???

    • @beerajsaikia
      @beerajsaikia 6 дней назад

      @@uroojmalik8454 because squared error and linear sigmoid makes it non-convex

  • @aradhyakanth8409
    @aradhyakanth8409 2 года назад +1

    ​sir, for classification we have classifier model. so, why logistic Regression

    • @atulkadam6345
      @atulkadam6345 2 года назад

      You can use any model whichever gives you best performance wrt training and testing data

    • @RonaldoRewind-cr7
      @RonaldoRewind-cr7 2 года назад +1

      logestic regression is a classification problem its name is regression but actually it is classifier problm

    • @Ram_jagat
      @Ram_jagat Год назад

      @@RonaldoRewind-cr7 exactly

    • @prabhatupadhyay7526
      @prabhatupadhyay7526 9 месяцев назад

      Because in logistic regression we take sigmoid function and sigmoid return data between o to 1.

  • @barwalgayatri4655
    @barwalgayatri4655 Месяц назад

    Bessssssssssssssssttttttttttttttt

  • @kulbhushansingh1101
    @kulbhushansingh1101 2 года назад +1

    Sir you didn't teach here about loss function in logistic regression

    • @kartikeysingh5781
      @kartikeysingh5781 11 месяцев назад +1

      Loss function will be the same as regression just you have to replace the hypothesis function by hypothesis function for logistic regression

    • @kulbhushansingh1101
      @kulbhushansingh1101 10 месяцев назад +1

      @@kartikeysingh5781 thanks kartikey, I got it that was 1 year ago 😂

  • @beingaiiitian4559
    @beingaiiitian4559 7 месяцев назад

    9:48

  • @shivamgondkar6183
    @shivamgondkar6183 10 месяцев назад

    Hello sir
    Can you please provide notes in pdf form?
    Thanks

  • @ankurgbpuat
    @ankurgbpuat Год назад

    Please tell us why a log function is used as a cost function(if you know at all)

    • @shaileshkumar-rg9tg
      @shaileshkumar-rg9tg Год назад

      if you know -we are all ears.

    • @ankurgbpuat
      @ankurgbpuat Год назад

      @@shaileshkumar-rg9tg Sure thing! It's done to ensure the cost function is convex.

    • @anamitrasingha6362
      @anamitrasingha6362 Год назад

      there are various flavors of ML algorithms. In logistic regression with the approach of trying to learn a discriminative function that can classify a point into a particular label => a function f:X->Y such that f(datapoint) = class_label(belonging to set Y). Since these class labels are discrete if you try to use a mean_squared_error loss function you will get an expression of the loss function which will not be a convex function, I have attempted a proof of it but it involves a bit of intricate mathematics. You can do that by showing that the hessian of the loss function is neither positive semi-definite nor negative semi-definite hence it's neither convex nor concave. When you use a loss function which is a logistic loss function you get a concave function and you basically would need to do a gradient ascent to get to the maxima of the concave function. Again these involve concepts from Convex Optimization which you may attempt to read if interested from Boyd.

  • @rahuldogra7171
    @rahuldogra7171 Год назад

    Can you explain probabilistic approach for logistic regression?

    • @arshiyakhan7757
      @arshiyakhan7757 Год назад

      Maximum likehood

    • @anamitrasingha6362
      @anamitrasingha6362 Год назад

      Let's say you have a 2-class classification problem. You henceforth assume that your random variable Y values come from a Bernoulli distribution, with each label being either 0 or 1. This random variable Y can take on value 1 based on some probability theta(say), since probabilities of a pmf add up to one hence you can also infer that Y takes on value 0 with probability (1-theta). Now you have a dataset with you consisting of features X(n features say) and your target Y and the number of observations(samples) you have is m(say). What you want to learn is a function mapping f such that f: X -> Y. This f can be a probabilistic function as well. You define the probability of having observed a particular datapoint taking on the y value as say 1 given its features x as Pr(y_i=1|x_i). What you want now is to find the probability of having observed the values of Y across the dataset in the particular order(like y_1 takes value 1, y_2 takes value 0, these y values are what you have from the dataset) given the features X across the whole dataset(in the same order) , so basically Pr(Y|X;theta) this is read as the probability of having observed Y given that you have observed X parameterised by theta. You now define your likelihood function as L(theta) which means the likelihood of theta => the probability of having observed this Y given X. Since each of the observations/samples are independent and they are believed to have come from the same bernoulli distibution(with replacement) or in short i.i.d you say that the Pr(Y|X;theta) = product across all i (Pr(y_i = 1|x_i; theta). Why I did this is because of the independence property in probability which says the Pr(A and B) = Pr(A)*Pr(B) if event A and event B are independent. You now take a log on both sides so as to make your calculation easier and it becomes summation across all i (log(Pr(y_i=1|x_i; theta)). This is called your log-likelihood. What you now want to do is find the value of theta for which this expression is maximized which is known as maximum likelihood estimation. I should also add that this theta is assumed to be a function of w^Tx => g(w^Tx) where typically your g is a sigmoid function. So when you take the gradient you also have to substitute this function in the log-likelihood expression and then you take the gradient w.r.t w.

  • @shrutisingh9801
    @shrutisingh9801 8 месяцев назад

    y=0, y=1, y is predicted value right?

  • @abhishektiwari9673
    @abhishektiwari9673 2 года назад

    Sir this ML playlist is enough to learn complete machine learning.

  • @SorryMe-o1y
    @SorryMe-o1y 4 месяца назад

  • @GhulamMustafaSherazi
    @GhulamMustafaSherazi Год назад

    Sir I want to ask how z= theta0 + theta1 X1 converted to z = theta tranpose of x. waiting for your reply.

    • @iamravimeena
      @iamravimeena Год назад

      Here we have theta = [theta0, theta1] and X = [1, X1], we are transposing theta matrix to get a single value after the multiplication, which is our hypothesis. z = theta0 + theta1 * X1 is another way of writing it. But z = theta transpose * X is a general way (in case if we have multiple features(X.columns > 2)).

  • @cs_soldier5292
    @cs_soldier5292 Год назад +1

    Kuch samaj mein nhi aaya sir

  • @as8401
    @as8401 2 года назад +4

    Sir sorry but subkuch dimag ke uppar se chala gaya

    • @MovieOk-p8q
      @MovieOk-p8q 9 месяцев назад +1

      Shi bola

    • @supriyasaxena5053
      @supriyasaxena5053 3 месяца назад +1

      Thanks maine puri vid dekhne se pehle ye comment dekh liya

    • @as8401
      @as8401 3 месяца назад

      @@supriyasaxena5053 मुझे खुशी है की आपका टाईम मेने बाचाया

  • @arshad1781
    @arshad1781 2 года назад +1

    Thanks 👍