Machine Learning Project in Python: Predicting California Housing Prices

Поделиться
HTML-код
  • Опубликовано: 19 окт 2024

Комментарии • 41

  • @GregHogg
    @GregHogg  Год назад

    Take my courses at mlnow.ai/!

  • @michaelle1229
    @michaelle1229 6 месяцев назад +2

    Thank you, Greg, for this amazing tutorial, helped me finish how to predict house prices in Melbourne.

    • @michaelle1229
      @michaelle1229 6 месяцев назад

      I have a question, does neural networks can be helpful or useful to reduce any noise/interference or overfitting on the final dataset? If so, what conclusion do you make in this case?

  • @rednax25
    @rednax25 2 года назад +3

    Please let us know how we can encourage you to do more videos like this......Thanks for the wonderful explanations. I really love your style of teaching.

    • @GregHogg
      @GregHogg  2 года назад +1

      Thanks so much Alexander - which is my middle name by the way!

  • @andreiardelean9324
    @andreiardelean9324 Год назад +4

    The video was very informative but I have what might be a silly question, however I am new to ML. How can we interpret the predicition values? I have seen you during the entire video mentioning here and there "oh this is overfitting" or "there is no overfitting here". I would like to know how can we differentiate overfitting by looking at the prediction values. Unless I am missing something?

  • @agnesdemarco9386
    @agnesdemarco9386 6 месяцев назад

    You're the best! Brilliant and clear explanation!!!! Thank youuu!!!

    • @GregHogg
      @GregHogg  6 месяцев назад

      Very welcome 😁😁

  • @yanhu2011
    @yanhu2011 Год назад

    Amazing video! Do you currently have a video that talks different AI models ? Linear regression, neural networks etc? And when and why one should be used for a particular data set?

  • @chenute-o8r
    @chenute-o8r Год назад +1

    16:00 , i find a easy way to do:
    median_house_value = 'median_house_value'
    all_columns = list(data.columns)
    all_columns.remove(median_house_value)
    new_column_list = all_columns + [median_house_value]
    data_new = data[new_column_list]
    data_new

    • @chenute-o8r
      @chenute-o8r Год назад +1

      also 19:50 it can use another easy and auto way to split train_test
      from sklearn.model_selection import train_test_split
      X = data_new.drop(['median_house_value'], axis=1)
      y = data['median_house_value']
      X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)

  • @sgpleasure
    @sgpleasure 3 года назад +1

    Thanks for the new video. Would using a PIPELINE be cleaner and recommended by you?

    • @GregHogg
      @GregHogg  3 года назад +1

      You're welcome! Pipelines are great. I've never personally used them, but I know how they work and they're pretty awesome.

  • @denysshushpanov7630
    @denysshushpanov7630 2 года назад

    Great video!
    I have a question though, how would you approach data preprocessing while making predictions on trained model with the new data (let’s say user input or streaming)?

    • @GregHogg
      @GregHogg  2 года назад

      As long as you have some predefined processing function that doesn't depend on anything, you would just apply that to every input

  • @bnneit777
    @bnneit777 8 месяцев назад

    Hi everyone! MSE and RMSE are too large, whether they are reasonable? All the predicting house price tasks I had done had the big MSE and RMSE like that.

  • @ayamouallem3619
    @ayamouallem3619 2 года назад +1

    Great explanation !! but i have a question , how can i know the correct split for train , test and validation , is it ok if test and val cross in some data ??

    • @GregHogg
      @GregHogg  2 года назад

      Roughly 70, 15, 15, at random

  • @mathhack8647
    @mathhack8647 3 года назад +1

    @39:00 the validate MSE is worse then the training. The model did not generalize but does this mean that's an overfitting issue.?

    • @GregHogg
      @GregHogg  3 года назад +2

      Yes, we overfit to the training set :)

  • @MohdIrshad-hv4vw
    @MohdIrshad-hv4vw 3 года назад

    That is good
    It will really helpful for me

  • @germanshepherd6638
    @germanshepherd6638 6 месяцев назад

    Why didn't you show us how to create the API and upload it?

  • @arsheyajain7055
    @arsheyajain7055 3 года назад

    Great project!!

    • @GregHogg
      @GregHogg  3 года назад +1

      Thank you ☺️

  • @mdashik2291
    @mdashik2291 Год назад

    Could u make a video for roadmap of machine learning & deep learning with Coursera in 2023 plz❤️

    • @GregHogg
      @GregHogg  Год назад +1

      I'd say my roadmap is still valid

  • @anitube7360
    @anitube7360 3 года назад

    can we have a project tutorial on real-time facial emotion detector in python......... plz tell the different approaches/methods that we can take to complete this project.

    • @GregHogg
      @GregHogg  3 года назад

      Okay this will be added to the to-do list!

  • @norbertishimwe228
    @norbertishimwe228 8 месяцев назад

    On 39:14 why did you predict on the train data instead of the test data?

    • @rohitsali2603
      @rohitsali2603 4 месяца назад

      Because we have to calculate the mse of training data not testing data
      Because we want to know how much accurate our model is by learning from training data

  • @longtruong9935
    @longtruong9935 3 года назад

    could you provide the notebook link?

    • @GregHogg
      @GregHogg  3 года назад

      Thank you very much for reminding me - it's there now.

  • @andrewsnell2309
    @andrewsnell2309 2 года назад

    I am doing this in google collab and and it keeps saying Import "tensorflow.keras..." could not be resol

    • @GregHogg
      @GregHogg  2 года назад +1

      Are you importing exactly the same way?

  • @thummapalasuresh292
    @thummapalasuresh292 2 года назад

    Could you please provide the dataset....

  • @__________________________6910
    @__________________________6910 2 года назад

    U are best