Kaggle Earthquake Prediction Challenge

Поделиться
HTML-код
  • Опубликовано: 13 сен 2024
  • Data Science coding challenge time! The popular Data Science competition website Kaggle has an ongoing competition to solve the problem of earthquake prediction. Given a dataset of seismographic activity from a laboratory simulation, participants are asked to create a predictive model for earthquakes. In this video, I'll attempt the challenge as a way to teach 3 concepts; the Data Science mindset, Categorical Boosting, and Support Vector Regression models. I'll be coding this using python from start to finish in the online Google colab environment. Enjoy!
    Code for this video:
    github.com/llS...
    Please Subscribe! And Like. And comment. Thats what keeps me going.
    Want more education? Connect with me here:
    Twitter: / sirajraval
    instagram: / sirajraval
    Facebook: / sirajology
    Join us at the School of AI:
    theschool.ai/
    More learning resources:
    www.kaggle.com...
    www.analyticsv...
    blog.griddynam...
    towardsdatasci...
    accio.github.i...
    kernelsvm.tripo...
    www.saedsayad....
    / support-vector-regress...
    scikit-learn.o...
    Join us in the Wizards Slack channel:
    wizards.herokua...
    Please support me on Patreon:
    www.patreon.co...
    Signup for my newsletter for exciting updates in the field of AI:
    goo.gl/FZzJ5w
    Hit the Join button above to sign up to become a member of my channel for access to exclusive content! Join my AI community: chatgptschool.io/ Sign up for my AI Sports betting Bot, WagerGPT! (500 spots available):
    www.wagergpt.co

Комментарии • 133

  • @druestaples3044
    @druestaples3044 5 лет назад +46

    I really appreciate these kaggle competitions you do cause it’s helping me gain confidence to do my own, which in turn is adding projects to my portfolio!

  • @X_platform
    @X_platform 5 лет назад +75

    The best video you have made to date.
    Cutting down the meme, rap, special effects really helped me focus more.
    Keep up the good work!

  • @amitkhetan5062
    @amitkhetan5062 4 года назад +6

    Hey Dude,
    I'm a DS Practitioner from India and I really appreciate 🙌 your work. A year ago, I was working in Mumbai and heard about a few things that got viral about you. I cannot choose to ignore that but I personally feel that you've been a strong influencer in the whole data science community. Just be yourself and keep doing what you do.. with citations and references, of course. :D #LetsMakeIndiaAGreatNation 🙌

    • @anilchaudhry804
      @anilchaudhry804 3 года назад

      dude u first go learn datastructures and algorithms

  • @НатальяСоколова-я1г

    Thanks Siraj! Study ML for 3 weeks already and want you to shoot more episodes with more focus on data-preparation and all that stuff with features because when scikit-learn has all algorithms made for us how could we get the max power from them tuning hypersparams and so on. I think you get what I mean. This episode seems pretty easy so I want to see you really making art from it!! Greets from Russia

  • @danielmiranda7315
    @danielmiranda7315 5 лет назад +15

    Make sure you join the competition in Kaggle. Otherwise, kaggle download command will fail with forbidden error.

  • @pothny3371
    @pothny3371 5 лет назад +13

    Best takeaway of this video is "from IPython.display import RUclipsVideo"

  • @CodeEmporium
    @CodeEmporium 5 лет назад +2

    Dude you should make a video on Lingvo (a framework for sequence-sequence modeling) I'd love to hear your thoughts ~

  • @YeeYeez
    @YeeYeez 5 лет назад +2

    Glad you re-did this. More to the point and cleaned up. Keep it up! Papers with code - super cool!

  • @MrMatheus195
    @MrMatheus195 5 лет назад +2

    With this video we now know how nervous Siraj is when coding hahah

  • @temitopeoyeyemi899
    @temitopeoyeyemi899 5 лет назад +2

    what else can i say......YOU ARE GOOD BRO

  • @zamazalotta
    @zamazalotta 5 лет назад +1

    Don't know about earthquakes, but this guy's hand-waving (both literally and allegorically) must have triggered the tornado in Alabama.

  • @thomaswyrick4648
    @thomaswyrick4648 5 лет назад +1

    Siraj, this video was gold. Thanks for everything that you do.

  • @terrydouglasjayasuriya5162
    @terrydouglasjayasuriya5162 Год назад

    Thanks for the "tutorial" Siraj. Great video and has sure stimulated some new ideas on prediction modeling.

  • @Simba-qm5qs
    @Simba-qm5qs 5 лет назад

    Waouwh RUclips videos can be shown on Jupyter :O That is so cool :) Siraj you're just the best !

  • @haoyilu8380
    @haoyilu8380 5 лет назад

    Thank you, Siraj! I have two questions: 1. Does catboost perform better than LGB and XGBoost? 2. Could you please do a video talking about the whole process of feature engineer? I found feature is more important than models but I don’t know how to create fantastic features based on the original data.

  • @SUGATORAY
    @SUGATORAY 5 лет назад +2

    Kaggle API would work to fetch the list of competitions one has signed up for. So, every person who is trying to fetch a kaggle competition dataset to Colab, will first need to signup for the competition and then use the api.

    • @SUGATORAY
      @SUGATORAY 5 лет назад +1

      @Siraj: perhaps if you include this somewhere in the video, for someone new to kaggle competitions following your video will be easier. I had not signed up for the competition at first and tried fetching the list of datasets from kaggle (following your live stream) and got stuck there for a while as it kept on throwing error, until I did some digging and finally signed up for the competition. It is a small detail that could save one a lot of time and trouble.
      Thanks for the great videos. :)

  • @shalabhgreatest
    @shalabhgreatest 5 лет назад +1

    Hi Siraj,
    In step 4 you use chunksize=150_000 in read_csv. Can you explain how does 150_000 work?
    Thanks

  • @NeoCodingGod
    @NeoCodingGod 5 лет назад

    @Siraj What's super exciting is hearing what you have to say! Whenever I watch your videos I know I'll not get disponited. You should do more videos about you & what you're up to etc.

  • @kabijoshi5949
    @kabijoshi5949 Год назад

    What a day I found your video. I just didnt get the code to confirm your claims.

  • @quarkquark7882
    @quarkquark7882 2 года назад

    Zarah effect random is the best earthquake forecast in world it can predicted minutea before it happen in epicentrum.i challenge kaggle.

  • @user-ym5kh5pq2x
    @user-ym5kh5pq2x 2 года назад

    Earthquake prediction is possible .. time and magnitude .. with 1% error .. Prediction Formula is generated .. and passed testing and verification phase ..

  • @NetViewer-hw2tu
    @NetViewer-hw2tu 5 лет назад

    Understanding those faults movements will get to nothing because there is always new ones appearing...understanding the cause's of those movements will be something...😊😊

  • @debasishjana3125
    @debasishjana3125 5 лет назад

    I think you did a live session with this problem. Did you delete it? I couldn't find that on your channel. That was also a great video.

  • @felipeozoski
    @felipeozoski 5 лет назад

    Thank you so much Siraj!! You are the man! Cheers

  • @brianchaplin278
    @brianchaplin278 2 года назад

    Great job

  • @quocanhhbui8271
    @quocanhhbui8271 5 лет назад

    Pls pls keep up what you are doing with kaggle. You helped me a lot

  • @akanksharastogi8741
    @akanksharastogi8741 5 лет назад

    Would love to see a video by you on SLAM algorithms! Keep up the good work and keep inspiring!

  • @aidenstill7179
    @aidenstill7179 5 лет назад +1

    Do a lesson on creating your own deep learning framework. please or tell me where it can be studied

  • @AsifAhmed-bf5hg
    @AsifAhmed-bf5hg 5 лет назад +4

    Can you please make video on neural network application on neuroscience

    • @SirajRaval
      @SirajRaval  5 лет назад

      search siraj math of neuroscience

  • @kaushalmistry4007
    @kaushalmistry4007 5 лет назад +2

    Siraj make some videos on solving Numerai Challenge

  • @jithendrayenugula7137
    @jithendrayenugula7137 5 лет назад

    This is really a helpful video,siraj!!! Helped me a lot... Thanks a lot

  • @invisibleimpostor299
    @invisibleimpostor299 5 лет назад +1

    Yay!
    The older hand intro?????

  • @miguelflorian4022
    @miguelflorian4022 5 лет назад

    Wow man, you made my day!! Thanks a lot!

  • @DeependraTube
    @DeependraTube 5 лет назад +1

    Very nice !
    I want to ask if we can tweak this AI /ML Algo. to predict "Crimes Probability at any place" and "People's gathering at any event" ???

    • @nikelias1995
      @nikelias1995 5 лет назад +1

      There is already software which predicts crime locations and the probabilities for crime

  • @earthnote
    @earthnote 3 года назад

    can you make one for real earthquakes? just for fun, maybe use IRIS earthquake database

  • @k.alipardhan6957
    @k.alipardhan6957 5 лет назад

    I like these Kaggle videos, but it would be nice to see even more advanced videos

  • @abilashvr4802
    @abilashvr4802 5 лет назад

    Thanks for the all your Videos Siraj, its really awesome ! Could you please demonstrate how should we approach Multivariate Time Series Prediction?

  • @etienneekpo348
    @etienneekpo348 5 лет назад

    Amazing Video Siraj, thanks for sharing.

  • @perlindholm4129
    @perlindholm4129 5 лет назад

    Idea - Could a cause of a vulcano eruption be that not wanted material got under into the lava? Like CO2 for instance. So a eruption has some very important meaning and is not just a physical effect or random noise event. I assume the lava_model() probably iterates like a sun_model(). So faults in the iteration is not wanted and the lava_model() needs to get rid of them.
    So wonder if you can insert a removable problem in a lava_model() and let the model find it and then move it.
    // Per Lindholm

  • @shubham2450
    @shubham2450 5 лет назад

    Siraj great video man... really helpful

  • @fyaa23
    @fyaa23 5 лет назад

    Nice video! Shouldn't you split the training data again into two sets for parameter optimization?

  • @AbhishekKumar-mq1tt
    @AbhishekKumar-mq1tt 5 лет назад +1

    Thank u for this awesome video

  • @munawarhussain7878
    @munawarhussain7878 4 года назад

    I Learn a lot from you boss, learn a lot.

  • @coffeebuzz69
    @coffeebuzz69 5 лет назад

    Great video! Could you do a video about genetic algorithms for hyperparameter tuning? maybe compare to other methods like gridsearchcv.

  • @elisimic4371
    @elisimic4371 5 лет назад

    really high quality video Siraj.

  • @HarshitDaftary
    @HarshitDaftary 5 лет назад

    First time you made a proper video. This really helps to learn.

  • @Pa-ow1nj
    @Pa-ow1nj 5 лет назад

    thank you man great explanation !

  • @chournsolidet5406
    @chournsolidet5406 5 лет назад

    I am new to Machine Learning and Deep Learning. I have learned some basic and done some small experiments. Now the school task is to join the challenge of research in Kaggle. I have no idea how to start. Any suggestions? I do appreciate your response. Thanks in advance.

  • @bambangkurniawan7992
    @bambangkurniawan7992 5 лет назад

    Siraj, can you make tutorial about prediction keywords with high volume or will be booming in 1/2 months from google trends data using machine learning? it would be awesome. Thanks

  • @vamsikalyan8013
    @vamsikalyan8013 5 лет назад +1

    please do vedios on kaggle for bigginers,
    thank your

  • @aquibk
    @aquibk 5 лет назад

    Can you please make videos on probabilistic graphical models from very basics.

  • @qusha588
    @qusha588 5 лет назад

    Is there any chanels like Siraj's?

  • @_monnie_
    @_monnie_ 5 лет назад

    THANK YOU SO MUCH !!

  • @ali985411
    @ali985411 5 лет назад

    My Visualization graphs are different from yours while I'm implementing your code with same data, I'm a complete beginner (rather just started learning data science). Please help me in this regard.

  • @radwaneberredai6622
    @radwaneberredai6622 5 лет назад +1

    HEY SIRAJ WE need kaggle for beginner

  • @sampletextsampletext7051
    @sampletextsampletext7051 4 года назад

    can you explain how [24] works because it gives
    TypeError: string indices must be integers

  • @nitin-goyal
    @nitin-goyal 5 лет назад

    Next, VSB Power Line Fault Detection?

  • @etienneekpo348
    @etienneekpo348 5 лет назад

    Ahhaha, "Time to Rap" . Wanting on the2019 AI Rap ... : )

  • @deepaknenyana
    @deepaknenyana 5 лет назад

    thanks siraj

  • @junkbingo4482
    @junkbingo4482 2 года назад

    well it's like financial time series; when i was young i learned things, and i seriously doubt one can find out smtg useful.....

  • @deltatv9335
    @deltatv9335 5 лет назад

    Thanks buddy🕺

  • @asharkhan6714
    @asharkhan6714 5 лет назад

    Another Great Video!

  • @gauravchaudhari9279
    @gauravchaudhari9279 5 лет назад

    Awesome! :)

  • @samuelsama8780
    @samuelsama8780 5 лет назад

    WHAT IS A PERSON IN DATA SCIENCE CALLED A QUANT

  • @phillipmarnell9966
    @phillipmarnell9966 4 года назад

    You make this so clear! But... using Colab, I'm getting this outdated API message, and a '403 Forbidden' message Any ideas on fix/workaround?
    Saving kaggle.json to kaggle.json
    User uploaded file "kaggle.json" with length 68 bytes
    Warning: Looks like you're using an outdated API Version, please consider updating (server 1.5.6 / client 1.5.4)

  • @priyankachauhan8503
    @priyankachauhan8503 5 лет назад

    Make a video on twitter sentiment analysis in python with dataset

  • @siraj3434
    @siraj3434 5 лет назад

    the code in step 4 gives me an invalid syntax on the last line
    return pd.Series.(strain)
    What am I doing wrong?

  • @TaddeusBuica
    @TaddeusBuica 5 лет назад

    Hey, i'd like to learn more about stateful LSTMs

  • @EricHathaway
    @EricHathaway 5 лет назад

    awesome

  • @ravisinghpatel4263
    @ravisinghpatel4263 5 лет назад

    can this type of data be classified as segmented data?

  • @thankyouthankyou1172
    @thankyouthankyou1172 5 лет назад

    9:38 so, the data was downloaded to the 'cloud'? and we cannot see the data locally? how to check the data in the cloud? where is the data exactly?

  • @vidusha123
    @vidusha123 5 лет назад +1

    I’ve never seen him so uncomfortable lol.
    Great video nevertheless I learnt a lot. Thank you so much sir.

  • @proturguenev1820
    @proturguenev1820 5 лет назад +1

    estamoh en primer lugar somos el mejor pais de chile

  • @yimingzhao3081
    @yimingzhao3081 5 лет назад

    i just dont usually see you use deep learning/tensorflow to solve problems like this, why? is that not fit to solve kaggle cases?

  • @annkauffman8026
    @annkauffman8026 4 года назад

    IOError: Could not find kaggle.json. Make sure it's located in /root/.kaggle. Or use the environment method 😢
    neither when I typed by myself nor copied uploading kaggle.json and next steps from github :(
    btw Siraj has same error mv: target '/root/kaggle/kaggle.json' is not a directory however his code is worked.

    • @annkauffman8026
      @annkauffman8026 4 года назад

      yikes, I missed second '&&' before chmod >_

  • @bhuvaneshs.k638
    @bhuvaneshs.k638 5 лет назад

    Finally..!

  • @GauravGuptaa
    @GauravGuptaa 5 лет назад

    @Siraj help....
    When trying to download earthquake data, it shows forbidden 403 error in Google colab

    • @randykat
      @randykat 5 лет назад

      You need to join the competition on the kaggle website to have access to download the data. On the kaggle website do a search for LANL Earthquake Prediction then click the join the competition button.

    • @GauravGuptaa
      @GauravGuptaa 5 лет назад

      @@randykat thanks but i already did enrolled.
      anyways i figured it out :D
      i was running out of storage space :P

  • @張軒豪-z3r
    @張軒豪-z3r 5 лет назад

    Hi!~It's a great video. Help me a lot. I try to do the same step to learn,
    but i stuck in step 4. I try to debug but it's still error about " module 'numpy' has no attribute 'quantile' ".
    I already try install higher numpy. As numpy==1.15.1 and upgrade.
    But error still the same. I see you video has no this problem. why it happen?

    • @stickyas1
      @stickyas1 5 лет назад

      !pip install kaggle
      !pip install numpy==1.16.4
      !pip install catboost

  • @mrme8521
    @mrme8521 Год назад

    watching this makes realize how much of a rookie I am...

  • @jusgaming99
    @jusgaming99 5 лет назад

    i did not understand your command for uploading file

  • @nayeemsunny2573
    @nayeemsunny2573 5 лет назад

    hey bro i'm the first viewer woho!!!!!!!!!!

  • @vedantkulkarni1711
    @vedantkulkarni1711 5 лет назад

    I'm unable to import the LANL-Earthquake-Prediction dataset by following the code from the video. I get the error "403 - Forbidden". How to fix this?

    • @crjuncher1
      @crjuncher1 5 лет назад

      have same issue

    • @antoniamendo
      @antoniamendo 5 лет назад

      Did you enroll in the competition?
      Also there is a typo error, in the video he wrote ~/kaggle/kaggle.json, but file has to be placed in ~/.kaggle/kaggle.json.
      The github code is cleaner:
      github.com/llSourcell/Kaggle_Earthquake_challenge/blob/master/Earthquake_Challenge.ipynb

  • @tylerreichardt3270
    @tylerreichardt3270 4 года назад

    Can someone explaine why this error is occuring:
    Warning: Looks like you're using an outdated API Version, please consider updating (server 1.5.6 / client 1.5.4)
    403 - Forbidden
    TIA

    • @anitakhandelwal553
      @anitakhandelwal553 4 года назад

      This might have happened when you tried to use the kaggle API for downloading the data. You have to first accept the rules of the competition (on Kaggle) in order to use the API .

  • @martincastellano89
    @martincastellano89 5 лет назад

    hi there is a gpt2 video??? (by siraj)

    • @aiwebbiz8532
      @aiwebbiz8532 5 лет назад

      blog.openai.com/better-language-models/

  • @NarendraKumar-qt8sj
    @NarendraKumar-qt8sj 5 лет назад

    !kaggle competitions download -c LANL-Earthquake-Prediction
    I am getting 403 - Forbidden error while i am trying to download the dataset from kaggle

    • @SUGATORAY
      @SUGATORAY 5 лет назад

      Narendra: Go to Kaggle and sign up for the competition from the competition’s page. This needs to be done manually first, before kaggle let’s you access the competition’s data through the api. Good luck. 👍

    • @NarendraKumar-qt8sj
      @NarendraKumar-qt8sj 5 лет назад

      @@SUGATORAY Thank u ,it helped

    • @wasgeht2409
      @wasgeht2409 5 лет назад

      @@SUGATORAY I did it but it didn't work , get the same error

    • @SUGATORAY
      @SUGATORAY 5 лет назад

      @@wasgeht2409 Follow the instructions as in this Github gist: gist.github.com/sugatoray/9244867c89fd3868ec8e5a5ba7341901#file-kaggle_import_dataset_public-ipynb

  • @AnkitSingh-lb9fj
    @AnkitSingh-lb9fj 5 лет назад

    i am not able to download the dataset following error is shown
    Warning: Your Kaggle API key is readable by other users on this system! To fix this, you can run 'chmod 600 /root/.kaggle/kaggle.json'
    403 - Forbidden

    • @yashmjain95
      @yashmjain95 5 лет назад

      You have to join the kaggle competition first. This is why you are getting the forbidden error

  • @werthersoriginal
    @werthersoriginal 5 лет назад +6

    Just slap a GPS collar on to my ex wife. The closer she gets to your location, an earthquake is a comin. Bam, project done! No ML needed.

  • @陳中彥
    @陳中彥 4 года назад

    Hi Siraj, I doubt the pseudocode you shown in 22:13 was AdaBoost, rather than gradient boosting. Or if I misunderstood the concept?
    Check this link out www.quora.com/What-is-the-difference-between-gradient-boosting-and-adaboost
    Nice video by the way. Helps me a lot.

  • @pranjalpathak4498
    @pranjalpathak4498 5 лет назад

    He should have told that he is actually using google colab and not jupyter

    • @stickyas1
      @stickyas1 5 лет назад

      I was thinking he used Kaggle kernel. but he is taking about google colab around 5:10.

  • @AnkitSingh-lb9fj
    @AnkitSingh-lb9fj 5 лет назад

    just want to know whether is it legal to discuss an ongoing challenge??

  • @nayeemsunny2573
    @nayeemsunny2573 5 лет назад

    bro while i'm trying to download datset it's giving me "403 forbidden" help me bro

    • @randykat
      @randykat 5 лет назад +1

      You need to join the competition on the kaggle website to have access to download the data. On the kaggle website do a search for LANL Earthquake Prediction then click the join the competition button.

    • @nayeemsunny2573
      @nayeemsunny2573 5 лет назад

      @@randykat thanks

  • @adityabachhawat8727
    @adityabachhawat8727 5 лет назад

    why making a video twice?

  • @user-ym5kh5pq2x
    @user-ym5kh5pq2x 2 года назад

    Please provide your data .. and I will predict the next coming earthquake .. time-wise and magnitude ..

  • @antoniamendo
    @antoniamendo 5 лет назад

    Viva Chile weon

  • @dhruvrnaik
    @dhruvrnaik 5 лет назад

    ♥️♥️🔥

  • @rickyzhang
    @rickyzhang 5 лет назад

    No offense. I don’t see the value that you added descriptive statistics in one single value to all rows as features. You must do a prank or make yourself like a fool.

    • @antoniamendo
      @antoniamendo 5 лет назад

      does the magic. He iterates over chunks, not single rows. I was wondering the same.

  • @arashkhajooei5944
    @arashkhajooei5944 4 года назад +4

    Useless information that explained everything unclearly ! it shows that you just memorized it and didn't understand it from the depth. I'm just wasted my time watching this.

  • @xrystalskyes2838
    @xrystalskyes2838 5 лет назад

    Lol we live on a flat earth right

  • @EhsanIrshad
    @EhsanIrshad 5 лет назад

    first

  • @MichaelMantion
    @MichaelMantion 5 лет назад

    how does a womens vajaja muscle predict earthquakes?