How to Create a Dataset for Machine Learning |

Поделиться
HTML-код
  • Опубликовано: 21 ноя 2024

Комментарии • 51

  • @4year4thyear
    @4year4thyear 3 года назад +9

    My professor recently voluntold me to build an AI model for data in our research lab. Thank you for this lifeline!

  • @BenjaSerra
    @BenjaSerra 4 года назад +19

    Nowdays every tutorial just talk about how to create deep learning models, but almost none talks about how to prepare your own dataset and how important is data in machine learning.
    Subscribed += 1

    • @BenjaSerra
      @BenjaSerra 4 года назад +3

      If is posible for you, would be great if you make a video about how to prepare your own datasets with sample code, depending in which case are you working (text clasification, images, etc...), as I said, there's almost no video about it. Thank you and greetings from Chile!

    • @sushantrauthan5704
      @sushantrauthan5704 4 года назад +3

      Couldn't agree more

  • @TheAmit4sun
    @TheAmit4sun Год назад +1

    I am sold. Just subscribed you. This is such an important topic but there is hardly any good videos on it. God bless

  • @Darksagan
    @Darksagan Год назад +1

    I have no idea why I searched this topic. But you did a great job explaining it. lol

  • @MyGraden
    @MyGraden Год назад +1

    can you send or show an example of a clean ready for machine learning dataset?

  • @lakeguy65616
    @lakeguy65616 4 года назад +17

    This is a great and important topic I hope you'll spend more time talking about it, particularly numerical data sets. Cleaning, accounting for missing values, scaling, normalizing, etc...

    • @JordanHarrod
      @JordanHarrod  4 года назад +2

      Definitely planning to go deeper into this on future AI 101 videos!

    • @farahwael8806
      @farahwael8806 2 года назад

      @@JordanHarrod Hello Jordan
      I want to study at Harvard university. How I can do this! Please answer me 🌸🌸🌸

  • @codebits4461
    @codebits4461 3 года назад +2

    I like your hair, and great video. The content is exactly what I was looking for and more😁

  • @rnsfebay1
    @rnsfebay1 Год назад +2

    Thanks!

  • @anandsheth5490
    @anandsheth5490 2 года назад +1

    Jordan - what would you say is the minimum quantity of data required? for example, for a specialized NLP. 20,000 lines of text? 50K?

  • @amiryavariabdi8962
    @amiryavariabdi8962 3 года назад +3

    Dear the artificial intelligence community
    I am pleased to introduce DIDA dataset, which is the largest handwritten digit dataset. I will be grateful, if you could help me to introduce this dataset to the community.
    Thanks

  • @parietal100
    @parietal100 Год назад +1

    Great overview which I have shared with my team. Thanks.

  • @opejohn1116
    @opejohn1116 4 года назад +1

    Hello Jordan. Thanks for this Video. I want to ask how you feel about using permutation and combinations to generate more rows for a dataset.

  • @alexplus3
    @alexplus3 4 года назад +7

    Jordan your great with AI topics❤🙏

  • @aguslimanto6766
    @aguslimanto6766 4 года назад +1

    Hi, how do I create a chemical structure dataset to predict the QSAR? I'm looking for a simple tutorial for my future Ph.D.
    study..thank you

  • @marcoprimo4042
    @marcoprimo4042 4 года назад +1

    How do I practically create create a dataset, meaning what program or software do I use.

    • @JordanHarrod
      @JordanHarrod  4 года назад +1

      There are a few options - if I'm using a public, well-known dataset, you can usually load it from an existing machine learning API (ex. TensorFlow, Keras, PyTorch, fast.ai) using one of their predefined methods. If you're using something custom, things become a lot more complicated, and it depends a lot on what data you're loading, the format it is in, and how much pre-processing you need to do to get it to the point where you can use it as training data.

  • @BuddhiKavindra
    @BuddhiKavindra 2 года назад +1

    Thanks! This is so clear explanation.

  • @slametwidodo727
    @slametwidodo727 4 года назад

    Data set is very important before we process data into scientific data

  • @LofiMoodCrafts
    @LofiMoodCrafts 2 года назад

    Hey Jordan where can I get NETWORK INTRUSION DETECTION SYSTEM DATASET?

  • @adamderose9468
    @adamderose9468 2 года назад

    i may have missed it but reCAPTCHA is a cool ex of crowd sourcing labeling

  • @HolyFacts
    @HolyFacts Год назад

    Thank you and Subscribed !

  • @_abhishek_08_
    @_abhishek_08_ Год назад

    hey how can I reach out to you ? incase of any doubts? I am a engineering student and for my final project I need to build a project which needs data set and I have no clue where to start

  • @anthonyjoshua7148
    @anthonyjoshua7148 3 года назад

    How can I develop dataset of student result to predict their project topic

  • @0xredpill
    @0xredpill 4 года назад

    thanks for sharing such knowledge in 20AI

  • @muneebabbas2424
    @muneebabbas2424 2 года назад

    Can I have a dataset of textfiles (.txt) for training and testing is viable for machine learning algorithms generally? I'm very new to this area and want to know before I start trying to create machine learning algorithms

  • @semilshah8252
    @semilshah8252 3 года назад

    I am working on a project Sentimental Analysis Based On Social Media Post and i need to create dataset in which data is extracted from instagram,facebook,twitter. Please describe a step by step process how should i go for it?

    • @olaniyiajayi4319
      @olaniyiajayi4319 3 года назад

      You should go with scrapping those social media you mentioned. There are libraries out there that can help you scrape easily from each site.

    • @hocineb8483
      @hocineb8483 Год назад

      Bro thought Jordan is ChatGPT or smth

  • @DeepFrydTurd
    @DeepFrydTurd 2 года назад

    Ai is like a baby and we are the parents. its in its infancy and will develop into a self aware adult

  • @benvolioombese9109
    @benvolioombese9109 4 года назад

    I've really liked this !!
    How can I learn this? Please

  • @abdelbassethechaichi4422
    @abdelbassethechaichi4422 3 года назад

    That was really helpful, thank you very much

  • @viniciusoliveira4798
    @viniciusoliveira4798 4 года назад +1

    this is so helpful

  • @vauths8204
    @vauths8204 Год назад

    im ready to simp. this was a great video and really helps me with my goal of ruling the world thank you future queen

  • @surendranathreddy7114
    @surendranathreddy7114 3 года назад

    Amazing! Thank you!

  • @md.masudurrahman5852
    @md.masudurrahman5852 10 месяцев назад

    hi ,can you please show ai dataset in live,,,?

  • @xabisontloko3574
    @xabisontloko3574 4 года назад

    Nice, thank you for this great video.

  • @njmagay0223
    @njmagay0223 3 года назад

    How to make dataset in Facebook?🥺

  • @shashwatvaibhav2769
    @shashwatvaibhav2769 4 года назад

    So, you hardly blink...right??

  • @auguststas7770
    @auguststas7770 Год назад

    cool

  • @haz1615
    @haz1615 11 месяцев назад

    i thought its a practical vid but its just talking video

  • @chris12081989
    @chris12081989 29 дней назад

    useless?

  • @BERNARD7269
    @BERNARD7269 4 года назад

    Who taught her to speak like that

  • @codebits4461
    @codebits4461 3 года назад +2

    I like your hair, and great video. The content is exactly what I was looking for and more😁