Python Data Science Tutorial: Analyzing the 2019 Stack Overflow Developer Survey

Поделиться
HTML-код
  • Опубликовано: 19 ноя 2024

Комментарии • 145

  • @coreyms
    @coreyms  5 лет назад +55

    I hope everyone finds this useful. Going out and practicing with real-world data like this is one of the best (if not the best) way to practice your skills. If any of you do your own analysis that's interesting then be sure to share it with us in the comment section below!
    PS I forgot to mention that as usual, the code for this video can be found in the description section.

    • @shpazhist
      @shpazhist 5 лет назад +4

      Corey, can you please do a video where you explain how to get to that level of intermediate-advanced developer where one can play with code really-really easily? In essence, say if one wants to learn about new library or framework or a new challenge - - can you please explain your process how do you go about actually learning that field rather than doing videos on standalone topics as IMO that would really help a lot of people to sorta move to the next level a lot. Thx

    • @bodekolton5961
      @bodekolton5961 3 года назад

      I know it's quite off topic but do anybody know a good website to stream new tv shows online?

    • @troylorenzo6430
      @troylorenzo6430 3 года назад

      @Bode Kolton Lately I have been using Flixzone. Just google for it :)

    • @maddoxronin9643
      @maddoxronin9643 3 года назад

      @Troy Lorenzo yea, have been watching on Flixzone for since april myself =)

    • @bodekolton5961
      @bodekolton5961 3 года назад

      @Troy Lorenzo thank you, I went there and it seems like a nice service :D I really appreciate it!

  • @mrityunjayahiremath3951
    @mrityunjayahiremath3951 5 лет назад +70

    Python + Data Science + Corey = ❤️

  • @Ghasakable
    @Ghasakable 5 лет назад +19

    Man, you are awesome, you are the reason that I open RUclips, please if you come to Japan one day, I will do my best to show you good places here in Tokyo.

    • @coreyms
      @coreyms  5 лет назад +4

      That'd be awesome!

  • @Vasko10100
    @Vasko10100 5 лет назад +3

    Finally, i understand analyzing Data in Python! I went from Coursera via EDX, UDEMY etc. But this.... this is a pure gold sir! Please, do some course or learning path for Data Science with Machine Learning in Python in depth( let it be if needed a paid learning path). You are one of the best! If you somehow manage to come in Macedonia, tell me, i want to bow before you!

    • @coreyms
      @coreyms  5 лет назад +3

      Thanks! Im still working on a Pandas series. 4 videos are complete but I'm working to finish the rest soon. Glad to have you as a subscriber and member. You all make it possible for me to release these for free. Thanks again!
      media.giphy.com/media/WvUyMX1gB1gnC/giphy.gif

  • @afternoontiger9319
    @afternoontiger9319 5 лет назад +14

    I'm definitely interested in a future tutorial of using pandas with this data! Otherwise a fantastic job

  • @AvantjerTube
    @AvantjerTube 5 лет назад +5

    Thanks Corey!
    I’ve been following your excellent tutorials for quite awhile.
    Just wanted to say that I’m excited and looking forward to your Pandas course(s).

  • @michaelworkspace
    @michaelworkspace 5 лет назад +25

    In case anybody ran into an error like "UnicodeDecodeError: 'charmap' codec can't decode byte 0x81" while reading the csv file like I had, the fix is to add a encoding='utf-8' parameter to the with open() function. e.g. with open('survey_results_public.csv', encoding='utf-8') as f:

    • @Mayur7Garg
      @Mayur7Garg 5 лет назад +4

      Thx man! You are a savior.

    • @shivstn98
      @shivstn98 4 года назад +1

      It can be also fixed by using raw string I.e. use r before first quote
      Eg. r"hello world"

    • @jemand4036
      @jemand4036 4 года назад +1

      thank u so much, i had no idea what was going on!

    • @irfandy4hmad
      @irfandy4hmad 4 года назад +1

      Thanks bro. U saved our time

    • @michaelworkspace
      @michaelworkspace 4 года назад

      Irfandy Ahmad No problem bro. Have a good one!

  • @mancalos
    @mancalos 3 года назад

    Finally a project not using either the Titanic or Iris datasets. Corey, you are a gem. Thanks for all the effort you put into this and every other tutorial.

  • @sanukurien2752
    @sanukurien2752 5 лет назад +1

    @Corey, I love watching your videos when I have my morning coffee. It's elegant and super informative. Thank you!

  • @MAli-os2yj
    @MAli-os2yj 5 лет назад +3

    Hi Corey! Love your videos man! I have finished your Flask series and currently doing the Django series. I have a few suggestions for future videos around the Blog Website that involve some ML and Data Science:
    1. Introduce 'like and comment' functionality to the application which allows a user to comment and like posts by other users.
    2. Then, integrate a recommendation system engine to the Blog application, the system will recommend content to the individual user based on his own posts and the posts that he has liked.
    3. Also, a user can look into the profile of other users and see a word cloud of all posts that the other has submitted, this way they can get a rough idea of what the other uses posts.
    Keep Doing the good work man!

  • @Strider1066
    @Strider1066 4 года назад

    I liked the incremental approach of coding by examples .

  • @muralitechtimes6487
    @muralitechtimes6487 Год назад

    You are true MASTER of coding. We are now a days not finding your other videos. Do it for us which re very help us to grow.

  • @anaesthesiadolorosa7100
    @anaesthesiadolorosa7100 5 лет назад +1

    It's awesome that you've started making data science videos. Please keep them coming! ;)

  • @jayjay6804
    @jayjay6804 5 лет назад +1

    As hobbiest my first inclination would be to extract the data using pivot tables in libreoffice. Thank you for demonstrating the use of python for this purpose.

  • @kizzavincent
    @kizzavincent 5 лет назад +1

    Very nice to have you back. Many thanks.

  • @mylesmontclair
    @mylesmontclair 5 лет назад +1

    Great to have you back producing great content for us. Thanks, Corey!

  • @GuilhermePalazzoRodrigues
    @GuilhermePalazzoRodrigues 5 лет назад +9

    That was incredible, really easy to follow and replicate, thank you very much.

  • @yosefco3
    @yosefco3 5 лет назад +2

    Hi great Corey ! Thank you very much on your channel , your flask tutorial is the best i ever seen ! Please could you do a flask-api tutorial too ?

  • @Sauhros
    @Sauhros 5 лет назад

    Just what I needed to start off my winter break. I can't wait for your pandas series! Keep up the great work Corey.

  • @yomajo
    @yomajo 4 года назад +1

    It seems like Counter() could be a topic of its own. I need to dig more into that. Awesome code-along experience!

  • @Jo-id9zm
    @Jo-id9zm 5 лет назад +6

    You're doing God's work, Corey!

  • @BoukariYameogo
    @BoukariYameogo 5 лет назад +2

    This is what I was waiting for! Thank you!

  • @ajb4964
    @ajb4964 5 лет назад +7

    Wow, please do more like this.

  • @dadoll1660
    @dadoll1660 5 лет назад +1

    Wow, this is Amazing! Please make more videos for this DS Tutorial series. Thank you Corey...

  • @akhilrapalli4118
    @akhilrapalli4118 5 лет назад +1

    wow what a video from an expert on python like u verymuch corey

  • @rangavembar
    @rangavembar 5 лет назад +1

    Brilliant as always!! Looking forward to the Pandas series!!

  • @백영래-u3x
    @백영래-u3x 5 лет назад +1

    Thank you for Python Data Science Tutorial! Keep it up, please!

  • @philinnc
    @philinnc 5 лет назад

    Looking forward to seeing a future video where pandas is used to analyze this same data. Really appreciated your stepping through the thought process, something I haven't seen in many other data science tutorials. Approaches to data cleaning is another interesting topic that would be nice to see covered in a future video. Recently there were these FEC reports... :-)

  • @bishantadhikari3440
    @bishantadhikari3440 5 лет назад

    Your videos are great and is probably the most easily understandable in all of youtube.
    Also, can you make videos about implementing AI in python cause there are not any good ones that meet your standard

  • @KevinOMalleyisonlysmallreally
    @KevinOMalleyisonlysmallreally 5 лет назад

    You produce some of the best tutorials on python. Itd really help if you did one on advancing jupyter notebook use, and seeing up sublime to work with pandas.

  • @DistortedV12
    @DistortedV12 5 лет назад +1

    Wow Corey you do it all; thanks for the helpful tutorials!

  • @theDrewDag
    @theDrewDag 5 лет назад

    Please make this the first of a Data Science playlist. PLEASE.

  • @_boris
    @_boris 5 лет назад +1

    Your videos are so brilliant as I want to keep them to myself only... 💎
    ...however, way too good not to share them with others. Shared with my classmates - hopefully, they find it useful :)

  • @Lfppfs
    @Lfppfs 4 года назад

    Awesome tutorial, thanks Corey!

  • @ZaidShaikh_619
    @ZaidShaikh_619 5 лет назад +1

    Data Science + Corey Sir = YAY!!!

  • @hserdarkuyuk
    @hserdarkuyuk 5 лет назад +1

    Please do similar live coding, these are great to improve our perspective

  • @HunniiChii
    @HunniiChii 5 лет назад +1

    Hope to see more of these videos from you in the future!

  • @ramil17998
    @ramil17998 5 лет назад +1

    Thanks a lot for the video! I love it cause its as real as it can get!

  • @amnah.-.4708
    @amnah.-.4708 5 лет назад

    Hi Corey, thank you for the content you put out, it is beyond exceptional. Also, it would be great if you could make an in-depth course on SpaCy, and its application in Data Science.

  • @borgestheborg
    @borgestheborg 5 лет назад +2

    Oh baby! Never clicked on a video so fast in my life! :D

  • @arnopisspot5115
    @arnopisspot5115 5 лет назад

    Looking forward the panda series boss. Thank you for all your hard work

  • @SagarSapkota-eo4ei
    @SagarSapkota-eo4ei 5 лет назад +1

    Hi Corey great and amazing content ..thanks a lot buddy

  • @gaunterodimm3569
    @gaunterodimm3569 5 лет назад +1

    Great video! This video helped me soooooooo much!

  • @DistortedV12
    @DistortedV12 5 лет назад +2

    From this, looks like you can't go wrong with javascript, python, SQL and a C type language. Wonder how long these will dominate.

  • @rosyndapee8129
    @rosyndapee8129 5 лет назад

    Nice! What about a future video with some visualization of geospatial data? I.e. using some GPS tracks converted into CSV file with all points lat and long. Would you suggest to use postrgres or postgis to then make queries to the geodata?

  • @alexanderten5497
    @alexanderten5497 5 лет назад +1

    Notification from Corey and Iam here to like the video

  • @nikhillingam4630
    @nikhillingam4630 5 лет назад +1

    absolutely brilliant work well done

  • @aydinjalilov2328
    @aydinjalilov2328 4 года назад

    it blew my mind.

  • @gtosXD
    @gtosXD 5 лет назад +1

    Thank you!!!! Really great video!

  • @sandeepgaur2407
    @sandeepgaur2407 5 лет назад

    Awesome tutorial like always. Please make video on asyncio in python.. please please.

  • @shivamkumraa
    @shivamkumraa 5 лет назад +8

    Who else liked before watching this. Just 1000% sure this guy can't make any shit.

    • @Jmzz542
      @Jmzz542 4 года назад +1

      what makes you say that?

  • @rangabharath4253
    @rangabharath4253 5 лет назад +1

    Awesome as always

  • @javierrivasseau8956
    @javierrivasseau8956 Год назад

    Hi Corey, how are you? i have been watching your series and videos, and think they are the very best out there, i really like to learn with you, I would like to ask you some things if you could solve them, when you make the matplotlib series, you separate the convertedcomp for all the devs and for the python dev and javascript dev, can you upload a video to solve it? I tried anyway and could not. ?

  • @Medhusalem
    @Medhusalem 4 года назад

    Hey Corey, I have seen the previous videos on DataFrames with Pandas where you used Jupyter as well. You mentioned in this video that it is nice that you can run code brackets or just certain lines of code.
    My question would be then, what is the difference between Jupyter and Spyder? Since Spyder can also execute certain code lines, without loading everything in again and has a super awesome functionality where you have a variable manager in which you can view every variable you declared.

  • @md.akib5124
    @md.akib5124 5 лет назад +1

    heeiiii corey. awesome to see another amazing tut from you

  • @skandrskon7091
    @skandrskon7091 4 года назад

    for line in csv_reader :
    counts[line['hobbyist'] ] += 1
    really great idea for increment on counts dict
    thanks

  • @arkhamian7586
    @arkhamian7586 5 лет назад +1

    Don't stop doing this.

  • @daltonmarhufu9549
    @daltonmarhufu9549 5 лет назад

    Great vid! Really helpful, could you possibly do a Pandas version of this?

  • @Shubh_chintak_for_you
    @Shubh_chintak_for_you 5 лет назад

    Hey @corey please make videos for Airflow DAG's also. They are basically python codes so it would fit nicely within your python series

  • @jackleone4150
    @jackleone4150 5 лет назад +2

    If anyone else is facing the UnicodeDecodeError try adding this second argument to the open function =>
    with open('data/survey_results_public.csv', encoding="utf8")

    • @sylvanmasha
      @sylvanmasha 5 лет назад

      wow, thank you so so much. i was facing the same problem

  • @kingslypaul2999
    @kingslypaul2999 5 лет назад

    Can u make a video on pyttxs3 ...or something better .....thanks for all your God skilled contents that makes ppl understand anything u teach

  • @syedabdulsalam4659
    @syedabdulsalam4659 5 лет назад +2

    Finally you are back :p

  • @736939
    @736939 5 лет назад

    Thank you Corey! Continue please: ScikitLearn, TensorFlow

  • @ЛеонидКлыков
    @ЛеонидКлыков 5 лет назад

    Hello, Corey! Please append your matplotlib tutorial with a lesson about Legend Picking

  • @mitos95176
    @mitos95176 5 лет назад +1

    Now I want to perform this task using pandas.

  • @BrendanMetcalfe
    @BrendanMetcalfe 5 лет назад +1

    nice vid! thanks

  • @graybits4111
    @graybits4111 5 лет назад

    thanks for this great video
    the total % for each dev type should be 100%, which is not the case

    • @coreyms
      @coreyms  5 лет назад

      Not necessarily. They can choose multiple languages. So for example, for front-end developers, most said they know JavaScript, but they also knew other languages as well. So each language is a percentage of how many developers said they know a particular language. Hope that clears it up

    • @graybits4111
      @graybits4111 5 лет назад

      @@coreyms it is clear now, thanks

  • @rahulsharma-st5yg
    @rahulsharma-st5yg 5 лет назад +1

    @corey: when to expect some data science real life projects?

  • @sprk2009
    @sprk2009 5 лет назад

    Your videos are awesome. I am new to Python and preparing for Python certification. Can you please let me know which one of the following programs are good to proceed? Thanks in advance.
    1. PCAP: Programming Essentials in Python from Netacad
    2. Microsoft 98-381 Introduction to Programming Using Python

  • @aryanvikash1967
    @aryanvikash1967 5 лет назад

    Corey please create a tutorial on python coroutine.. async await its so confusing .. please

  • @morganfree100
    @morganfree100 5 лет назад +1

    More please sir..(:-

  • @nowyouknow2249
    @nowyouknow2249 5 лет назад +1

    Nailed it

  • @anandinamdar4054
    @anandinamdar4054 5 лет назад +1

    Legend 🙌🙌

  • @ebrahimm.roshdy4487
    @ebrahimm.roshdy4487 5 лет назад

    Please do it using pandas since it's way easier and so common for data scientists

  • @rolandgerard6064
    @rolandgerard6064 5 лет назад +1

    man, you are good 👍

  • @johnnya590
    @johnnya590 5 лет назад +1

    I don't know if you realize this, but at the start of EVERY video you say "Hey there how's it going everybody?" :)

  • @ashissahoo4158
    @ashissahoo4158 3 года назад

    Is the pandas version of the last question has been uploaded?

  • @pingojr4048
    @pingojr4048 5 лет назад

    Can you do something for home bill management?

  • @Medhusalem
    @Medhusalem 4 года назад +1

    It didn't work somehow and I had to do a google search on my error: 'charmap' codec can't decode byte 0x81 in position 5739.
    So I had to change the with open part and specify the encoding part, after that everything worked fine.
    *with open('data/survey_results_public.csv', encoding='utf-8') as f:*

  • @opst1704
    @opst1704 5 лет назад

    If one day I decide to learn Python, then do I waste my time by learning C language now?
    I am learning C, primarily because I want to understand how memory works and DSA is easier to understand with C. Will this knowledge be helpful if I decide to learn Python? Seems like C and Python have a lot of differences, in Python there are so many things that I don't need to worry about.
    One more question, is it difficult to find out what is happening behind scenes in Python, for example: implementation of input() function?
    Great videos, the main reason why I am considering to learn Python.

  • @chinmayabarik557
    @chinmayabarik557 5 лет назад

    Please do a pandas tutorial with this survey results sir..

  • @darrenlefcoe
    @darrenlefcoe 5 лет назад

    On inspection of the data. The csv.DictReader(f) throws 594 errors in the for loop >> line in csvReader:
    Not sure what causes the error, but this exists.
    Needs to be handled with
    try:
    except UnicodeDecodeError:
    who else sees this ?

  • @yuliu1105
    @yuliu1105 5 лет назад

    Please update pandas version of the 2nd half of the problem, my forloops don't work, bummer. Thank you

  • @amankothiyal7074
    @amankothiyal7074 4 года назад

    Hey can you analyse it using numpy library

  • @darrenlefcoe
    @darrenlefcoe 5 лет назад

    for line in csvReader: failed on line 16.
    UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 5739: character maps to
    problem with the data-set or other reason ?

    • @sylvanmasha
      @sylvanmasha 5 лет назад +1

      f anyone else is facing the UnicodeDecodeError try adding this second argument to the open function =>
      with open('data/survey_results_public.csv', encoding="utf8")
      this is a comment i found by Jack Leone. you're welcome

  • @hemanth6951
    @hemanth6951 5 лет назад +1

    i saw you in another RUclips channels

    • @coreyms
      @coreyms  5 лет назад +1

      Which one was that?

    • @hemanth6951
      @hemanth6951 5 лет назад +1

      @@coreyms some thing pyconf channel with some one interviewing with you last year

    • @coreyms
      @coreyms  5 лет назад +1

      @@hemanth6951 Oh yeah! I think that was Dan Bader. He's a great guy.

    • @hemanth6951
      @hemanth6951 5 лет назад +2

      @@coreyms yes yes yes finally I saw you and this second time I'm chatting with you I'm so happy and luky 😊😀😀

  • @marinbeslo7841
    @marinbeslo7841 4 года назад

    Is the video with pandas-way out? :)

  • @natesh1
    @natesh1 5 лет назад

    Please do a pandas tutorial series with this survey results too.

    • @bluekeybo
      @bluekeybo 5 лет назад

      You didn't watch the video till the end did you

  • @christiaan3315
    @christiaan3315 4 года назад

    Sorry. I am getting error messages... 'for line in csv.reader: row = next(self.reader) etc

  • @iangitonga2811
    @iangitonga2811 5 лет назад +2

    Great content.

  • @barungh
    @barungh 5 лет назад +1

    Badass !👍👌

  • @pavanmandikal
    @pavanmandikal 5 лет назад +1

    Please do make a video on numpy also

  • @boolloop
    @boolloop 5 лет назад

    I love your videos, but I wondered why you didn't use pandas for this?

    • @coreyms
      @coreyms  5 лет назад +3

      I mentioned the reason in the video. I wanted to show how to do this without Pandas first, and once I release my Pandas series then I will redo this video using Pandas as well. That way people who are unfamiliar with Pandas can do this as well

    • @boolloop
      @boolloop 5 лет назад

      @@coreyms neat, looking forward to it!

  • @strange_man
    @strange_man 5 лет назад +1

    The video that I need :D

  • @adildeshmukh4140
    @adildeshmukh4140 5 лет назад +1

    yes, please pandas

  • @gnuter3000
    @gnuter3000 5 лет назад +1

    greatgreatgreat

  • @samishabli5191
    @samishabli5191 5 лет назад

    i got this error: Yes
    Traceback (most recent call last):
    and it just shows me the first yes and no
    and in the end it says: return codecs.charmap_decode(input,self.errors,decoding_table)[0]
    UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 5739: character maps to

    • @atishayshukla1117
      @atishayshukla1117 5 лет назад +2

      Its an encoding issue with the file, open a file by specifying proper encoding something like: with open('data/survey_results_public.csv', encoding='utf8')

    • @ThePasupathi
      @ThePasupathi 5 лет назад

      @@atishayshukla1117 thank you..it works

  • @Ajithkumarsekar96
    @Ajithkumarsekar96 5 лет назад

    Almost everyone has used Javascript and html/css? From front-end developer to system-administrator

  • @shaguftakhan302
    @shaguftakhan302 3 года назад

    I clicked on this video just to see what data scientists do.

  • @olicairns8971
    @olicairns8971 5 лет назад

    This is much easier if you use pandas

    • @coreyms
      @coreyms  5 лет назад

      Agreed. I mentioned this in the video, but I wanted to show how to do this without Pandas first, and once I release my Pandas series then I will redo this video using Pandas as well. That way people who are unfamiliar with Pandas can do this as well