Data Analyst Portfolio Project | SQL Data Exploration | Project 1/4

Поделиться
HTML-код
  • Опубликовано: 24 сен 2024

Комментарии • 4,1 тыс.

  • @davidng9403
    @davidng9403 2 года назад +1774

    I'm doing this in Oct 2021 and apparently one of the code chunks where you need to convert new_vaccinations column to integer, the sum value now has exceeded 2,147,483,647. So instead of converting it to "int", you will need to convert to "bigint". Hope this helps everyone.

    • @martamonteverde6560
      @martamonteverde6560 2 года назад +26

      Totally helped! Thank youu!

    • @tayobello1935
      @tayobello1935 2 года назад +9

      Thank you , that was a real big help for me.

    • @TayuanRebelo
      @TayuanRebelo 2 года назад +17

      You totally saved my life! I was almost doing crazy!

    • @keipers
      @keipers 2 года назад +10

      Thanks, I can stop banging my head.

    • @DayFlix_n_Chill
      @DayFlix_n_Chill 2 года назад +2

      Totally helped! Thank you!

  • @amystucker4282
    @amystucker4282 3 года назад +321

    I'm transitioning from teaching to data analytics. I recently finished the Google Data Analytics Certification, and I've been struggling to figure out how to get my portfolio started. I found this video, and I got really excited to use this as a guided project. Then, I saw Alex was using Covid data, and I had to pause to consider if I could handle it. I lost my dad to Covid 4 months ago. This is personal.
    However, I think I need this now as another way to process through the continuing grief. The fact is, data is personal. Each of these data points describes people. Someone's father, mother, sister, brother, grandparent, friend, and so on. This data is reality, and it's so important to analyze it in order to truly show the impact that this pandemic has made. Especially, as things may be getting back to "normal" in some places around the world, this data proves that "normal" is not the same without the lives of so many. And maybe, just maybe, analysis and visualization of this dataset can encourage others to get the vaccine, and we can be more informed moving forward.

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 года назад +20

      Thank you for sharing Amy and I’m sorry for your loss. It definitely does give you some interesting insights into the real data rather than just seeing what is on news websites.

    • @esperanzafeliciayakubu7794
      @esperanzafeliciayakubu7794 3 года назад +9

      So sorry for your loss. In Ghana we tell people to adhere to the protocols if they don't want to be used as data.

    • @garry6882
      @garry6882 3 года назад +3

      so did I , google data anlaytics certificate was too basic and so many theory!

    • @yashlokhande2859
      @yashlokhande2859 3 года назад

      @@garry6882 Yeah true

    • @yashlokhande2859
      @yashlokhande2859 3 года назад +6

      I lost my grandma, the closest person to my heart in this whole world. I know what the loss is, the dead may be just numbers for someone but only the people who lost someone close, knows the pain.

  • @veronicab2096
    @veronicab2096 3 года назад +577

    I’ve just started the video and I’m about 9 minutes in. I wanted to stop at this point to tell you I’m glad you showed your mistake live. It’s important for beginners to see that experts also make mistakes. It is encouraging. As a bonus, I will better remember to save as xlsx because you pointed it out. So thank you for leaving that in.

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 года назад +95

      I make a ton of mistakes! haha the cuts that I took out were 5+ minutes of me having to research something or figure something out and then coming back. Didn't want to keep that boring stuff in lol

    • @Major_Data
      @Major_Data 3 года назад +29

      Agreed. The realness is encouraging for us mere mortals (and the mistakes help give me time to catch up to Alex...)

    • @veronicab2096
      @veronicab2096 3 года назад +3

      @@Major_Data well let’s not get carried away here. This is THE Alex the Analyst we are talking about! 🤣

    • @Major_Data
      @Major_Data 3 года назад +8

      @@veronicab2096 oh, I didn't mean catch up to him on skills. I just meant keep up with him on the video. 🤣

    • @ivanott7196
      @ivanott7196 3 года назад +3

      Years ago I said no to programming because I always find myself making mistakes (I'm a GIS Engineer). Then came my friend, a software developer also make tons of mistakes. I questioned him because he has a degree in software engineering with years of experience, then he replied "I'm a human, not a computer. Even human to human also sometimes struggle to communicate" haha

  • @bwhle
    @bwhle 2 года назад +346

    This was a lifesaver when I needed a portfolio quickly for a job interview while still being a beginner.
    It was easy to follow and I could base my own projects on it. This definitely was part of me getting the job!

    • @user-gy4ck9ci8x
      @user-gy4ck9ci8x 2 года назад +2

      How did the J interview go

    • @agatanakoncuswiata
      @agatanakoncuswiata 2 года назад +2

      I wish I could give a "bravo" reaction to this comment! Congrats! Hope to hop on your boat soon enough and happy to have found this video!

    • @nikhilbhute07
      @nikhilbhute07 2 года назад +4

      how did you mention this project in your portfolio? like summary of this project. please help me with this.

    • @omotayosheyi2175
      @omotayosheyi2175 Год назад

      Please where did you host your portfolio?

    • @omotayosheyi2175
      @omotayosheyi2175 Год назад +1

      i mean how do you paste projects on your github if that's where we'll be hosting all our projects

  • @kacieheath2120
    @kacieheath2120 4 месяца назад +66

    I'm doing this in May 2024 and had trouble with inserting data. Here is the solution that worked for me:
    Select DB in SSMS > Tasks > Import Data > Next > Data source: Flat file source > Select CSV file (make sure you have saved the excel file as a CSV) > > Next > Destination: Microsoft OLE DB Driver for SQL Server > Click on Properties and enter server name, change to Windows Authentication, Select your Database name and test connection > Next > Next > Finish

    • @Saleeboy10
      @Saleeboy10 4 месяца назад +2

      Thank you soo much😭😭😭❤️

    • @transplantman2287
      @transplantman2287 4 месяца назад +1

      I've been googling my way through errors for TWO MONTHS, thank you so much for this.

    • @kacieheath2120
      @kacieheath2120 4 месяца назад

      @@transplantman2287 I hear you! I looked into the 2016 redistribution download and everything but nothing worked other than this. Hoping I can figure it out soon but this way for sure worked and got all the data inserted

    • @pranavbhawane7591
      @pranavbhawane7591 3 месяца назад +5

      How to find a server name

    • @kacieheath2120
      @kacieheath2120 3 месяца назад

      @@pranavbhawane7591 When you first open the SQL Server studio, you will get the pop-up to connect. In this pop-up, you'll see the server type, server name, authentication... The server name is listed here. I would suggest copy and paste this before connecting to your server that way you already have it when you type in the name for the database

  • @StasLeo1987
    @StasLeo1987 9 месяцев назад +52

    For me, this way worked:
    Select DB in SSMS > Tasks > Import Data > Next > Data source = Flat file source > Select CSV (!) file (you have to change expected file extension during file selection) > Locale = English (United States) (i don't know if it matters though) > Next > Destination > Microsoft OLE DB Provider for SQL Server > Next > Next > Finish

    • @godswillekanem
      @godswillekanem 8 месяцев назад

      Hey I need help. After uploading my data, doesn't show up under the database. I have refreshed repeatedly but no luck with it. Any idea how I can solve this issue? thanks

    • @brendachiri2890
      @brendachiri2890 8 месяцев назад

      Same,I’ve encountered the same problem

    • @godswillekanem
      @godswillekanem 8 месяцев назад

      when choosing destination, select "microsoft ole db provider for sql server"@@brendachiri2890

    • @JunSim-t2o
      @JunSim-t2o 8 месяцев назад

      @@godswillekanem have you checked "Tables" under database?

    • @deeplakshmiyadav
      @deeplakshmiyadav 6 месяцев назад

      Thanks man. You really saved my day with this method. I've wasted 2 entire days trying to import data already!!

  • @jjsinclair94
    @jjsinclair94 3 года назад +514

    Great content, as per. You helped me to get my dream job as a modelling analyst in the last couple of weeks, I am forever grateful to you!

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 года назад +39

      That's so awesome! Congratulations!!

    • @CE-vd2px
      @CE-vd2px 3 года назад +9

      Great to here. What did the interviewers look for? Excel,SQL, and Tableau/PowerBi? I'm asking because I want to make sure I am on the right track.

    • @binodrai3653
      @binodrai3653 3 года назад +7

      @@CE-vd2px Please check previous videos of Alex, he has made some good videos on DA skills

    • @CE-vd2px
      @CE-vd2px 3 года назад +4

      @@binodrai3653 I did but I like to verify.

    • @binodrai3653
      @binodrai3653 3 года назад +6

      @@CE-vd2px in addition to what you have mentioned learn Python or R.

  • @wisdomchinaza6882
    @wisdomchinaza6882 Год назад +23

    For those who are currently working on their portfolio project and can't import Excel file using any of the methods Alex showed.
    I tried this and it worked.
    Save your file as CSV and apply the first method Alex showed right click the portfolio project, select task, in the dialogue box, select flat file source, browse your folder to select the CSV file (ensure the extension on the browse dialogue box is .CSV, so you can see your CSV files and select it, destination option select SQL server and click next till finish"
    Hope it helps. Alex thanks again for the video . I can now go ahead with my project

  • @katl3597
    @katl3597 Год назад +469

    Hi Alex! Doing this in July 2023 and I think the SMSS features have changed a bit. Instead of using SQL Server Native Client 11.0 it is now Microsoft OLE DB Provider for SQL Server. This took me a while to figure out and import. Hope this helps!

  • @alijokhio2995
    @alijokhio2995 Год назад +234

    To the ones stuck on 19:10 , use the following code to convert into float datatype when ''Null"
    Select location, date, total_cases,total_deaths,
    (CONVERT(float, total_deaths) / NULLIF(CONVERT(float, total_cases), 0)) * 100 AS Deathpercentage
    from PortfolioProject..covidDeaths
    order by 1,2

    • @normatinoco9956
      @normatinoco9956 Год назад +3

      So stuck in this part! thank youuuuuuuuuuu!

    • @solomonadeniyi4197
      @solomonadeniyi4197 Год назад +1

      Ah yes. Thank you. I would love to understand more about how to query this

    • @alijokhio2995
      @alijokhio2995 Год назад +2

      @@solomonadeniyi4197 brother its all about data type. Some of the data in "Numeric" data type is actually float, thats why we need to convert it

    • @CanalIGUINorante
      @CanalIGUINorante Год назад

      Yay! You saved my life

    • @solomonadeniyi4197
      @solomonadeniyi4197 Год назад

      @alijokhio2995 thank you so much, brother

  • @sukumarsarambekar648
    @sukumarsarambekar648 3 года назад +72

    Just when I thought this guy has done a great deal of work for beginners like me, he goes on to do greater things like this. Really thank you Alex -from an aspiring analyst

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 года назад +3

      You're most welcome! So glad it's helpful!

    • @k-EE-VIPESHDUNKWAL
      @k-EE-VIPESHDUNKWAL 3 года назад

      @@AlexTheAnalyst can you tell What is the Primary key in this Table?

    • @ruhultusar8390
      @ruhultusar8390 2 года назад

      @@k-EE-VIPESHDUNKWAL In short you can say it's the unique identifire of every rows in a table.

  • @SilasEdet
    @SilasEdet 6 месяцев назад +34

    For those doing it from March 2024, you should cast all the columns to floats. Took me days to figure it out. The data in the tables are more and int or even bigint doesn't cut it

    • @maddie8564
      @maddie8564 5 месяцев назад +3

      Thank you so much! At 1:06 I got stuck because of the constant errors I was getting. When I read your comment and casted as float on the line code reading ", SUM(CAST(vac.new_vaccinations as float)) OVER (Partition by dea.location order by dea.location,dea.date) as RollingPeopleVaccinated" I finally got the query to work as it was suppoded to.

    • @SilasEdet
      @SilasEdet 5 месяцев назад

      @@maddie8564 I'm glad I could help

    • @hafsaabdullahi1250
      @hafsaabdullahi1250 5 месяцев назад

      thank you so much i was getting errors and this helped

    • @maryanal3476
      @maryanal3476 4 месяца назад

      Thank you!

    • @fahadshaikh9099
      @fahadshaikh9099 4 месяца назад +1

      did you also find out that not all the data gets loaded onto sql for some reason? I only got about 2600 rows in smss. Not sure what to do to fix that

  • @agatanakoncuswiata
    @agatanakoncuswiata 2 года назад +161

    Man... you're amazing. Don't congratulate us to have stayed until the end! It's TONS of work you've done and it's only to be applauded and appreciated! I bet many agree with me - against your sore throat you guided us through soooo many ideas and possibilities! Thank you so much and wish you MILLIONS views!

    • @AlexTheAnalyst
      @AlexTheAnalyst  2 года назад +29

      Working on your education isn't easy, I'm proud you made it to the end - shows commitment :)

    • @Ts18762
      @Ts18762 2 года назад

      Thanks alot for this great walkthrough Alex. 👏 👏
      Just a quick question for the audience, I am looking for a good budget laptop. My primary purpose is to be able to run all the DB applications online or offline including SQL, PowerBI. Please suggest under $600-800

    • @naziaali642
      @naziaali642 2 года назад +1

      Hey did u get the data from jan 2020 bcz now i am doing it and same website is consisting of data from feb 2020 to september 2022

    • @s.i.8852
      @s.i.8852 Год назад

      I did everything up till the end and keep refreshing the View tab but it won’t appear.

    • @LSOluwatobi
      @LSOluwatobi Год назад

      ❤ATA❤

  • @RemotePossibilities1
    @RemotePossibilities1 3 года назад +40

    This is SO helpful, thanks Alex! For anyone looking for the intermediate-level SQL queries, they're near the end - here are the timestamps:
    51:20 Joins
    62:00 CTEs
    66:30 Temp tables
    70:20 Views

    • @nickarrigo5540
      @nickarrigo5540 3 года назад +1

      great comment craig!

    • @eniolababafemi8549
      @eniolababafemi8549 2 года назад

      Hi, please can you help with the particular dataset that was used in this video?

    • @RemotePossibilities1
      @RemotePossibilities1 2 года назад

      @@eniolababafemi8549 Please see the link to it in the video description.

    • @s.i.8852
      @s.i.8852 Год назад

      I did everything up till the end and keep refreshing the View tab but it won’t appear.

    • @jakecwieka9415
      @jakecwieka9415 Год назад +1

      @@s.i.8852 Make sure you are filtered to Portfolio Project and not Master when you create it. Then refresh and it should show up

  • @BetterLife95
    @BetterLife95 Год назад +40

    I just completely watched this priceless video. Many thanks Alex for teaching us free of cost and guiding us all along. I am an Economist and have been learning data analysis since 2022 and it is amazing that I have found your channel and you.

  • @jeremiahwooten226
    @jeremiahwooten226 2 года назад +60

    I can’t express how much I appreciate you creating these tutorials. As a beginner prospective analyst looking to add projects to a portfolio, I was lost and overwhelmed. Your videos are a godsend. Thanks a lot

    • @AlexTheAnalyst
      @AlexTheAnalyst  2 года назад +2

      So awesome to hear it!

    • @oghenechucknorris3634
      @oghenechucknorris3634 2 года назад +3

      @@AlexTheAnalyst Godsend is an understatement. As a beginner, getting something together from what you have studied is probably the hardest and having someone show you how to get that done with all the errors along the way, makes it seem possible. Glad to have found this channel.

    • @furrywarrior8765
      @furrywarrior8765 2 года назад +6

      I totally agree. I graduated from the data analytic course recently and was also struggling with the portfolio project. Alex's video is definitely a godsend and very helpful.

    • @anilani9331
      @anilani9331 Год назад

      Sir where can I get this data set

    • @s.i.8852
      @s.i.8852 Год назад

      I did everything up till the end and keep refreshing the View tab but it won’t appear.

  • @drizzle1866
    @drizzle1866 2 года назад +55

    What I really want to thank you for is making this real. I mean, keeping all your own mistakes in the video is a booster for Aspiring Analysts like myself.
    "Yes, it can get messy. We just need to keep calm and figure it out". Thanks once again, Alex

  • @Rinkutoki
    @Rinkutoki 2 года назад +14

    As someone who had to track this type of data since the early days of the pandemic, the project hits a little different. I wish I had found data analysis tools sooner, spent days of work finding, loading and analyzing this information when we didn't have that kind of time. This isn't just a tutorial project just to learn, its the real deal!

  • @kolawolekafayat6487
    @kolawolekafayat6487 Год назад +10

    Every time I watch this part, it makes me smile. "I don't understand, don't ask me". Thank you Alex.

  • @ushnakhan198
    @ushnakhan198 2 года назад +35

    38:23 The reason we are not getting correct total deaths when we break down by continents is that in our query we are using MAX(cast(total_deaths as int)), so what it does is it returns the maximum total_death from that particular continent(ex in Oceania continent it returned 910, which is the total death count for Australia and highest in its continent). We need to replace the query by- select continent, sum(new_deaths)
    from coviddeaths
    where continent!=''
    group by continent;
    Hope this helps!

    • @AlexGreece45
      @AlexGreece45 2 года назад +2

      Exactly! Because we're requesting the total of all deaths of the countries in each continent and not the HIGHEST death count 🙂

    • @lizejackson3248
      @lizejackson3248 2 года назад +2

      Thanks for pointing it out. I was also confused by the code Alex was suggesting because it didn't make sense so I also applied your suggestion.

    • @joseeduardocordoba2512
      @joseeduardocordoba2512 Год назад +1

      Yes, I also noticed that, if you do it correctly you will notice that Europe had way more deaths than North America.

    • @UnrealK9999
      @UnrealK9999 Год назад

      bro i made like that, the thing is that the numbers of north america are higher than US+CAD, idk why

    • @UnrealK9999
      @UnrealK9999 Год назад +1

      nvm, NA has more countries not only US and CAD, thanks a lot!!

  • @hoeeeng
    @hoeeeng 3 года назад +11

    I'm using MySQL and had a problem when importing the .csv files to the MySQL server. The entire records didn't go into the server, so I changed all the empty values to NULL with Pandas and I finally got the entire records. For people who are facing the same issue. AND I really appreciate your videos!

    • @joshuajosh5181
      @joshuajosh5181 Год назад +1

      Hey, do you mind explaining this a bit

    • @akinsanyaoluwatomisin513
      @akinsanyaoluwatomisin513 Год назад

      I'm having issues importing the CSV files into Mysql as well. Unfortunately, I don't know much about Python. Is there any other way around this?

    • @SaOD18
      @SaOD18 Год назад

      @@akinsanyaoluwatomisin513 Hi there, have you been able to figure out a way around it? Having the same issue :(

    • @xintianzhang8056
      @xintianzhang8056 Год назад

      Thanks for your suggestion !! I am a Mac users, your comment is a life saver!

    • @PK-tc8ib
      @PK-tc8ib Год назад

      @@joshuajosh5181 Use pandas to replace all empty cells with a space and save as a new modified CSV. Select Table Data Import Wizard under Table in MySQL....Hope this will help

  • @jackjeffries6872
    @jackjeffries6872 2 года назад +51

    For those using Mac and found MySql as their preferred option; before you import data, be sure to reformat the date to yyyy-mm-dd so that MySql recognizes the date as an actual date. To do this, highlight the column and press control+1, go to date and select the format I mentioned above. Hope this helps someone!

    • @rrromal
      @rrromal 2 года назад +3

      Were you able to import the data? im still having trouble

    • @SBSpaceRocks
      @SBSpaceRocks 2 года назад

      Hey Jack. I formatted it to yyyy-mm-dd and then imported the files. But are you able to see the results sorted by date when we order by date? I tried cast(date as date) as date1 but still the result is not ordered by date.

    • @hi.imferg
      @hi.imferg 2 года назад +1

      @@rrromal I'm having trouble importing as well. I'm using Azure Data Studio on Mac. Is that what you were using and were you able to figure out the issue?

    • @rrromal
      @rrromal 2 года назад +2

      @@hi.imferg No I was using MySql, turns out I had to change data types

    • @sushantpatil4035
      @sushantpatil4035 2 года назад

      @@rrromal how you import files in mysql??? , please tell me

  • @leoangelovb
    @leoangelovb Год назад +3

    It is currently 2023 and I am doing this since yesterday May 29, 2023.
    Challenges encountered so far:
    1. Using Macbook Air M1, and SQL Server Management Studio (SSMS) is not natively compatible, I have to use Azure SQL Edge virtual container in Docker(which makes my Mac a bit warm). I have to use Azure Data Studio for SQL IDE. Both Docker and Azure Data Studio can run natively in M1.
    2. I am not able to import Excel files to the database, I need to convert them to CSV. Azure Data Studio can only import flat files like CSV and JSON.
    3. Before importing, most of the numeric attributes needed to be declared into a float data type based on the error-caching of Azure Data Studio.
    All in all, it was a great dive, thank you, Alex!

  • @candacedillon97
    @candacedillon97 2 года назад +21

    Ok so something I want to point out to the newbies, like me, who may be frustrated, don't give up and don't be afraid to ask. Also, I spent an hour trying to figure out how the results pane was kept so clean.... Highlight the text, then click "execute". If you just do the text or code and then click execute, your results pane will stack and eventually you won't be able to see any of your results. Highlight, then execute. This is something nobody explains. Even the Google certificate course doesn't tell you this little detail that is important.

    • @blessingkamar6670
      @blessingkamar6670 2 года назад +1

      thanks this was helpful

    • @EGstill85
      @EGstill85 Год назад +1

      Is there a free version of an app that allows you to edit XLSX files, or do you have to pay for some version of Microsoft 365? Thanks

    • @curiousss4960
      @curiousss4960 Год назад

      Thank you. I really wish this was explained.

  • @gillianchapman5106
    @gillianchapman5106 2 года назад +51

    This has been beyond helpful! I've been taking lots of SQL classes at Datacamp, and as you mentioned in your video on Datacamp, one of the drawbacks is not necessarily knowing how to apply the concepts within the software. This helped me feel a lot more confident navigating and applying concepts with SQL Server :)

  • @cthecheese1620
    @cthecheese1620 2 года назад +4

    I appreciate that you talk about where you have issues, as a developer I'm so used to having people breeze through projects in their video without talking about the hiccups they had on their first run before recording. Although it doesn't provide much help in the way of solving problems while trying to follow along, it does help make it clear that solving the weird problems is a part of the journey (and in some cases a part of the fun)!
    Edit: For those curious, we're now at 92m cases as of August 2022 with a Death Percentage of 1.12%

  • @kelechieva7473
    @kelechieva7473 Месяц назад +2

    Just finished the video after weeks of starting it and I'm so happy I finished it!
    I really want to thank you Alex for taking out time and teaching this cost free, it's my first SQL portfolio and I must say it worth the time spent.

  • @ilovelifely
    @ilovelifely 2 года назад +6

    Running this on windows is sooooo much easier than mac ! just fyi...On a mac you have to go to your terminal and program it in there to be compatible ... a lot of work to even come up with the right code! And Alex you ROCK! Thank you so much, wish there were more people like you out in the world giving back the same way you do!!

    • @juanmanuelarevalomadrid1937
      @juanmanuelarevalomadrid1937 7 месяцев назад

      Hey! I'm struggling trying to import the files from cvs to my sql on my Mac, did you figure it out, would you mind explain me how do it, thank you so much

  • @ninoshka81
    @ninoshka81 Год назад +11

    I just started learning sql 2 days ago, and now I'm doing my first project with your help!
    So easy to understand!

    • @AlexTheAnalyst
      @AlexTheAnalyst  Год назад +1

      Good work! :D

    • @munachinwanedo494
      @munachinwanedo494 Год назад

      nice one nino, pls keep us updated. i too just started sql last 2 weeks

    • @euniceduah4793
      @euniceduah4793 Год назад

      @@munachinwanedo494 I just started too.

    • @euniceduah4793
      @euniceduah4793 Год назад

      hi, so to import the data, did you have to purchase Microsoft server 2022? I think it is not free anymore

  • @SouVirtualWorld
    @SouVirtualWorld 3 года назад +8

    This is great to start with thanks Alex :) . If anyone like me had an issue with the NULL value for the Date column after importing the excel files to SQL Server, you might need to change the "Excel version" in SQL Server Import and Export Wizard to Microsoft Excel 2016. Also check your system date format to make sure it is the same format as in the video. These changes resolved my issue.

  • @Misspearl01
    @Misspearl01 28 дней назад +1

    I have started and stopped this video many, many times, but today I got to the end and I'm glad I did.
    If you're struggling, that's okay.
    Take a break and come back when you're ready.
    Happy learnings y'all.
    And, big thanks Alex🎉

  • @kavishshah5586
    @kavishshah5586 Год назад +13

    This is my first project on SQL. I couldn't have done without you.
    Thanks a lot

  • @erikaruv22
    @erikaruv22 2 года назад +7

    You're a life saver! i just finished the google data analytics course and since im new to all of this i had no idea what to do or where to start to make my portfolio to apply for jobs. Really appreciate you ❤

  • @JennaKiely
    @JennaKiely 3 года назад +8

    How you explain these steps is so kind and encouraging and makes nervous beginners such as myself feel fully capable!! THANK YOU. Honestly, without your videos, I probably would have felt too intimidated by data analytics and kept searching for another profession to change into (like you, I'm transitioning from healthcare!)

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 года назад +1

      I'm so glad to hear that! That makes doing all of this worth it :D

  • @codencolor
    @codencolor 3 дня назад

    I just started with it and realized the data is no longer available with so many comments about it. Thank you for sharing the data in the links, you are so considerate.

  • @courtneyheld8740
    @courtneyheld8740 Год назад +11

    I’m only 20 mins in and already learned more than my 4 years of bachelors schooling lolololllol. Plus I LOVE how you make tiny mistakes ( we are human, it happens, but they are common something we as beginners might make) and show us directly how to avoid or problem solve and fix them!!! You’re my hero !

    • @amosadeleke4548
      @amosadeleke4548 Год назад

      HI Courtney, do you mind figuring out what is wrong with this code?
      select location, date, total_cases, total_deaths, (total_deaths/total_cases)*100 as deathpercentage
      from [covid death]
      order by 1,2
      Operand data type nvarchar is invalid for divide operator.(this is the output i keep getting)

    • @harikrishna7634
      @harikrishna7634 Год назад +1

      @@amosadeleke4548 "Operand data type nvarchar is invalid for divide operator" suggests that one or more of the columns used in the calculation for the death percentage field may be of data type nvarchar, which cannot be used in mathematical operations.
      To fix this issue, you need to ensure that the data type of both the total_cases and total_deaths columns is numeric, such as integer or float, before performing the division operation. One way to achieve this is by converting the data types of these columns using the CAST or CONVERT functions.
      You can use the below code:
      SELECT location, date, total_cases, total_deaths,
      (CAST(total_deaths AS float) / CAST(total_cases AS float)) * 100 AS deathpercentage
      FROM [covid death]
      ORDER BY 1, 2

    • @allanemenike812
      @allanemenike812 Год назад

      @@harikrishna7634hi i tried this and it didn’t give me the result i wanted

    • @maganrodeo129
      @maganrodeo129 7 месяцев назад

      @@amosadeleke4548 what i did here to solve this is i go to his github to download the coviddeaths and covidvaccination the author said that the dataset have been change by the source that is why you get that error so try to do it again as for me i get stuck at 1:09:30 temp table

  • @N1246-c2f
    @N1246-c2f 3 года назад +94

    For the query at 58:56, I was getting an error:
    "Arithmetic overflow error converting expression to data type int. Warning: Null value is eliminated by an aggregate or other SET operation."
    For those of you getting the same thing, change the "int" into "bigint", apparently its due to the sum function.

    • @mayzin5453
      @mayzin5453 2 года назад +1

      Thank u so much you’re a life saver!

    • @alejandrareyesvaldez8532
      @alejandrareyesvaldez8532 2 года назад +1

      thank you!!

    • @ericbrown2135
      @ericbrown2135 2 года назад +6

      Great catch! I recently noticed this earlier in the video too - int allows max number of 2,147,483,647, bigint allows up to 9 quintillion (9,233,372,036,854). So when vaccinations surpassed ~2bn the video became just slightly obsolete 😀

    • @N1246-c2f
      @N1246-c2f 2 года назад

      Glad all of you worked it out in the end!

    • @Pjsmurphy
      @Pjsmurphy 2 года назад +1

      you saved me thank you!!!!

  • @elphasluyuku4167
    @elphasluyuku4167 Год назад +6

    My Only Regret is not finding your Channel sooner Alex. You have helped me so much and I am super grateful.

  • @PK-tc8ib
    @PK-tc8ib Год назад +2

    Thank you soooo much, Alex.
    For those using Mac and having trouble importing xlsx into MySQL……Here’s my solution:
    1.reformat date to yyyy-mm-dd and save as CSV
    2.use python to replace all empty values in this CSV with a space
    (import pandas……fillna()…….)
    3. create a new database in MySQL, and select Table Data Import Wizard under Table
    4. Select your new modified CSV file and import…Hope this will help!!

    • @profmike1373
      @profmike1373 10 месяцев назад

      Hey, please can you help me. I am still having trouble using MySQL. Please help!

  • @RoshanBholah
    @RoshanBholah Год назад +6

    For anyone using Azure Data Studio and having trouble importing the CSV files with the SQL Import extension. You will need to change around with the Data Types that Azure identifies. e.g. change Small Int to Int or nvarchar(1) to nvarchar(50). In addition, I made all columns to allow null values. Making these corrections then worked and both CSV files imported into my database.

    • @curiousss4960
      @curiousss4960 Год назад +3

      I'm using the same software the author is and having same issues. I will probably have to go back into excel and change a LOT of the data types because almost all of the numerical columns are showing up as nvarchars. Makes any math functions impossible to use.

    • @sflax91
      @sflax91 Год назад

      @@curiousss4960 Did you ever figure this out? I feel like I'm running into the same issues. No matter what I do to the data in excel (making sure all numbers are actually in a number format), every time i try to perform a calculation in SQL i still gets "0"s. I even tried creating a table from scratch in SQL, specifying what each column's data type should be, and kept running into the same issue.

    • @juanmanuelarevalomadrid1937
      @juanmanuelarevalomadrid1937 7 месяцев назад

      how did you do with covid vaccination table??

  • @neelam5170
    @neelam5170 2 года назад +12

    For people who r geeting "arithmetic overflow error converting expression to data type int", this is because the value in the data exceeds the integer limit,
    Try converting new_vaccinations to BIGINT in place of INT.
    Like....
    SUM(CONVERT(BIGINT, cv. new_vaccinations))
    --cv is the covidVaccinations table
    This will remove the error.
    Hope this helps...

    • @Chinwenduh
      @Chinwenduh 2 года назад

      Thanks, I had the error. i was stuck lol.

    • @isaamanov5892
      @isaamanov5892 2 года назад

      Thanks dude!

    • @jamescloonan
      @jamescloonan 2 года назад +1

      Appreciate you my dude!!!

  • @JordanVillarreal
    @JordanVillarreal Год назад +21

    I did data analysis and error management in a HUGE database for sprint. The work I did was extremely similar to this. The database was way messier and more confusing than this, but if you can work out the problems as you go like he is, this is basically what I did everyday.

    • @donald01
      @donald01 Год назад

      Hi, you seem to have prior experience of related datasets so pardon my asking. I'm at the part where I divide total deaths by total cases to get the deathpercentage. However upon running the query , I'm getting an error message that says 'operand data type nvarchar is invalid for divide operator'.
      Can you kindly suggest how solve it?
      Thanks

  • @julianoviedo
    @julianoviedo Год назад +2

    Dear Alex, as of July 2023 it was a real pain to install SSMS 19 and SQL server 2022. Many errors whilst trying to Connect to the server, and then to import the data through the Wizard...but I managed to find solutions to everything.
    Right now I just finished the whole Project Tutorial and feel grateful to you for dedicating time and effort to teach us. I'm looking forward to finish the whole Bootcamp and become a greater Data Analyst day by day.
    Cheers!

    • @favourchukwumam3179
      @favourchukwumam3179 Год назад +1

      Hi, i could not find population column in CovidDeaths Table, pls did you experience this?

    • @julianoviedo
      @julianoviedo Год назад

      @@favourchukwumam3179 Hello mate.
      I used Alex's original data file in the description box. Nowadays the data set has changed drastically in order (i.e. columns).
      I just went to the website "ourworldindata" and downloaded the file in .CSV, then a quick search with ctrl+F found the "population column" in the AW spot in the data sheet.
      Hope this helps! Cheers.

    • @gustavo-yv1gk
      @gustavo-yv1gk Год назад

      Hi @julianoviedo, how did u download SSMS 19?

    • @gauravvadpillay7364
      @gauravvadpillay7364 Год назад

      Hey there can you help me with the issues regarding uploading excel file into SQL

  • @p.m4118
    @p.m4118 3 года назад +17

    You are doing an excellent job filling the gaps between knowledge and experience! Just finished Google’ s data analytics and building my portfolio, all thanks to your videos. Many thanks!

    • @anthonysaturno7220
      @anthonysaturno7220 2 года назад

      How long did it take you to complete the course?

    • @p.m4118
      @p.m4118 2 года назад

      I did it in 2 months, but if you’re going to follow thorough every link,, sub course and indexes, then you’ll need more than 6 months!

    • @johnmestyanek4766
      @johnmestyanek4766 2 года назад

      This! I'm finishing mine right now as well and feel blessed to have found Alex's amazing videos.

    • @MohdDanish-kv9sw
      @MohdDanish-kv9sw 2 года назад

      is this course any good ???

  • @sergiochavezlazo5362
    @sergiochavezlazo5362 Год назад +4

    Thank u so much for the video! Something that I think it is important to clarify that the variable new_vaccinations refers to the number of doses available and not to the number of people vaccinated. If this exercise is done with the updated data, the cumulative percentage exceeds 100% in many countries. I think it should be reinterpreted as 'ratio of doses administered per 100 inhabitants'.

  • @bikramshrestha4233
    @bikramshrestha4233 Год назад +7

    For whoever is having trouble in inserting their data in temp table, make sure to have the same data type of the columns in the covidDeaths table and temp table. And I found it better to specify the columns that you're inserting data in as:
    INSERT INTO #PercentageOfVaccinatedPopulation(location,date,population,new_vaccinations,TotalInLocation)

    • @chasbjoes
      @chasbjoes 10 месяцев назад +1

      I actually did mine a bit differently because I noticed that people_vaccinated is a running total and more accurate than the new_vaccinations (since that column was only getting updated once in a while instead of each day new vaccinations were being performed 💡). I don't know about the earlier data sets, but the one I pulled on 11 November 2023 actually shows an interesting twist too that I didn't think about. If a person was quarantined outside of their resident country and received the first "vaccination" in the country they got it, then the spreadsheet shows more vaccinated people than residents in some cases. That also means though, that if they returned to their country of origin after they were initially "vaccinated" and received a booster there, then they will show again in the people_vaccinated for that country. Food for thought.

  • @cesskinyua
    @cesskinyua 2 месяца назад +1

    Doing this in July 2024, If still you have problems trying to import the data for me this worked:
    task>import flat file(choose the first one)> next>browse the location>new table name (will be autocompleted)>next >next>next>finish. Hope this helps.

  • @mariahhayes5089
    @mariahhayes5089 Год назад +307

    Absolutely amazing video, I have finally realised that as a beginner in the financial investment market, you can achieve close to nothing yourself because you still have a lot to learn. Trading with a professional broker is more profitable and my advice for beginners is to always take advantage of that.

    • @raymondbarnes5264
      @raymondbarnes5264 Год назад

      Instead of timing the market, you should try to diversify your portfolio in order to get a dollar-cost average when it’s time to retire. Keep in mind that you don't need a ton of money to invest. Investing in small amounts can build long-term wealth too!

    • @helenoliver4838
      @helenoliver4838 Год назад

      I'm an amateur making terrible picks and I really need assistance.

    • @greenquake11931
      @greenquake11931 Год назад

      Trading is Profitable with the help of a seasoned broker managing your trading account.

    • @mariahhayes5089
      @mariahhayes5089 Год назад

      @@helenoliver4838 'BRIDGET MARY TUROW"".

    • @helenoliver4838
      @helenoliver4838 Год назад

      @@mariahhayes5089 How can i reach her?.

  • @mrinalmishra1437
    @mrinalmishra1437 2 года назад +7

    For those who are doing it on MySql, kindly select a smaller dataset per say in a group of 50 from countries in each continent datewise, for both the total no. Of deaths and total vaccinations table.

  • @verbingnoun
    @verbingnoun 3 года назад +8

    Extremely useful video! Thanks so much. I play around a lot with data in SQL but was having a challenging time figuring out how to display my knowledge in a portfolio project. This really gave me a solid idea of how and which skills to include. As a healthcare professional looking to switch careers, this was a great topic as well.

  • @urmimalaray7226
    @urmimalaray7226 Год назад +1

    Please do no congratulate us at the end. It is my privilege that you put in so much hard work to show how things work. I came here to learn how to 'play around' with the downloaded single file (any form ) and make it multiple files and do analysis. I must say I learn so much new things in SQL as for self learner in this almost 4 months journey, I feel comfortable working with Excel, Power Query, power pivot ,DAX ,Power BI etc. and when ever I encounter with SQL tables ,my main focus is always make simple query in SQL and then copies/import them into Excel for analysis. From today onwards, I would like try the reverse just to come out from my comfort zone. Thanks for your video. It is al so helping me to create portfolio of my work as this this the time to do that.

  • @fpassow1
    @fpassow1 2 года назад +6

    I really appreciate the inclusion of mistakes and research time, etc. I've been working with software long enough that I can google my way through all kinds of things. But I'm trying to start a data analyst career. And I get depressed when I read the "requirements" in job listings. So it's great to see what reality is for someone who got a job... and has their own RUclips channel.

  • @B3TT3R_YOU
    @B3TT3R_YOU 2 года назад +4

    Thank you Alex for making this learning process an easy and interesting journey. I'm certain that I'm getting a Data Analyst job very soon. I can't believe that I can finally analyze data using SQL, I've tried learning for months from other sources until I found yours.
    Thank you Alex.

    • @AlexTheAnalyst
      @AlexTheAnalyst  2 года назад +1

      I'm so glad it's helpful! So awesome to hear :)

  • @juliangrant4747
    @juliangrant4747 Год назад +3

    Hi Alex! You are one of the real heroes.
    During the course of my graduate school studies and doing work on the side, I have had the great fortune of attending seminars, classes, workshops, etc. in great educational institutions from Europe, North America, and Asia.
    On that note, I have the chance to sit down maybe in the presence of great minds in my field.
    What I find common among them is the level of emotion, engagement, conviction, and insightfulness when they deliver they disseminate their knowledge and wisdom.
    Yeah, maybe I'm also a sucker for an excellent education.
    But man! Sorry to put you in the spot, but you're one of those few people I put in the category of those great educators.
    You are brilliant!
    I would compare the great Sal Khan (of Khan Academy) at least in the field you teaching.
    Much Thanks and appreciation.
    I'm a constant consumer of your content and enjoy extensively learning from them.
    Hoping you continue your great work.
    So happy to hear the last part of this video.

  • @alvarobayardi7576
    @alvarobayardi7576 Год назад +1

    Thanks Alex, you're great!!!
    Today I finished my first project in SQL and I plan to complete all your videos. It's one of the best courses I've ever done, congratulations!
    I wish success to all students who, like me, seek to grow professionally.

    • @KickNojh.p1
      @KickNojh.p1 Год назад

      How were you able to load the data.? I'm using the 19.1 version and for some reason it does not give me the option of import 64 bit and load the data as he mentions. Please help.

  • @umaisbhatti7806
    @umaisbhatti7806 3 года назад +4

    I really appreciate your efforts to train other beginner level students to Data Analytics (I'm one of them). Pls keep making such fruitful vedios for us. You're the best man!!!

  • @DaKillaZZZ64
    @DaKillaZZZ64 2 года назад +5

    You're actually the best Alex, thank you so much, I'm so glad people like you are on this planet man, you make it better

    • @AlexTheAnalyst
      @AlexTheAnalyst  2 года назад

      So glad to hear it! Thanks for watching :D

  • @joeyfung7508
    @joeyfung7508 3 года назад +31

    7:25 I believe the reason is because you saved the excel file as .xlsx (64 bit), so you won't be able to use the 32 bit "SQL Server Import and Export Wizard" to import the data. Try saving the excel file as .xls instead (32 bit). At least that works for me :) Awesome content btw!

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 года назад +3

      Definitely could be? I should look into that

    • @wilsonjunior3196
      @wilsonjunior3196 2 года назад

      True. Changing from xlsx to xls works without the error coming up

    • @Monika-vu3zs
      @Monika-vu3zs 2 года назад

      How did you put this big data through workbench wizard ....it would take months......can you help me

    • @chriszhang2805
      @chriszhang2805 2 года назад +3

      I tried to save as .xls, but it shows that some data might be lost in the .xls file.

    • @xMastJedi
      @xMastJedi 2 года назад

      xls have only 64k rows.
      xlsx have +1m rows.
      xls is Excel 1997 format and have nothing to do with 32/64k. Its simple NOT compressed format. Wide used to exchange data between many systems and have status 'spreadsheet standard' in the old days.
      change ext of ANY xlsx file to .zip and you can open it by any archive program.

  • @AndreasTibell
    @AndreasTibell Год назад +3

    Hi Alex, I just want to give a huge thanks for making these videos. I have tried a couple of other courses but none have made as much sense as you do. I love the fact that you focus on the practical aspects instead of endlessly explaining the theory behind it. Since I´m not looking to reinvent the field but more learn how use the necessary tools, this is exactly what I´m looking for. Keep doing what you are doing, precisely like you are doing it!

  • @lizejackson3248
    @lizejackson3248 2 года назад +7

    For those getting an error message "ORDER BY list of RANGE window frame has total size of 1020 bytes. Largest size supported is 900 bytes." at time 1:00:09 in video, the solution I found is the following:
    SUM(cast(vac.new_vaccinations as bigint)) OVER (Partition by dea.Location Order by dea.Date)

  • @LukeBarousse
    @LukeBarousse 3 года назад +6

    I see from your task bar you're burning the midnight oil, Alex!
    Such a great showcase of building your portfolio for a data analyst, love this content!

  • @adrianamejia3345
    @adrianamejia3345 3 года назад +12

    Just stopped here to say: This is gold!!

  • @nankundabridget9783
    @nankundabridget9783 9 месяцев назад +1

    Doing this in December 2023. I am a SQL beginner and a on my journey to breaking through the data analyst field and am so thankful to you ALEX. I have completed my first project.

    • @gaurabkundu9088
      @gaurabkundu9088 9 месяцев назад

      Hey I am doing this project right now, For the second query I did this sql code
      -- Analyzing the likelihood of death if contracting Covid-19 in a specific country
      Select Location,date,total_cases,total_deaths,
      CONVERT(DECIMAL(18, 2), (CONVERT(DECIMAL(18, 2), total_deaths) / CONVERT(DECIMAL(18, 2), total_cases)))*100
      as DeathPercentage
      From Covid_Data_Exploration_Project..CovidDeaths
      order by 1,2;
      But I am getting every value in the total_deaths column as NULL. Can you help?

  • @ghazella_alizadeh
    @ghazella_alizadeh 2 года назад +6

    Hey Alex, thank you so much. I'm certain that I'm getting a Data Analyst job very soon. I can't believe that I can finally analyze data using SQL, I've tried learning for months from other sources until I found yours. I recently finished the Linkedin Data Analytics Certification, and I've been struggling to figure out how to get my portfolio started. I want to tell you I’m glad you showed your mistake live. I really want to thank you for making this real. It’s important for beginners to see that experts also make mistakes and how can they solve. It is encouraging. As a bonus, I will better remember to save as .xlsx because you pointed it out :) keep going :D

    • @AlexTheAnalyst
      @AlexTheAnalyst  2 года назад

      So glad to hear it :)

    • @triumphotamiri7904
      @triumphotamiri7904 2 года назад

      @@AlexTheAnalyst Please help me out, am getting same errors when I try importing from the Portfolio project database
      But the alternative way you used isn't bringing up Microsoft Excel in the data Source in my system I don't know why

    • @rayanstar7
      @rayanstar7 2 года назад

      Update? Got a job?

    • @sheilaolayo2155
      @sheilaolayo2155 2 года назад +1

      @@triumphotamiri7904 I'm not sure if you figured this yet, but if you didn't.. I was having the same exact problem so what I did was save the files in csv format and then import them as flat data sources since the Microsoft Excel data source was missing

  • @matts9577
    @matts9577 Год назад +8

    Thank you so much Alex, I really feel this going to get me an actual shot for interviews and a new career, my online courses don’t seem to impress many employers lol

  • @mk8829
    @mk8829 2 года назад +21

    Thank you for doing this. I finished SQL on Khan, and I know you never stop learning, but doing the Covid project, I was able to put it on my github and portfolio, but for me I had a hard time understanding how you came up with the more complex queries at the end, that I thought maybe I’m not ready to apply for jobs? Is it just me, or did anyone else feel lost still even after teaching yourself sql? I was proud of being about to submit it on my github but felt less confident in my abilities because I don’t think I could have figured out those queries on my own.

    • @diegorebolledo7718
      @diegorebolledo7718 2 года назад

      Thanks to your comment I discovered the Khan Academy's SQL course, it's just amazing how it teaches SQL in such an efficient way, definitively worth it. Thanks to you man.

    • @brittanyyoung3928
      @brittanyyoung3928 2 года назад +8

      I just want to let you know, that I am at this point where I am learning SQL and I am feeling less than confident in myself because I am unsure how to figure out on my own what syntax to use to get certain results and how to apply them correctly. So you're not alone. I see that this comment it 5 months old how are you doing and feeling now?

    • @chibilume
      @chibilume Год назад

      yea I have 2 SQL certificates and have watched most of the popular SQL videos on YT but still am not that confident. It's weird because w/ anything else I learn and spend so much time w/ I would be more confident in.

    • @jessicaBee234
      @jessicaBee234 3 месяца назад

      @@brittanyyoung3928 hey girl , presently at this phase now . just want to know how you feel now and if you were able to gain confidence .

  • @kaili919
    @kaili919 5 месяцев назад +2

    I'm doing it in April 2024. The new_cases and new_deaths columns are reported every week, not daily anymore. You can just add new_cases 0 in the where statement and it will show you the weekly cases.

  • @TayuanRebelo
    @TayuanRebelo 2 года назад +4

    I just want to make sure that you know the value of the work that you have been doing.
    This is so far my favorite portfolio project vídeo on RUclips. I enjoyed every moment of it. Especially the errors/mistakes!
    I’m learning analytics and I face those errors I feel so frustrated and wondering if one day I will be really good at it. But watching your video I could see that the mistakes and experiments are part of the process!!!
    Thank you very much!!!

  • @ChrisHasFunYT
    @ChrisHasFunYT 2 года назад +4

    Can't thank you enough for this content!!
    Small note at 48:38 - I believe SUM(new_cases) and SUM(new_deaths) will be the same as new_cases/new_deaths (won't reflect the true totals) because you are grouping by date. I think it may be more accurate to use MAX(total_cases) and MAX(total_deaths) given that the values are compounded each day and will never decrease. Would love to hear your thoughts/feedback. Really loving this channel - keep up the great work.

    • @FilipFlorin
      @FilipFlorin Год назад

      Thank you for this! I had the same syntax as the video but new cases and new deaths were not being summed up and I was so confused.

    • @adebayoolaitanjoseph5589
      @adebayoolaitanjoseph5589 6 месяцев назад

      You're right. I was having issues with this.

  • @sarahghaya5461
    @sarahghaya5461 3 года назад +8

    Thank you Alex. I was really lost for the past 6 months, not knowing where to start! you really helped me a lot as a beginner, looking forward to more portfolio projects!:)

  • @sj1795
    @sj1795 11 месяцев назад +1

    Alex, THANK YOU!!! I can't express my gratitude enough for everything you've done for us. I just made it through your SQL series (Beginner, Intermediate, & Advanced). Hooray! You make the process so simple and straightforward, which makes it SO easy to follow along. The project component is the BEST part. You're breaking down what we've learned and showing how we can actually use it. This is a HUGE missing link in today's education system. My Biomedical Engineering PhD program funding feel through (absolutely heartbreaking). I made it 2/3 of the way through and wrote 90 pages of a thesis that I was unable to use. While I was able to get a Master's degree, it was not in the subject/career I had wanted. My world was thrown upside down. I had no idea what I was going to do. Then, data analysis and YOU showed up in my life. You (and your "Become a Data Analyst" bootcamp) have served as a beacon of light and hope for my future. Bless you! And THANK YOU again!

  • @giovannigalvez5046
    @giovannigalvez5046 Год назад +3

    Alex, your videos are amazing and you have really helped me get to a place in sql where i feel like i can jump off and just continue to increase my skills and value. I will continue to return to your content regularly and support how i can.

  • @imolar50
    @imolar50 Год назад +15

    Hi Alex! Thank you very much for this video, I have been watching these videos thanks to the bootcamp playlist and I have learned a lot.
    For those who have problems uploading the files to SQL Server, what I did was transform the files to CSV and import them as Flat File, in the same way that Alex mentions it as the first option, at some point it will ask for the destination and I selected the one for “Microsoft OLE DB Provider for SQL Server”, or something like that.
    I know very little about this software, so I don't know exactly what I did, but it worked. I hope it works for you too.
    Greetings from Mexico!

    • @AyushKumar-wx6tn
      @AyushKumar-wx6tn Год назад

      Thanks man, you are a lifesaver

    • @mirandab8813
      @mirandab8813 Год назад

      thank you so much! life saver!

    • @arunhazarayou1
      @arunhazarayou1 Год назад

      While importing as excel(all formats) most of my data went NULL. but CSV plus Flatfile import worked. Can't thank you enough.

    • @clarissacampos9735
      @clarissacampos9735 Год назад

      Thank you so much, your answer definetely helped me!!

    • @LilyNa-hg4ti
      @LilyNa-hg4ti Год назад

      YOU ARE A LIFESAVER

  • @swolliston
    @swolliston Год назад +5

    finally finished and submitted to my github account. So proud of myself! Thank you Alex!!

    • @VitorAbreu1001
      @VitorAbreu1001 Год назад

      Congratulations on finnishing the project! Did you had any troubles with data types? Im having a lot of problems 🥲

    • @amtzgaming3688
      @amtzgaming3688 Год назад

      @@VitorAbreu1001 i also having error like this "Operand data type nvarchar is invalid for divide operator." i tried to change the data type to float..but not working

    • @VitorAbreu1001
      @VitorAbreu1001 Год назад

      @@amtzgaming3688 The code will look a bit different from Alex because I went for the updated dataset instead but the error you are saying I managed to fix with the code I have on the GitHub!

    • @JunSim-t2o
      @JunSim-t2o 8 месяцев назад

      @@amtzgaming3688
      Select location, date, total_cases,total_deaths,
      (CONVERT(float, total_deaths) / NULLIF(CONVERT(float, total_cases), 0)) * 100 AS Deathpercentage
      from PortfolioProject..covidDeaths
      order by 1,2

  • @jiajuanlu4652
    @jiajuanlu4652 3 месяца назад

    Thank you for doing this tutorial I've learn SQl from Udemy and Datacamp. But I really have no idea how to practice SQL until I found your channel! Such a gem.

  • @ezhankhan1035
    @ezhankhan1035 8 месяцев назад +7

    Great Project - Thanks Alex!
    Had some trouble importing in the data, but figured out a fix. I did it as a Flat File (CSV) and for some reason, the data types which were recommended were completely wrong (e.g. Varchar was given for some, instead of Float). So I had to manually change a few of the data types myself. Tedious, but was the only solution here :)

    • @conorkelly9642
      @conorkelly9642 8 месяцев назад +1

      @ezhankhan1035 - How did you change the data types? It is blocking me from changing them when I am uploading the CSV file in SQL Server Import & Export Wizard

    • @ezhankhan1035
      @ezhankhan1035 8 месяцев назад

      @@conorkelly9642 ​ I think I used a separate import wizard, just for flat files (CSV files).
      I right-clicked the database (which I wanted to create the table in) - went to tasks - import flat file. This way then allowed me to change any data types and
      all that good stuff. Hope it helps!

    • @IndigoWolf13
      @IndigoWolf13 5 месяцев назад +1

      Thank you for this comment! It helped me get my data uploaded.

  • @emmasaldana2724
    @emmasaldana2724 Год назад +3

    It took forever, but i finally figured out how to import the covid files. Save files in excel as CSV, right click your portfolio database, click task, import data. The the import wizard pops up, on the top where is says data source choose at Flat File Source, then browse for you file (make sure you select all files on the bottom right, so your CSV files can be seen), then click next, click next again, under destination choose Microsoft OLE DB Provider for SQL Server, confirm the server name is correct, click next, click your file name if it's not already, then next, run immediately should already be checked, click next then finish. When the execution is successul, hit close.

    • @favourbarididum2623
      @favourbarididum2623 Год назад

      Hey, thanks for this. Followed your steps one-by-one, however some error messages popped up when I was performing the running and finishing up step.
      Stopped: prepare for execute
      Stopped: Pre-execute
      Error: Executing
      Stopped: Copying to [dbo].[Covid-Deaths]
      Stopped: Post-executed

    • @emmasaldana2724
      @emmasaldana2724 Год назад

      @@favourbarididum2623 sorry, i wish i did!

    • @katspeony
      @katspeony Год назад +1

      worked for me :)

    • @iam_fancy
      @iam_fancy Год назад

      @Emma thanks for this

  • @pariabagherifaez9437
    @pariabagherifaez9437 8 месяцев назад +4

    You need to converts a values to a float datatype
    Select location, date, total_cases, total_deaths, (cast(total_deaths as float)/cast(total_cases as float))*100 as DeathPercentage
    From PortfolioProject..covidDeaths
    Order by 1,2

    • @zerox227x
      @zerox227x 7 месяцев назад +1

      Its been so long since i program that i forgot that float is to carry anything after a 0.000. thank you

    • @fahadshaikh9099
      @fahadshaikh9099 4 месяца назад

      did you notice that the excel data loaded doesnt contain the full rows but shows only 2600 rows?

  • @playtpuspower3241
    @playtpuspower3241 Год назад +2

    Took me over a week to translate MSSMS's syntax into bigquery in order to do this project but I'm smarter now because of it and also feel like an SQL god 😤 thanks sm for this tutorial Alex appreciate it a million

    • @NaturalNae
      @NaturalNae Год назад

      Im trying to do that now. Do you have any tips?

    • @opelimson5579
      @opelimson5579 Год назад

      i just downloaded the data today but i am finding it confusing cos the some columns are empty

    • @opelimson5579
      @opelimson5579 Год назад

      @@NaturalNae can you send the dataset you downloaded to my email

  • @mahmudulhasantuhin5327
    @mahmudulhasantuhin5327 2 года назад +9

    Those who are having trouble while trying to cast the matrices as integer value(apparently the numbers have grown lately ), Arithmetic overflow error is likely to occur. Try BIGINT instead of INT

    • @BobbyDaDan
      @BobbyDaDan 2 года назад +1

      This comment should be pinned.

    • @coffeeyum9
      @coffeeyum9 2 года назад +1

      Thank you!

    • @txx8302
      @txx8302 2 года назад +1

      Really Thank you...

    • @cakennedy2009
      @cakennedy2009 2 года назад

      I converted to bigint but now it's showing multiple instances of the same dates and that's giving me the wrong sums

  • @MrJawnzMan
    @MrJawnzMan 3 года назад +22

    When I see that smiling face in the thumbnail I know it’s gonna be a great video!

  • @1689JeffChavez
    @1689JeffChavez Год назад +3

    On my way through the boot camp. WHohoo! Portfolio Project!

    • @1689JeffChavez
      @1689JeffChavez Год назад

      -- Operand data type nvarchar is invalid for divide operator.
      I'm getting this error on this
      SELECT Location, date, total_cases, total_deaths, (total_deaths/total_cases) *100 as DeathPercentage
      FROM CovidDeaths_updated
      I checked chatgpt and I was directed to change the columns to INT by doing this:
      ALTER TABLE CovidDeaths
      ALTER COLUMN total_deaths INT;
      After doing so, my data seems inaccurate. Did I miss anything?
      By the way, I also ch went back and updated the data types in Excel and tried importing it again. But I am still getting the same error.
      TIA.

  • @balajeedevesha493
    @balajeedevesha493 Год назад +2

    Doing this in 2023, the data columns of total_cases and total_deaths are assigned as nvchar which leads to issues in doing the death proportion operation, you need to cast the columns as decimal for the operation to work: I used (CAST(total_deaths AS decimal(12,2)) / CAST(total_cases AS decimal(12,2)))*100 also choose the decimal function first entry based on the numbers you get as if it is given too low it will result in nvchar to decimal overflow error.

  • @WesTsidE231
    @WesTsidE231 Год назад +3

    I just completed this project and I would say this is really fantastic. I appreciate the effort you put in putting this video together. Thanks for your help Alex🙌

    • @M-fb7yv
      @M-fb7yv Год назад

      Hi
      I also want to complete this project.
      But I am enable to export this large dataset from Excel to Microsoft SQL.
      As the data has grown since the video was released.
      Were you able to get all rows in SQL ,which were lakhs in number approximately

    • @shilpashah-qf8qb
      @shilpashah-qf8qb Год назад

      Hey please help me making this project I am getting problem in importing data.... which version u used developer or express please tell...

    • @osaelartey816
      @osaelartey816 Год назад

      ​@@shilpashah-qf8qbI'm also having problem importing the data from Excel to SQL server, I keep getting Microsoft.Ace.Oledb.12

    • @innocentnduaguba
      @innocentnduaguba 11 месяцев назад

      please can you help me with the data set?
      I'd be very grateful

  • @colinhicks5135
    @colinhicks5135 3 года назад +7

    Spent a whole weekend on this but man did I learn a lot. Ready to get into part two visualizations next weekend. Thanks your guidance.

  • @GriefHC
    @GriefHC 3 года назад +12

    Thank you so much for these videos, Alex. You're really helping me prepare for my transition to a data analytics position.

  • @leprincessuu
    @leprincessuu Год назад

    I know you said you felt like you were rambling as the video progress, but please don't think so! It really helps understand the thought process behind each query. Thank you so much Alex for these videos! :)

  • @jamesmikky4012
    @jamesmikky4012 7 месяцев назад +9

    I am doing this in 2024, if you notice this error(divide by zero error encontered at 48:42, insect function
    __________Nullif(sum(new_deaths), 0)/nullif(sum(new_cases), 0)*100 as deathpersntage________
    Hope this helps

    • @maryanal3476
      @maryanal3476 5 месяцев назад +1

      I have the same issue. I had to CAST both sums due to data format. In my case it looks:
      Nullif(sum(cast(new_deaths as int), 0)/nullif(sum(cast(new_cases as int), 0)*100
      I get an error -- The sum function requires 1 argument(s).

  • @SocheatOu
    @SocheatOu 3 года назад +19

    What a coincidence!! I’m currently doing the Google Data Analytics Certificate Program. Already completed 6/8 courses in 3 weeks. Now in Course 7 learning about R. Found this dataset last week and plan to use it for my personal project. Very excited to see you work on this exact same dataset ;-)

    • @gianglearning3509
      @gianglearning3509 3 года назад +1

      You finished 6/8 courses in 3 weeks, wow, how is great ^^

    • @SocheatOu
      @SocheatOu 3 года назад +2

      @@gianglearning3509 cos of lockdown 😁

    • @anoukc6928
      @anoukc6928 3 года назад +4

      Wow! You are fast! I’m still at 2/8

    • @RiteshRaj-nn7ig
      @RiteshRaj-nn7ig 3 года назад +2

      @@anoukc6928 And I am still at 1/8 in one week 😀😀

    • @anoukc6928
      @anoukc6928 3 года назад +1

      @@RiteshRaj-nn7ig how do you like the course so far?

  • @projects3726
    @projects3726 Год назад +7

    to whoever is still having issues with importing
    Option 1: ruclips.net/video/JVP9frj2VSQ/видео.html
    Options 2: ruclips.net/video/brGq2by7DvA/видео.html
    Options 2 worked for me. Feel free to add more options as they arrive

    • @aashibansal1556
      @aashibansal1556 Год назад

      Thank you! That was really helpful.

    • @SleezyCabbage
      @SleezyCabbage 7 месяцев назад

      The second solution was the fastest and most straightforward, learned a new trick with the /quiet at the same time!

    • @Indranil-qj5jo4qf8s
      @Indranil-qj5jo4qf8s 6 месяцев назад

      I tried both but not working, in the second solution cmd prompt showing error while executing the installation.

  • @bakaresopefoluwa3883
    @bakaresopefoluwa3883 10 месяцев назад

    Hello Alex, I'm doing this in Nov 2023. I've identified the root cause for our inaccurate total deaths breakdown by continent at 37:00 The issue lies in the cumulative total deaths column, which isn't accurate. Instead of relying on the 'Max' aggregate function, I now use the 'SUM' function for the new deaths column instead of the total deaths column that you used in the video, providing a more accurate total. This has been beyond helpful! Thank you so much for putting the time and effort into this! You're too kind

    • @bakaresopefoluwa3883
      @bakaresopefoluwa3883 10 месяцев назад

      yeah sure. What problem do you have there?@ChristabelAdesua

    • @Theonlyone142
      @Theonlyone142 9 месяцев назад

      Hello, if you can please also help me. I have typed in exactly what he typed on 39:28 and I’m getting invalid project name error

  • @aishwarygawande6466
    @aishwarygawande6466 3 года назад +7

    Can't thank you much for taking this initiative and helping beginners like me to apply the SQL skills to real-world data. Waiting for the second one.
    You are awesome buddy.

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 года назад +1

      So glad to hear that! Thanks for watching! :D

  • @Coctvdt
    @Coctvdt Год назад +3

    11-Aug-2023
    Having problems with importing the data or csv files. Then, You should save your Excel file to a lower version is can be a 2003 version of excel. Once save the file you can now do the Data Transfer of SQL.
    THEN Hit NEXT.
    Instead of using SQL Server Native Client 11.0, It is now Microsoft OLE DB Provider for SQL Server. This took me a while to figure out and import.
    Hope this helps!

  • @krystlestevens2585
    @krystlestevens2585 Год назад +4

    Great video and channel Alex. When creating the view around the 70 minute mark, I had to add "USE PortfolioProject GO" before the "CREATE VIEW" to get the view to show in the object explorer. Hope it helps someone. So thankful for the channel!!!

    • @mariamaghamalyan1897
      @mariamaghamalyan1897 7 месяцев назад

      Thank you!

    • @StarWisperer
      @StarWisperer 7 месяцев назад

      thanks! couldn't understand what am I doing wrong, adding USE .. GO saved me

  • @sajanasigdel9821
    @sajanasigdel9821 Год назад +1

    Had a problem calculating TotalDeathPercent and took me 2 hours to figure out the solution. I looked through the comments and performed below steps ,
    1) uninstalled my Microsoft SQL server 2023 and installed Microsoft SQL server 2019.
    2) Imported Excel file rather than Flat text through Import Export Wizard as shown by Alex.
    3) Converted the ncharvar Total deaths and total cases to Float using Command ALTER TABLE CovidDeaths ALTER COLUMN total_Cases FLOAT.
    Worked like a Charm. Hope it helps :)

    • @mangolos
      @mangolos Год назад

      wow great, thank you very much

    • @bikramshrestha4233
      @bikramshrestha4233 Год назад

      Seems like you don't even have to re-install different versions of server. I just changed the data type of total_deaths and total_cases to FLOAT

  • @Joi_SS
    @Joi_SS 3 года назад +12

    I was just about to find some courses to continue SQL and Python study. Now I can follow this project! Thank you so much!!!

    • @alexvargas5907
      @alexvargas5907 2 года назад +2

      Forget the courses, projects gives you real useful knowledge!

  • @shinej11
    @shinej11 3 года назад +4

    Alex, you're awesome. Thank you very much for taking out time to make these videos. These are priceless. I was struggling to create a project. Thanks a lot again.

  • @HarshPatel6293
    @HarshPatel6293 3 года назад +5

    Invaluable content for aspiring data analysts! This is such a great channel. Thanks Alex!

  • @NB20079
    @NB20079 Год назад +1

    HI ALEX, I am from Philippines ,I appreciate you and your heart to helping us, you are my stepping stone in this career, Thank you again and God bless you!

    • @SieanElpidama
      @SieanElpidama 6 месяцев назад

      any update bro? im also from the philippines

  • @Major_Data
    @Major_Data 3 года назад +24

    "This guy's smart, I should hire him..."

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 года назад +2

      :D

    • @CE-vd2px
      @CE-vd2px 3 года назад

      @@AlexTheAnalyst Hey Alex, I didn't realize how expensive Excel is: Should I buy excel or make due with Google Sheets?

    • @mugdhaparbat9776
      @mugdhaparbat9776 3 года назад

      @@CE-vd2px if you’re a student, you can get office 365 free on Mac with your student email ID

    • @habdata1338
      @habdata1338 3 года назад +1

      @@CE-vd2px download wps office.Which is free and has most features of excel and cross platform compactible.