End to End Data Analytics Project (Python + SQL)

Поделиться
HTML-код
  • Опубликовано: 15 янв 2025

Комментарии • 277

  • @ankitbansal6
    @ankitbansal6  9 месяцев назад +23

    Please like the video as it takes lots of effort to record these videos.
    Checkout my high quality data analytics courses :
    www.namastesql.com/

    • @revathigangisetty9065
      @revathigangisetty9065 9 месяцев назад +1

      Hi Ankit, can you please kindly help the below requirement.
      create table #temp ( DepartmentId int, Name varchar(255),Hiredate date,Sal float,Ruleid int)
      Insert into #temp (DepartmentId,Name,Hiredate,Sal) values ( 10,'Sai','2021-10-23',5500)
      Insert into #temp (DepartmentId,Name,Hiredate,Sal) values ( 10,'Sairam','1999-10-23',6000)
      Insert into #temp (DepartmentId,Name,Hiredate,Sal) values ( 10,'Saikrishna','2002-10-23',3000)
      Insert into #temp (DepartmentId,Name,Hiredate,Sal) values ( 10,'Sair','2021-10-23',5000)
      Insert into #temp (DepartmentId,Name,Hiredate,Sal) values ( 10,'Raj','2015-10-23',8000)
      Insert into #temp (DepartmentId,Name,Hiredate,Sal) values ( 10,'SRK','2021-10-23',5000)
      Create table #Rules (Departmentid int,Ruleid int, Condition Varchar(2000))
      Insert into #Rules values ( 10,1,'Name like ''Sai%''')
      Insert into #Rules values ( 10,2,'Hiredate >= ''2000-01-01''')
      Insert into #Rules values ( 10,3,'sal >= 5000')
      output
      Departmentid Name Hiredate sal ruleid
      10 Sai 10/23/2021 5500 2,10,30
      10 Sairam 10/23/1999 6000 2,30
      10 Saikrishna 10/23/2002 3000 2,10
      10 Sair 10/23/2021 5000 2,10,30
      10 Raj 10/23/2015 8000 10,30
      10 SRK 10/23/2021 5000 10,30

    • @revathigangisetty9065
      @revathigangisetty9065 9 месяцев назад +1

      DepartmentId is used to join #temp and #Rules.
      check each condition dynamically from #Rules against the corresponding rows in #temp.
      If a condition is satisfied, concatenate the Ruleid value from #Rules to the existing Ruleid in #temp

    • @ankitbansal6
      @ankitbansal6  9 месяцев назад +2

      ​@@revathigangisetty9065 send the problem on sql.namaste@gmail.com

    • @revathigangisetty9065
      @revathigangisetty9065 9 месяцев назад

      @@ankitbansal6 i shared the details in the email ankit. its urgent requirement ankit please kindly help me. i will be eagerly waiting for your input.

    • @revathigangisetty5734
      @revathigangisetty5734 9 месяцев назад +1

      i shared the problem to you via email and tried multiple times but i didnot get the exact solution. please kindly help ankit

  • @HarshGupta-rn6db
    @HarshGupta-rn6db 9 месяцев назад +10

    You are the torchbearer for many Ankit, trust me!
    Cheers!

  • @ankushjain4128
    @ankushjain4128 9 месяцев назад +6

    Great video! Loved the easy explanation of the full ETL process and data analysis. Keep up the good work!

    • @HaroutBlack
      @HaroutBlack 8 месяцев назад

      Jesus is the only way to healing, restoration and salvation to all souls. Please turn to him and he will change your life, depression into delight, soul heading from hell to heaven all because of what he did on the cross
      “Whoever calls upon the name of the Lord shall be saved” Romans 10:13

  • @tenzinchoepheldev
    @tenzinchoepheldev 12 дней назад +1

    Very informative and helpful - one thing what I like about ur video is that u r very detailed oriented but at the same time u explains all to the point without bla bla . Thanks for the big tutorial 🎉🎉🎉

  • @jaybharat3949
    @jaybharat3949 7 месяцев назад +12

    First of all, thank you very much for the project. But there’re couple of concerns which you may have overlooked -
    1. In every query where we’re doing analysis based on sales, I guess it’s better to consider sale_price*quantity as sales rather than sale_price only
    2. In the last problem, profit should have been taken into consideration rather than sale_price
    But again I’m repeating that, above are some modification we can make to get the best business answers, otherwise in the video you’ve shown the approach how to handle an end to end project - from that pov it’s absolutely fine. We students can make necessary changes.

    • @ankitbansal6
      @ankitbansal6  7 месяцев назад +5

      Thanks for the feedback. Appreciate it 🙂

    • @IMRANAHMED-eh7yl
      @IMRANAHMED-eh7yl 6 месяцев назад +1

      I have noticed same thing but overall good Video for learning

    • @KennethDarby-o3j
      @KennethDarby-o3j 4 месяца назад +1

      I assumed the sale_price is the result of this calculation, which would make the way he does it correct.

    • @mugim1588
      @mugim1588 3 дня назад

      Indeed. Thanks for pointing out.

  • @randyawuri1664
    @randyawuri1664 8 месяцев назад +2

    You sir, are a wonderful teacher! I am currently learning Data Engineering. This video has enabled me to catch up and understand some core concenpts that I found challenging because I missed live classes. Thank you!

  • @xyz1234544
    @xyz1234544 3 дня назад

    Thanks a lot for this fantastic project tutorial! It was super informative and well-explained. I’d love to see more content like this-keep inspiring learners like me!

    • @ankitbansal6
      @ankitbansal6  2 дня назад

      I'm glad you found it helpful! More projects coming soon.

  • @danishsajan4114
    @danishsajan4114 Месяц назад

    Great help for someone just stepping into the data world! Thanks

  • @divyanshisharma4054
    @divyanshisharma4054 8 месяцев назад +1

    I thoroughly enjoyed this video and followed along with you. Thanks for this. Please keep posting more of such end to end analysis problems. Thanks a ton for taking the effort to make these videos so that we keep learning :)

  • @vivekkumar6546
    @vivekkumar6546 9 месяцев назад +1

    Ankit bhai no words.. new things learned today..❤

  • @piyushsharma8294
    @piyushsharma8294 9 месяцев назад +1

    Thank you bro for the best explanation i’ve ever seen about this topic.

  • @joydeepsarkar3092
    @joydeepsarkar3092 8 месяцев назад +1

    Very Helpful video Ankit! Thanks a lot for the efforts you have put for creating this guided project. Looking forward for many more guided projects :)

  • @paulperez2005
    @paulperez2005 Месяц назад

    I have been looking for projects to get experience from since leaving college. This was great and reminded me of my previous course work

  • @jeniyakulshreshtha7555
    @jeniyakulshreshtha7555 9 месяцев назад

    Today only i was seraching for data analytics ..and just saw your vedio ...thank you so much..pls post more such content on data analytics and any course on this as well please 😊

  • @nikhilreddy4582
    @nikhilreddy4582 9 месяцев назад +1

    Thankyou so much Ankit sir For this project👍

  • @anushar3361
    @anushar3361 3 месяца назад

    Great start for learning. Provided complete knowledge how the ETL works and improved the self confidence. Thank you so much for wonderful lecture about the project

  • @satishwalaki
    @satishwalaki 9 месяцев назад

    This was really helpful as a beginner me. thanks a lot & we need more n more videos like this.

  • @mrrathore55
    @mrrathore55 6 месяцев назад

    Thanks Ankit for this wonderful end to end course,I have gone through it and cleared many of the doubts.

  • @ShekoniTaofeek
    @ShekoniTaofeek 7 месяцев назад

    Thank you very much. Never believed I could really joy it as much as I did. I really really appreciate your effort

  • @macx8360
    @macx8360 9 месяцев назад +1

    thanks , i was waiting for this ....honestly

  • @gunaravi6224
    @gunaravi6224 7 месяцев назад

    Hello Ankit Sir, am a fresher and am looking forward to start my career in data analytics or data engineering field. This video is awesome and very helpful for me. Thank you so much for making the End ot End videos like this. And the way you explaining each and everything is very nice and clear. Keep it up Sir

  • @ravitejatavva7396
    @ravitejatavva7396 9 месяцев назад

    Wow, Appreciate your effort and work! 😍

  • @shreecrafts651
    @shreecrafts651 5 месяцев назад

    Thank you so much!! You are the motivation and this is the first step for me!! It means a lot thank you so much Ankit🙏🙏

  • @kadapalarakeshreddy910
    @kadapalarakeshreddy910 3 месяца назад

    The best video for beginners

  • @deviprasadgharpehai3315
    @deviprasadgharpehai3315 9 месяцев назад

    Thanks a lot bhai love nd support from berhampur❤

  • @koa1663
    @koa1663 7 месяцев назад +3

    how do you document your findings. please make a video on how you put everything together for a portfolio showcasing your skills in python and sql

  • @sakeenasammi1768
    @sakeenasammi1768 5 месяцев назад

    I liked the way you explained every query... Keep it up..

  • @sshibam
    @sshibam 2 месяца назад

    extremely insightful thank you sir

  • @reddysekhar459
    @reddysekhar459 9 месяцев назад

    Thank you so much for the video! great video👍

  • @ijazma54
    @ijazma54 23 дня назад

    Ankit this is brilliant.

  • @anoopdube9581
    @anoopdube9581 9 месяцев назад +1

    Liked your video, I am from Australia.

  • @JayeshTatipamul
    @JayeshTatipamul Месяц назад

    Ankit Sir, it is a really very helpful video !! Thank you so much !!

  • @aniket9595
    @aniket9595 8 месяцев назад

    Thanks a lot Ankit for creating such an interesting project

  • @bodybuildingmotivation5438
    @bodybuildingmotivation5438 9 месяцев назад

    really love this kind of content, please make more video like this.

  • @emadeconsultinginc
    @emadeconsultinginc 2 месяца назад

    Great work and thanks for sharing this

  • @mohitjoshi8984
    @mohitjoshi8984 9 месяцев назад +2

    Thank you ankit it's really helpfull.
    Can you make a practical video on A/B Testing plzz

  • @NalyticsEnterprises
    @NalyticsEnterprises Месяц назад

    Well done! The sales price column should be multiplied by the quantity since there is a discount on each product

    • @ankitbansal6
      @ankitbansal6  Месяц назад

      You are right, I should have mentioned that!

  • @subhajitdutta9983
    @subhajitdutta9983 5 месяцев назад

    Thanks for this awesome project.

  • @himanshshakk3083
    @himanshshakk3083 6 месяцев назад +15

    Bro I think you have done mistake in top 10 highest revenue generating products category you should use SUM(orders_quantity*selling_price) because sum of only selling price cannot produce the revenue it should be multipied by quantity

    • @KeepLearningAndGrowing19
      @KeepLearningAndGrowing19 6 месяцев назад +3

      While watching the video I also has thought the same. Could you please clarify it @ankitbansal6

    • @tenzinchoepheldev
      @tenzinchoepheldev 12 дней назад

      It is not clearly mentioned. That cost price is total price of item price * qty or just item price . Ur question is valid and need more clarification but if cost price is the total price for item * qty then Sir is right .

  • @gurutech7398
    @gurutech7398 9 месяцев назад

    It's really great bro❤

  • @shivakumar-ic9dh
    @shivakumar-ic9dh 6 месяцев назад

    Simple and good explanation😊

  • @adityavamsi12
    @adityavamsi12 5 месяцев назад

    Thank You so much for your content

  • @abolisabne4887
    @abolisabne4887 9 месяцев назад

    Thankyou so much for this Vedio Ankit.

  • @tarekmustafa2525
    @tarekmustafa2525 4 месяца назад

    Excellent tutorial, thank you!

  • @AsifDawood
    @AsifDawood 5 месяцев назад

    Great video. Thanks for sharing

  • @countryroads11
    @countryroads11 28 дней назад

    Great knowledge man

  • @bhanujuneja58
    @bhanujuneja58 6 месяцев назад +1

    Very nice video. Just a little doubt. In the last question we have to calculate subcategory with highest growth by profit but in the video highest growth by sales is calculated. So the one given in the video is right or not?

  • @iscixe
    @iscixe 6 месяцев назад

    Thank you very much. The explanation is very helpful.

  • @proud_indian0161
    @proud_indian0161 8 месяцев назад

    Very nice explanation. Thanks

  • @SunilKumar_67
    @SunilKumar_67 9 месяцев назад +3

    Can't add the ODBC Sql Server even after having the MySQL that was installed for working on your SQL Course. I am getting an error message saying TEST FAILED. Can you please share any link that can help us to get that server name present in the list of servers available to connect or better a follow up video showing how to do it on your system. Thanks in advance

  • @NigyAntony
    @NigyAntony 8 месяцев назад

    Oh my God Amazing project

  • @waqarbinjamil3577
    @waqarbinjamil3577 2 месяца назад +1

    I found doing analysis more easy by Python rather than SQL... Its because may be my conviction is more dominant over Python..

  • @vijaypratap8173
    @vijaypratap8173 9 месяцев назад

    Awesome😍😍😍😍

  • @arunkumarkesavan6077
    @arunkumarkesavan6077 6 месяцев назад

    Superb explanation !!!

  • @tenzinchoepheldev
    @tenzinchoepheldev 8 месяцев назад

    Thanks for the effort 🎉😂

  • @othmaneallouch326
    @othmaneallouch326 6 месяцев назад

    Great content! thank you so much for sharing !

    • @ankitbansal6
      @ankitbansal6  6 месяцев назад +1

      Glad you enjoyed it!

    • @othmaneallouch326
      @othmaneallouch326 6 месяцев назад

      @@ankitbansal6 thanks to you i'm discovering a whole new level of sql with your constant use of cte and complex analysis, I will surely watch every single video in your channel

  • @akashjha7277
    @akashjha7277 9 месяцев назад

    Great video❤

  • @chethuchethu6530
    @chethuchethu6530 6 месяцев назад

    great vedio

  • @-es2bf
    @-es2bf 3 месяца назад +1

    Hi Ankit,
    Thanks for your video. I work at a company where we load 70-80 files into our database each month. These files can be in formats like CSV, XLSX, or TXT etc..., and they all contain data. While each file is similar, there are always some differences. Currently, we use bulk insert in SQL Server to handle this, but things can go wrong, leading to some manual work. For example, one of the files might look different from the previous month.
    Do you think it would be better to use a mix of Python and SQL Server for this task, or is SQL Server sufficient? I feel that SQL Server's error messages can be poor, making it hard to troubleshoot why a table sometimes fails to load.
    Thanks!

  • @SidIndian082
    @SidIndian082 6 месяцев назад

    excelllent Sir

  • @AnushaGanipisetty-t5u
    @AnushaGanipisetty-t5u 4 месяца назад +1

    Hi, In my C drive i dont have .Kaggle folder so do we need to create new folder for .Kagglejson file to save

  • @shubhamtayde3797
    @shubhamtayde3797 6 месяцев назад +2

    cant't we direclty open csv file in python using read_csv no need of kaggle API

  • @yadavikasAnalyst
    @yadavikasAnalyst 8 месяцев назад +1

    Sir when you solved the first question why didnt you multiplied the quantity column with the sale price?? For total revenue by product I'd?

    • @jaybharat3949
      @jaybharat3949 7 месяцев назад

      Yes, I think likewise. Revenue should have been qty*sale_price

  • @sagarjadhav3564
    @sagarjadhav3564 5 месяцев назад +1

    Hi Ankit , What should I do If I don't see ODBC driver for SQL server in ODBC data source administrator ?

  • @hamzazahidulislam3490
    @hamzazahidulislam3490 7 месяцев назад

    Thank you so much

  • @rakeshkoli4384
    @rakeshkoli4384 8 месяцев назад

    thanks bro and kindly make video on ETL process

  • @amithvalentine1479
    @amithvalentine1479 8 месяцев назад

    hey Ankit, great content, thanks fro the video! How did you obtain the list the columns names with the datatypes and memory allocation to create the new empty table?

    • @ankitbansal6
      @ankitbansal6  8 месяцев назад

      You can right click on the table name in the browser and choose create to

  • @brishtisaha4785
    @brishtisaha4785 8 месяцев назад

    Sir please upload more SQL projects with datasets from kaggle.. Thank you for this..

  • @ActiveLifePicks
    @ActiveLifePicks 8 месяцев назад

    Hello Ankit, all these queries can be made while using pandas library, can you tell why your made sql server database and used sql for queries?

  • @Rohit-r1q1h
    @Rohit-r1q1h 5 месяцев назад

    Bhai tumne company ka experience karwa diya kaise kaam hota he ❤it man

  • @Rohitchauhan-kg7sp
    @Rohitchauhan-kg7sp 3 месяца назад

    great video

  • @PravinMane-el5fo
    @PravinMane-el5fo 8 месяцев назад

    Nice one

  • @JugesDebnath
    @JugesDebnath 21 день назад

    Hi, please review the code at 28:05, I'm a bit confused when you have done group by product_id. I can see multiple entries of the same product "sub_category " with different product_id; hence grouping by product_id will not reflect the actual revenue generation for each product. So, I have done the following:
    SELECT sub_category AS product,
    sum(sale_price) AS sales
    FROM df_orders
    GROUP BY sub_category
    ORDER BY sales DESC
    LIMIT 10;

  • @subodhthore6454
    @subodhthore6454 9 месяцев назад +1

    For first question, Don't we need to multiply sale_price with the quantity for revenue generated for each product?

    • @ankitbansal6
      @ankitbansal6  9 месяцев назад +1

      We can do that. I assumed it was total sales in the sale price.

    • @subodhthore6454
      @subodhthore6454 9 месяцев назад

      @@ankitbansal6 ok sir

    • @yousrasaadaoui1238
      @yousrasaadaoui1238 8 месяцев назад

      same question. my answer is
      SELECT top 10 product_id, SUM(sale_price * quantity) AS total_revenue
      FROM df_order
      GROUP BY product_id
      ORDER BY total_revenue DESC;

  • @PANDURANG99
    @PANDURANG99 7 месяцев назад

    We can use power bi

  • @navaneeth6414
    @navaneeth6414 9 месяцев назад

    Hi Ankit, Wonderful Explanation. Can I have those DDL statement in SqL Server. Not able to find in Github

  • @beaver499
    @beaver499 8 месяцев назад +1

    Hi Sir! I am not able to connect the SQL Server at 22:10. I have used the same syntax because the driver name is same still getting errors. Please help

    • @Hustler19
      @Hustler19 6 месяцев назад

      Bro, I'm facing the same problem! do you have resolve it?

    • @sambeetpanda6919
      @sambeetpanda6919 6 месяцев назад

      @@Hustler19 hey were u able to solve this,m having same issue as well

  • @sowmya6471
    @sowmya6471 9 месяцев назад

    V useful

  • @rizwan1396
    @rizwan1396 4 месяца назад

    Great work sir , sir i have one problem im using mysql and for third questionsy output showing me only 2 rows for same query why is that

  • @kuntalchowdhury5336
    @kuntalchowdhury5336 8 месяцев назад

    Excellent.

  • @rajvardhan465
    @rajvardhan465 8 месяцев назад

    Your a goat 🐐 in data filed sir

  • @074_umairkidwai8
    @074_umairkidwai8 6 месяцев назад +1

    Showing error in changing order date to datetime

  • @ayat5483
    @ayat5483 6 месяцев назад

    Brother, I am new to data science. I want to know which language will be good to learn data analysis? I have learned python, numpy and am currently learning panda.

  • @WiseCoder-rp2zn
    @WiseCoder-rp2zn 8 месяцев назад

    Good Video, Keep it up

  • @friendofnature1662
    @friendofnature1662 6 месяцев назад

    Hello and congratulations on the course! In the first part where I have to put the json file in the .kaggle file this is doesn't t seed to exist So what can I do?

  • @gameply347
    @gameply347 8 месяцев назад

    Sir will we have a PowerBI course like tableau ?

  • @kishorereddy3155
    @kishorereddy3155 5 месяцев назад

    Hi Ankit where can I can these type of queries to get hands on can you please tell

  • @kantipudisaisowmith9628
    @kantipudisaisowmith9628 8 месяцев назад

    Can you explain how to connect sql server on mac with jupytor

  • @vishnucruz4529
    @vishnucruz4529 Месяц назад

    How did you get the .kaggle folder ? Did you create one ?

  • @hritikapal683
    @hritikapal683 9 месяцев назад

    Can we expect something related to cloud as well?

    • @ankitbansal6
      @ankitbansal6  9 месяцев назад +1

      Here you go
      ruclips.net/video/52CWagk3-jw/видео.html

  • @saiteja-gb8ho
    @saiteja-gb8ho 9 месяцев назад

    Thank you so much Ankit bro

  • @Ranidalvi-n3o
    @Ranidalvi-n3o 5 месяцев назад

    please share the link for Microsoft Sql server management studio installation.

  • @abdallahfahmy-h6k
    @abdallahfahmy-h6k Месяц назад

    why did not you remove nulls ?

  • @MayankGadiya-uq1el
    @MayankGadiya-uq1el 8 месяцев назад

    "for each category which month had highest sales"
    above question has different output from your youtube video and sql queries that u have provided in sql file.

  • @abdallahfahmy-h6k
    @abdallahfahmy-h6k Месяц назад

    in last query : which sub category had highest growth by profit in 2023 compare to 2022 i think it is by price not profit?

  • @rahulagrawal622
    @rahulagrawal622 9 месяцев назад

    Hi Ankit
    Can you please also show us
    How to add triggers
    To update the data every month on database
    Thanks

  • @sulaimans4720
    @sulaimans4720 8 месяцев назад

    where can i find dataset ankit ?

  • @satyajeet7633
    @satyajeet7633 8 месяцев назад

    In the last question of sql
    Should we calculate growth by profit change or sale change because the question is for profit change
    Also in total revenue quantity of each order is not included

  • @NetTakeAways
    @NetTakeAways 3 месяца назад

    Hi,my table is not reflecting the changes in mysql tables which I have done in jupyter notebook. Even after saving and refreshing . Please help me out in this.

  • @randyawuri1664
    @randyawuri1664 8 месяцев назад

    Can this be done using VScode?