Secret To Optimizing SQL Queries - Understand The SQL Execution Order

Поделиться
HTML-код
  • Опубликовано: 23 ноя 2024

Комментарии • 180

  • @cumbi-mongo
    @cumbi-mongo Год назад +279

    Great video! One addition: The "EXPLAIN" command is an invaluable tool for optimizing SQL queries. It provides a detailed execution plan, allowing the developers to understand how the database engine processes a query. By analyzing the execution plan, you can address the performance bottlenecks with proper optimizations, e.g. proper indexes.

    • @Omar-ic3wc
      @Omar-ic3wc Год назад +5

      Thanks for sharing this.

    • @luis5d6b
      @luis5d6b Год назад +5

      Thanks a lot for the addition, really good :)

    • @ksm1847
      @ksm1847 Год назад +5

      @cmertayak - I second you. It's an awesome command I use many times at my work to optimise. My go to command to improve queries execution.

    • @zeelthumar
      @zeelthumar Год назад +3

      Thanks for sharing

    • @DavisTibbz
      @DavisTibbz 9 месяцев назад

      Oh yes, if you run EXPLAIN in some desktop client like Mysql Workbench, shows you detailed chart diagram of your Query, quite useful

  • @uzair004
    @uzair004 Год назад +70

    Opt for indexes with SELECT, WHERE, JOIN clauses.
    Use full column comparison to get data instead of half or computed comparison (i.e startsWith)
    Avoid ORDER_BY on large data retreval
    Use limit of smaller number with pagination for more data.

    • @sampri22
      @sampri22 7 месяцев назад +1

      Could you explain how? What if i need large data retrieved with order by. How would i use limit and pagination in this case? Thanks

    • @beebeewijaya1374
      @beebeewijaya1374 5 месяцев назад

      ​@@sampri22 for data processing and analytics? better way to do this is dump your database data and put into BigQuery or Hadoop, they have better resources for processing a large data

  • @JohnS-er7jh
    @JohnS-er7jh Год назад +27

    One of the best SQL videos I have come across, just the way it is put together and the infographics. If you are learning SQL, you really should understand the mechanics behind optimizing queries, how databases work. Just adding more hardware or VM resources will not fix the issue if your queries are not optimized properly.

    • @rembautimes8808
      @rembautimes8808 9 месяцев назад +2

      Very well presented, thanks for explaining SARGAble concept

  • @ayazahamed8254
    @ayazahamed8254 Год назад +12

    The way you explained with the animations are Awesome. Great Job. Very Well Explained.

  • @abhinav10x
    @abhinav10x 7 месяцев назад +3

    Very profound, please share more on SQL like windows and CTE, your explanation is very approachable.

  • @TheSocialDrone
    @TheSocialDrone 2 месяца назад +1

    Thank you, @bytebytego, for breaking down the topic with clear visuals and simple narration!

  • @Qith629
    @Qith629 2 месяца назад +1

    the way you help us visualize this is next level. Thank you!

  • @CyberMew
    @CyberMew Год назад +98

    Very good intro. Would like a more detailed explanation on more complex queries.

    • @adicide9070
      @adicide9070 Год назад +13

      they don't do detailed explanations. it's basically "use indexes". don't sort lots of data. well, thanks.

    • @davidlee588
      @davidlee588 Год назад +4

      @@jonbaird9718agreed, RUclips is made for juniors

  • @SalmanSayyad-q1h
    @SalmanSayyad-q1h Год назад +3

    bro this way of teaching is really really make sense. thanks a lot for these visuals.

  • @MrSuriyam
    @MrSuriyam 8 месяцев назад +2

    Hi Sir thank you 🙏 for taking the time to explain the SQL. Sorry Iam new and very helpful.

  • @JosephDSilva-i6j
    @JosephDSilva-i6j Год назад +2

    Additionally, for the optimizer to "make up" a reasonably good plan (from the various alternatives), it needs to know a bit about the data (value) distribution. This is where STATISTICS / ANALYZE (depends on the DB vendor) come handy. It helps the optimizer do estimates for the various steps (rows, size of data, etc.) of each plan, and figure out which of the different plans is the best candidate to execute. Therefore it is important to collect this information on critical columns (usually join, where clause columns). It is also important to keep this information regularly refreshed so that the optimizer does not make bad decisions based on stale statistics. Very bad things can happen with stale statistics.

  • @vivektarab1959
    @vivektarab1959 17 дней назад

    0:08 - Start
    1:25 - Using Index on Join Columns significantly improve the Join
    1:47 - Next Step is use of Where Clause
    2:13 - Lets
    3:14 - To write Sargable Queries
    4:18 - Optimising
    5:21 - Remember - Order of Optimization

  • @stpaquet
    @stpaquet Год назад +26

    Understanding how the DB engine works with indexes is key. you may assume that a WHERE purchase_date >= 2022 AND purchase > 100 would be the same if you have indexes on purchase_date and purchase, but it might be required to have a composite index... Order in the WHERE clause may also be important as it helps reducing the dataset before applying the second condition.

    • @MiningForPies
      @MiningForPies Год назад +3

      WHERE order has no effect on most sql systems. The only way you can force SQL to filter data first is to use a derived query.

  • @deni_.s
    @deni_.s Месяц назад

    That's the gap I needed to fill. Couldn't find this info wrapped in the right words and animations as here. Thanks a lot for the content! Hoping to find more relevant videos to expand my knowledge of SQL.
    Currently struggling with performance of my queries on big datasets (~6mil rows). Not clear how to avoid functions and computations during search and filtering in some cases. Biggest struggle so far

  • @gabrielb.962
    @gabrielb.962 Год назад +29

    Index usage tip: When using params in your query (e.g., select .... where year > ?), databases may not utilize an index if it is unbalanced. For instance, if you have approximately 1 million rows with year = 2022 and only 1000 rows with year = 2023, the database cannot predict whether the parameter will be useful for filtering. To resolve this issue, pass the value directly in the query itself, allowing the execution plan to determine if the index is suitable for the intended purpose.

    • @stpaquet
      @stpaquet Год назад +3

      As I wrote in my comment, good understanding on how you db engine works is key. And they are all different. So never assume that a good query on a MySQL will be a good query on Postgres, Oracle or any SQL engine.

    • @maf_aka
      @maf_aka Год назад +9

      this opens the gate for SQL injection, don't do this

    • @maksymbaranovskyi8362
      @maksymbaranovskyi8362 Год назад +1

      @@maf_aka I think the idea was not to use prepared statements *where you don't need them.* E.g. if you already have validation in place that ensures your received value is enum (number, null, etc.) - you can be sure no SQL injection is possible there - so no need to use prepared statements *there.*

    • @lethern2
      @lethern2 Год назад

      Ok, but then you get a different query plan for each (different parameter / set of parameters) query

    • @stpaquet
      @stpaquet Год назад

      @@lethern2 yep. but that's why you need to understand how your db engine works

  • @mobiledisco
    @mobiledisco Месяц назад

    Thank you. SELECT coming practically at the end of the process was a hard thing to get my head around, let alone remember, when I first began with SQL. Still is TBH. The fact it means something more like 'display' than 'go and get' (what we typically mean by 'select' in conversational English) was a hurdle too. Wish I'd come across this video back then.

  • @連文瑞-o5n
    @連文瑞-o5n 10 месяцев назад +2

    This is the best explanation I've ever seen. Big thumbs for you!

  • @karunakaranr2473
    @karunakaranr2473 7 месяцев назад

    Thank you for your time and effort to explain any of the subjects. Really like it and more over able to register the concept in mind easily. Thanks again,.

  • @lucyk7292
    @lucyk7292 Год назад +10

    Thank you for a fantastic visualization of the SQL queries execution order. That's exactly what I have been missing in the other materials. I really appreciate your style of teaching

  • @zackwong1000
    @zackwong1000 Год назад +10

    You should select from the orders table then join the customers since your where clause is a column in orders table! Your SQL is joining on unnecessary rows from orders & customers!

  • @RobinSingh-ms3zt
    @RobinSingh-ms3zt 2 месяца назад

    You are awesome at explaining any concept. Thank you so much

  • @naveenverma2951
    @naveenverma2951 7 месяцев назад

    oh my goodness, this is too good for non IT background jumping ship to see where AI will land. Thx. You are my 3blue1brown for IT

  • @AliBensoukehal
    @AliBensoukehal Год назад +4

    Simple and to the point explanation. Love it. Thanks 👍

  • @JeremyChone
    @JeremyChone Год назад +4

    wow, what an awesome introduction to SQL optimization.

  • @bonita799063
    @bonita799063 5 месяцев назад

    What a Video , Voice , Explanation , Graphics and etc...well done mate

  • @user-rw6iw8jg2t
    @user-rw6iw8jg2t Месяц назад

    The best one , interms of Optimization !

  • @moneycrab
    @moneycrab Год назад +4

    I heard it called "predicate pushdown" when you move a condition earlier in the plan

  • @avijeethati5323
    @avijeethati5323 Год назад +2

    Excellent video explaining basic concepts in very short time..❤
    Impressive graphic animation, could you please share how the execution plan animation was done

  • @niketu99
    @niketu99 Год назад +1

    Superb video! Simple explanation on query optimisation.

  • @KEsh_123_4
    @KEsh_123_4 14 дней назад

    Great explanation in a short video.
    If orderby comes after the select then it will work on the data already read from disk, right?

  • @TravisMcAuley-e1z
    @TravisMcAuley-e1z 9 месяцев назад

    Awesome visualization, I've been loving all the short videos on this channel!
    Clarifying Q. The execution order has SELECT happening after HAVING, so this should mean that the calculated column total_spent doesn't exist at the time the HAVING clause is evaluated?

  • @caseyspaulding
    @caseyspaulding Год назад +1

    Wow. To the point with knowledge I can use today. Thank you.

  • @developersharif
    @developersharif Год назад +1

    *Explanation level is so beautiful!*

  • @huywarrior
    @huywarrior Год назад

    Lord Buddha. I'm looking for an active data flow visualization that can shorten data query response times! A great video, it saved me today. Leaving with 1 subscription as a fan! 🔍⚡

  • @int1128
    @int1128 4 месяца назад

    LOL just had an interview and had exact copy of example he is showong and explaing on 😂😂😂
    Thanks on this video realy helped.

  • @zixuanzhao6043
    @zixuanzhao6043 7 месяцев назад

    This query actually does not need to join customers table since all the fields are present in the orders table already. (unless there are invalid / dirty customer_id data in the orders table and you want to filter them out)

  • @antonboiko788
    @antonboiko788 Год назад

    cool, didn't think it's possible to include all these concepts in 6 min video. One thing, it's great to watch it when you want to summarise already existing knowledge

  • @gliderguld
    @gliderguld Год назад

    Well explained. However I do miss 1) the generation of more query-plans and selection amongs them (cost estimations) and (as an element herein) 2) different table access tactics (sequential scan, index access or index only).

  • @blackisblack22
    @blackisblack22 Год назад +2

    I have always thought that the Sql structure is poorly designed by not starting from FROM and placing the reference at the end of the statement, for example in a SELECT it should go just before ORDER BY, in an UPDATE the SET after WHERE, etc. Somehow they wanted to remedy the problem by introducing the WITH clause but I'm sure many regret that whoever designed the language should have worked a little harder at the time.

  • @twistedace6365
    @twistedace6365 Год назад

    thanks, helped clear up some issues I had.

  • @DerBarde2012
    @DerBarde2012 9 месяцев назад

    Your presentation is so pleasant to watch, is it manually key-framed in the video editor or are there tools to do that naturally?

  • @NiamorH
    @NiamorH Год назад +4

    Nice bird's-eye view introduction.
    It is not clear how to 'use appropriate indexes' to optimize for sorting, and how to implement pagination. Especially in your example where the sort order is made on an aggregate.

  • @AbhishekVaid
    @AbhishekVaid Год назад

    Something doesn't add well here. If you notice HAVING clause refers to 'total_spent' which is defined in SELECT, so dependency wise HAVING should be after SELECT and not before it.

  • @mer2550
    @mer2550 Год назад

    good things to practice for the interview. Thanks

  • @chobemaloso
    @chobemaloso 5 месяцев назад

    Thank you for sharing your knowledge

  • @fishinawaterbottle
    @fishinawaterbottle Год назад +7

    I feel like this is a bit misleading because sometimes where and select influence the first stage. As you said, when there’s a covering index, the database won’t read the entire table. So the select and where influence what is read from the source.
    Order and limit can also come it at the source as well if the index can be used with the order. You refer to this when you talk about “sorting the whole table”.
    CTEs and sub queries are not mentioned but that’s okay i guess.

  • @souhaildahmeni9961
    @souhaildahmeni9961 2 месяца назад

    Thanks for this kind of explanation

  • @prasasthaslife6270
    @prasasthaslife6270 6 месяцев назад

    Great video!! Very helpful! Thanku sir!

  • @MPIPolitics
    @MPIPolitics Год назад +1

    my app didnt reached 40 queries per second yet but i will implement that just in case my app will be next amazon :D

  • @jagmanderbazzad8425
    @jagmanderbazzad8425 Год назад

    Nice and simple explanation.Thanks

  • @sahandjavid8755
    @sahandjavid8755 10 месяцев назад

    Question: at the end of the video you mentioned do not sort the whole data and use pagination for optimizing ORDER BY and LIMIT. Those are the things I use for pagination! What do you mean by that?
    The other thing is from your video LIMIT happens after ORDER BY. How come it can help when ORDER BY has already happened?!
    Btw great videos and content, thank you for these

  • @nixjavi7220
    @nixjavi7220 8 месяцев назад

    these videos are amazing!!!! thanks!!!

  • @gokulkumarprasoon5158
    @gokulkumarprasoon5158 9 месяцев назад

    00:45 Understanding SQL query execution and optimization techniques
    01:30 Understanding SQL execution plans can optimize queries for better performance
    02:15 Optimizing SQL queries through index usage
    03:00 Writing soluble queries is essential for optimizing database performance.
    03:45 Sargable queries improve query performance.
    04:30 Understanding the SQL execution order is crucial for query optimization
    05:15 Optimizing SQL Queries with Indexes
    05:57 Understanding SQL execution order is key
    Crafted by Merlin AI.

  • @yashwanthbedre8220
    @yashwanthbedre8220 Год назад

    Best explanation ever

  • @MarredNDisenchanted
    @MarredNDisenchanted Год назад +3

    Having uses total_spent from the SELECT, so how come HAVING is executed before the SELECT?

    • @urgorka
      @urgorka 9 месяцев назад

      I'd say so too. This is error. First SELECT part is evaluated, then - HAVING part.

  • @Moali11
    @Moali11 Год назад

    Can you make a video explaining the difference between system design and software architecture?

  • @darrenklein6090
    @darrenklein6090 7 месяцев назад

    Thank you, this was really helpful.

  • @tallalmoshrif6643
    @tallalmoshrif6643 Год назад

    Great video, very informative and well explained bravo!

  • @nguyentanphuc4101
    @nguyentanphuc4101 2 дня назад

    Avoid using non-saragable condition (func or calc) on index column
    If has to use func on column, write a computed column or function-based index first.

  • @vasiliynet3425
    @vasiliynet3425 Год назад +1

    Awesome as usual! Thanks a lot!

  • @anirudh7463
    @anirudh7463 Год назад +3

    This stuff is gold. Thank you for making this available for free. Really appreciate it!

  • @GabrielGasp
    @GabrielGasp Год назад

    I always thought that the SELECT happened before HAVING, considering that we can use SELECT aliases in the HAVING filter.

  • @robbybankston4238
    @robbybankston4238 Год назад

    Love your channel. Your videos are great.

  • @mahmudulmohtasim7612
    @mahmudulmohtasim7612 Год назад +2

    In this example the 'total_spent' alias is already in use in the HAVING clause without defining. How is that possible?

    • @tianyuhu349
      @tianyuhu349 Год назад

      yes, I have the same question, it doesnt make sense...

  • @HariGajan-yl1en
    @HariGajan-yl1en 4 месяца назад

    Good insight Thanks!

  • @Piyush_Kumbhare
    @Piyush_Kumbhare 5 месяцев назад

    why don't you make a tutorial on SQL. I would like to watch it and I think it'll help a lot of people. By the way thank you very much for this amazing explanation.

  • @mikeshin77
    @mikeshin77 Год назад

    Very good video. It is really helpful.

  • @seraphfull
    @seraphfull Год назад

    thank you for your video,
    i working on IT with 10 years experience, but I never know the order between JOIN and WHERE,
    utill I watch this video

  • @sengs.4838
    @sengs.4838 Год назад

    So good explanations

  • @ThinhLe-eh9re
    @ThinhLe-eh9re Год назад

    Thanks for your sharing Bro's.

  • @99aabbccddeeff
    @99aabbccddeeff Год назад

    Excellent explanation, thanks!

  • @vi-2932
    @vi-2932 Год назад

    Thanks. Good to know! Useful!

  • @mohan1958
    @mohan1958 8 месяцев назад

    Great. Thanks for sharing..

  • @martingrof1685
    @martingrof1685 Год назад

    Amazing. Thank you!

  • @UrielVS
    @UrielVS Месяц назад

    Think the key component missed here is that you are utilizing a SELECT aggregate within the HAVING statement. To me that looks like SELECT has to come before HAVING, would it not?

  • @SudharshaunMugundan
    @SudharshaunMugundan Год назад

    Fantastic explanation.

  • @jamesayento1321
    @jamesayento1321 Год назад

    Thanks for this! Will there be a transcription soon?

  • @enockoloo3814
    @enockoloo3814 Месяц назад

    really cool - thanks.

  • @hyperborean72
    @hyperborean72 Месяц назад

    your write in predicate in one case '2023-01-01' and in another '01-01-2008' - which format is correct?

  • @sriteja2510
    @sriteja2510 Год назад

    Hi The actual plan should be derived from the explain and explain analyze right instead from the query?

  • @mwaikul
    @mwaikul Год назад +1

    What tool do you use to generate your animations?

  • @jerichaux9219
    @jerichaux9219 7 месяцев назад

    You guys are awesome!

  • @bilo1897
    @bilo1897 6 месяцев назад

    Great video!

  • @theawesomeharris
    @theawesomeharris Год назад

    hi, can you enable captions/subtitle for this video? thank you!

  • @thati2792
    @thati2792 4 месяца назад

    join before where?? not always!

  • @MuhammadFahreza
    @MuhammadFahreza Год назад

    I still don't understand the difference between first point noted on here 3:19 and second point noted on 3:23. Would you mind to re-explain it ? thank you!

  • @ronenTheBarbarian
    @ronenTheBarbarian Год назад

    Would building a cte table and then running a non-sargable query on it, should also be avoided?

  • @bhooshan25
    @bhooshan25 Год назад

    good explaination

  • @helal.ismail
    @helal.ismail Год назад

    Very simple and to the point, love the visualization too

  • @antonanton22743
    @antonanton22743 Год назад

    thanks a lot for your content

  • @bigbrother1211
    @bigbrother1211 Год назад

    Thank you so much!

  • @RZing
    @RZing 17 дней назад

    Thanks for the video but there is one mistake you've make, in HAVING clause you can't use the alias 'total_spend' which was defined in SELECT statement, becuase SELECT was executed after HAVING. THE CORRECT WAY IS : HAVING SUM(order_amount)>=1000

  • @apexyu164
    @apexyu164 Год назад

    so in the above example, which place we should index ?

  • @pinusong
    @pinusong Год назад

    Why are there no subtitles? I need subtitles. Thank you very much!

  • @sevsxes
    @sevsxes Год назад

    Will it be even faster if we always order where first and join after?

  • @Matthew8473
    @Matthew8473 9 месяцев назад

    This is top-notch in every aspect. I read a book with similar content, and it was top-notch. "Better Sleep Better Life" by William Brook

  • @VivekRaj-i3j
    @VivekRaj-i3j 4 месяца назад

    Won't it throw an error near Having total_spent as total_spent is an alias used in select clause and according to order of execution having will be executed before select and total_spent won't be recognised

  • @some1rational
    @some1rational Год назад +1

    Is that a typo in the first select clause, total spent should be total_spent?

    • @tianyuhu349
      @tianyuhu349 Год назад +1

      yes, i think so, and I have another question, 'Having' uses total_spent from the SELECT, so how come HAVING is executed before the SELECT? Doesnt make sense...

  • @shmuelsigler2918
    @shmuelsigler2918 Год назад

    מדהים!

  • @mePrafull
    @mePrafull Месяц назад

    Thanks!