Lecture 3.5 - Describing Numerical Data - Percentiles, Quartiles, and Interquartile range

Поделиться
HTML-код
  • Опубликовано: 30 сен 2024

Комментарии • 109

  • @dhruvchawla1752
    @dhruvchawla1752 3 года назад +318

    Explanation: Since many people were having trouble understanding the lecture (took a while for me as well)
    I have divided into subsections so you can read only the parts you didn't understand.
    ***100p Percentile***
    Percentile basically tells us how many values are more than or less than a given value in a dataset. So, if I say that you got 90 percentile in your class test, I mean that you scored more than or equal to 90 percent people in your class.
    In this case, p is 0.9 and (1-p) is 0.1, which means 100p is 90 and 100(1-p) is 10.
    So, the definition makes sense, you got more (or equal) marks than 100p% of your class and less (or equal) than 100(1-p)% of your class.
    ***How to calculate this
    ***
    Now suppose your teacher asks you who scored 75 percentile in your class of 40 students, i.e. which student in your class has more or equal marks as 75% of your class.
    1)First thing you will do is to arrange your data in ascending order.
    2)Second, you know that 100p is 75, which means p is 0.75 and since your class has 40 students, n is 40. So, now you get np to be equal to 0.75*40 = 30.
    3)Third, according to the rules if np is not an integer you find the smallest integer greater than n and the value at that position in the data will be your answer. But, since np is an integer (30), you will take the average of the values at the positions np and np+1 (30 and 31) and that will be your answer.
    ***Google Sheets Method
    ***
    In the video, they chose a dataset of 10 values (n = 10) and then they found the values of 10, 25, 30, 40, 50, 60, 75, 80, 90 and 100 percentiles of the data.
    Now, for these values, to find p, we divide all of them by 100, so the values of p are 0.1, 0.25, 0.3, 0.4, 0.5, 0.6, 0.75, 0.8, 0.9 and 1.
    I will only take the example of one of them to explain. Let's say you have to calculate the 25th percentile. The following is the explanation of the process and not the logic behind .
    1) First, you arrange the data in ascending order and number them 1, 2, 3....10.
    Dataset: [35, 38, 47, 58, 61, 66, 68, 68, 70, 79]
    1 2 3 4 5 6 7 8 9 10
    2) Next, you need to find the rank of 25th percentile, you will do this using the formula rank = p * (n -1) + 1. So, rank = (0.25 x 9) + 1 = 3.25.
    3) You then split 3.25 into its integer part (3) and its fractional part (0.25).
    4) You set the value of variable i = 3 (the integer part). x[i] represents the value at the position i in the dataset, so that will be the value at position 3 i.e. 47 and so x[1+1] will be the value at position 4 i.e. 58.
    5) Now, you substitute the values in the formula: Percentile = x[i] + fractional part * ( x[i + 1] - x[i])
    So, your answer for the value of 30th percentile will be: Percentile = 47 + 0.25 * (58-47) = 47 + 2.75 = 49.75
    The rest of the video (from 17:30) is easy to understand.

  • @shreyamehray4553
    @shreyamehray4553 2 года назад +58

    I love this teacher but I just cannot understand this lecture?????

  • @HimanshuYadav-qy5sm
    @HimanshuYadav-qy5sm 3 года назад +29

    I want to meet the people personally who liked the video

    • @subhanarayan1195
      @subhanarayan1195 3 года назад +7

      True.. One of the most confusing video..

    • @ShaswatManojJha
      @ShaswatManojJha 2 года назад +11

      🤣🤣 Exactly. They are teaching like a revision crash course.
      While the qualifier was allowing everyone in, i don't think people will survive degree level.

    • @raj_patel
      @raj_patel 2 года назад +2

      +1

  • @krishnamsettijayakrishnava3010
    @krishnamsettijayakrishnava3010 3 года назад +28

    I didn't understand percentile clearly. Please give some more examples to such topics

  • @MuskanKumari-im7jz
    @MuskanKumari-im7jz 28 дней назад +1

    Took me whole day to understand this lecture....thanks to dhruv chawla who explained this concept in comment section.....i also want to contribute to help so here are the explanations
    1. What is a Percentile?
    A percentile is a measure that tells you how a particular score compares to the rest of the group. Specifically, it indicates the percentage of scores in the dataset that are below (or sometimes equal to) a specific value.
    Example: If you are in the 90th percentile in your class, it means you scored higher than 90% of your classmates.
    2. Understanding p in Percentile Calculations
    In percentile calculations:
    p represents the percentile rank expressed as a decimal.
    For example:
    90th percentile → p = 0.90
    75th percentile → p = 0.75
    50th percentile (median) → p = 0.50
    100p converts the decimal back to a percentage.
    For p = 0.75, 100p = 75%
    100(1 - p) represents the remaining percentage.
    For p = 0.75, 100(1 - p) = 25%
    In summary:
    p is the percentile as a decimal.
    100p% is the percentile in percentage terms.
    3. How to Calculate a Percentile Rank
    Let’s walk through an example to understand how to calculate a percentile rank.
    Example Scenario:
    Dataset: Class scores of 40 students.
    Task: Find the 75th percentile score.
    Step-by-Step Guide:
    Step 1: Arrange the Data in Ascending Order
    First, list all 40 students' scores from lowest to highest.
    Step 2: Determine p and Calculate np
    For the 75th percentile: p = 0.75
    n is the total number of data points (students).
    n = 40
    Calculate np:
    np=0.75×40=30
    Step 3: Determine the Percentile Position
    If np is not an integer, round it up to the next whole number. The score at this position is the percentile.
    If np is an integer, take the average of the scores at positions np and np + 1.
    example:
    np = 30, which is an integer.
    Therefore, the 75th percentile is the average of the scores at positions 30 and 31.
    Step 4: Find the 75th Percentile Score
    Locate the 30th and 31st scores in your ordered list.
    Score at position 30: Let's say it's 85
    Score at position 31: Let's say it's 87
    Calculate the average:
    75 percentile =85+87/2=86
    Interpretation:
    A score of 86 is the 75th percentile.
    This means you scored higher than or equal to 75% of your class.

  • @shubhambiswas7904
    @shubhambiswas7904 2 года назад +12

    Numerical Summaries --> Percentiles
    ==============================================
    0:15 - Percentiles
    2:55 - Computing Percentile
    6:27 - Example
    8:13 - Computing percentile using google sheet
    17:24 - Quartiles
    19:31 - The Five Number Summary
    20:05 - The Interquartile Rang (IQR)
    20:55 - Section Summary
    22:10 - Module Summary
    (Added these time stamps for revision)

  • @sakshi5535
    @sakshi5535 2 года назад +7

    I think reading the notes of this lecture would be more helpful than learning from her lectures.... sorry but i didn't get anything u taught 👍🙏

    • @jainamkhadalia
      @jainamkhadalia Год назад +1

      where are the notes
      if you have please send
      or share link here

    • @awierdo69
      @awierdo69 11 месяцев назад

      Yeah damn right, this lecture basically is about her reading the slides out blindly.

  • @hameeyasmaheenpc2790
    @hameeyasmaheenpc2790 3 года назад +10

    10:28 The Google Sheets algorithms gives a different percentile value (37.5) compared to the value we calculated initially. It raises the question: shouldn't ultimately the percentile value be the same irrespective of the algorithm used?
    Well turns out there are 2 types of percentile values: Interpolated or Nearest Rank.
    So by my guess, the first algorithm employs one of these 2 methods and Google Sheets algorithm employs the other.

  • @teeyeskay1006
    @teeyeskay1006 3 месяца назад +4

    after seeing the comments ,I'm glad that I'm not the only one who is confused 🙂

  • @arushibhagat4810
    @arushibhagat4810 3 года назад +12

    Didn't understand anything, sorry.

    • @dhruvchawla1752
      @dhruvchawla1752 3 года назад +7

      I have divided into subsections so you can read only the parts you didn't understand.
      ***100p Percentile***
      Percentile basically tells us how many values are more than or less than a given value in a dataset. So, if I say that you got 90 percentile in your class test, I mean that you scored more than or equal to 90 percent people in your class.
      In this case, p is 0.9 and (1-p) is 0.1, which means 100p is 90 and 100(1-p) is 10.
      So, the definition makes sense, you got more (or equal) marks than 100p% of your class and less (or equal) than 100(1-p)% of your class.
      ***How to calculate this
      ***
      Now suppose your teacher asks you who scored 75 percentile in your class of 40 students, i.e. which student in your class has more or equal marks as 75% of your class.
      1)First thing you will do is to arrange your data in ascending order.
      2)Second, you know that 100p is 75, which means p is 0.75 and since your class has 40 students, n is 40. So, now you get np to be equal to 0.75*40 = 30.
      3)Third, according to the rules if np is not an integer you find the smallest integer greater than n and the value at that position in the data will be your answer. But, since np is an integer (30), you will take the average of the values at the positions np and np+1 (30 and 31) and that will be your answer.
      ***Google Sheets Method
      ***
      In the video, they chose a dataset of 10 values (n = 10) and then they found the values of 10, 25, 30, 40, 50, 60, 75, 80, 90 and 100 percentiles of the data.
      Now, for these values, to find p, we divide all of them by 100, so the values of p are 0.1, 0.25, 0.3, 0.4, 0.5, 0.6, 0.75, 0.8, 0.9 and 1.
      I will only take the example of one of them to explain. Let's say you have to calculate the 25th percentile. The following is the explanation of the process and not the logic behind .
      1) First, you arrange the data in ascending order and number them 1, 2, 3....10.
      Dataset: [35, 38, 47, 58, 61, 66, 68, 68, 70, 79]
      1 2 3 4 5 6 7 8 9 10
      2) Next, you need to find the rank of 25th percentile, you will do this using the formula rank = p * (n -1) + 1. So, rank = (0.25 x 9) + 1 = 3.25.
      3) You then split 3.25 into its integer part (3) and its fractional part (0.25).
      4) You set the value of variable i = 3 (the integer part). x[i] represents the value at the position i in the dataset, so that will be the value at position 3 i.e. 47 and so x[1+1] will be the value at position 4 i.e. 58.
      5) Now, you substitute the values in the formula: Percentile = x[i] + fractional part * ( x[i + 1] - x[i])
      So, your answer for the value of 30th percentile will be: Percentile = 47 + 0.25 * (58-47) = 47 + 2.75 = 49.75
      The rest of the video (from 17:30) is easy to understand.

    • @mehulpratapsinghyadav85
      @mehulpratapsinghyadav85 2 года назад

      Dhruv Chawla great explanation 💐

    • @komaldhumal9353
      @komaldhumal9353 Год назад

      Please send link video

  • @Werner.Heisenberg
    @Werner.Heisenberg 3 месяца назад +1

    If you can not understand the lecture read the top comment and then watch the lecture

  • @facts_funday
    @facts_funday 2 года назад +8

    She is a AMEZING teacher but i think she have to rush something 😅❤️

  • @gaurav-no1qh
    @gaurav-no1qh 2 месяца назад

    If anybody have problems to understand these teacher's lacture i have solution you must divided topic in all lectures and visualize it
    Concepts are very easy but the brain not understand easy concepts so, thank you🙏🙏 all the best

  • @sicksib.
    @sicksib. 3 месяца назад

    No introduction, no explanation, just stating the formulas. Nthg special as always , rely from free youtube videos

  • @yourturn_00
    @yourturn_00 11 месяцев назад +1

    I personally feel that it's confusing to introduce terms like 100p, at least here. Maybe they have some relevance or usefulness later but it would be just so much better if we can keep it more intuitive wherever possible.
    They could have just said that percentile rank = (number of values below x / N ) * 100.
    x is the value we want to find the percentile rank of.
    When we want to find the value which exists at a certain percentile rank, we use: value = percentile*(n+1)/100

  • @mercyyy8385
    @mercyyy8385 Год назад +2

    Yhii sbb agrr hindiii mee smjhaa diyaa hota tohh in sbb lectures ko smjhne me itna time nhi lagta. 😅

  • @akashpandey8253
    @akashpandey8253 3 года назад +7

    Kuch samjh nii aaya

    • @dhruvchawla1752
      @dhruvchawla1752 3 года назад +10

      I have divided into subsections so you can read only the parts you didn't understand.
      ***100p Percentile***
      Percentile basically tells us how many values are more than or less than a given value in a dataset. So, if I say that you got 90 percentile in your class test, I mean that you scored more than or equal to 90 percent people in your class.
      In this case, p is 0.9 and (1-p) is 0.1, which means 100p is 90 and 100(1-p) is 10.
      So, the definition makes sense, you got more (or equal) marks than 100p% of your class and less (or equal) than 100(1-p)% of your class.
      ***How to calculate this
      ***
      Now suppose your teacher asks you who scored 75 percentile in your class of 40 students, i.e. which student in your class has more or equal marks as 75% of your class.
      1)First thing you will do is to arrange your data in ascending order.
      2)Second, you know that 100p is 75, which means p is 0.75 and since your class has 40 students, n is 40. So, now you get np to be equal to 0.75*40 = 30.
      3)Third, according to the rules if np is not an integer you find the smallest integer greater than n and the value at that position in the data will be your answer. But, since np is an integer (30), you will take the average of the values at the positions np and np+1 (30 and 31) and that will be your answer.
      ***Google Sheets Method
      ***
      In the video, they chose a dataset of 10 values (n = 10) and then they found the values of 10, 25, 30, 40, 50, 60, 75, 80, 90 and 100 percentiles of the data.
      Now, for these values, to find p, we divide all of them by 100, so the values of p are 0.1, 0.25, 0.3, 0.4, 0.5, 0.6, 0.75, 0.8, 0.9 and 1.
      I will only take the example of one of them to explain. Let's say you have to calculate the 25th percentile. The following is the explanation of the process and not the logic behind .
      1) First, you arrange the data in ascending order and number them 1, 2, 3....10.
      Dataset: [35, 38, 47, 58, 61, 66, 68, 68, 70, 79]
      1 2 3 4 5 6 7 8 9 10
      2) Next, you need to find the rank of 25th percentile, you will do this using the formula rank = p * (n -1) + 1. So, rank = (0.25 x 9) + 1 = 3.25.
      3) You then split 3.25 into its integer part (3) and its fractional part (0.25).
      4) You set the value of variable i = 3 (the integer part). x[i] represents the value at the position i in the dataset, so that will be the value at position 3 i.e. 47 and so x[1+1] will be the value at position 4 i.e. 58.
      5) Now, you substitute the values in the formula: Percentile = x[i] + fractional part * ( x[i + 1] - x[i])
      So, your answer for the value of 30th percentile will be: Percentile = 47 + 0.25 * (58-47) = 47 + 2.75 = 49.75
      The rest of the video (from 17:30) is easy to understand.

    • @mationman6822
      @mationman6822 2 года назад

      @@dhruvchawla1752 thanks for your efforts

    • @raj_patel
      @raj_patel 2 года назад

      @@dhruvchawla1752 Thanks a lot

    • @komaldhumal9353
      @komaldhumal9353 Год назад

      ​@@mationman6822 konsa chanel he please send

  • @Dexter4o4
    @Dexter4o4 2 года назад +5

    Simple formula for finding percentile value is
    =Percentile *(n+1)/100 where n= no. of terms

  • @narangtamang5558
    @narangtamang5558 Год назад +1

    Please remake this video with more better explanation

  • @heyanaskhan
    @heyanaskhan 11 месяцев назад

    Didn't understand anything 🙂 !
    Mam is just focused on google sheets ..

  • @magithaganesamoorthi5468
    @magithaganesamoorthi5468 7 месяцев назад

    So in the exam i should use which algorithm to calculate percentile the first or the second?

  • @AyeshaZubia-yb1rz
    @AyeshaZubia-yb1rz 2 месяца назад

    8:00

  • @vishnupv6867
    @vishnupv6867 3 года назад +1

    np=10 =>not (70+79)/2, but 79 why?

  • @sumantkumar7661
    @sumantkumar7661 3 года назад +9

    This lecture was lengthy

    • @avinash5411
      @avinash5411 3 года назад +5

      I dont think its lengthy

    • @TheCricketRadio
      @TheCricketRadio 3 года назад +4

      Can't clearly understand what is the purpose of calculating measures of dispersion, especially the variance part

    • @innovative-minds
      @innovative-minds 3 года назад

      Are median and mode also sensitive to outliers?

    • @dhruvchawla1752
      @dhruvchawla1752 3 года назад +1

      @@TheCricketRadio The reason we calculate measures of dispersion is to be able to see how well the average represents the data.
      Imagine you have two datasets
      A = [4, 4, 5, 5]
      and
      B = [1, 2, 7, 8]
      The mean of both A and B is 4.5, but 4.5 represents A better than 4.5 represents B. This is because the values in B are more far apart, you can calculate yourself and verify that B will have higher variance i.e. the values in B are more spread out.
      This is why we use measures of dispersion.

    • @dhruvchawla1752
      @dhruvchawla1752 3 года назад +2

      @@innovative-minds No, median and mode are not sensitive to outliers.
      Outliers are values in the data which are on the extremes.
      Median is a measure of central tendency so the values on the extremes do not have an effect on it.
      Mode is the value that has the highest frequency in the data. This is also unaffected by outliers.

  • @fazlunnisha5768
    @fazlunnisha5768 11 месяцев назад

    What if theres no fractional part?

  • @quantumgamers69
    @quantumgamers69 Год назад

    Kaun sa Sahi hoga Computer wali ki Solve karne pe

  • @mohammedfuzailsirkazi6518
    @mohammedfuzailsirkazi6518 10 месяцев назад

    6:35 Why is np not 10x35=350

  • @AmitAnjunand
    @AmitAnjunand 3 месяца назад

    What is "P" ??

  • @AmitAnjunand
    @AmitAnjunand 3 месяца назад

    What is "P" ??

  • @kamleshsolanki230
    @kamleshsolanki230 Год назад

    Aap sab ka kya kahana hai about this writting

  • @SunnySingh-jv5oj
    @SunnySingh-jv5oj 3 года назад +5

    Kuch samjh nii aaya

    • @dhruvchawla1752
      @dhruvchawla1752 3 года назад +8

      I have divided into subsections so you can read only the parts you didn't understand.
      ***100p Percentile***
      Percentile basically tells us how many values are more than or less than a given value in a dataset. So, if I say that you got 90 percentile in your class test, I mean that you scored more than or equal to 90 percent people in your class.
      In this case, p is 0.9 and (1-p) is 0.1, which means 100p is 90 and 100(1-p) is 10.
      So, the definition makes sense, you got more (or equal) marks than 100p% of your class and less (or equal) than 100(1-p)% of your class.
      ***How to calculate this
      ***
      Now suppose your teacher asks you who scored 75 percentile in your class of 40 students, i.e. which student in your class has more or equal marks as 75% of your class.
      1)First thing you will do is to arrange your data in ascending order.
      2)Second, you know that 100p is 75, which means p is 0.75 and since your class has 40 students, n is 40. So, now you get np to be equal to 0.75*40 = 30.
      3)Third, according to the rules if np is not an integer you find the smallest integer greater than n and the value at that position in the data will be your answer. But, since np is an integer (30), you will take the average of the values at the positions np and np+1 (30 and 31) and that will be your answer.
      ***Google Sheets Method
      ***
      In the video, they chose a dataset of 10 values (n = 10) and then they found the values of 10, 25, 30, 40, 50, 60, 75, 80, 90 and 100 percentiles of the data.
      Now, for these values, to find p, we divide all of them by 100, so the values of p are 0.1, 0.25, 0.3, 0.4, 0.5, 0.6, 0.75, 0.8, 0.9 and 1.
      I will only take the example of one of them to explain. Let's say you have to calculate the 25th percentile. The following is the explanation of the process and not the logic behind .
      1) First, you arrange the data in ascending order and number them 1, 2, 3....10.
      Dataset: [35, 38, 47, 58, 61, 66, 68, 68, 70, 79]
      1 2 3 4 5 6 7 8 9 10
      2) Next, you need to find the rank of 25th percentile, you will do this using the formula rank = p * (n -1) + 1. So, rank = (0.25 x 9) + 1 = 3.25.
      3) You then split 3.25 into its integer part (3) and its fractional part (0.25).
      4) You set the value of variable i = 3 (the integer part). x[i] represents the value at the position i in the dataset, so that will be the value at position 3 i.e. 47 and so x[1+1] will be the value at position 4 i.e. 58.
      5) Now, you substitute the values in the formula: Percentile = x[i] + fractional part * ( x[i + 1] - x[i])
      So, your answer for the value of 30th percentile will be: Percentile = 47 + 0.25 * (58-47) = 47 + 2.75 = 49.75
      The rest of the video (from 17:30) is easy to understand.

    • @bsc-itai-01arunsamrat80
      @bsc-itai-01arunsamrat80 3 года назад

      @@dhruvchawla1752 bro the value of p that are we taking is up to us we can take any value

    • @komaldhumal9353
      @komaldhumal9353 Год назад

      ​@@bsc-itai-01arunsamrat80 video link send karo na please

  • @RoyAyush13
    @RoyAyush13 2 года назад

    what is small p

  • @kamleshsolanki230
    @kamleshsolanki230 Год назад

    Writting is very poor