C5W3L06 Bleu Score (Optional)

Поделиться
HTML-код
  • Опубликовано: 19 окт 2024

Комментарии • 52

  • @alignedbyprinciple
    @alignedbyprinciple 6 лет назад +1

    I read other people's tutorials regarding this topic and by far this is the best and easiest tutorial on bleu score. Thanks a lot.

  • @epistemophilicmetalhead9454
    @epistemophilicmetalhead9454 11 месяцев назад +1

    when it comes to translation, there can be >1 correct answers. BLEU (bilingual evaluation) score measures how correct a translation is by comparing it with the translation provided by actual people. modified precision is used to calculate BLEU score
    modified precision (word by word) = max number of times the word is supposed to appear / number of times the word is present in the translation.
    modified precision on bigrams is where you take two consecutive words at a time (like a slider) and then calculate using the same formula (but for a two word phrase this time)
    same goes for n-grams
    if the output is exactly equal to one of the references, all modified precision values (for 1,2,....n-grams) = 1.0
    combined BLEU score = BP*exp(sum of k modified precisions / n) where k goes from 1 to n and BP=brevity penalty (it penalizes translations that are too short because short translations have a higher chance of having higher modified precision scores)
    BP = 1 if output (machine translation) length>reference (human translation) length
    BP = exp(1- (machine translation length/human translation length)) otherwise

  • @thedissociation3666
    @thedissociation3666 6 лет назад +6

    Thank you and also your voice is so calming

  • @bobli3467
    @bobli3467 5 лет назад +21

    There is an error that BP:=exp(1-ref_output/MT_output)

  • @shayanhati2325
    @shayanhati2325 6 лет назад +7

    Please upload the full series. Eagerly waiting.

  • @wenkaidai8291
    @wenkaidai8291 6 лет назад +4

    Looking forward to your upload of full series of sequence models~

  • @Heyoo-vx5vt
    @Heyoo-vx5vt 5 лет назад +5

    I think that brevity penalty factor has to be
    exp(1-reference_output_length/MT_output_length) if MT_output_length

    • @DrJohnnyStalker
      @DrJohnnyStalker 5 лет назад +3

      Yes this is a typo. The original paper also sum over log(p) to scale Bleu between 0 and 1

  • @jimmylee2197
    @jimmylee2197 6 лет назад +6

    according to the original paper, does the BP under otherwise condition should be exp(1-reference_outut_length/MT_output_length)?

    • @ladingokalpay1849
      @ladingokalpay1849 2 года назад +1

      yes. it was corrected later in the coursera course.

  • @siddharthakantipudi2329
    @siddharthakantipudi2329 5 лет назад

    Please upload the full series !!!

  • @NikhilSharma-td3hr
    @NikhilSharma-td3hr 2 года назад

    Thank you.

  • @anthonyarmour1812
    @anthonyarmour1812 2 года назад

    Great video, but there's an error I've seen in every resource I've looked at. Had to find out from reading the original paper. Cumulative Bleu score = BP × exp( 1/n x sum(log(Pn))).. the log is an important difference. Video was great tho! I've seen like 5 resources that seem to have left the log out

  • @therri1227
    @therri1227 3 года назад

    Lecture L03 and L04 are missing from the playlist of the week

  • @jimmccarthy8087
    @jimmccarthy8087 5 лет назад +6

    Will the full series of Sequence Models be uploaded soon?

  • @utkarshraj9061
    @utkarshraj9061 5 лет назад +1

    Whole course in python or octave

  • @유기중-n1f
    @유기중-n1f 5 лет назад

    thank you andrew.

  • @Acha413
    @Acha413 5 лет назад +3

    at time 8:14 there is a mistake, the count clip for "the mat" should be 2, isn't it.?

    • @HTSFestival
      @HTSFestival 5 лет назад +2

      I'm afraid not. It might be max appearence in one sentence.

  • @veerudumpala1793
    @veerudumpala1793 6 лет назад +2

    Brevity Penality Factor:
    IF len(MT_output) == len(ref_output)
    then also exp(1-m/r) equals to 1? Right?

    • @aojing
      @aojing 5 лет назад

      Yes. But the equation has a typo. You should swap numerator and denominator.

  • @annawilson3824
    @annawilson3824 4 года назад +1

    Why only up to p_4 n-gram, if there are 6 words in reference #1? Up to p_5 is better, no?

    • @simoncrase5360
      @simoncrase5360 3 года назад +1

      There's an awful lot of 5-grams! Papineni et al (2002) states "... as can be seen in Figure 2, the modified n-gram precision decays roughly exponentially with n...", so I expect that 5-grams are a pain to calculate, and they don't add much precision to the score.

  • @arsalan2780
    @arsalan2780 6 лет назад

    Kindly upload full series

  • @RaviCHandra-fj8dr
    @RaviCHandra-fj8dr 6 лет назад +3

    please upload full series of sequence models. waiting for it.

    • @adamishay808
      @adamishay808 6 лет назад

      ruclips.net/p/PLBAGcD3siRDittPwQDGIIAWkjz-RucAc7

    • @RaviCHandra-fj8dr
      @RaviCHandra-fj8dr 6 лет назад

      Hey thanks for the reply

    • @simon5771
      @simon5771 6 лет назад

      You can do the whole course with the Jupyter Notebook projects and quizzes and get a certification on Coursera.

    • @ensaadghedada3328
      @ensaadghedada3328 5 лет назад

      @@adamishay808 The playlist does not exist.

  • @sawsanasjea4828
    @sawsanasjea4828 6 лет назад

    thank you . we need more example

  • @chrischappell7643
    @chrischappell7643 6 лет назад

    thank for upload this series ... plz add course about natural processing language that pro andrew mention is last past of full series about deeplearn in coursea :)) thk

    • @sainimohit23
      @sainimohit23 6 лет назад

      wtf is wrong with your english??

    • @sainimohit23
      @sainimohit23 5 лет назад

      @@Ahmed-fj5jq no it's not... I recommend you to buy it. It worth every penny you spend.

  • @mebeasensei
    @mebeasensei 5 лет назад +4

    "on the" appears twice?

    • @jackyangara1
      @jackyangara1 5 лет назад +1

      sentence 1 = 1 "on the", sentence 2 = 1 "on the".
      in unigram he mentioned that we take the maximum one.
      Thus, "on the" = 1

    • @BalaguruGupta
      @BalaguruGupta 4 года назад

      @@jackyangara1 "On the" is bigram right?

    • @jackyangara1
      @jackyangara1 4 года назад

      @@BalaguruGupta yes

    • @aynieeetube
      @aynieeetube 3 года назад

      For count clip, we take the max. occurence of the n-gram in the referenes. Not the count of the n-gram from each reference.

  • @aqwkpfdhtla9018
    @aqwkpfdhtla9018 6 лет назад +7

    There is a high pitch sound. It is so annoying.

  • @CTimmerman
    @CTimmerman 5 лет назад

    0:45 Reference 2 is not perfectly fine.

  • @SupunKandambige
    @SupunKandambige 4 года назад

    1:59 didn't know that he uses Slack

    • @vinson2233
      @vinson2233 3 года назад

      I thought my Slack give me notification

  • @ananthakrishnank3208
    @ananthakrishnank3208 Год назад

    2:40

  • @wangjohn6480
    @wangjohn6480 5 лет назад

    有老吴的 中文视频吗 这个真的听不懂

    • @Shen-w9q
      @Shen-w9q 4 года назад

      我觉得不错啊!讲的非常好

  • @alexminsky1
    @alexminsky1 5 лет назад +2

    One of the most boring lecturer I’ve ever seen! He’s great, though.

    • @Linz0r1s
      @Linz0r1s 5 лет назад +1

      nice one, you made me laugh idiot !

  • @bnglr
    @bnglr 2 года назад

    poorly explained. and the formula is wrong. Andrew is overrated