The U-Net (actually) explained in 10 minutes

Поделиться
HTML-код
  • Опубликовано: 21 ноя 2024
  • НаукаНаука

Комментарии • 123

  • @salmanzafarsatti1346
    @salmanzafarsatti1346 Год назад +52

    man, this video is such a great explainer. I was confused about the use of skip connections since a long a time, but he explained the intuition behind it very nicely.

  • @Anton_Sh.
    @Anton_Sh. Год назад +16

    This architecture is one of the truly brilliant ones in the world of deep learning in terms of its simplicity and efficiency.

  • @mayankukani9600
    @mayankukani9600 Год назад +32

    Why didn't I find your channel before. Please upload more content, the best content on Deep Learning I have seen.

  • @jayhu2296
    @jayhu2296 8 месяцев назад +1

    your explained under 10 minutes videos are goated

  • @nikhilchouhan1802
    @nikhilchouhan1802 7 месяцев назад +5

    You might not find my comment since the video is too old, but man I just want to thank you for this video. I am a student who has always been interested in computer graphics and related fields like game engines, physical rendering, ray tracing, etc, and jst didnt get the ML/AI hype everyone was on the past 2 years. I only ever managed to study ML basics for 2 weeks before I left it for good. But recently I got in a team where my friends were working on CNN based projects, and that made me learn about many basics about NNs and DL. This explaination for Unet seals the deal for me, and I will strive to work on integrating my two interests into one and hopefully create something I love.

  • @thebakareview8009
    @thebakareview8009 7 месяцев назад

    This channel deserves more subss!! Great content and delivery :)

  • @mridulsehgal7773
    @mridulsehgal7773 6 месяцев назад +2

    The best ever video you can get on Unet explaination

    • @Atreyuwu
      @Atreyuwu 2 месяца назад +1

      Not even close lol

  • @JohnZakaria
    @JohnZakaria 9 месяцев назад

    This was the best unet explanation I have ever seen

  • @LucaBovelli
    @LucaBovelli 6 месяцев назад

    dude thankssssss i thought this was another one of these things thatll take me 2 hours of youtube to *not* understand, but u saved me

  • @caleharrison5387
    @caleharrison5387 Месяц назад

    Thanks, this is really good. One thing that would be helpful is if the example was itself convoluted like the algorithm, to make easier to visualise the algo.

  • @ABCEE1000
    @ABCEE1000 2 месяца назад

    Man i like you ! . you are the best ! how you simplify thing and how you are careful to deliver the idea perfectly >> please keep this great presentation up >>

  • @niralpatel5889
    @niralpatel5889 7 месяцев назад

    This was great, would love a video on diffusion transformers! It looks like they are taking off and replacing U-Net's as the backbone to new diffusion models.

  • @hemalathat8773
    @hemalathat8773 2 месяца назад

    I LIKEED THE ANIMATIONS AND YOUR PTESENTING STYLE IN THE VIDEO. THANKS.

  • @s4lome792
    @s4lome792 6 месяцев назад

    Clearly explained. What caused my consfusion in the first place is, in the graphic in the original paper, why does the segmentation mask not have the same dimensionality than the input image?

  • @shubhamarle96
    @shubhamarle96 6 месяцев назад

    thanks for the video, I am trying to use U-net for anomaly detection in time series and your video gave me the idea.

  • @miguelxplayer9641
    @miguelxplayer9641 7 месяцев назад

    Dude, you're great. I'm from Portuga 🇵🇹 🟩🟨🟥🟥and I'm learning Machine Learning and Neural Networks. Thank you very much! I loved how you teach. You are intuitive and dynamic. A person is learning a difficult subject and still manages to laugh when watching the videos. I loved. I already subscribed and liked. I'm going to watch more of your videos now. Hugs from Portugal😉

  • @oblivitus.
    @oblivitus. 11 дней назад

    brilliant! thank you for this illustration!

  • @pushkar9021
    @pushkar9021 Год назад +3

    Continue this series, very helpful

  • @DannyGeisz-vb2dt
    @DannyGeisz-vb2dt 18 дней назад

    Thank you Rupert! Excellent, excellent explanation and intuition for this :)

  • @gokulsaisrinivas5312
    @gokulsaisrinivas5312 11 месяцев назад

    very good explanation of U-NET

  • @NoOne-p3e
    @NoOne-p3e 11 месяцев назад +1

    Extremely useful for beginners like me. This is very good

  • @jacobidoko3924
    @jacobidoko3924 9 месяцев назад

    Yooo...this is quality content right here. Thank you so much for putting this out

  • @dhanushs4833
    @dhanushs4833 3 месяца назад

    great vide mate , would love to see more brilliant stuff like this❤❤

  • @rogercomix5648
    @rogercomix5648 Месяц назад +1

    I liked it but you did not explain the role of the 3*3 kernel, and how it scans the pixels of the image at each layer, and the reason for the downsampling is because it is more expensive to increase the size of the kernel at each layer so we downsample the image so we get the same relative size differential as if we did increase the size of the kernel. Apart from that, it’s brilliant.

    • @vgtgoat
      @vgtgoat 3 дня назад

      I'm not an expert but here's what I understand. The conv filters on the earlier full resolution image will learn highly detailed features such as edges. The conv layers run on downsampled (lower resolution) images can't see edges because they're all fuzzy now, so they will learn more large-scale features, such as shapes, then objects. As for how the 3*3 kernel (filter) scans, I believe it's just a standard convolution which you can learn from other videos.

  • @pranavgandhiprojects
    @pranavgandhiprojects 3 месяца назад

    Hey just show this first video from your channel and immediately subscribed to your:) Great explaination with visuals

  • @puekai
    @puekai 8 месяцев назад +39

    Still don't know how it works

    • @vardhan254
      @vardhan254 5 месяцев назад +1

      me when reading goodfellow all night

    • @prathamghavri8591
      @prathamghavri8591 4 месяца назад +1

      @@vardhan254 dude hows the book ,what would u suggest so that one has a good read

    • @arf9759
      @arf9759 3 месяца назад +1

      No one really knows how/why a CNN works!

    • @cupckae1
      @cupckae1 2 месяца назад

      @@arf9759 Exactly and these make sense to machines during training(When backpropagating errors). this is the reason filters are initialized randomly and are trained.

    • @haruthunanyan7387
      @haruthunanyan7387 2 месяца назад

      ​@@arf9759 I don't know what are you referring to, but there's actually a mathematical proof why conv NN's are used in image classification. Check out geometric deep learning by michael bronstein

  • @nguyenangkhanh4971
    @nguyenangkhanh4971 6 дней назад

    great, hope you continue with the videos

  • @TheHopeOfTruth
    @TheHopeOfTruth 7 месяцев назад

    Thank you for great explanation.On basic level it helps better understand unet

  • @Natstranaut
    @Natstranaut Год назад +2

    Oh my god man. Awesome videos. Keep it up, I'm really enjoying them!

  • @liliznotatnikiem6755
    @liliznotatnikiem6755 3 месяца назад

    I’m interested at multiclass problems (recognising bike, human AND house). Also what would you choose instead of confusion matrix?

  • @Topninja6
    @Topninja6 7 месяцев назад

    Thank you so much. Now I just need to figure out how to implement this for my project lol

  • @VikashSingh-vd9cp
    @VikashSingh-vd9cp 5 месяцев назад

    bestvideo for understanding U-net model

  • @jsparger
    @jsparger Год назад +3

    This was extremely helpful. Thank you

  • @ubanaga
    @ubanaga 10 месяцев назад

    Very nice my friend, this has been most helpful

  • @faaz12356
    @faaz12356 4 месяца назад

    Very useful and great explanation.

  • @yyww4267
    @yyww4267 Год назад +2

    Really impressive vedio! And fun work at the end!!!!! LOVE LOVE LOVE!!!

  • @willlowtree
    @willlowtree Год назад

    i love your presentation style

  • @nagham96
    @nagham96 Год назад +1

    Thank you that was so helpful and cute! 🤩

  • @terjeoseberg990
    @terjeoseberg990 5 месяцев назад

    You didn't explain how the skip connections are connected across. What is the data that's transferred and how is it incorporated into the output half of the U-Net?

  • @TechHuntBD
    @TechHuntBD 6 месяцев назад

    Nice explanation

  • @pratyushsahoo4948
    @pratyushsahoo4948 8 месяцев назад

    Absolutely amazing work 🎉

  • @rippingmyheartwassoeasy
    @rippingmyheartwassoeasy 9 месяцев назад +1

    Thank you for creating this video! Its the best explaination of how a U-Net works that was easy to understand. The visual animation is superbly done!!

  • @sakethsreeram6981
    @sakethsreeram6981 7 месяцев назад

    Great presentation!, Easy to understand

  • @transcendingvictor
    @transcendingvictor 9 месяцев назад

    Thank you very much for the time put on doing thisvideo. Interesting and helpful :)

  • @xarisalkiviadis2162
    @xarisalkiviadis2162 7 месяцев назад

    Amazing video, cleared everything!

  • @coffeestudi0s
    @coffeestudi0s Год назад

    Yooo the effort haha. Amazing Video!!!

  • @mincasurong
    @mincasurong 5 месяцев назад

    Great summary, Great thanks

  • @JohnVinchi-bk2dw
    @JohnVinchi-bk2dw Год назад +1

    this is extreeeemely helpful,and funny

  • @DanielaFrankl-l8t
    @DanielaFrankl-l8t Год назад

    Woooooow! Finally I understood it , really great explanation, thank you

  • @ozzafar1982
    @ozzafar1982 6 месяцев назад

    great explanation thanks!

  • @aligreen786
    @aligreen786 10 месяцев назад

    Very nice explanation. Thanks a lot.

  • @ShwethaVme23b073
    @ShwethaVme23b073 7 месяцев назад

    wow awesome video and explanation

  • @hexeldev
    @hexeldev Год назад

    This video has been extremely useful. I subbed.

  • @r.walid2323
    @r.walid2323 7 месяцев назад

    thanks, good explanation

  • @atifadib
    @atifadib 6 месяцев назад

    If you want to just use the Decoder how would you do it?

  • @vijaykumarb9622
    @vijaykumarb9622 9 месяцев назад

    Great Explanation.

  • @Lautaro04000
    @Lautaro04000 6 месяцев назад

    nice video, very helpful

  • @BooleanDisorder
    @BooleanDisorder 9 месяцев назад

    What's the background music called in this video?

  • @siddhantpassi8237
    @siddhantpassi8237 Месяц назад

    Amazing video!

  • @_the_one_who_asked_
    @_the_one_who_asked_ Год назад

    Hi, thank u for this video. can u pls do a video to explain YOLO?

  • @HelloIamLauraa
    @HelloIamLauraa 4 месяца назад

    best explainer!! great video, I had an "aaaaááaaa" moment at 8:05

  • @amolkumar1538
    @amolkumar1538 Год назад +1

    This is Just awesome, great video

  • @dfparker2002
    @dfparker2002 10 месяцев назад

    This explains inference (I think) by decomposition (dividing) and recomposition (adding) images. Is that accurate?

  • @ABCEE1000
    @ABCEE1000 2 месяца назад

    whould you please make a presentation on 3D Unet . that would be really appreciated

  • @ny8828
    @ny8828 Год назад

    hi its very helpful, how can I reach the PowerPoint of it?

  • @kiraqueenyt5161
    @kiraqueenyt5161 11 месяцев назад

    such a well made video

  • @SyncMicroaga
    @SyncMicroaga 9 месяцев назад

    Hi. I find the video very interresting. As I'm at the begining, i'm little confused. please, can you also propose a pdf file ? thank yu. Nicely

  • @Ngochi-ff7hk
    @Ngochi-ff7hk 7 месяцев назад

    I still don't understand that the output is x2 or x3 or x4.I don't understand why that is the case?

  • @PAHADIBABAJI
    @PAHADIBABAJI 10 месяцев назад

    Very helpful

  • @HadbbdbdDhhdbd
    @HadbbdbdDhhdbd 7 дней назад

    Helpful

  • @ingenuity8886
    @ingenuity8886 6 месяцев назад

    Thank you very much bro...

  • @Manar-Sg
    @Manar-Sg 11 дней назад

    thank you so much!

  • @alirezasaberi6383
    @alirezasaberi6383 Год назад

    awesome! can you calso make similar (actually) for Unet++ and Unet3+ please??? thank you so much.

    • @rupert_ai
      @rupert_ai  Год назад +2

      Glad you liked it! Its not currently on my list of to-do videos as I like to cover the most popular fundamentals at the moment, but I'll let you know if I get around to it! :)

  • @sisami2109
    @sisami2109 Год назад

    very nice dude thank you so much

  • @gregorioosorio16687
    @gregorioosorio16687 Год назад

    Thanks for sharing!

  • @Nerthexx
    @Nerthexx Год назад

    If downsampling works by max-pooling, how does upsampling work? In traditional image processing, we would just interpolate image colors, but how does the network apply it's "convolution" in this process? I would understand "deconvolution", but in my mind it wouldn't work here.

  • @ajipboy
    @ajipboy 8 месяцев назад

    bro , immediate subscribe!

  • @shinobidattebayo7650
    @shinobidattebayo7650 2 месяца назад

    nice effort, but the sound of music is distracting.

  • @Atreyuwu
    @Atreyuwu 2 месяца назад

    I found this while looking up UNet ELI5...
    😭😭

  • @abhishekkanojia2816
    @abhishekkanojia2816 Год назад +1

    cool videos

  • @jaybrodnax
    @jaybrodnax 6 месяцев назад

    I feel like this is more a description to experts than an actual explanation of how and why it works.
    Questions I'm left with:
    What is the purpose of downsampling/upsampling (I'm guessing performance?)
    How is segmentation actually done by the u-net?
    How is feature extraction actually done?
    What are max pooling layers?
    What does "channel doubling" mean, and what does it achieve?
    How does the encoder know "these are the pixels where the bike is"?
    Why is it beneficial to connect the encoder features to the decoder features at each step, versus in the last step?
    How does unet achieve anything other than downscaling/upscaling performance efficiency? Where are the actual operations to derive features?
    How is u-net specifically applied for various use cases like diffusion? What does diffusion add or change, for example.

    • @abansalah4677
      @abansalah4677 6 месяцев назад +1

      (Disclaimer: I am a beginner, and this is not intended to be a complete answer.)
      You should read about convolutional layers and pooling layers to better understand this video. At any rate:
      A colored image has three channels: R, G, and B. A convolutional layer is specified by some spatial parameters (stride, kernel size, padding) and how many filters are there - the number of filters is the number of channels of the output. You can think of each filter as trying to capture different information. Doubling the channels, therefore, means using double the number of filters when using a stride of 2.
      The segmentation is done just like any ML task - the training data consists of pairs of images and their annotated versions. I think it's often hard to decipher the inner workings of a particular neural networks, and your question can/should be asked in a more general way - how do neural networks learn?

  • @poggiesgw
    @poggiesgw Год назад

    good stuff

  • @Grapemaid
    @Grapemaid Год назад

    Thanks a lot lot. I understand it!

  • @1.4142
    @1.4142 Год назад

    Dalle 3 is coming to gpt 4 and it can write text!

  • @Englishwithshima1993
    @Englishwithshima1993 10 месяцев назад

    Perfect

  • @MrMadmaggot
    @MrMadmaggot Год назад

    Now how they coded it?

    • @rupert_ai
      @rupert_ai  Год назад

      Hahaha well there are actually plenty of online code implementations available but I will see if I can get round to a code tutorial on the u-net sooner rather than later!

    • @rishabhbhardwajiitb178
      @rishabhbhardwajiitb178 10 месяцев назад

      @@rupert_ai can u provide one

  • @notrito
    @notrito 6 месяцев назад

    If anyone wonders how to concatenate the features if they don't match the size... they crop it.

  • @usaid3569
    @usaid3569 6 месяцев назад

    Great video champ

  • @boughouyasser7471
    @boughouyasser7471 3 месяца назад

    Make a video on I-JEPA

  • @frommarkham424
    @frommarkham424 2 месяца назад

    U NETS RULEEEEEEEEEEEEE

  • @007bindass007
    @007bindass007 11 месяцев назад

    Nice Comment: Useful 👍👍😎😎

  • @MacProUser99876
    @MacProUser99876 9 месяцев назад +1

    nice explanation. but why distracting background music?

    • @endlesshybrids
      @endlesshybrids 6 месяцев назад

      Agreed. Good explanation but I wish people would stop using background music.

  • @user-mn2bj1hw1vdtfhgh
    @user-mn2bj1hw1vdtfhgh 7 месяцев назад +1

    Me seeing the video at 1.5x 😂😅

  • @leoyu6400
    @leoyu6400 Год назад +2

    hope you can come back to life

    • @c.e1187
      @c.e1187 Год назад +1

      Is he dead?

    • @BooleanDisorder
      @BooleanDisorder 10 месяцев назад

      ​@@c.e1187nah, just busy I imagine. He was active on github in December so

  • @LucaBovelli
    @LucaBovelli 6 месяцев назад

    bro why did u stop making videos i need you lmao (its a painful lmao.)

  • @linamallek6900
    @linamallek6900 8 месяцев назад

    nice video, but ideo i hate the music in the background ( so disturbing )

  • @InoceramusGigas
    @InoceramusGigas 6 месяцев назад

    TIGHT TIGHT TIGHT

  • @timanb2491
    @timanb2491 Год назад

    goodgood

  • @luisluiscunha
    @luisluiscunha 5 месяцев назад

    You are very funny!

  • @jcpouce
    @jcpouce 9 месяцев назад

    music is too distracting... :(