Это видео недоступно.
Сожалеем об этом.

Computer Vision: Crash Course Computer Science #35

Поделиться
HTML-код
  • Опубликовано: 14 ноя 2017
  • Today we’re going to talk about how computers see. We’ve long known that our digital cameras and smartphones can take incredibly detailed images, but taking pictures is not quite the same thing. For the past half-century, computer scientists have been working to help our computing devices understand the imagery they capture, leading to advancements everywhere, from tracking hands and whole bodies, biometrics to unlock our phones, and eventually giving autonomous cars the ability to understand their surroundings.
    Check out Origin of Everything here!
    / @pbsorigins
    Produced in collaboration with PBS Digital Studios: / pbsdigitalstudios
    Want to know more about Carrie Anne?
    about.me/carri...
    The Latest from PBS Digital Studios: • All PBS Digital Studio...
    Want to find Crash Course elsewhere on the internet?
    Facebook - / youtubecrash. .
    Twitter - / thecrashcourse
    Tumblr - / thecrashcourse
    Support Crash Course on Patreon: / crashcourse
    CC Kids: / crashcoursekids

Комментарии • 159

  • @danielkohwalter5481
    @danielkohwalter5481 6 лет назад +258

    This is by far the greatest course that I had on my entire life about computers. I work with full flight simulators for pilot training and many things that I learnt here became so clear for me... We see many systems in a very superficial way due to those abstraction levels and with those classes I can see what's behind the scene, what's going on in a deeper way.
    Thank you, guys. Thank you very much for sharing all this knowledge and in a way so simple and easy to understand. You're the best!!!
    And I'm recommending the channel for everybody I know that likes computer science on any level of understanding!

  • @shtanaka121
    @shtanaka121 5 лет назад +89

    This is probably the best explanation of computer vision I've ever seen in my life.

  • @yangashi
    @yangashi 6 лет назад +480

    Your computer will detect when you are happy and start a forced 10GB update to swipe off the smile on your face.

    • @soapyfruit4731
      @soapyfruit4731 6 лет назад +5

      lol

    • @caleb-hill
      @caleb-hill 5 лет назад +11

      @@Lecadre2 or you can upgrade to Linux

    • @jekytck
      @jekytck 5 лет назад +14

      @@user-rj8th9mr3s Well, being incapable to run games it doesn't necessarily mean that is worse, ain't only videogames out there , ya know

    • @chengong388
      @chengong388 5 лет назад +4

      Your Windows computer*

  • @Firithfenion
    @Firithfenion Год назад +13

    "not to ask for updates if you are frustrated" LOL this course is so informative and entertaining at the same time. Very good job!

  • @aaronfox3613
    @aaronfox3613 6 лет назад +63

    Our university's robotics team is currently using OpenCV so our autonomous drone can see and navigate the world. Lots of theory, documentation reading, and pulled hairs come along with computer vision, that's for sure.

    • @potterherry462
      @potterherry462 5 лет назад +2

      Dude, I know what you feel I have been learning machine learning and most of the times it gets very frustrating.

  • @GiveMeCoffee
    @GiveMeCoffee 5 лет назад +38

    I really love this show, it's a great way to introduce concepts before having a full lecture at a college class, or to have a wide general idea of what the career path will include.

  • @Luke87o26
    @Luke87o26 Год назад +4

    I am currently studying Imaging Science at RIT and this is the best explanation I have ever found. One of the greatest refreshers of what’s going on sense I got here.

  • @WistrelChianti
    @WistrelChianti Год назад +3

    Facinating to get to this one in 2023 in the context of where things have gone since.

  • @hudsona4004
    @hudsona4004 4 года назад +18

    Anyone else watch these on 0.75 speed?

  • @smob0
    @smob0 6 лет назад +187

    Seems like a convoluted way to process images.

    • @dustinjames1268
      @dustinjames1268 6 лет назад +23

      Come up with a more efficient algorithm
      I'll wait

    • @skilet1
      @skilet1 6 лет назад +35

      I see what you did there.

    • @rhubarbjin
      @rhubarbjin 4 года назад +13

      @@dustinjames1268 Still, you've got to *recognize* that there's a *kernel* of truth to the criticism.

  • @tahsinl
    @tahsinl 6 лет назад +13

    Great video! I'm taking a Computational Vision course right now. It was nice to know what you were talking about.

    • @muhammadadeel1150
      @muhammadadeel1150 5 лет назад +1

      Tahsin Loqman May I have your email address . I am interested in this course

  • @IceMetalPunk
    @IceMetalPunk 6 лет назад +47

    For anyone who's interested, there's a (relatively) recent system called YOLO: You Only Look Once. Version 2 came out less than a year ago, if I remember right, and basically it uses computer vision techniques to classify many different objects in a scene in real-time video. As in, it's fast enough to fairly accurately detect and label many different objects in an arbitrary scene 24 times per second (24fps is a standard video frame rate). It's super interesting! :D

    • @Huntracony
      @Huntracony 6 лет назад +3

      Do you know if it uses the information it got from previous frames?

    • @sophiacai8157
      @sophiacai8157 6 лет назад +1

      Awesome naming for it!

    • @BUDA20
      @BUDA20 6 лет назад +2

      Only the current frame, also you use in general, downloaded pre-trained models for weights, so you can start detecting things easily, you can add if want new detection, but of course it takes a lot of work, real time HD video needs around the GPU power of the GTX 1080 Ti, depending on the complexity of the weights, but you can trans-code a video, with the labels added on them, and watch later the final results.

    • @edmonda.9748
      @edmonda.9748 6 лет назад +4

      @buda20,
      Thanks for referring to the type of gpu required for this, which answered one of questions as I'm building my own workstation for deep cnn, video object detection, ...
      Can you recommend minimum hardware specs? Seems to me a capable system has always been taken for granted.
      Thanks

    • @knowledgemagnet4077
      @knowledgemagnet4077 4 года назад +1

      Yes! 😁

  • @ozzyfromspace
    @ozzyfromspace 4 года назад +13

    You're an absolutely brilliant communicator! I'm doing a computer vision specialization on Coursera with the University of Buffalo and your high level intuition just gave me oodles of excitement. I dream of one day developing my own algorithm for real time navigation for data constrained systems. Thanks, really, this was a fabulous primer video, and certainly one I'll show my best friends. ☺️

  • @mattkuhn6634
    @mattkuhn6634 6 лет назад +9

    Ooo, speech recognition and synthesis! I'm super excited for next week now - I'm a computational linguist, so this is my jam. Can't wait!

  • @microbuilder
    @microbuilder 6 лет назад +107

    I *totally* understood all of this. Yeah, thats it...

  • @user-vq2nn7op3e
    @user-vq2nn7op3e Год назад +2

    funny and clear! This series is the best.

  • @user-oj3gb8nh2q
    @user-oj3gb8nh2q 6 лет назад +7

    I found the narrator very pleasant to listen to. Also the video was very good.

  • @TheBassManBoy
    @TheBassManBoy 6 лет назад +12

    I've used Photoshop for years, it's really cool took take a look under the hood of image processing.

  • @Filwoj00
    @Filwoj00 6 лет назад +5

    The best online program, don't stop doin it!

  • @splanksplank7973
    @splanksplank7973 5 лет назад +7

    Wow, you did a great job of making something difficult easy to understand! This video was a great help!

  • @WWG1-WGA
    @WWG1-WGA Год назад +1

    I love computer vision with maths and all ❤

  • @gamestv4875
    @gamestv4875 5 лет назад +9

    Love to see the passion this woman have for her job.
    I lost my passion somewhere along the way.

  • @DuluthTW
    @DuluthTW 6 лет назад +5

    Great lesson. I can't wait 'til next week. Thanks!

  • @Garentei
    @Garentei 5 лет назад +2

    Paused because I noticed the Ghost in The Wires book on your shelf. Bought this book after a Kevin Mitnick conference I saw last year :)

  • @cikif
    @cikif 6 лет назад +2

    The computer in the thumbnail looks like the one in Don't Hug Me I'm Scared Part 4. Which makes the topic even scarier.

  • @FredoCorleone
    @FredoCorleone 5 лет назад +5

    Convolution just happened to pop out from nowhere. In case you are wondering, convolution is the operation that maps a set of values (also called N-tuple where N stands for the quantity of elements) to another set of values.
    Very simple example:
    _1,2,3,4_ is a 4-tuple
    _+1,+1,+2,+2_ is a simple convolution
    _2,3,5,6_ is a 4-tuple as result of applying the above convolution

    • @acidtears
      @acidtears 4 года назад

      Wasn't AlexNet responsible for CNNs becoming a thing?

  • @gianlucadelillo8861
    @gianlucadelillo8861 6 лет назад +3

    wonderfully explained

  • @GiorgosIoak
    @GiorgosIoak 6 лет назад +7

    Just wondering where have you been 😊 Happy to see you again

  • @MrDXCool
    @MrDXCool 4 года назад +2

    Thanks a lot! It was a great introductory video to computer vision.

  • @WaitingInMyColdCell
    @WaitingInMyColdCell 6 лет назад +2

    More useful than my whole semester CV course...

  • @XRaym
    @XRaym 6 лет назад +2

    Apart from face recognition, OCR is another nice field of research for 'teaching computers' to see !

  • @cesarbrown2074
    @cesarbrown2074 6 лет назад +2

    This could be were Quantum computers shine. It can analyze all that data all at once basically seeing the bigger picture.

  • @kaheichan9
    @kaheichan9 6 лет назад +3

    First couple seconds of the Video, what a second that looks familiar, then realise it’s a footage of my hometown.

  • @brocksprogramming
    @brocksprogramming 6 лет назад +2

    Way to go Carrie Anne!

  • @mrdraynay
    @mrdraynay 6 лет назад +2

    Can't wait for next week!

  • @Huntracony
    @Huntracony 6 лет назад +4

    Self driving cars often (also) use LIDAR, which has the great advantage of knowing distances, so the car is able to see in 3d. The (biggest) exception to this is Tesla, which decided that normal cameras work just fine, to which I say sure, but why not make it even better?

    • @sirtripalot2770
      @sirtripalot2770 6 лет назад +5

      Good quality LIDAR sensors needed for automotive applications are expensive. One of Tesla's goals was to ship all of their cars with the hardware needed for automated driving, long before their cars were capable of automated driving. Shipping the initial cars without this feature but with LIDAR might have been too expensive to be feasible.
      Using this approach, Tesla collected millions of miles of consumer data about typical driving conditions. This treasure trove of data enables them to understand under what conditions what sensors fare poorly and gives a large test set to compare algorithms on.
      I don't know if Tesla made the right decision. They might have problems getting their system working in urban areas or other challenging environments without the more detailed data that LIDAR provides, but the low cost of Telsa's sensor package enabled them to use consumers to collect data for Tesla, and that is a significant advantage for Tesla's engineering design team.

  • @mrsaidulhasan
    @mrsaidulhasan 5 лет назад +1

    Very excellent explanation. Thanks for your videos. Please upload videos on machine learning and artificial intelligence.

  • @swatijain6898
    @swatijain6898 6 лет назад +2

    Mam very nice video,
    Mam please also made full course videos also with very easy explanation & cover only those maths which require for that course.
    Because your explanation is very simple

  • @bobbobety
    @bobbobety 5 лет назад +7

    Awesome video! How exactly are these image processing softwares implemented - would it be a low-level programming language like C, a high-level like Python or would it even be at the hardware level?

    • @TalSoikis
      @TalSoikis 4 года назад +9

      Shame no one answered before. Anyway, can be both. Python can be used in production and in testing (Really good libraries for complex computer vision like convolutional neural networks, object segmentation and so on). However when really high control over performance is needed, be it memory or computing speed, a low level language like c or c++ are used. Hope that helped :D

    • @bobbobety
      @bobbobety 4 года назад +5

      @@TalSoikis Yeah awesome, thanks :D

  • @samuelgriffin1955
    @samuelgriffin1955 4 года назад +2

    Great video , very informative.

  • @magica2z
    @magica2z 5 лет назад +1

    Best videos series ever about computer science,.,, Thank you..

  • @daserstereichen
    @daserstereichen 5 лет назад +1

    At 5:52 you forgot to mention the bias value.

  • @morezco
    @morezco 6 лет назад

    what a sweet world would be one that has computers capable of awareness of their surroundings

  • @totallynotgad
    @totallynotgad 6 лет назад +25

    CVision + Neural Network + Bad AI = me
    by the way 5th

    • @Pugpono
      @Pugpono 6 лет назад +1

      Lol. 😂

  • @Ernscht1987
    @Ernscht1987 6 лет назад +1

    That's super cool^^ Thank you!!!

  • @TheShovon123
    @TheShovon123 4 года назад +1

    YOU ARE AMAZING!

  • @nagalakshmiduvvuri2316
    @nagalakshmiduvvuri2316 6 лет назад +1

    thank you, this was helpful

  • @ravindumirihana2784
    @ravindumirihana2784 6 лет назад +2

    This is awesome

  • @Baxtexx
    @Baxtexx 6 лет назад +2

    Lol I just imagined this in the next patch of Windows:
    If(user.faceEmotion=frustrated){
    restartNotice.Postpone(2)
    };
    Not that they would ever do that though...

  • @anthonyosnacz525
    @anthonyosnacz525 6 лет назад

    KinaTrax uses computer vision to record kinematic data on baseball pitchers. Biomarkers are no longer a requirement and data can be tracked accurately in game. Computer vision is revolutionizing the game!

  • @mh47saiyyad21
    @mh47saiyyad21 Год назад

    Extremely excellent. Thankyou.

  • @Kid_illithid
    @Kid_illithid Год назад

    She said kernel so many times i can’t quit thinking about popcorn

  • @oldcowbb
    @oldcowbb 6 лет назад +1

    thats really convoluted

  • @Roma-pv2pk
    @Roma-pv2pk 5 лет назад +4

    *connects a function generator to an oscilloscope in the background for some fun sciency atmosphere *

  • @bnfgh123
    @bnfgh123 5 лет назад +1

    When I started watching this video, I did not expect it would actually help me with my physiology course. I finally understand receptive fields :-D

  • @WiseWik
    @WiseWik 6 лет назад +6

    When my Windows laptop will be able to recognize I'm not in the mood for an update, only then I'll pull that sticky tape off my webcam. That also means I'll never get updates :(

  • @DrachenKaiser
    @DrachenKaiser 6 лет назад +2

    Thanks for the greate video!

  • @ZZ-sb8os
    @ZZ-sb8os 6 лет назад +1

    Carrie Anne you look so cute with your glasses on, you should keep them on for all your videos

  • @subrisubrika5652
    @subrisubrika5652 6 лет назад +2

    You guys rock!!!!

  • @nikonissinen6772
    @nikonissinen6772 6 лет назад +2

    My PC is already quite aware of it's suroundings. Usually there's me and there will be a hammer if computer starts to misbehave.

  • @zenchiassassin283
    @zenchiassassin283 4 года назад

    I love convolutional neural networks

  • @edmonda.9748
    @edmonda.9748 6 лет назад +1

    Can anybody recommend a minimum hardware requirements for computer vision/object detection?
    Thanks

  • @cpatsalos1
    @cpatsalos1 6 лет назад +1

    Make a video on Mercury cycle! Please

    • @B3Band
      @B3Band 6 лет назад

      you make it

  • @Pllutus
    @Pllutus 6 лет назад +2

    Where can i find the sources for this video???

  • @thomasw4422
    @thomasw4422 6 лет назад +1

    Hey, i know that place! Sydney Olympic park!

  • @ContentToHover
    @ContentToHover 6 лет назад +2

    ....are internet connected microwaves a real thing?

  • @salemamer3428
    @salemamer3428 5 лет назад +1

    Great !!

  • @walexkinopapy9598
    @walexkinopapy9598 6 лет назад +1

    Good video Anne.. i need your insight on something... am working on recognizing partial occluded license plate. can you contribute to my research. thanks

  • @josegonzaleznunez3869
    @josegonzaleznunez3869 6 лет назад +1

    Would you share a link for further reading?

  • @1000Marcopeters
    @1000Marcopeters 6 лет назад +4

    "Abstraction is the key to build complex systems"

  • @mikeg9b
    @mikeg9b 6 лет назад +3

    Yay Fei-Fei Li! Watch her TED talk too.

  • @shayanshamsi7540
    @shayanshamsi7540 6 лет назад +1

    Will you guys be uploading after 2 weeks from now on as you did with this video ?

  • @Infantry12345
    @Infantry12345 6 лет назад +1

    The clip of the tracking of the fingers, arms, and face of the guy reading from the book makes me think that some day soon there will be a presentation or something where they show a computer detecting sleight of hand in a magic trick. Would be a pretty neat way to show off the accuracy, anyway.

  • @angelavalderrama2221
    @angelavalderrama2221 6 лет назад +5

    Nice👍

  • @user-su1gq6mc3c
    @user-su1gq6mc3c 6 лет назад

    very like this video

  • @Cubinator73
    @Cubinator73 6 лет назад +7

    A computing device should never change behavior depending on highly subjective factors, it should only do what it is explicitely told to do.

    • @EclecticFruit
      @EclecticFruit 6 лет назад

      If my computer can't lie then it's not really alive!

    • @IceMetalPunk
      @IceMetalPunk 6 лет назад +1

      But then it's just a faster calculator :(

    • @horsesrmylife78
      @horsesrmylife78 6 лет назад +4

      Even if something that is subjective is in the code, it’s still *in the code*, meaning that the computer is not making its own decisions. Only the programmer makes decisions based on subjective ideas.

    • @nibblrrr7124
      @nibblrrr7124 6 лет назад +2

      The point is that the user's inputs to the program are no longer under the complete control of the user. Typing & clicking the same things might not get the same behavior/output, based on some ML classifier trying to interpret your mood/intentions/... more or less well.
      This *affective computing* approach is very different from regular UI design.

  • @brightfuturebf
    @brightfuturebf 6 лет назад

    Plz leave a link to The Origin of Everything, would love to check it out.

    • @RaymondHng
      @RaymondHng 6 лет назад

      The link is in the description.

  • @BrokebackBob
    @BrokebackBob 6 лет назад

    Machine vision will match ours when we can shrink 1000s of processors each capable of 1000s of petaflops to the size of an eyeball connected to the equivalent of the human brain's vision center.

  • @tueemsyhu4846
    @tueemsyhu4846 6 лет назад

    GREAT-VIDEO!!😁💻👀👂👍

  • @bondedomao
    @bondedomao 6 лет назад +1

    I would trade all my privacy just so Windows do not ask to install updates when I'm mad!

  • @JM-us3fr
    @JM-us3fr 6 лет назад +3

    So the government is watching me through my webcam?

  • @SexyBakanishi
    @SexyBakanishi 6 лет назад +1

    Is the guy in the middle the secret brother Dave?

  • @Masoudy91
    @Masoudy91 6 лет назад +1

    Big brother 😎

  • @JimPlaysGames
    @JimPlaysGames 6 лет назад +2

    That Macintosh in the back needs some serious retrobright treatment.

    • @nibblrrr7124
      @nibblrrr7124 6 лет назад

      Dang, you're right. I shall watch an 8BitGuy restoration video to soothe myself now.

    • @JimPlaysGames
      @JimPlaysGames 6 лет назад

      Ah I knew there'd be an 8 Bit Guy fan around here somewhere :D

  • @zanekarl8391
    @zanekarl8391 6 лет назад

    Does anyone know the titles of all the books in the background of the videos? The only ones I can make out are "Ghost in the Wires" and "Linear Systems and Signals".

    • @nibblrrr7124
      @nibblrrr7124 6 лет назад

      The orange one is *Robot Builder's Bonanza* . The others are way too blurry to read unless someone recognizes the spine design.
      (I've tried extracting frames from the downloaded video and enhancing them... no luck :C)
      I guess we could always ask CrashCourse themselves? ^^

  • @user-vf2jo2sn9g
    @user-vf2jo2sn9g 6 лет назад +1

    我想字幕 Who stole the subtitles?

  • @melozzoreacts
    @melozzoreacts 4 года назад

    Designer is a Liverpool FC fan I see.

  • @qinyuping2823
    @qinyuping2823 4 года назад

    CCTV camera?

  • @Phie_Mc
    @Phie_Mc 6 лет назад

    No edge!

  • @GarrettBSettles
    @GarrettBSettles 6 лет назад

    yay!

  • @MarcoCortex
    @MarcoCortex 5 лет назад +1

    I was 100% in until 80% of the video. Then, it was like...

  • @vuufke4327
    @vuufke4327 5 лет назад

    I suppose these are the same kernels used in Photoshop

  • @hakanahlstrom8310
    @hakanahlstrom8310 6 лет назад

    isn't it upper left corner?

  • @informativecontent4778
    @informativecontent4778 5 лет назад

    Amazon Go is an example

  • @itswarhawk
    @itswarhawk 5 лет назад

    So.... How do you play sudoku

  • @Wherrimy
    @Wherrimy 6 лет назад

    According to this, I should never be asked to update...

  • @NeoShameMan
    @NeoShameMan 6 лет назад +17

    YOLO (you only look once)

    • @IceMetalPunk
      @IceMetalPunk 6 лет назад +1

      I was going to mention that.

  • @kdavid9228
    @kdavid9228 6 лет назад

    How did you get 147 ? I can't understand.
    -185-186-186+233+233+233 = 142

    • @MaheshKiran
      @MaheshKiran 5 лет назад +3

      May be the presentation error, but still the concept is clear with the next example which equals to 1

  • @odiseezall
    @odiseezall Год назад

    Hear and speak you say? Well...

  • @ck7671
    @ck7671 6 лет назад +1

    She spoiled the next video of 3blue1brown! He's litteraly in the middle of the image recognition by deep learning subject

    • @afourthfool
      @afourthfool 6 лет назад +1

      This channel is for the jitterbug learners for whom reading and super breakout compress to the same function. Grant is more for the laid-back learners who leave good questions unsaid.