9. Constraints: Visual Object Recognition

Поделиться
HTML-код
  • Опубликовано: 17 ноя 2024
  • MIT 6.034 Artificial Intelligence, Fall 2010
    View the complete course: ocw.mit.edu/6-0...
    Instructor: Patrick Winston
    We consider how object recognition has evolved over the past 30 years. In alignment theory, 2-D projections are used to determine whether an additional picture is of the same object. To recognize faces, we use intermediate-sized features and correlation.
    License: Creative Commons BY-NC-SA
    More information at ocw.mit.edu/terms
    More courses at ocw.mit.edu

Комментарии • 41

  • @WepixGames
    @WepixGames 5 лет назад +7

    R.I.P Patrick Winston

  • @user-ol2gx6of4g
    @user-ol2gx6of4g 7 лет назад +6

    Wish the professor could expand the binary mask technique (for finding correlation) to higher dimensions and to non-binary cases.

  • @JeremyDeanLakey
    @JeremyDeanLakey 10 лет назад +9

    Interesting! The faces are easy to recognize when they're upside down or noisy, but not both. It seems that our brains rely on lower-level features specific to the individual face to recognize the faces when they're upside and higher-level specific features to recognize the face if the photo is noisy but right-side-up.
    What I'm saying is that there is BOTH low-level and high-level features inside our brains that specifically identify Bill Clinton.

    • @sumit3195
      @sumit3195 5 лет назад

      Thats pretty accurate

  • @EranM
    @EranM 7 лет назад +13

    Nice view 0:30 1:17 showing us the professor instead of the material he shows to his students

    • @MrFujinko
      @MrFujinko 10 месяцев назад

      Pc culture

  • @william22426
    @william22426 3 года назад +1

    this great man is worthy of being cloned ,,,your lectures are exquisited thanks mr, PATRICK WINSTON,,Colombia resiste may 2021

  • @hnkulkarni
    @hnkulkarni 3 года назад +1

    Thank you for this great lecture.

  • @kutilkol
    @kutilkol 6 лет назад +2

    i dont understand the 17:20 . When 3 objects sufficient for 3 axis rotations and translation and 2 objects are sufficient for 1 axis rotation and translation. how so?

  • @rsd2dcc
    @rsd2dcc 4 года назад +2

    RIP.

  • @HaoHu-n9g
    @HaoHu-n9g 3 года назад +3

    6:30 Now Deep Learning is able to do this automatically. It is hard to describe the feelings I have when seeing this pre-deep learning era lecture with the current development of DL.

  • @SeaJay_Oceans
    @SeaJay_Oceans 4 года назад +2

    Eye enjoy this very much. :-)

  • @adityapatane6789
    @adityapatane6789 Год назад

    as he is comparing only two images to find point corresponding to the in third image. Then should not he select only two point and get all other corresponding points in third image directly? why does he need to select three point to get other corresponding points? #MIT

  • @clinicalneuropsychology3973
    @clinicalneuropsychology3973 7 лет назад +2

    thank you for sharing.

  • @oudarjyasensarma4199
    @oudarjyasensarma4199 5 лет назад +4

    what softwares do professor winston use for the demonstrations?

    • @mitocw
      @mitocw  5 лет назад +9

      Much of the material in 6.034 is reinforced by on-line artificial-intelligence demonstrations developed by us or otherwise available on the web. Those demonstrations developed by us are provided via the easy-to-use Java Web Start mechanism, which comes with the Java Runtime Environment, the so-called JRE. See the "Demonstrations" section of the course on MIT OpenCourseWare at: ocw.mit.edu/6-034F10.

  • @Tzvetkov
    @Tzvetkov 7 лет назад +1

    Boom! Tetris for Jeff!

  • @praski149
    @praski149 5 лет назад +3

    Somebody get this man some oxygen!

    • @GhostkillerPlaysMC
      @GhostkillerPlaysMC 4 года назад +2

      I don't think he needs oxygen anymore since he's dead.. dick

  • @donbasti
    @donbasti 2 года назад

    Where is lecture 8 ? Or is it mistakenly labeled ?

    • @mitocw
      @mitocw  2 года назад +1

      Here's lecture 8: ruclips.net/video/dARl_gGrS4o/видео.html. For more info and course materials, visit MIT OpenCourseWare at: ocw.mit.edu/6-034F10. Best wishes on your studies!

  • @mesutpiskin
    @mesutpiskin 8 лет назад +8

    this lesson it's great. I think teacher is very tired :)

    • @hengyue6596
      @hengyue6596 7 лет назад +2

      Think that's his style.. Since he's been like that from the very first lecture..

    • @AbdallahHegab
      @AbdallahHegab 7 лет назад +1

      but is he useful and you can do project by the end of the course

  • @RealMcDudu
    @RealMcDudu 8 лет назад

    Can anybody explain why, in the projections, he *subtracts* the Ys*Sin(theta) instead of adding it? If we are in a vector space, subtracting Ys*Sin(theta) would mean our new point is going to be down and under our current point, and not up and above like it is shown on the graph... Did he make a mistake or did I just missed something?

    • @hailapathy2801
      @hailapathy2801 8 лет назад

      I'm not familiar with linear algebra, so I'm not really sure. In fact, I don't really understand why the Ys*Sin(theta) term was included at all when Xs*Cos(theta) seemed to do just what he needed.
      That said, focus on the fact that he's solving for Xa there. Whether Ys*Sin(theta) is included in the equation or not, it won't affect the point's upward position on the graph, only the horizontal position. It may also be helpful to note that he states that he subtracts because the Ys*Sin(theta) vector is going in the wrong direction.

    • @conflikt89
      @conflikt89 8 лет назад

      It's a subtraction because what he's actually doing is rotating the entire triangle. I wasn't sure this was possible so I derived the expression for Xa from Xa = s*cos(theta_s+theta_a) and s = Xs/cos(theta_s), and when you do some manipulations you can achieve the same result. Because he's rotating the entire triangle, there is an x component now associated with the angle y_s is on. He takes the new x component of the old x component, Xs, shown by Xs*cos(theta_a), and the x component of the old y component, Ys, shown by Ys*sin(theta_a) to get the new resultant x component. Essentially he treats X_s as the hypotenuse to a new triangle, the x component of which is the same as Xa. I feel like I've explained this poorly, if you still need clarification flick me a pm and I'll upload a graph and my derivation somewhere.

    • @guzmanchehab4020
      @guzmanchehab4020 8 лет назад

      The sign there is just a choice. He intuitively used minus cause the new positions were getting "shorter".

    • @pegah9413
      @pegah9413 6 лет назад

      I understand it this way: Take x_s first. x_s*cos(theta) gives you the projection of x_s "beneath" the xs (counter-clock-wise direction). Now, if you take x_s*cos(-theta) you take the projection of x_s in clock-wise direction which is what we want. But since cos(-theta) = cos(theta) you don't see a minus there. Now, for y_s, you have y_s*sin(-theta) which gives you -y_s*sin(theta).

  • @qzorn4440
    @qzorn4440 8 лет назад

    very interesting, now back to my RPi3 and opencv AI logic. thanks.

  • @jasdeepsinghgrover2470
    @jasdeepsinghgrover2470 7 лет назад

    I am just a B Tech first year student.... just a small query.... if we have orthographic projections... we take views from mutually perpendicular directions... if my coordinate system is set with axis parallel to our viewing direction then won't the computation be much easier.... view along x axis and view along y axis will always have same z coordinate and that along x and Z would have same y coordinate... so won't these condition actually give the object in 3d(I mean a 3 dimensional array with known XYZ coordinate of all vertices)... later then we can rotate and check if we can generate similar 2d images from 3d view?)....

    • @jasdeepsinghgrover2470
      @jasdeepsinghgrover2470 7 лет назад

      experts please check this one

    • @saikiran1314
      @saikiran1314 7 лет назад

      jasdeep singh Grover try asking the question on stackoverflow, man.

    • @AbdallahHegab
      @AbdallahHegab 7 лет назад

      is that a useful course mate

    • @user-ol2gx6of4g
      @user-ol2gx6of4g 7 лет назад

      Not sure what you are asking but you don't get to take "mutually perpendicular directions" of pictures in practice.

  • @katateo328
    @katateo328 2 года назад

    hahah, yeah, the power of storytelling :D not the power of love or power of dream :D is the real power! the other two are fake power, just for propaganda.

  • @deusvult5738
    @deusvult5738 6 лет назад

    "It's still not solved." Why am I wasting my time watching this then? Maybe I should just go back to actually solving it and shit.

    • @shakesbeer5171
      @shakesbeer5171 11 месяцев назад

      just fed the image to chatgpt. It recognizes it.