JPEG DCT, Discrete Cosine Transform (JPEG Pt2)- Computerphile

Computerphile

Просмотров 640 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 22 ноя 2024

Комментарии • 648

@joshhyyym 9 лет назад ⁺³⁶⁸
1:24 that is the best freehand sine wave I've ever seen.
@akshayaggarwal5925 7 лет назад ⁺²⁵
Cosine*
@bluerizlagirl 6 лет назад ⁺²⁵
It's exactly the same curve, just shifted to the left by π/2 .....
@davidjames1684 4 года назад ⁺⁴
Look again, he bottomed out the curve well past pi radians. That is a blatant error.
@itsbk6192 4 года назад ⁺¹¹
@@davidjames1684 lol you would hate to see what I can achieve freehand
@seanmatthewking 4 года назад ⁺⁴
David James Oh dear, that will not do. Fire up the guillotine and prep my guillotine dress.
@jawtheshark 8 лет назад ⁺⁷³⁷
18 years after university, I finally understand what my prof tried to explain....
@カラスKarasu 8 лет назад ⁺¹⁰
But it's the best way to get in touch with professionals a and get job offers in the field as a junior
@jawtheshark 8 лет назад ⁺²¹
With luck and understanding other parts well enough.
@DarthZackTheFirstI 5 лет назад ⁺⁵
doubt you need that to graduate (at least as bachelor) ... . took disney years to figure out how to handle fur of pets :P
@WahranRai 4 года назад ⁺⁵
Il vaut mieux tard que jamais
Better late than never
@DrPastah 3 года назад
@@カラスKarasu It is? How so?
@GtaRockt 8 лет назад ⁺⁴⁴⁵
I love it when I procrastinate and notice "hey, I need the stuff this guy's explaining in my exam!"
double win
@Mustombrider 6 лет назад ⁺¹
Was exactly my thoughts just before i saw this comment :D
@vitarkamudra4548 Год назад ⁺⁵
This young gentleman uses paper and pen to explain something so well, much better than many others using fancy cartoons and movies. Thanks!
@peterbonnema8913 8 лет назад ⁺⁷⁶
For anyone interested: this is roughly how the fourier transform on images works. The main 3 differences is that you don't do it on 8x8 blocks but on the image as a whole, you consider 'waves' with lots of different angles and not just vertically and horizontally and you also add in sine waves and not just cosine waves.
@askmiller 9 лет назад ⁺³²⁹
There's a few steps that he skipped.
as many of you might realize, images aren't all going to make those nice blocks of 8, so you need to pad the edges with a few pixels most of the time.
second, he never actually talked about how the DCT is mathematically performed. Basically it's matrix multiplication between your shifted values and a DCT matrix that's generated as basically a sum of a bunch of cosine saves.
third, i'm not sure if this is included with their huffman video or not, but the values are actually stored in 1's complement, which is interesting because they completely ignore 0's. In coding, there's basically a skip code which could mean either skip every remaining coefficient or skip a chain of several.
fourth, the DC value (top left) isn't just stored separately, but it also needs to be encoded in a separate way. Because the values are typically so much larger than the others, you don't actually store the DC value itself, but the comparison to the previous value. For instance, if the first value is 84 and the second is 85, you store 84 for the first block, then 1 for the second.
This is by far the best video I've ever seen for explaining jpeg, and all of the above isn't really necessary for anyone just curious about how jpeg works, but it's still cool stuff to know imo.
5 лет назад ⁺³⁵
the first is at 14:45
@МарияБалконская-к1с 5 лет назад
hey man, can you answer me, I need your help
@geot4647 5 лет назад ⁺³
Explains why JPEG lossless cropping never quite matches the original boundaries unless they hit edges of the frame.
@steelmagnum 4 года назад
I'm trying to work out step two you listed, applying the DCT but I'm not getting the correct values. Would you be able to help me out? I'm using google sheets to get the sumproduct of T M T'. For the DC I get -332.7 instead of -370. The AC values are just completely out of whack
@dhruv1846 3 года назад
@@steelmagnum use matlab image processing toolbox
@williamborrell4219 7 лет назад ⁺¹⁴
These series are fantastic for three reasons: 1) high quality information 2)organized, sequential presentation with examples 3) no youtube fluff
@doemaeries 9 лет назад ⁺²²⁰
6:11 nice how he did the trick with the pen without even stopping to talk
@ThePillow86 9 лет назад ⁺¹¹
doe maeries well spotted!
@Imagonem 9 лет назад ⁺⁶⁰
doe maeries This guy has skills. His freehand sinusoids are also pretty impressive.
@Celrador 9 лет назад ⁺¹¹
doe maeries Typical computer scientist. :P Almost half of my fellow students can do this aswell. And the drawing of the function? Well... If you are forced to draw them so often in your courses, you just get used to it, I guess. (He is competent and good though, nonetheless, it's just not THAT special in our field of nerds. :p)
@ashfaqadib8085 7 лет назад
I rewatched that part 4-5 times, so awesome that was.
@saultube44 4 года назад
Oh SOB, I didn't even notice it and I've seen this video a few times, only because of this comment, wow, awesome trick, smart guy, a little nervous had a energy release with that, it usually happens
@Sleeperknot 2 года назад
I am so glad that I clicked on this video out of the search results to learn something about DCT. I have to say that the quality of teaching in this video is simply top-notch. Many other videos out there simply explains how to calculate DCT, without ever relating to any practical usage at all. Some of them dwell only in the dark regions of the textbook filled with a lot of formulas.
@DanTheLetsPlayMan 9 лет назад ⁺⁹
Okay so during my university course we learned this in 90 minutes. And there are still some bits in this video that we never learned. This is so much better explained than anything we learned or that I could find on the topic online. Very awesome video!
@ViltsuV 9 лет назад ⁺²
Tried to understand this around a year ago, but Mike really put it in words better than any book I read. Thanks!
@EdEditz 9 лет назад ⁺²⁰¹
I'd love to experiment with changing the quantization numbers and see what weird images that would produce. Like glitch art maybe. :)
@maxemore 4 года назад ⁺¹⁰
I had the same thought after watching this
@Toksyuryel 3 года назад ⁺⁹
I bet you could do some really interesting stenography with this
@nayeemrafsan356 2 года назад ⁺²⁶
now those arts are being sold as NFT
@archermidland 2 года назад ⁺¹⁶
now those NFTs are worthless
@Chakaramba 3 года назад ⁺¹
After an online university lecture about JPEG compression, that video sets all the stuff in my head to the right places! Thanks for such a great example of tables with their input/output performed
@tylouww.1915 3 года назад ⁺¹
This is so cool! It's like Fourier Transform but only the cosine coefficient. In university class, analysis, the Prof always said it's being used all over the computer image and video compression, but never really gave an example, so now that I have one its really cool to see this at work
@miaowang4913 2 года назад
Thanks so much for the clear explanations! I was reading through different papers trying to understand the concept of DCT but always felt a gap here and there. This video gave a super lucid and straightforward understanding in a layman-friendly way.
@THEzTROLLlz 6 лет назад ⁺¹
Extremely well presented. There was a bit about what exactly the AC values represented that I didn't already know and this video didn't skip a beat.
@AnuragSyal 7 лет назад
This is by far the best video I have seen on JPEG compression. He explained the process thoroughly.
@taojiang2735 Год назад
This guys is so eloquent. make the jpeg so much easier to understand
@rathinavelsankaralingam2929 Год назад ⁺¹
Wow. Just wow. Hands down, the best video for understanding JPEG on the internet! Thank you Sir :)
@muralidharan6755 3 года назад
How I missed this great lecture about JPEG all these years. Well youtube won't recommend these videos and I searched for JPED compression and landed up here part 1 and part 2. Amazing :D ....
@shubhammguptaa 7 лет назад
This man explained this so easily which I was not able to understand through any article/book. Great job!!
@issamoudriss6564 8 месяцев назад
I think no one has ever explained frequency transformation as good as this video. Thank you man!
@son-tchori7085 9 лет назад
For those wondering what is a *macroblock*, it is a superset of *blocks*.
For instance, in *4:2:0 YCbCr* (subsampling by two both horizontally and vertically), a macroblock is 16x16 pixel², thus containing four 8x8 pixel² Y blocks + one 8x8 Cb block + one 8x8 Cr block.
@Loatroll 9 лет назад ⁺³
Very well done! I've been meaning to learn more about JPEG, and here you come along and explain it very coherently. Thanks for that!
@ai_is_a_great_place 2 года назад ⁺²
Branch Education just did a fantastic video on jpeg compression but this one is even more fantastic!
@hlilje 9 лет назад ⁺⁵⁸¹
Obamna
@ibrahim47 6 лет назад ⁺⁶
this is sickly true.
@a.wosaibi 5 лет назад ⁺⁴¹
That was such a discreet pen flip
@davidjames1684 5 лет назад ⁺⁶
looks more like a twirl to me, not a flip. Also, closer to 6:14, not 6:10.
@geot4647 5 лет назад ⁺¹
Bloke talks a bit loud and fast, though. Snowden semi-doppelganger with brow jewel to boot. Sorry, back to the graph now.....
@TeamGreedler 5 лет назад
@@a.wosaibi
discrete* :)
@RobinWootton 3 года назад
Brilliant - I've wondered this for 25 years (since meeting the .jpg format in 1996). Another well prepared lecture by Dr Pound
@lit2021 7 лет назад
This is the best basic explanation of JPEG compression that I've seen.
@kilésengati 9 лет назад
What I love about this channel is that it keeps me interessted in maths lessons.
@MrDivinity22 9 лет назад ⁺⁶⁹
I love the by-the-way-I-do-penspinning on 6:13 xD
@karl5874 5 лет назад ⁺³
Omg I paused the video, played it in slow motion a few times, practised the rotation by holding the pen with my other hand and after 10 minutes I did my first successful pen spin. I did not expect to learn that watching this video.
@henrikwannheden7114 9 лет назад
OMG! Mike just drew the most perfect sine curve I've ever seen drawn by hand! Impressive. Most impressive.
@crevlthe 3 года назад
and once again this channel comes to the rescue, doing a superb job in explaining a complex concept in an easy manner
@1knmd 3 года назад ⁺¹
Man, this is brilliant. I'm going to put the video to my college students and sit with them to watch it instead of giving the lecture myself.
@adiosm57 5 лет назад
This is the best explanation for the DCT process I've ever searched.
@ilkerylmaz 3 года назад ⁺¹
greetings from Turkey. we will do a jpeg algorithm this year at school. While researching I found this video. You explained it very well. I hope we can succeed too.
@ilkerylmaz 3 года назад
14 day for finish...
@stellamn 2 года назад
Very well done. Very clear explanation that included all necessary information to get an understanding of the entire process! I wished my prof would take that as an example of an efficient way of explaining a theory. He could save 50% of his time.
@tl8990 2 года назад ⁺¹
Thank you for saving my course project, Sir.
@screamingfungus_ 9 лет назад ⁺¹
Love the casual pen spin 6:14 .
@macronencer 9 лет назад ⁺¹
Worth waiting for! Thank you for this very enlightening explanation. I realise there's more to it than you showed, but I now have a very good idea of what's going on. It may sound odd to say this, but I think this is an important day in my life. I've been using JPEG since at least 1995, and twenty years later I've finally discovered some of its secrets. It's like having a deep talk with an old friend...
@sanjeevdubey8913 7 лет назад
You addressed the meaning of frequency in images, which others completely miss out . Thanks.
@5astelija75 4 года назад ⁺¹
6:14 dayum boi that penflip tho
@rishabhkash5077 4 года назад
Your knowledge with your great voice makes this subject more interesting
@hannahalsouqi7609 8 лет назад ⁺²
This is so fascinating!!! This video is gotten me more interested in compression than i already am. I love seeing math at work.
@pranavsreedhar1402 5 лет назад ⁺¹²
this is just awesome. Thank you for explaining JPEG in a compressed form
@5N34KY 6 лет назад ⁺¹
You explained this 100x better than my prof did in 1/100th of the time... Thank-you for this!
@awisecar9540 Год назад
The jpeg videos are probably some of my favorite computerphile videos! Well done! 😊
@PuglyWont 9 лет назад
Very nice explanation. I've had some ideas of how the encoding works, but seeing the cosine chart really clarified it.
@willmcpherson2 3 года назад
9:53 so satisfying when he reveals all the 0s that can be huffman-encoded!
@rageagainstthebath 9 лет назад
What a nice repetition of forgotten lectures back from college :) Thanks a lot!
@cringeycrocodile 9 лет назад
So the spectral method with truncation is the common practice in compressing images. Now I understand why we see the dirty patches from highly compressed jpeg images of texts or line works, which in fact have different weights (smaller in low freq. and greater in high freq.) from the pictures.
@karkinissan 8 лет назад
Well, that was much easier than reading 5 pages of the book. Thanks.
@TechyBen 9 лет назад
I always thought it was more complex than this, with JPEG using more systems/divisions or shapes/patterns for the image compression. I never realised the 8x8 sections were using just cosine waves. Wow.
@Apchenail 9 лет назад
You guys make the most interesting video on youtube, all channels considered. High level synthesis. Please keep doing what you do!
@baronvonmike 11 месяцев назад ⁺¹
Wonderfully done, but it would have been nice if you explained how the coefficient for each 8x8 DCT was calculated. I assume its just a straight accumulation of each pixel difference on the 8x8 block, hoping for a total of zero, but I'm left wondering.
@TheBoojah 9 лет назад ⁺¹
I used a hex editor to mess with the quantization table of an image, fun times! The picture comes out all weird looking, but once you know how it works you can achieve some interesting effects.
@itsRAWRtime007 8 лет назад
very nice series of videos. used them for preparing exams on multimedia systems.
@anatoliykosterev8856 5 лет назад
This is by far the best JPEG explanation I found. Thank you!
@sebastianamado7758 2 года назад
Incredible, clear and concise explanation. Greetings from Argentina
@PeterParker-vn2hv 5 лет назад
This is one of my favourite videos on RUclips.
@kalleguld 9 лет назад ⁺²
Great explanation of some fairly difficult subject matter. Looking forward to the next part.
@batman3698 4 года назад ⁺¹³⁰
Jpeg is like a fastfood worker who drops the bun on the floor and picks it back up "they won't notice"
@cancername Год назад ⁺²
And you won’t.
@batman3698 Год назад
@@cancername true
@stevesynan3910 8 лет назад ⁺⁴⁰
It blows my mind how some people can just rattle this stuff off like it's nothing, meanwhile if you asked me what I ate for dinner last night I'd probably have to think for two solid minutes.
Tons of great information! RGB hurts my brain much less than YCbCr..
@yashdeephinge 7 лет назад ⁺¹²
Dude may be the guys teaching in the video doesn't now what he eat yesterday but its the passion that helps people store this much info in brain.
@DarthZackTheFirstI 5 лет назад
its just practise and interest. its like learning a language. after some time (and work, most want to skip *g* ) you get there usually.
@sanjayreddy3295 4 года назад
Sir, you have way too much of knowledge. Thanks a lot for such super high-quality knowledge resources that you are providing for free.
@AvZNaV 9 лет назад ⁺¹
Ingenious method to remove quick changes in a channel!
@Mohamed.wael7 3 года назад
This was very straightforward to understand although I am a Mechanical Engineer !
@GardenStateDigital 9 лет назад
this application is cool. now I have a better and concrete appreciation for the cosine wave
@DanielBeecham 9 лет назад
One of the best videos on computerphile. Thanks for this.
@FortuneRayzor 9 лет назад ⁺¹
I think that the next video should be about Haar Wavelet Transform and its superiority to the method in this video. It's a shame that newer implementations of JPEG are not more popular than this old method with DCT.
@pwlegolas3 4 года назад
Very Impressive Dr. Mike Pound..
@BenjaminWiberg 9 лет назад
This was a very good and thorough explanation of the encoding. I am currently studying transform theory and signal processing and this was a great complement for further understanding!
@Scalibq 9 лет назад
There is one step missing between zigzag-reorder and Huffman: the zeros are actually not compressed by Huffman at all, but the block is zero-length encoded first: instead of storing all the 0s, you just store the number of zero elements.
@josephpeters5681 6 лет назад
He is the coder that rips people off.
@jlinkels 6 лет назад
It is the first time in my life I actually understand what happens with DCT in JFIF compression. So I am grateful for this video. But WHY has everything to be shown so fast. Papers, figures, graphics, diagrams are all shown like flash-flash-flash as if this were an action movie. As far as I understand video time on RUclips does not cost anything, and neither does it require reels of celluloid or film processing. Like I said, I understand how DCT is used, but I had to replay the video several times and pause it often.
@dicegameuchiha 8 лет назад ⁺¹⁵
the greatest video ever created tbh
@charleslandry-forcier2231 9 лет назад
Probably one of the best explanation I have ever seen!
@rommix0 8 лет назад ⁺¹
DCT is very similar to FFT as it converts samples from a time domain to a frequency domain. MP3 also uses DCT but with windowing.
@sharks445 7 лет назад ⁺¹
Mr Spectacals Correct. MP3 uses a variant of the DCT-IV with overlapped window, called the MDCT. that's applied to the granules of each subband
@MBaadsgaard 7 лет назад ⁺¹
Okay okay, guys.. I don't understand mathses that well so help me out... "Windowing" is defining a finite domain of the series of functions, right?
And what is the difference between FFT and DCT exactly? They are both series of sine or cosine waves at different frequencies used to make an approximation of a signal or other function or such.
tried the wikipedia page, but that is not much helpful with those big words :/
@heaslyben 9 лет назад
Great explanation! All the 8x8 printouts worked well on my brain.
@mikael642 4 года назад ⁺¹
I'd really like to see a video like this one explaining how mp3 compression works
@squidcaps4308 9 лет назад ⁺⁴
Modified DCT is used to compress MP3, AAC etc. audio formats.. Didn't know that, read it in wikipedia but since it is essentially a frequency filter, i thought it could be used for audio too. Audio, of course is 1 dimensional and images are 2D so the exact same can't be applied, thus "modified" DCT or MDCT..
Quote " In MP3, the MDCT is not applied to the audio signal directly, but rather to the output of a 32-band polyphase quadrature filter (PQF) bank."
@matsv201 9 лет назад
SquidCaps Yepp, thats was why the Pentium MMX was made
@squidcaps4308 9 лет назад
matsv201
Ah, thanks for that tidbit. It was if i remember right it was marketed as MultiMedia Extension or something like that and it really was considerably faster than non-MMX machines.. Made my first album on 188MHz MMX :)
@StigHelmer 9 лет назад ⁺¹
SquidCaps Actually sound compression using DCT operate reversed to image compression. JPEG transform real color values into frequency domain as explained in the video but sound is already in frequency so the DCT transform them into real value domain before compression.
@squidcaps4308 9 лет назад
Stig Helmer
Again, makes sense, audio is serial data to begin with.
@Madsy9 9 лет назад ⁺¹
Stig Helmer That's not correct. Uncompressed PCM audio data as well as raw data fed to the audio card is in the time domain, not the frequency domain. That's why filtering of audio data is done with FIR and and IIR filters in the time domain with convolution. If audio generally was represented in the frequency domain, you would just do filtering with multiplication and an appropriate window.
@MorganEarlJones 9 лет назад
This guy is pretty good at drawing those waves.
@trudyandgeorge 9 лет назад
A lot of info compressed into 15 minutes, well done.
If PNG is equally fun to explain then to that next! It would be good to see something lossless in comparison.
Thanks.
@SerBallister 9 лет назад ⁺¹
George Edwards PNG is unfortunately not as interesting as JPEG.
@OmarChida 5 лет назад
Dr. Mike I love the way you explain things
@aswinpillai9777 8 лет назад
terrific video...cleared all the doubts in a flash...thank you sir
@danielg9275 9 лет назад
Damn, Mike Pound can really draw some freehand cosines
@markurban9113 9 лет назад
Great work, I have DCT on exam next week and I finally understand it. :)
@loopuleasa 9 лет назад
This guy is really good at explaining really complicated concepts.
College senior here, and I was fascinated by the gist of it. I always wondered how the hell are jpegs so small.
Will there be any talk about PNG files? I guess they have a different encoding dimension for the opacity/alpha levels.
@MaizumaGames 9 лет назад
Very good explanation. The sheets with tables helped a lot!
@magiccouponsREAL 9 лет назад
Thanks a bunch! Helped with my exam tomorrow
@jdgrahamo 9 лет назад
I actually understood most of that. Things are looking up.
@thevoid141 7 лет назад
Finally i knew the relation between waves and images. Thank you!
@TheRomichou 9 лет назад
Such a complex process but so well explained! awesome video
@bayraktarx1386 9 лет назад
Never thought it's so complicated, great video!
@sumejjaporca2231 8 лет назад ⁺¹
Awesome video... Thanks for pointing out the important stuff needed to understand how JPEG actually works. :)
@krabhisheksaharsa 3 года назад
Wow! I really very badly needed this explanation. Thanks a ton!
@stromboli183 5 лет назад ⁺¹⁸
At 6:58 “we calculate the DCT coefficients” which are the weights of each cosine wave, or the amount that each cosine wave contributes to the original image. But the actual calculation is not shown, suddenly a piece of paper with DCT II coefficients just appears with all the numbers.
How are these coefficients calculated??
@hazemkhairy8283 4 года назад ⁺²
you can see it at 6:26. Basically, each 8x8 block from the image has a contribution from all blocks "the ones that have blue borders". How do you find the contribution of each blue block to our 8x8 image block ?
Well, you correlate the 8x8 image block with a blue block and the result will be a number (coefficient). This coefficient is the "weight" of the blue block to our 8x8 image.
Correlation here means multiply each element in the 8x8 block with its corresponding element in the blue block and sum the result into one number.
Hope this helps
@skyjumper4097 Месяц назад
insane hand-drawn waves, wow.
@TheWyrdSmythe 9 лет назад
That was really good! I've never been clear on how JPEG did its magic -- now I know. Thank you!!
@yuxin7440 5 лет назад ⁺³
Great video. Can you also talk about JPEG2000 compression algorithm? I heard that it uses discrete wavelet transform to achieve even higher compression than DCT.
@derekprestegard9614 6 лет назад
FANTASTIC explanation.
@SimeTologist 9 лет назад
Very nice video. Mike has got some serious didactic skills. Plus he's prepared!

Следующие

Автовоспроизведение