I Tried Sorting Pixels

Поделиться
HTML-код
  • Опубликовано: 21 ноя 2024

Комментарии • 2,3 тыс.

  • @Acerola_t
    @Acerola_t  Год назад +4479

    the 30 second parallel bitonic merge segment took me 15 hours to edit
    anyways what was your favorite cat clip

    • @meqativ
      @meqativ Год назад +413

      the cat one definitely

    • @sans5955
      @sans5955 Год назад +281

      I actually wanted to listen to your optimization explanation cuz optimizing is something I struggle with immensely but your cat clips were too distracting :(

    • @SokoBuilds
      @SokoBuilds Год назад +43

      the one with the cat

    • @DashconroyAS
      @DashconroyAS Год назад +29

      gotta be cat clip #2

    • @caps_lock
      @caps_lock Год назад +29

      ill watch it again if it makes you feel better

  • @capsey_
    @capsey_ Год назад +7012

    Graphics Programmers: My program renders an image for 3 days, but I was able to shave off 4.3 hours, I'm so good at it
    Real-Time Graphics Programmers: I just wasted quarter of a millisecond, it was reserved for pathfinding calculations, my managers is going to kill me

    • @Numbabu
      @Numbabu Год назад +781

      “My optimized bogosort only takes one billion years to finish sorting now, down from 1.1 billion, it’s so much faster”

    • @thysem
      @thysem Год назад +363

      @@Numbabuto be fair that is 100 million years less than before

    • @ayaderg
      @ayaderg Год назад +269

      @@Numbabu the optimization? remembering not to choose the same random numbers as last time

    • @senzmaki
      @senzmaki Год назад +205

      @@Numbabu bogo sort lowkey underrated, literally the fastest sorting algorithm if you're lucky enough

    • @airplanes_aren.t_real
      @airplanes_aren.t_real Год назад +75

      ​@@senzmaki that's like saying "casting fireball on dnd is underrated, it creates a portal to the hell realm if you're lucky"

  • @Not_mera
    @Not_mera Год назад +6022

    In art class, I opened my photos in notepad++ and wrote some stories. My teacher was shockingly confused how I'd intentionally altered a pic to make it "glitched." They're just numbers in an order, and I changed it.

    • @Cyberfishofant
      @Cyberfishofant Год назад +70

      niice

    • @Cyberfishofant
      @Cyberfishofant Год назад +244

      did you just straight up type into it?
      Like Counter: 40 (in case yt hides it)

    • @roberine7241
      @roberine7241 Год назад +96

      A am wondering what that looks like. I'll have to try that out at some point.

    • @likemau5552
      @likemau5552 Год назад

      I did that too, way to go you can really get a variety of effects depending on type of file, bmp, png, jpeg, propably a bunch more interesting lossy and lossless compression algorithms, but jpg is really good if you want a lossy compression artifacts, and for lossless compression you can introduce major glitches into a png file, also you can mess with pure non compressed bitmap if you want more precise glitch manipulation, try them all out and remember that there are different types of .bmp to choose from

    • @gljames24
      @gljames24 Год назад +419

      @@Cyberfishofant Yep, you can open an image in a text editor and just start typing. Or even manually write an image, but most formats are compressed.

  • @Attemptedburger
    @Attemptedburger Год назад +674

    I would like to point out that the effect would be great for gameplay if it were only applied to certain objects in the game, via a two-color buffer that shows which areas are allowed to be affected by the effect before the contrast map is applied to those areas, allowing the effect to be limited to certain portions of the image.
    the uses would be awesome.
    you could have a character's skin glitching out, but their clothes stay stable, you could have a sword that leaves behind a glitchy trail in the air, and so on and so forth.

    • @BustyCatbot
      @BustyCatbot Год назад +39

      A game mechanic that allows you to interact with "glitched" objects in a specific way would be pretty damn cool, it'd definitely be an eye-catching indicator.

    • @krasistefanovlol
      @krasistefanovlol Год назад +26

      considering it has to sort less pixels too, if the screen isnt covered in areas to pixel sort im sure it'd get to 2 ms or less

  • @themilkguy9572
    @themilkguy9572 Год назад +725

    I would love to see a game made with this filter. something like psychological or surreal horror. I think this filter paired with good sound design and a decent art style would make for a really mind bending and fun game

    • @sgramstrup
      @sgramstrup Год назад +12

      I think he needs to blur the final stage a bit, but interesting..

    • @slimeball3209
      @slimeball3209 11 месяцев назад +4

      cyberpunk isnt having it?

    • @TRENTINATERx2000
      @TRENTINATERx2000 11 месяцев назад +10

      cyberpunk kind of implemented it when the relic effects affected the player.

    • @uh-ohspaghetti-os4118
      @uh-ohspaghetti-os4118 10 месяцев назад +4

      Not a horror game but Splatter uses effects like this very often

    • @spazzls4090
      @spazzls4090 10 месяцев назад +2

      Cyberpunk is your game if you like existential horror

  • @Mcbuzzerr
    @Mcbuzzerr Год назад +1800

    I'm not gonna retain any of this information, but I always feel like acerola got to learn so much in making each video! This was really interesting to watch!

    • @snooks5607
      @snooks5607 Год назад +19

      just yesterday Andreas Kling in his 1000th video talked about consumption patterns and how he found it hard to retain information from watching tech videos himself. he mentioned trying taking notes about the videos and how that led him to being more selective in what to watch. not sure I'd ever have the discipline but found it interesting anyway.

    • @SquidHominid
      @SquidHominid Год назад +7

      as a 3D graphics student, this comment is every day of my life lmao

    • @tobenamed610
      @tobenamed610 Год назад +1

      Me neither, especially because I was 100% focused on the cats

    • @ren_cool_guy
      @ren_cool_guy Год назад +2

      the cat cams were the best parts

  • @Manafii102
    @Manafii102 Год назад +704

    i have never and likely will never need to create graphics shaders, yet these videos are so endlessly entertaining and informative that i can't stop watching

    • @reed6514
      @reed6514 Год назад +6

      i love learning about other programming specialties through vids like this

    • @konstaConstant
      @konstaConstant Год назад +2

      But because of this vid, you were there to witness how someone else was! And it was *so* cool

    • @CatinaboxReal
      @CatinaboxReal Год назад

      hello fellow void pfp

    • @yan-amar
      @yan-amar Год назад +1

      I mean, cat videos, right?

    • @CatinaboxReal
      @CatinaboxReal Год назад

      i am no longer void pfp

  • @holo_fever
    @holo_fever Год назад +302

    I read the title of the video as “I tried snorting pixels” - so you’re welcome for your next video idea.

    • @hamzamotara4304
      @hamzamotara4304 5 месяцев назад +7

      They make me feel like... So digital... Like fingers, yo!

    • @neverstopdatta1402
      @neverstopdatta1402 5 месяцев назад +5

      I almost read the channel name as areola

  • @juliusnebulus702
    @juliusnebulus702 Год назад +275

    Came for the programming knowledge, stayed for the reminders that the monogatari series live forever in my heart

  • @MMYLDZ
    @MMYLDZ Год назад +105

    The editing is very spesifically cultured and I now want more.

  • @verybasic1836
    @verybasic1836 Год назад +141

    Actually one of the most beginner friendly descriptions of Compute Shaders i have ever heard.
    You should do more with them. I had a hard time when learning Compute Shader Concepts the first time.

  • @TheDrMike25
    @TheDrMike25 Год назад +149

    In animation, PixelSorters are a great way to create scene transitions with this glitch aestethic! Ist awesome to see how they work

    • @kindauncool
      @kindauncool Год назад +3

      What is "black scene"'s importance

    • @WinterNox
      @WinterNox Год назад +1

      ​@@kindauncoolI don't that think it is anything of importance.

    • @amruthchangappa
      @amruthchangappa Год назад +1

      @@kindauncool not sure if you still care, but it's a reference to the monogatari anime series

  • @dimon3271
    @dimon3271 Год назад +690

    Babe, wake up. New Acerola video just dropped

    • @lolcat69
      @lolcat69 Год назад +6

      Ok honey

    • @dimon3271
      @dimon3271 Год назад +4

      @@lolcat69 😘😘😘

    • @PropaneWP
      @PropaneWP Год назад +4

      Just 5 more minutes ok

    • @mech_electro
      @mech_electro Год назад +5

      Babe, wake up, this comment trend started years ago

    • @smilleur
      @smilleur Год назад +7

      Im so fucking sick of seeing this comment trend

  • @renkuroex3548
    @renkuroex3548 Год назад +98

    I thought the title said snorting pixels

    • @RivertedYT-
      @RivertedYT- 6 месяцев назад +1

      No Comments?

    • @zoranradakovic2199
      @zoranradakovic2199 5 месяцев назад

      ​@@RivertedYT-1 comment? (start a chain dont break)

    • @kraberap-sniper
      @kraberap-sniper 5 месяцев назад

      ​@@zoranradakovic2199 chain dont break

    • @SmileReal
      @SmileReal 5 месяцев назад +2

      new coke flavor (i broke the chain, watchu gonna do about it?)

    • @SoulikPoulik
      @SoulikPoulik 5 месяцев назад +1

      THATS WHY I CLICKED ON THE VIDEO, LIKE WTF IS PIXELS

  • @austinconner2479
    @austinconner2479 Год назад +12

    If the parallel bitonic merge sort was fast enough you can also use it in the span sorting case: Do a first pass to assign to each pixel the index of its corresponding span, and then sort the entire column lexicographically first by span index. Lexicographical sorting can be accomplished by putting span index into some bits higher than the highest data bits of the sort keys.

  • @Riccorbypro
    @Riccorbypro Год назад +120

    Oh boy time for another sorting rabbit hole.
    Also having Nekopara as the associated visual for a "video game" just killed me every time.

  • @The5thUSER
    @The5thUSER Год назад +153

    I think there IS an improvement on the shader code to be made: If your sortvalue buffer is of a known data type (e.g. uint8), I think you can use a radix sort - which should be a lot faster than your current alg.

    • @Acerola_t
      @Acerola_t  Год назад +87

      it would still be single threaded radix sort which would be a yikes but I should try it yeah

    • @Sloimay
      @Sloimay Год назад +58

      ​@@Acerola_t radix sort is like O(3n) space and O(7n) time, we won't even need to transfer the buffer to group memory as each value in the buffer is accessed only once
      Its only problem is that its slower than the current sorting alg for small spans.

    • @IgnacioLosiggio
      @IgnacioLosiggio Год назад +31

      ​@@Sloimay why not both then. The control mask already has the span lengths!
      Each group can decide it's preferred algo (AFAIK doing this means that you should not even try to do more than one thread per group, GPUs are SIMT machines, right?)

    • @GhostGlitch.
      @GhostGlitch. Год назад +20

      @@IgnacioLosiggio imo probably not worth the effort. As long as the short spans don't take longer than the long spans, it doesn't help to optimize them.

    • @Nitronoid
      @Nitronoid Год назад +11

      There exist efficient `stable_sort_by_key` algorithms for the GPU.
      The solution here would be to sort the original pixels using a key.
      We can calculate it by first running a prefix sum over the mask, giving us 'spans' each filled with a unique index.
      Multiply the prefix_sum by the maximum value that will be used in the sort (ensures a pixel can't 'escape' it's span).
      Then, calculate the sort key using (original value * mask + prefix_sum).
      All the masked-out values will have the exact same key within a span. All the masked in values will retain the same local delta within a span. Each span's maximum value is guaranteed to be less than the following spans minimum value.
      Using a stable sort ensures that although masked out values have the same key value, their order doesn't change.

  • @springheeledjacques
    @springheeledjacques Год назад +87

    As an engineer with over a decade of experience in a completely different part of the field (distributed systems), graphics and shader programming has always felt like magic to me. Very cool to see it broken down like this.

  • @WinderTP
    @WinderTP Год назад +31

    I'm a video editor who loves glitchy effects and learning about how pixel sorting actually works has been very entertaining. I love how the post-processing workflow is so similar to doing VFX and colour correction too lol

    • @linesinthedirt
      @linesinthedirt Год назад

      how would you implement this as a sort of Adobe or AE plugin? is it possible? Pleas get back to me

    • @gabgren1
      @gabgren1 10 месяцев назад

      There’s another one called PixSort as well

  • @CEOofGameDev
    @CEOofGameDev Год назад +15

    13:42
    Mr. Rola, as in, short for acerola.
    I legit fucking died here, omg.

  • @isaacbunsen5833
    @isaacbunsen5833 Год назад +41

    You would get an effect that looks very similar if you simply set a hard number of samples per span. Because they're sorted they tend to look like gradients, so even a simple min and max of a span would look very similar.

  • @dotdotmod
    @dotdotmod Год назад +383

    I'm pretty new to shader programming and have no idea of 90% of the shader optimizations you were talking about, but just so I didn't feel dumb, I covered the cat videos with my hands in order to avoid getting distracted!
    Also, instead of having groups that have already finished a thread wait for the others to finish, can't you divide the spans in 2s (two spans a thread) so you can roughly get them 2 spans done by one group at the same time as others? It may sound weird because I'm probably mixing up some of these terms

    • @sephdebusser
      @sephdebusser Год назад +28

      That would make the worst case be twice the amount of time that it is now(bc you have no way to prevent a group of 2 spans to not be 2 worst-case ones). Right now we have the same amount of active groups than there are spans in the image. Reducing the number of groups is not a thing that's going to speed the whole thing up bc GPUs can easily do thousands of things in parallel. On a CPU implementation, this would be a better approach, bc CPUs can't do that many threads in parallel, so there'd be a queue of spans to still be processed.

    • @dotdotmod
      @dotdotmod Год назад +5

      @@sephdebusser So it essentially positively double the best case's performance and negatively doubles the worst case's performance?

    • @sephdebusser
      @sephdebusser Год назад +11

      @@dotdotmod no, bc best case scenario of the original is still the time of one group processing a single small span. In your case, the best case is one group processing two small spans. Still double the time

    • @SimonBuchanNz
      @SimonBuchanNz Год назад +5

      The trouble is knowing you need to do that, because it's the CPU that starts threads, but the GPU that knows how many threads are needed.
      It is doable, I'm pretty sure, but I think a better approach would be to try to figure out how to apply quicksort, O(n log n) is a lot better than O(n²). I think you could maybe pull it off by coloring the entire span in the thread mask with a span id instead of the start index having the length? Damn, now I want to try and figure this out.

    • @error.418
      @error.418 Год назад +7

      @@SimonBuchanNz "Damn, now I want to try and figure this out." This video is such a nerd snipe, yeah.

  • @Blockmaster987
    @Blockmaster987 Год назад +23

    Thank you for providing a full, in-depth explanation of how everything works. Graphics programming is a niche field and basic theory is hard to come by when searching the internet.

  • @supplechap5429
    @supplechap5429 Год назад +8

    I read the title as “I tried snorting pixels” and thought this was going to be a trip report on some new compound

  • @kaviprashanna
    @kaviprashanna Год назад +30

    I didn't understand 95% of the video.. but i feel smarter somehow.

  • @Dumdumshum
    @Dumdumshum Год назад +29

    I think they used an effect like this in cyberpunk. It would actually be useful for small segments of play in a sci-fi game where the matrix is glitching.

  • @BebxOfficial
    @BebxOfficial Год назад +66

    FINALLY THERE'S A VIDEO ON THIS!!!!!! LET'S GOOOOOOOO!!!!!!!!!
    this effect is such a banger. they should implement it into every competitive game ever made.

  • @NBDbingo5
    @NBDbingo5 Год назад +14

    you're so professional for displaying the other videos for each technique you applied toward the end of the video

  • @jamcdonald120
    @jamcdonald120 Год назад +20

    you may be able to speed up the algorithm by using the parallel prefix sums algorithm to calculate the span of each row. If you have enough compute cores, it can drop the time for creating this mask from O(n) to O(logn). Also, once you build the span mask using prefix sums, there are some handy parallel sorting algorithms that should let you divide up the sorting of each sub section without having to ask the CPU for guidance. If you want I can see if I can find my psudocode for this (im pretty sure we solved basically this problem in my parallel algorithms class), but I havent worked much with shaders, so your on your own translating it

    • @unflexian
      @unflexian Год назад +2

      heya, this is crazy interesting.
      do you have the source?

    • @jamcdonald120
      @jamcdonald120 8 месяцев назад

      @ianI think cs.wmich.edu/gupta/teaching/cs5260/5260Sp15web/lectureNotes/thm14%20-%20parallel%20prefix%20from%20Ottman.pdf covers prefix sums well and maybe www.dcc.fc.up.pt/~ricroc/aulas/1516/cp/apontamentos/slides_sorting.pdf for sorting?

  • @mistikalcanavarlarparlamen3265
    @mistikalcanavarlarparlamen3265 Год назад +18

    Hey! I like the fact that you spend time explaining us the history behind it all!!!

    • @strongmungus
      @strongmungus Год назад +1

      I agree. I also thought it was interesting that a woman was an early pioneer of sorting algorithms. The world was a lot more sexist in the 50s, so it's pretty impressive that she made the contributions that she did.

    • @celeron55
      @celeron55 11 месяцев назад

      @@strongmungus Except that it really isn't that surprising when you know women ended up working in the early programming profession a lot, as back then it was kind of an extension of a data entry job or similar. Then it transitioned into an engineering job, which in turn was more popular among men.

  • @theprobplays5986
    @theprobplays5986 Год назад +11

    13:10 where tf did my cats go >:(

  • @NeatNit
    @NeatNit Год назад +7

    17:09 In the end, the real time pixel sorter is the friends we made along the way.

  • @kachelstacktus
    @kachelstacktus Год назад +12

    I absolutely love the video and the idea. The one thing I have to ask for the next one is a short explanation at the start with example images. I wasn't sure what you were going for with the pixel sorting, since I thought you would do so over the entire image, thus rendering it entirely unrecognizable.
    It wasn't until 4:42 that I finally understood what the goal was.

  • @JohnDoe-hk6fe
    @JohnDoe-hk6fe Год назад +3

    That tony the tiger picture is 100% certified cursed.

  • @DavidHidde
    @DavidHidde 9 месяцев назад +1

    Came here to say that pixel sorting is something academic research has sort of looked at, it's just generalized into 2D grid sorting for visualizations (and def not real time lol). Pixel sorting is actually the use case I'm currently testing for my research into these methods 😅

  • @Tealsquiddev
    @Tealsquiddev Год назад +6

    Great video. All it needs is a quick historical recap of all sorting algorithms

  • @musicstreams3479
    @musicstreams3479 Год назад +44

    I think they may have used pixel sorting shaders to do some of the effects in Cyberpunk 2077. The effect at 16:15 looks really similar to the effect when you're on the Net or while viewing the edges of braindances

  • @TheSpyFishMan
    @TheSpyFishMan Год назад +95

    Idea for making it possible to parallelize the pixel sort algorithm (keep in mind I have no idea what I'm talking about):
    Instead of generating a texture with the start of each span encoded by the position of pixels, generate a texture where the value of each pixel in a column represents what segment of the column it's pixel is in. Say there are two spans in a particular column, then the values of pixels in the texture from top to bottom would be a block of pixels with value 1, then for the extent of the first span, pixels would have a value of 2, then a value of 3 between the spans, 4 within the bounds of the second span, and 5 to the end. Also, if the first pixel is part of a span, then the first block of pixels should be even, so it can start at 0, to maintain the relationship that even numbers represent pixels within a span, and odd numbers represent pixels outside a span.
    Then during the sort phase, instead of using a thread for every pixel, use a group of n/2 threads for every column of pixels, where n is the height of the columns. Then sort the entire column using the parallel bitonic merge sort algorithm, except make sure to first multiply the pixels sort value by 255 times the value from the corresponding texture location, or use a second comparison between the two texture values. Either way, the sorting algorithm will sort the entire column, and the increasing index will prevent mixing of spans and inter-span regions by making each region's values all greater than the previous region's values, and all smaller than the next region's values, or else achieve the same by some other logic. Then, either within the sort logic, or on a separate pass, take the original (unsorted, or just don't swap in the sort algorithm if both tex values are odd) value for pixels with an odd numbered texture value.
    So, in the end, the CPU can dispatch the same number of thread to every column of the image regardless of the number of spans in each column, and you can do the parallel sort algorithm instead of single threading them.

    • @telaferrum
      @telaferrum Год назад +9

      Would love to see Acerola try this. I saw someone else also suggest this same idea after reading all 300-something comments as of now.

    • @iskierka8399
      @iskierka8399 Год назад +15

      For someone who has no idea what they're talking about, this is pretty spot-on to what I was going to suggest as a graphics programmer.
      You would probably keep the ranges that don't want sorting by simply modifying the swap logic, so that either even or odd spans (depending on code design and/or artistic choice) evaluate as their pixel position instead of value (easiest way to preserve position). This would be preferable to a second access since you're already loading the texture and writing anyway, so copying the original values in would be less efficient by adding more accesses.

    • @charleslambert3368
      @charleslambert3368 Год назад +1

      is parallel bitonic an in-place sort algorithm? If so, i'd not worry about spans per-se but just play around with functions you AND with the comparison logic to see what happens. Maybe only swap two values if their luminance has the same first 3 bits or if their original positions are within n pixels of each other.

  • @Opal_Sand
    @Opal_Sand Год назад +2

    Your videos are amazing. You’re good enough at coding to talk about it in a easy to understand way, you structure your videos in easy to digest and entertaining way, and your editing complements it all perfectly.

  • @williammanning9323
    @williammanning9323 9 месяцев назад +1

    If you think the techniques used in this video make you uncomfortable, check out per-pixel linked lists! The shaders for those look like they were written by somebody who doesn't quite understand shaders yet. Including 'for' loops over every "fragment"

  • @almoni127
    @almoni127 Год назад +10

    Never did graphics programming, but what about the following solution?
    Calculate for each pixel whether it is inside a span and if so, what span number.
    Then use the bitonic sort with the following comparator:
    If pixels are both in a span and have the same span number then compare them by value. Otherwise, compare them by index.

    • @thechurro3920
      @thechurro3920 Год назад +2

      That was my thought as well! We could even combine the process of computing the simplified sorting value into span marking to avoid excess data writing.

    • @almoni127
      @almoni127 Год назад

      @@donaldhobson8873 right

    • @kiyu3229
      @kiyu3229 Год назад

      I haven't seen the video yet but can't you just get the RGB average of every pixel and then just sort that

    • @telaferrum
      @telaferrum Год назад

      Would love to see a follow up with this approach. Someone else suggested this in another comment too after reading through all 300-something

  • @ainsel98
    @ainsel98 Год назад +6

    7:12 you can't keep teasing me with that tiger

  • @psychwardescapee420
    @psychwardescapee420 Год назад +3

    14:43 waittt i was literally thinking "hey this looks like a serial experiments lain edit" this ENTIRE time

  • @Moulk
    @Moulk Год назад +2

    Instructions unclear: I ended up sorting kitties by cuteness, but they were all so cute that my GPU melted.

  • @Riazey
    @Riazey Год назад +2

    "tasteful chromatic aberration" was up there in hype quotes for this video, just below "lets switch over to FFXIV"

  • @benmcreynolds8581
    @benmcreynolds8581 Год назад +6

    I really dig this aesthetic that you created from this. It's really a awesome vibe. The amount of creativity that could be utilized by this is, is huge. Especially with Analog Horror, and other Avant guard type art styles.

    • @Nerdule
      @Nerdule Год назад +1

      I'm sorry, but this kind of algorithmic manipulation of discrete image pixels is exactly the opposite of analog horror. It literally could not be more digital. Like I know this is a nitpick, but not everything glitchy is analog horror!

  • @whiteeyedshadow8423
    @whiteeyedshadow8423 Год назад +7

    0:06 man of culture I see

  • @Lol_low
    @Lol_low Год назад +3

    10:23 no way💀 bro really pulled the subway surfers and family guy combo on us

  • @nxone9903
    @nxone9903 Год назад +1

    that cat trick to retain my attention and not abandon the video worked very well

  • @shimlaDnB
    @shimlaDnB Год назад +2

    genious, i used to work a lot with pixel sortring processing scripts back in the day but this is kind of a game changer, to run it on the GPU in realtime is quite a challenge you overcame, love the content and thanks for sharing the code!

  • @filiformis
    @filiformis Год назад +8

    As much as I love your cat, it's getting in the way of all the interesting graphics programming optimization visuals.

  • @Gymomanen
    @Gymomanen Год назад +5

    I crack up completely from these videos. Love how you manage to keep it entertaining while still being so informative.
    Considering you set a cap on the span length anyway, would it make sense to make an indirect dispatch with one threadgroup per span to bitonic sort them instead of just running one thread per span?

    • @Acerola_t
      @Acerola_t  Год назад +3

      In the absolute worst case yes since you know the whole image is being sorted but like you could just do a full bitonic merge sort if you want to sort the whole image where as I wanted to only sort the spans in the mask, which the cpu can't optimally dispatch groups for.

    • @Gymomanen
      @Gymomanen Год назад +5

      I may have missed the point, but my idea wasn’t that you would dispatch the groups from the cpu. You would do an indirect dispatch based on what you get from the pass that generates the span-mask you had. Instead of writing that mask, you could count the number of spans (using some atomic counter) and store that in an indirect-buffer. For each unique span you also store the start pixel and the number of pixels that span covers. Then you can just dispatch the number of groups (one per span) indirectly using the indirect buffer. So the cpu does not need to know anything about spans, and that data is kept on the GPU :)
      Are there reasons that would not work?

    • @calvin2307
      @calvin2307 Год назад +1

      I was looking for the comments if anyone had mentioned this. IndirectDispatch would work, you can even bucket a few fixed span lengths and dispatch shader variations. 1 threadgroup per span so you could do the bitonic sort in parallel. The problem is not dissimilar to tiled classification deferred shading .
      First video I've seen from this channel, but lots of fun :)

  • @Not_Pulsus
    @Not_Pulsus Год назад +4

    I was so damn happy watching this, I never thought I'd see an in depth exploration of my niche interest like this! I do Glitch Photography and I'm so passionate; I almost cried when you mentioned Kim Asendorf!

  • @pachete.
    @pachete. Год назад +1

    my brain stopped working after you put 4 cat videos on the screen, congratulations you overloaded my brain with a cat

  • @nathanchapin8445
    @nathanchapin8445 Год назад +1

    The thumbnail made me think that this was gonna be a deep dive into the human psyche and the limitations of the human brain but this is cool to

  • @zloboslav_
    @zloboslav_ Год назад +4

    I've seen similar looking effects in some glitch art communities, both in image and video, but never in a live rendering shader! Very cool video!

  • @kaidwyer
    @kaidwyer Год назад +25

    Incorporating this into a game itself instead of as a filter would be interesting. Imagine if you're playing as a character who's had a memory-altering implant for something like schizophrenia, and slowly start to see phantoms appear from a malfunction of the memory device.
    The character could slowly begin to realize the device was hiding the true reality all along, and fight to discern what the world is really like beyond the deep-fried digital veil. There could be quests exploring previously invisible areas, and a neverending struggle to balance the mental illness and the glitchy device.

  • @Pizhdak
    @Pizhdak Год назад +9

    13:36 I was already enjoying the video, and then the MGMT reference made it even better

  • @SiobhanYmeow
    @SiobhanYmeow Год назад +2

    every time i see that thumbs up tony the tiger image my mind numbs a little

    • @Acerola_t
      @Acerola_t  Год назад +2

      it's like a painkiller for the math parts of the videos

  • @TheDonsky
    @TheDonsky Год назад

    Great video! I think I know of an actual efficient solution:
    0. You create your mask as you do;
    1. You allocate a buffer of integers with elements for each pixel and assign 1 where there's a transition between neighboring segments; (1 operation per pixel; single dispatch)
    2. For each row, you create a segment tree of sums from the buffer above; (Roughly log(row/column) size dispatches; can be reduced with shared memory)
    3. For each pixel, you evaluate the segment tree from row/column start to the pixel index in that axis and store it in a buffer; this will give you "segment index" for each pixel in your desired direction (O(log(n)) operations per pixel; single dispatch)
    4. Do the parallel bitonic merge sort per row/column as usual, but replace 'less' operator with (a.groupId < b.groupId || (a.groupId == b.groupId && a.value < b.value)); (Exact same O(N * log^2(N)) you had);
    Don't think I'll be doing that, but have done enough stuff like this to believe this'll likely give you decent enough performance. Not sure about 2ms
    Update: I think someone already mentioned something very similar, I have not really looked into the comments before commenting. Not entirely sure if using segment trees is new or not...

  • @Vallee152
    @Vallee152 Год назад +26

    1:40 I wanted history tho ;-;

  • @dylangergutierrez
    @dylangergutierrez Год назад +12

    I think it would be really interesting to separate the chroma and luminance channels, and sort only one of them before recombining.

  • @mikeciul8599
    @mikeciul8599 Год назад +3

    1:39 - video skips past Betty Holberton
    *pauses video to google Betty Holberton*
    She was one of the ENIAC developers!

    • @Acerola_t
      @Acerola_t  Год назад +1

      betty holberton is one of the greats

  • @animalcrossingfan113
    @animalcrossingfan113 Год назад +1

    didn’t understand literally anything in this video, i loved it

  • @Wato-1876
    @Wato-1876 Год назад +2

    the amount of buff cereal tiger is crazy

  • @Not_Pulsus
    @Not_Pulsus Год назад +7

    Acerola, this channel is so damn cool. I feel like you're pioneering the "white paper" of the modern age.

  • @SupperGammer
    @SupperGammer Год назад +17

    The cat videos did not help my ADHD

    • @jaywalmoose9623
      @jaywalmoose9623 18 дней назад +1

      Acerola is farming watch time from how often I've had to rewatch parts after getting distracted by the cats

  • @pindebraende
    @pindebraende Год назад +7

    Your videos really helped my five year old learn about real-time image rendering, it's way better than the cocomelon version. she always asks for "the long hair man" (: thank you so much

    • @Acerola_t
      @Acerola_t  Год назад +4

      my true target audience

  • @yaojo27
    @yaojo27 Год назад +1

    Never thought Ted could be used to depict the raw emotion in regards to the CPU and GPU having communication issues with each other

  • @spataruandrei8888
    @spataruandrei8888 Год назад +1

    You won me at the moon album cover. Great video and amazing explanation. I like a lot the way you explain aspects in an academic manner while still keeping it funny and entertaining. Great video and great skills!

  • @Keriously
    @Keriously Год назад +15

    they did sorting on a pixel

  • @Request_2_PANic
    @Request_2_PANic Год назад +4

    I think a sorting algorithm for image data could be an interesting way to compress its size, even if it's not as efficient as other methods. Go through the image, list the colors as 24bit values, and use the numbers they correspond to and how many of each are along the lines.

    • @gljames24
      @gljames24 Год назад +4

      I'm pretty sure PNG already does something similar with its lossless compression.

  • @vintage08
    @vintage08 Год назад +3

    9:27 what if i wanna watch both the explanation and the cat!! they're BOTH cool!!

  • @ItsGBMoD
    @ItsGBMoD 10 месяцев назад +1

    i love the monogatari style editing you have for some scenes

  • @Killerkraft975
    @Killerkraft975 7 месяцев назад

    I like how simple yet in depth the explanations were. Im at university wanting to do this and thankfully this was easy enough for me to understand but no so simple to know without any context. Thanks !

  • @Aragubas
    @Aragubas Год назад +11

    That effect looks amazing! I can already think of usecases for it!

  • @Mo-rc6ph
    @Mo-rc6ph Год назад +5

    “Snorting Pixels”

  • @luismarioramirezperalta8045
    @luismarioramirezperalta8045 Год назад +6

    I think you could use simple quicksort or mergesort (no tread division) to make the sorting of the spans significantly faster, maybe not 2ms fast but way faster

    • @BrooksMoses
      @BrooksMoses Год назад +1

      The sort "keys" are small one- or two-byte integers, so you could also try a radix sort. That's O(N), whereas the quicksort or mergesort algorithms are O(N log N).

  • @bonehead537
    @bonehead537 Год назад +1

    Honestly blows my mind seeing how effects like these work behind the scenes. Loved playing around in after affects using datamosh 2 and ae pixel sorter 2, but to see a pixel sorter in real time in a game engine while also being open source is mind blowing.
    Been recently been trying to create a little suite of shaders in unity of the majority of effects that I use in After Effects, the main ones being datamoshing, pixel sorting and ae's colorama effect (like a custom heat map effect, annoying me atm 💀). This pixel sorting video and the rest of your series of creating these custom shaders for FF14 has given me a hell of a lot of motivation and i cant thank you enough for that. 🙏

  • @SorcererDragon
    @SorcererDragon 11 месяцев назад +1

    I do QA for a rendering software and this is genuinely so much more interesting than I imagined. It’s facinating to take a look under the hood even if it’s not something the devs I work with would ever do.

  • @nxone9903
    @nxone9903 Год назад +6

    11:37 he he he cat-egorised

  • @drolemem
    @drolemem Год назад +7

    12:06 sus

  • @Nielk1
    @Nielk1 Год назад +3

    What if you use the mask's X (or Y) coordinate as a high significance value added to the value you're sorting. You'd use the start of the span as the number, so on black areas it would keep counting up but on white areas it would stall and create a span that all has the same high sig value added to it. Then you can use the Parallel Bitonic Merge Sort again.

  • @Colin__
    @Colin__ Год назад

    Thank god he put another video of his cat on the other side of the screen! That way my eyes were drawn to the middle of the screen, so I could actually follow what was happening, truly a life saver!

    • @Acerola_t
      @Acerola_t  Год назад

      finally someone got the vision

  • @CompilerStuck
    @CompilerStuck Год назад +1

    I haven't thought of sorting in that way.. great video!

  • @AlexandreCoulonOldChannel
    @AlexandreCoulonOldChannel Год назад +5

    I watched this video and I will never be the same ! Wow !

  • @Krono159
    @Krono159 Год назад +10

    0:29 ... bro what's up with that tony the tiger cropped yiff?

    • @hhjpegg
      @hhjpegg Год назад +1

      howd you know what it was? 🤨🤨🤨🤨🤨🤨🤨

  • @bottlekruiser
    @bottlekruiser Год назад +4

    oh hey a video that is not a short

    • @Acerola_t
      @Acerola_t  Год назад +2

      lol I made the shorts cause those topics won the patron poll

  • @sacordovaplata
    @sacordovaplata Год назад +1

    This looks a lot like the Cyberpunk 2077 Braindance effect and Relic malfunction, but realtime.

  • @Hwyadylaw
    @Hwyadylaw Год назад +2

    This was beginning to lose hold of its position as my #1 favourite RUclips subscription, but they you rolled out the cat clips
    Well done!

  • @zekiz774
    @zekiz774 Год назад +6

    Did... Did you make a Monogatari reference throughout the video with the different colored scenes and chapters?

    • @Acerola_t
      @Acerola_t  Год назад +6

      next you'll wonder where my name comes from!

    • @zekiz774
      @zekiz774 Год назад +2

      @@Acerola_t ohhh. Didn't even think of that. Lol

  • @miwelysuscosas1507
    @miwelysuscosas1507 Год назад +8

    10:55 help, I'm trying to pay attention but the cat cam's are distracting me

  • @asddsaasdfg2846
    @asddsaasdfg2846 Год назад +4

    10:27 omg I just wanted to watch the video 💀💀 bro went full on z gen

  • @JeoshuaCollins
    @JeoshuaCollins Год назад +2

    I actually used this shader before finding this video. I had no idea what it was supposed to be for.
    Now, after watching this video, I have even less of an idea about what it's supposed to be for.
    Great job!

  • @Folly_Inds
    @Folly_Inds Год назад +1

    eyyy, Paradise killer song. my favorite!.
    I appreciate the cats too. they helped with the whole paying attention thing.

  • @AxWarhawk
    @AxWarhawk Год назад +13

    Out of curiosity, have you looked at odd-even transposition as an alternative to the Bitonic merge sort? Their parallel runtime complexity should be about the same, but the implementation is simpler (more structured) and might therefore be more cache friendly.

    • @Acerola_t
      @Acerola_t  Год назад +2

      They both have their uses
      developer.nvidia.com/gpugems/gpugems2/part-vi-simulation-and-numerical-algorithms/chapter-46-improved-gpu-sorting
      here's a cool article on it

  • @noThankyou-g5c
    @noThankyou-g5c Год назад +3

    10:44 What toy is that? (in the bottom left. the rod or whatever sticking out of the blue ball rolly toy) i think my cats would like it 🥺🙌

    • @Acerola_t
      @Acerola_t  Год назад +1

      lol I just stuck one of the rod toys into the hole that was in the blue ball toy, so it's not an official thing you can buy.

  • @Posty-vw9jc
    @Posty-vw9jc Год назад +4

    16:15 legit looks like an effect in cyberpunk

    • @sacordovaplata
      @sacordovaplata Год назад +1

      That's what I thought! It look like when they are *SPOILERS* inside Mikoshi and talking with Alt

  • @Majromax
    @Majromax Год назад

    You don't need to sort each span of "glitched" pixels separately. If you modify the sort key to impose a mostly-monotonic order on pixels, you can globally sort the row (with massive parallelism) and preserve the relative locations of each span. Consider:
    A masked pixel (not sorted) at index i will have sort key 256*i. An un-masked pixel (to be sorted) at index i will have a sort key of (val) + 256*(largest j < i such that pixel[j] is masked). This value can be found via binary search in log(rows) steps, using conditional moves that should still play nicely with a shader.
    Separate runs of to-be-sorted pixels within a row will occupy a disjoint interval of sort keys, such that sorting the whole row will preserve pixel placements within their respective runs. The massively parallel sorting network can be used without further modification.
    The overall runtime for an image is then bound by the O(log(rows)^2) time required for the bitonic sort.

  • @insurrxct
    @insurrxct 5 месяцев назад +1

    i don't understand a single thing but i love this video