What Decides a CPU Tourney's WINNER?

Поделиться
HTML-код
  • Опубликовано: 29 сен 2024

Комментарии • 316

  • @Casual_PKBeats
    @Casual_PKBeats  Год назад +440

    12:53: *Level 8, I would NEVER root for level 9
    Also, at 10:42 I’m only just now seeing the red color on the tech ring, which means it COULD’VE been an untechable situation, but… I also don’t think that’s reliable, since there’s a red ring at 11:55 where level 9 still teched it. Honestly the mechanics of what is and isn’t techable has always been weird to me lol

    • @kwisowofer9872
      @kwisowofer9872 Год назад +29

      About the red ring, it’s the outer ring that shows if it’s techable. When the red ring appears you can see the big blue/green ring around it. If that outer ring was red, then it would be untechable.
      Besides, that red ring only appears when a move does a lot of knockback, so considering that purple Mario was at 0%, then that up-b wouldn’t deal that much knockback

    • @ayoraster2970
      @ayoraster2970 Год назад +16

      I don't think it's untechable. Usually with untechables they have a red outer ring, and the threshold to get an untechable is really high. I don't think an up b at 0 could ever be untechable.

    • @jefferydavidson5347
      @jefferydavidson5347 Год назад +5

      I didn’t see any red ring at 11:55

    • @beeftips1628
      @beeftips1628 Год назад +7

      It’s most likely what other ppl said but from what I’ve heard, it decides if it’s techable or not based on original knockback but hitting a wall can lower your speed just enough to allow it to be techable, allowing a techable red ring.

    • @KainYusanagi
      @KainYusanagi Год назад +4

      Can confirm that it's the stats that matter with the spirit combos, with a high defense making a big difference. If you match the stats around the same values and then try again, the type matchup might actually factor, though.

  • @LordPhilipJFry
    @LordPhilipJFry Год назад +1182

    So basically, CPU matches are like quantum mechanics: observing them changes the outcome.

  • @lamergamer8211
    @lamergamer8211 Год назад +325

    I love using ways of doing things that are definitely objectively worse than alternatives, but are just more funny looking

    • @MrPenguineo
      @MrPenguineo Год назад +4

      So true

    • @neoxus30
      @neoxus30 Год назад +4

      The beyblading of Iceack)

    • @TheThirdPrice
      @TheThirdPrice Год назад +9

      You should watch the video 'harder drives' it's all about that sort of thing. One of my favorite videos on RUclips

    • @lamergamer8211
      @lamergamer8211 Год назад +3

      @@TheThirdPrice I have. Its great

  • @_vapor_3866
    @_vapor_3866 Год назад +5

    19:50 Hello, statistics student here. I did the math on the Kazuya vs. Isabelle match-up and it turns out that the results were well within the realm of possibility for a 50/50 chance.
    Technically, I should say that "There isn't enough evidence to reject the claim that the odds are 50/50," but I can say with 99% confidence that the odds are even, if not extremely close to even.
    Great video btw, thanks for all the amazing content you put out.

    • @Casual_PKBeats
      @Casual_PKBeats  Год назад +1

      I appreciate your feedback, thank you for the info and for watching the video! :)

  • @pauldavison2856
    @pauldavison2856 Год назад

    I recently discovered a kind of glitch or maybe just a weird mechanic thing that happened: if mii brawler does a flaming drop kick at a specific moment on piranha plant when a ptooie is just coming out, the mii will somehow take double damage from it. This happened when I was playing plant and I watched it back multiple times, and I couldn't figure out what happened.

  • @supermaximglitchy1
    @supermaximglitchy1 Год назад +2

    i once had a cpu tournament and I skipped a match between a level 1 vs level 8 and the level 1 won

  • @thrillhouse4151
    @thrillhouse4151 Год назад

    When I was young I’d have teams of 9s fight each other in Smash 64, and make up little storylines for em.

  • @Mad_T3
    @Mad_T3 Год назад

    Honestly tho. I love doing Iron Man Level 9 CPU matches with the full roster with every Smash game. Even the modded ones.

  • @Jjaeger_H
    @Jjaeger_H Год назад

    Glad to hear I'm not the only one that leaves Lady Luck to make my decisions for me lmao, I'm way too undecisive by myself. Though I personally like using wheel decide when it comes to multiple options

  • @Sampson-9091
    @Sampson-9091 Год назад

    I once had a level 9 cpu Sephiroth get destroyed by a level 4 cpu Peach, and a different time a level 9 cpu Game & Watch lose to a level 5 Dedede. We all were watching and we’re like …..HUH

  • @landronsc
    @landronsc Год назад +1

    OH MY GOD IM NOT CRAZY. SOME CPUS ARE JUST BETTER.

  • @larryflaco8320
    @larryflaco8320 Год назад

    alpharad should watch this vid

  • @dropfish3109
    @dropfish3109 Год назад

    galore galore cool cool

  • @sparklyskittles6367
    @sparklyskittles6367 Год назад +388

    sometimes when we are bored, my group of friends will run lvl 9 cpu tournaments, but once during a birthday party we accidentally set a luigi to lvl 6 instead
    he won the tournament

    • @WeegeeDX
      @WeegeeDX Год назад +106

      only Luigi could pull off such a feat

    • @bukenator0788
      @bukenator0788 Год назад +74

      he probably did nothing

    • @MeesterTweester
      @MeesterTweester Год назад +21

      Melee Luigi CPU is cracked

    • @Stoopid420
      @Stoopid420 Год назад +17

      I did this but it wasn't a single match with a level 7 luigi versus a level 9 ridley WITH SPIRITS and Luigi completely wrecked him to death it was very funny.

    • @nickdavis27
      @nickdavis27 Год назад +3

      Did he win by doing nothing?

  • @fernandobanda5734
    @fernandobanda5734 Год назад +212

    I think we can safely say that the skipped matches use a random outcome, with weighted bias towards higher CPU level and spirit power (and maybe character handicap?), but the devs couldn't bother to accurately calculate these odds. They programmed better things to be better, but not really reflecting a played out match.

    • @palidopali
      @palidopali Год назад +32

      Of course. I mean I wouldn't see anyone complaining at all if this algorithm was even way worse

    • @hunterculpepper1973
      @hunterculpepper1973 Год назад

      So basically any thing can happen

    • @Pikana
      @Pikana Год назад +1

      Yeah. The skip feature feels more like a thing that just exists because of the fringe cases it's useful to have. They figured everyone would just watch matches between CPUs as any serious use of the system would be for purely player vs player matches. Meaning anything with CPUs are likely there to watch CPUs beat each other up.

  • @attackboss6
    @attackboss6 Год назад +153

    Stage selection doesn't matter, you can make a custom stage that has no "end" and it still ends a cpu match with a winner if you skip

    • @snivyboy168
      @snivyboy168 Год назад +17

      You gotta wonder though... what *did* happen in that skipped match?

    • @ToaderTheToad
      @ToaderTheToad Год назад +35

      @@snivyboy168 It's like Mario Kart 64, they used noclip while nobody was watching

  • @foxbra7258
    @foxbra7258 Год назад +163

    I love doing CPU tournaments, I have multiple Google sheets of data for them, so I absolutely love seeing this kind of content.

    • @lol1013
      @lol1013 Год назад +5

      What's the appeal of watching cpu? They aren't even good; I can understand the appeal of watching bad players fighting it out, but cpu?

    • @foxbra7258
      @foxbra7258 Год назад +28

      @@lol1013 what I mostly enjoy is the statistical side of it, I like finding out which CPUs do well in which scenarios or matchups, it's all just stuff I do for fun

    • @Greenglower2012
      @Greenglower2012 Год назад +3

      What r ur findings

  • @Sabagegah
    @Sabagegah Год назад +62

    19:51, 20:56 *I DID THE MATH!* (lol)
    Null hypothesis: both characters will have equal win rates (of .5).
    Alternative hypothesis: the win rates will differ.
    The two-tailed 95% confidence interval is bounded by 99/211±1.96*sqrt((99/211)(112/211)/211), which simplifies to a win rate between .4018 and .5365 for Isabelle. This includes the null hypothesis of .5, so we fail to reject the null. It is feasible that the true win rates are equal.

    • @pgmonkeh2166
      @pgmonkeh2166 Год назад +13

      For anyone that doubts this man, can confirm he did this right.
      This is known as hypothesis testing, something you learn during college (high school, to some of you) in statistics, or perhaps a bit earlier.

    • @freddiesimmons1394
      @freddiesimmons1394 Год назад +1

      What about a one tailed hypothesis, that kazuya will win more?

    • @MesserTAMU
      @MesserTAMU Год назад +1

      Not all heros wear capes

    • @Sabagegah
      @Sabagegah Год назад +2

      @@MesserTAMU I should get a cape...

  • @Skypatroller_BenCD
    @Skypatroller_BenCD Год назад +95

    15:48 there's an other problem you aren't using the strongest spirit from their categories (except for Akuma) cause the strongest spirit in the shield category is Galacta Knight and in the grab categories is Big Boss (also Galeem/Darkom are the strongest in neutral but sense their special effects are only usable in story mode so Soma Cruz is the strongest)

    • @riley3087
      @riley3087 Год назад +10

      I thought the Akuma equivalent was the Absolutely Safe Capsule, considering the lack of spirit slots on both.

    • @Skypatroller_BenCD
      @Skypatroller_BenCD Год назад +16

      @@riley3087 well no the top 4 are
      1 Galeem/Darkom with 13640 of power for both
      2 Galacta Knight with 13345 of power
      3 Akuma with 13235 of power
      And 4 Big Boss with 12727 of power
      Soma is 12110 of power (with reminder that he is the second strongest neutral spirit in the game) and the Absolute safe capsule is only 10000 of power. For comparison he's weaker than Eggman (10118) and Protoman (10029) and both Protoman and ASC are the 28th and 29th strongest shields spirits (also all states are told with ALL spirits at level 99 so the max)

    • @guymadeofbees8464
      @guymadeofbees8464 Год назад +5

      @@Skypatroller_BenCD also you have to account for the fact that the ASC has all 10,000 in defense which gives it no damage

    • @Skypatroller_BenCD
      @Skypatroller_BenCD Год назад +3

      @@guymadeofbees8464 defending wise he has the MOST defence of them all and the LEAST attack which is a reference to Earthbound 2 cause it is indestructible and can't attack you but still the ASC isn't that strong he only has the same power as Geno also the overall power are just an addition of their defencing and attacking states for example Super Sonic is 5537 in attack and 4009 in defense, 5537+4009=9927 which is Super Sonic overall power or Rayman is 3391 atk+3748 def=7567 power. And the ASC is only 10000 def+0 atk=10000 power unlike Geno who's 5250 atk+4750 def for the same result

    • @leaffinite2001
      @leaffinite2001 Год назад +2

      I use only james mccloud because he is fox with sunglasses

  • @LinearAztec
    @LinearAztec Год назад +145

    There are a few factors I’m aware of. Once they’re sponsored, they’ll never win another tourney, and if Blood Falcon beats you in the dark realm you die for real

  • @hurikain213
    @hurikain213 Год назад +18

    "Mom can I get switch online to to use Spectate?"
    Mom: "we have Spectate at home."
    The spectate at home:

  • @UnknownCleric2420
    @UnknownCleric2420 Год назад +37

    4:05 Fun fact! This is also the odds of the Icon of Sin spawning an Archvile in Doom 2/Final Doom.

    • @demi-femme4821
      @demi-femme4821 Год назад +1

      Imagine that sequence happening several times in a row.
      Those are the odds needed to beat the Icon of Sin in Doom 2 without directly attacking it, and that's if you get the execution right.
      Yeah, there's a reason nobody has done this without tools.

    • @UnknownCleric2420
      @UnknownCleric2420 Год назад +2

      @@demi-femme4821 Or you could just get a Cyberdemon to shoot at a Lost Soul in Plutonia Map 30 :P

  • @BainesMkII
    @BainesMkII Год назад +16

    The chance of starting with seven red Mario wins is 1/128, but the chance of simply starting with seven consecutive wins of one color is 1/64. If you take away that "starting with" condition, the chance increases with the number of total matches. Probabilities are highly dependent on just how the scenario is defined, which is where a lot of people get tripped up in over or underestimating chances. Add into this the human tendency to look for (or outright create) patterns, and it arguably was likely that you'd find *something* that looked "suspicious" over multiple tournaments.

  • @dazedheart9006
    @dazedheart9006 Год назад +18

    Omg I loved to do “Tourney Roulettes” on SSBB. Everything was automatic and you couldn’t see the AI’s difficulty. A level 2 ICs somehow beat a level 7 Ganon I think once.

  • @LaughingThesaurus
    @LaughingThesaurus Год назад +18

    I theorize that there's probably a behind the scenes Power formula that weighs CPU level and spirit power and possibly other factors like handicaps or starting stamina, maybe spirit abilities or individual stats but I sort of doubt it on that front.
    There has to be some math going on to calculate the odds, because I wouldn't believe a supercharged level 1 could possibly beat a level 9 in a match, without the match being purely about Math.

  • @diamondmemer9754
    @diamondmemer9754 Год назад +81

    Plot twist:
    When you skip a battle the game plays a very quick and texture-less sudden death between the two characters to see who wins

    • @329link
      @329link Год назад +20

      I could believe that actually happening tbh. When you take video rendering out of the equation, a computer can often simulate a game at an insane speed.

    • @diamondmemer9754
      @diamondmemer9754 Год назад +28

      @@329link yep but the inaccuracies shown here tell us this isn't the case

    • @nightmare3642
      @nightmare3642 Год назад +16

      @@diamondmemer9754 It simply immediately drops bob-ombs.

    • @diamondmemer9754
      @diamondmemer9754 Год назад

      @@nightmare3642 that would be too randomic

  • @manupm9161
    @manupm9161 Год назад +17

    Quick question: are the odds of a level 7 to beat a level 8 also 4%?

    • @randomguy6680
      @randomguy6680 Год назад +2

      Was wondering about level 1 and 2, but gesting this would be interesting.

  • @Amirrorofmirrors
    @Amirrorofmirrors Год назад +12

    This must have been so tedious to set up. Thank you for doing the research so that we don’t have to

  • @dabiskitt
    @dabiskitt Год назад +11

    1:27 I would totally watch a full video of you cracking jokes and mentioning random smash facts over a CPU tournament!!

  • @Liggliluff
    @Liggliluff Год назад +17

    Considering how good the CPU can be in Ultimate compared to previous games, I get the feeling that they have made special AI code specifically for these characters. Because there's sometimes a huge difference between two level 9 of different characters. So terrible CPUs like Sheik, I think is because Sheik has the generic AI, while other characters got specially designed AIs.

    • @byzantine5761
      @byzantine5761 Год назад +4

      cough kazuya being a level 12 CPU at level 7 or smth

    • @SentientPulse
      @SentientPulse Год назад

      And considering certain characters will just use random moves. I’ve had a Fox and a Mewtwo level 9 kill themselves with up b on stage>off stage. Like what option would mewtwo ever want to TP offstage into free fall lol. Could have been smash 4 though, but I also agree that there is most likely a general AI and something else for certain characters

  • @michaels.4527
    @michaels.4527 Год назад +7

    We used to drink to CPU tourneys when we were getting too drunk to play. Each person picks a character, yours dies, you take a drink. Your character loses take an extra drink after the drink for the lost stock... I've spent many hangovers cursing random smash characters. SDs are 2 drinks.

    • @ashmyoshi3912
      @ashmyoshi3912 Год назад

      I cant legally drink yet

    • @SGEnjoyer
      @SGEnjoyer Год назад +1

      @@ashmyoshi3912 just be older 5head

  • @Rawls9805
    @Rawls9805 Год назад +6

    I’m curious if going with lvl 2s vs lvl 3s would change the result of skipped battles since you’d be pitting two “bad” AIs against each other rather than two “good” ones

  • @NinJaguar
    @NinJaguar Год назад +9

    Cool experiments, I had known that tourney mode skips were RNG when it came to characters at the same level but never considered to what extent; interesting to see how CPU levels, stage/format selection and spirits come into play.
    Also glad you found your way to the level 9 CPU tier list, worth noting that the one labelled as being from 2021 is actually from 2022 (graphic was a typo that never got fixed) and still holds up largely!

  • @LilyLambda
    @LilyLambda Год назад +15

    I've been looking through the code a little bit, and I've found a list of parameters related to tournaments-
    Apparently the game uses a "score" system, where points are applied based on certain metrics, and the cpu with the higher score wins
    -10 points are added for every level after 1
    -10 points are added for every 1500 team power for 1500-9000 (starting at 20 points for 1500), then 10,000 adds 80 points, and everything higher is 90
    -amiibo use a similar system, where between 0 and 90 points are added based on level
    -10 points are randomly given (I'm not sure what the chance to give 10 points is)
    There's also three point systems I'm not sure about, as I couldn't find any code, only a list of numbers
    -"score_reversal" can either give 5 or 50 points
    -"score_unique" adds 25
    -"score_attribute" adds 20

    • @halt1931
      @halt1931 Год назад +1

      so a level 7 has 0 chance against a level 9, barring the last three unknown systems?

    • @LilyLambda
      @LilyLambda Год назад +4

      @@halt1931 with no spirits, yes

    • @Liggliluff
      @Liggliluff Год назад +1

      I was confused first when you said -10 points are added for each level.
      So for a random spirit-less match, their scores are 10*(LVL-1), and the random 10 points is given at a factor of like 10%, which results in:
      10% chance that the level 8 CPU gets 10 extra points, but 1% out of that being when level 9 CPU also getting 10 extra points. So at 9%, only the level 8 CPU gets the extra point, making both have 80 points, and when they have the same points, the winner is randomly selected. Making it so level 8 CPU only wins 4,5% out of the games.
      It's a simple system, but it still makes me want to improve it. Step 1 is to determine the CPU skill of each character; Kazuya is at least 3 times better than Sheik, so taking the 13 step tier list, top tier would be 38 and bottom tier is 26. Multiply with the CPU level. So at level 9, Kazuya is 38*9=342, Isabelle is 35*9=315, Mario is 29*9=261, and Sheik is 26*9=234. Kazuya level 6 is 38*6=228 around the level of Sheik. Then add a random number from 0 to 100. Kazuya 9 vs Sheik 9 is 100 to 0. Kazya 9 vs Isabelle 9 is 73 to 27, Kazuya 6 vs Sheik 9 is 44 to 56. The strength of the tierlist and the strength of the CPU levels can further be adjusted. Then the spirits have to be added too.

    • @Greenglower2012
      @Greenglower2012 Год назад

      Wow

  • @tyler9425
    @tyler9425 Год назад +4

    Lmao I was rewatching Alpharad’s cpu cs flr the millionth time when this video popped up

  • @alexv0714
    @alexv0714 Год назад +6

    my brain works in a very statistical manner and this is something I've thought about for a long time due to Alpharad's old CPU tournaments. thank you for doing the testing for me, it was really fun and interesting!!

  • @Sabagegah
    @Sabagegah Год назад +3

    Resisting the urge to calculate a confidence interval.
    Edit: I did not resist.

  • @quantwomwhale5984
    @quantwomwhale5984 Год назад +7

    20:50 I have never seen a Steve win a single battle in a CPU tournament in my entire life

  • @THEDKA3
    @THEDKA3 Год назад +8

    Thanks for the video! Really interesting analysis to see how skipping battles affects the CPU tournament results based on different attributes. I am also glad you were able to reference the Level 9 CPU tier lists by the SmashCPU community at 18:58.

    • @Greenglower2012
      @Greenglower2012 Год назад +1

      Theres a community? Nice. Can you please explain why ganon is so much better in melee as a cpu?

    • @THEDKA3
      @THEDKA3 Год назад

      @@Greenglower2012 Thanks for the question! So in Melee CPU vs. CPU matches, Level 9 Ganondorf poses as the most dominant character in the meta for many reasons. These reasons can all be traced back to how incredibly underdeveloped the game's AI were overall. (The AI programming was noticably poor until Brawl came out).
      In Melee, the CPUs had very slow movement and commonly relied on a few moves, mainly throwing out jabs when standing next to the opponent, neutral air when hit in the air (a direct counter to jabbing), and down air when the character is launched high enough and fall directly onto their opponent.
      Trying to approach Ganon as a CPU is an absolute nightmare. Since the CPUs could not dash unless they were using dash attack or dash grab, most characters could get destroyed at early percents by a raw forward smash, up tilt, or Warlock Punch. At close range, his jab is fast, did high damage, and has enough knockback to evade getting hit by a stray nair most of the time. Ganon's aerials are very strong, particularly down air which kills off the top very early with a large hitbox. His Up Special is his most infamous move, perhaps the best programmed move of all of the Melee AI's. It has absurd knockback and Ganon will abuse it as very simple edgeguarding tool, something that most other CPU characters lack.
      Combine that with Ganon's great survivability with his heavy weight, decent horizontal recovery, and SDs rarely, he destroys just about every Melee AI matchup. Whenever we run CPU tournaments for vanilla Melee we nerf Ganon to Level 7 to keep him balanced to most of the characters (along with Link and Luigi at Level 8), and yet he's still considered "top tier".

  • @PJWooperIsBack
    @PJWooperIsBack Год назад +2

    PKBeats can make the most boring topic of smash ultimate of all and make it into a interesting video

  • @TaleOfTheToaster
    @TaleOfTheToaster Год назад +3

    This is the video I’ve wanted since my school days watching Brawl CPU tournaments

  • @legosiw
    @legosiw Год назад +4

    Cpu tier list is interesting. Im doing an amiibo tourney (currently paused for the lack of mythra/pyra and sora) and i wonder if the amiibo would follow roughly the same list.
    And before anyone asks, i cant sit and train each one so they all trained by fighting a cpu9 of themselves and have no boosted stats.
    Either way, its entertaining to watch

    • @taboo4011
      @taboo4011 Год назад

      fairly sure amiibos have the same ai as cpus

    • @LilyLambda
      @LilyLambda Год назад

      Fun fact! The only thing different between spirits and amiibo once you take away spirits is that cpus use moves at a determined probability, while amiibo can change probability based on a number of factors, so if you train amiibo with cpus, all you have is a cpu :)

  • @Cooldudecrafter
    @Cooldudecrafter Год назад +3

    This is a question I've been waiting for an answer for years. Thank you PKBeats

  • @amoura39
    @amoura39 Год назад +1

    THERE'S CPU TIER LISTS!?!?!?!?!?! THAT'S SO WILD LOL
    I think I looked into that before but couldn't find anything I really liked. I love how ordinary tiers don't necessarily apply in CPU things because the CPU's need good enough AI to properly play their characters lol

  • @Shinigami-Cat
    @Shinigami-Cat Год назад +1

    Man I loved watching this- I was always curious about skipped tourney matches. I was quite shocked seeing only a 4% chance for a level 8 to win against a level 9. What I've personally seen would contradict that, but maybe I've just gotten lucky. Kinda curious what the difference of a level 4 and a level 5 might be, but I can't ask you to test everything. Thank you for making this, it was incredible to watch.

  • @solveforx314
    @solveforx314 Год назад +1

    19:54 I'm a little rusty at AP Stats myself, but I remembered enough to figure out how to do a z-test on my calculator and got a p-value of about 0.37. What this means is that, assuming there is a 50/50 chance Isabelle wins, the chances of her winning 99 or less out of 211 are about 37%, which is not a statistically significant result. Usually, you need a p-value of less than 0.05 to say that your results are statistically significant.
    In summary, it's entirely plausible that Isabelle had a 50/50 chance at winning.

  • @darioschottlender
    @darioschottlender Год назад +1

    Maybe it's based on power level, and the CPU level adds a power level modifier, so even though the lvl 9 CPU might have a great power level modifier, the spirit gives a much more important power boost and the lvl1 ends up having a superior power level

  • @4xzx4
    @4xzx4 Год назад +1

    Hi PKBeats, please read this: for quite a while ago, I battled a Hero CPU as Hero myself, aka a mirror matchup. I use the purple alt (the same one as you - Hero is my main btw) all the time, and what struck me the most was that whenever I faced the purple alt CPU Hero, it played much more like me, whereas the other alts didn't. To make sure this wasn't just a coincidence, I battled every alt in the game several times, including the purple one,, but the purple one was always the toughest to beat; it felt sometimes like playing against a human. (It played how I play but much weaker.) It has been a mystery ever since Brawl that the CPUs might learn imputs from human players (if that's the case then Nana can be trained like crazy). I have even lost to the purple alt Hero but not to the other alts. Can you please look into this phenomenon?
    I have even saved videos of this, so if you want them I can send them to you.

  • @EvelynFTTE
    @EvelynFTTE Год назад +1

    About spirit numbers (around 16:00)
    It's possible that the result is due to scale. Grapple was half of Defense, whereas Defense was 2/3 of Attack. Even though the difference was 4000 points each time, the percentage is way different

  • @laffypanda6822
    @laffypanda6822 Год назад +1

    I have my theory on what happened when testing the types of spirits. The difference between attack is 4255 and the difference between grapple and defense is 3510. At first glance, this makes it seem like grapple is stronger compared to defense than defense was to offense. However what if instead of the difference of the two spirits, it uses the ratios. Doing so gives that defense is approximately 0.6785% as strong as attack and grapple is 0.6091% as strong as defense. This approach makes the difference between grapple and defense larger.
    Why do I think this may be the case? Because in video games, buffs such as a super effective type of stat boost are often applied in the form of a multiplier. If this potential multiplier was set at let’s say 1.5, then defense would manage to beat out attack but grapple would not manage to beat out defense.
    I can explain further if need be but tldr is that I think you may be underestimating the effect of the difference in spirit levels. I would very much like to see the results of spirits that are much closer in levels

  • @jonathon422
    @jonathon422 Год назад +1

    I can't imagine all this research is much fun, but I can't help but be captivated by the the varied outcomes from seemingly random changes.
    I'd love to know more about how the CPUs function but if it's too agonizing getting all the data, then I'd rather you do something more enjoyable. 😅

  • @TheAweDude1
    @TheAweDude1 Год назад +1

    I think something like this would be great for a 24/7 livestream. That way you can have hundreds of matches, all automated, with automatic data recording, with a variety of different factors.

  • @Clemehl
    @Clemehl Год назад +1

    Do Characters have an affinity toward Offense, Defense, and Grapple in the code? Could Mario be a Defense character, therefore gaining "more" strength from Defense spirits? Would Luigi have different results as he might be considered as a Grapple character instead?

  • @KiraSlith
    @KiraSlith Год назад +1

    Gotta give Nintendo credit, a perfect coinflip ratio from a straight random number generator is very hard to do. There's usually a LOT more wander than 5% either direction in a 100 test pool.

  • @therealohead
    @therealohead Год назад +1

    My college has a casino night with poker, roulette, the works. They also have CPU betting. One huge tourney, and you bet on each CPU. Very fun stuff

  • @whiz8569
    @whiz8569 Год назад +1

    For why the Grappler spirit didn't do as well, it's possible the way the game determines which spirit is more powerful is based on a threshold system. So, for example, the game considers spirits to be level 8000 and higher to be in the same tier, so in that case, the defensive spirit is roughly equal to the offensive one, but in a higher tier than the grappler one.

  • @FireflyMykah
    @FireflyMykah Год назад +1

    As a math/stat major with a bio background, I am now interested in creating a machine learning algorithm with the data to predict CPU fighting outcomes based on input parameters for the "skip battle" setting

  • @cms_bb8817
    @cms_bb8817 Год назад +1

    Pkbeats will be like “someone sent me this new glitch where toon links down taunt will move him into the left platform of battlefield. So anyway I taped the down taunt button down for 38 years and…”

  • @Ithaca-vv5dy
    @Ithaca-vv5dy Год назад +1

    God dammit! You just HAD to bring up the 1/128 item drops in earth bound! I’m trying to get one from ghost of star man right now and I wanna cry

  • @simplyfrozenwater
    @simplyfrozenwater Год назад +1

    [defense mario walks into a room where grapple mario and attack mario are fighting]
    defense mario: i have 8980 POWER-

  • @ripstick45
    @ripstick45 Год назад +1

    I think it would have been interesting to test the difference between a level 1 and a level 2 as well. While the level 8 and 9 matchup was 4-96, that win rate doesn't explain if it was because of the level DIFFERENCE or the levels themselves. I think a comparison could be drawn to pokemon, where the importance of level difference decreases over time (think: lvl 5 vs lvl10 and lvl95 vs lvl100). I think in the case of Ultimate, it might be the inverse.
    also going to try and make a hard (and likely incorrect) callout here based on a hunch I have, the lvl1 vs lvl2 battles will be a 25%/75% split.

  • @Bunaxy_alt
    @Bunaxy_alt Год назад +1

    I tested for hours straight and my results may shock you. I did a full tourney with every other being level 1 around 1000 times and got only 1 win from a level 1

  • @vara202
    @vara202 Год назад +1

    One thing I really want to see is if a level 1 CPU has a better chance against a level 2 CPU than a lvl 8 does against a lvl 9. Is the 4% consistent all the way up the levels or does the power difference increase?

  • @roniemena
    @roniemena Год назад +1

    It's funny how pkbeats says he isn't charismatic yet i still keep watching his videos because i find him likeable

  • @femain1788
    @femain1788 Год назад +2

    I believe the best way to do this would be to use your clout (people that follow you, mostly discord) to run the test with you amplifying the number of possible test. That is of course if you wanted to do this. Adding the number of people would heavily increase your the number of test run giving a much hight confidence level to your test.

    • @lumbajackthumbs7755
      @lumbajackthumbs7755 Год назад

      But people can lie about data

    • @femain1788
      @femain1788 Год назад

      @@lumbajackthumbs7755 assuming people are trying to actually help or he does it with only his most trusted discord users then the data should be reliable. As far as lying yeah people do, but if you go I. With the mindset that everyone cheats and lies nothing gets done

  • @medjaimcgraw5632
    @medjaimcgraw5632 Год назад +2

    Oh dear gosh what happened to that kids face in the beginning

  • @dabiskitt
    @dabiskitt Год назад +2

    I wonder if the 4% level 8 vs level 9 win is the same percent as a level 1 vs level 2 win

    • @Liggliluff
      @Liggliluff Год назад +1

      According to one user, each level is worth 10 points (level 1 is 0 points, level 9 is 80 points), and 10 points are awarded randomly. I suspect it is 10% randomly per player (so both and neither can gain it), meaning that 9% of the times, only CPU 8 gets it, making it so both have 80 points each, and at that point the game just randomly picks a winner. This gives this 4,5% win rate of CPU 8 that we see in the video.
      Then spirits also add to this system in some way.

  • @SportsGaming-ll3ru
    @SportsGaming-ll3ru Год назад +2

    I wish you included the dk guy screaming in the intro that’s my favorite :P

  • @TheRealBatabii
    @TheRealBatabii Год назад +2

    You're right about shiny hunting being braindead easy nowadays. It really takes the appeal out of it for me. Now they're just palette swaps that are marginally different to obtain rather than trophy pokemon.

  • @justsans.
    @justsans. Год назад +1

    Why have I never thought of this lmao this is actually really cool.

  • @amoura39
    @amoura39 Год назад +1

    LOL LEVEL 9'S REALLY TECH THAT MUCH?
    THAT MUST BE SO FRUSTRATING TO FIGHT LOL

  • @fiqures1529
    @fiqures1529 Год назад +1

    You say that the CPU alts don't change the outcome (when skipping them), but back when the CPUCS was still going there were 2 incineroars, PG Incin and Blue Incin, that were frequently placing high. While they were sometimes similar, Blue Incin seemingly put it all on the line more often than PG Incin. I genuinely have no idea why this is and I'm also pretty sure only the smash devs could explain this, I don't want to think that it's just chance because it felt a bit too consistent to be chance.

    • @Casual_PKBeats
      @Casual_PKBeats  Год назад +1

      Honestly? I could see there being some really, REALLY weird, obscure algorithm that gives certain things a better chance. It was nothing I was able to pinpoint in this video, as my testing and science was pretty basic as I pointed out, but I would NOT be surprised to find out that the RNG leaned towards something really weird

  • @TragicalLog
    @TragicalLog Год назад +1

    Cheers on you for not deadnaming alpha's second channel

  • @rubixtheslime
    @rubixtheslime Год назад +1

    I'm quite a bit into math, and honestly, this was very well done. I only really caught two issues and they were remarkably unimpactful.
    When collecting the results, including anything beyond the first 16 matches of given tourney _could_ lead to bias, as it means that anyone more likely to win is more likely to be sampled a second time. Of course in this case it's pretty reasonable to assume that the game doesn't pull a Mario kart and make some CPUs randomly better for the scope of a tourney. And even if it did I don't have to run any calculations, those results are definitive enough.
    For the case of everyone being equal, it's actually not a 1/128 chance that you'd get 7 in a row during the tourney. If there were exactly 7 matches it would be, but when you have more matches the odds go way up (because there's more opportunities for it to happen), and even more when you realize that 7 of the other side winning in a row would also raise the same alarm. I ran the numbers (brute force script), in the case of 16 matches, there's a 7.78% or 1/12.9 chance that the tourney will contain at least one instance of the same result 7 times in a row (4.29% or 1/23.3 if you're only looking at eg "left side" victories).
    The script (python3): sum(int(f'{x:b}'.find('1111111') != -1 or f'{x:b}'.find('0000000') != -1) for x in range(2**16))/2**16

    • @rubixtheslime
      @rubixtheslime Год назад +1

      Please RUclips don't delete my comment just because i included the script

    • @Casual_PKBeats
      @Casual_PKBeats  Год назад +1

      These points make a ton of sense, and honestly I probably should've thought of them. Or at least, the point with the "1/128" chance. Though, in the context of the video, the 7-in-a-row being the FIRST 7 rounds was what especially stood out, but you're right that the odds for ANY 7-in-a-row to happen aren't quite that bizarre. I appreciate the extra information/expertise! Glad you enjoyed the video :)

    • @rubixtheslime
      @rubixtheslime Год назад +1

      @@Casual_PKBeats yeah of course! Once again I want to stress that you did a very good job. I only really leave these sorts of comments when I can tell that the creator really cares about accuracy. I actually really liked the part where you just started doing coin flip simulations to get a feel for how much variance you might have, that was pretty smart!

  • @TheSaxRunner05
    @TheSaxRunner05 Год назад +1

    Do you have a link to that CPU tier list page? It looked like something I’d want to look over. I have done CPU 3 stock tournaments before and I want to see how my data lines up so far.

    • @Casual_PKBeats
      @Casual_PKBeats  Год назад

      Sure thing: amiibodoctor.com/the-cpu-tier-list/

  • @amoura39
    @amoura39 Год назад +1

    PKBeats saying a video should wrap up due to its length
    Me: NOOOO PLEASE KEEP TALKING FOREVER

    • @amoura39
      @amoura39 Год назад +1

      YAY MY FIRST HEARTED COMMENT FROM YOU

  • @BabuBabu-ip6oh
    @BabuBabu-ip6oh Год назад +1

    You are correct that the defense stat takes slightly more priority over the offense stat aiding in survivability. I tested this with a spirit that had equal attack and defense on two characters, and despite the attack stat being equal to its defense it was taking at least 1-3% less damage than normal than it would be without spirits on either character. The attack stat needs to be higher than the opponents defense stat to really make a sizable difference, even with type disadvantage you can tank some hits with high enough defense.

  • @ShurikanBlade
    @ShurikanBlade Год назад

    So from this video alone and no research on my behalf what determines a victory goes in this order.
    Whether you skip or watch > Level difference > spirit or no spirit > spirit kind > character, items, stage, final smash (minimal to no effect)

  • @MEDYuzu
    @MEDYuzu Год назад

    Whenever you skip a match in which a Mii character fights a non-Mii, the Mii will never win. Under no circumstances will the game allow a Mii to advance through a skipped match over any of the regular characters, which has been kind of irritating to me

  • @KazefuYousomo
    @KazefuYousomo Год назад +2

    This was actually a really fun video! I mean I expected it to be interesting, but it was even more interesting than I thought it would be

  • @Liggliluff
    @Liggliluff Год назад +1

    (2:20) Melee introduced tournaments with 64 players
    Brawl reduced it to 32
    3DS/Wii U removed it from local play
    Ultimate brought it back with 32
    Due to Ultimate having more than 64 fighters, it really should have been 128 at max.

  • @thebananaspeedruns9275
    @thebananaspeedruns9275 Год назад +1

    How would this work with amibo fighters?

  • @rtyuik7
    @rtyuik7 Год назад

    i feel like Skipping the match just rolls the RNG-Dice once (does CPU-A or CPU-B win?)...whereas Watching the match has to roll the dice Multiple times (does CPU-A start with an Attack or a Grab? does CPU-B Jump, Shield, or Dodge? do either of them grab Items? etc)...by simply increasing the Number of Chances, you add opportunities for Flukes and other Anamolies to influence the results (if the Lvl1 grabs more Pokeballs than the Lvl9, then some of those Pokeballs are bound to add some KOs for the Lvl1)

  • @TheRealBatabii
    @TheRealBatabii Год назад +1

    I wasn't sure if "skip" just still did the match, but didn't have to render anything, and as a result, all the calculations were done super-fast behind the scenes. Probably a silly way to do it though.

  • @amoura39
    @amoura39 Год назад +1

    This sounded very painful to test and stuff

  • @ValToadstool
    @ValToadstool Год назад +1

    I would guess there's some kind of formula that calculates a cpu's "strength" which when compared to other cpus gives the chance for each to win

  • @DarkoGameplayer
    @DarkoGameplayer Год назад +1

    This was a really interesting video, I really liked this kind of "experimenting video"

  • @Jestiphur
    @Jestiphur Год назад

    I doubt this would change anything, but I'm curious if the CPU starting with Pyra or Mythra would skew the results of which CPU won or Pokemon Trainer starting with Ivysaur or Charizard instead of just Sqirtle. Doubt this would affect anything in the skipping tourney, but more interested in how it plays out I suppose.

  • @oygemprime3864
    @oygemprime3864 Год назад +2

    Did you test level 1 cpus vs level 2 cpus? to see if it's just level difference or something else?

  • @kenjib88
    @kenjib88 Год назад +1

    you did a great job showing the statistics

  • @banif1
    @banif1 Год назад +2

    Nice spy plays

  • @AgentQuacc
    @AgentQuacc Год назад +2

    What an interesting video! This sounds like torture to make though 😅

  • @momothemagecat
    @momothemagecat Год назад

    This reminds me of one time where me and my brother had this drinking game where we got like 32 bots of random levels and we had to guess which bot won what match. Everytime we failed to guess, we took one shot of this absolutely devilish mix of alcohols I liked to call "The Bog Water".

  • @GaybrohamStinkton
    @GaybrohamStinkton Год назад +1

    Shoulda done Alph vs Olimar

  • @hunterculpepper1973
    @hunterculpepper1973 6 месяцев назад

    The current strongest spirit in the game dwatfs akuma in only raw stregth with 8421 offense well has 7941 or the absolute safe capsule defense is 10000 meing it takes low damage and hits like Norma also spirits can make Mario fire ball do 36 percent with evil ryu at max also you're muder spirt stat wise is only good at lv 99 because it has no stat buffs and no support slots which Also contribute to the ramdisrr factor i don't worry i did the work for you degese is a 30 times multuplyer over attack type and vice versa gibe the losing character a buff they will go back to ramdom oe make the stats bigger yeah being weaker dosint Matter power and 30 person buff 5800 power beates 6440 if you maxed him out for the full 13941 power

  • @prizm9515
    @prizm9515 Год назад

    Yeah, the tourney odds in Ultimate are much more balanced than Melee. One time in Melee a I had to fight a level 3 in grand finals in a 64 entrant tournament of random CPU levels.

  • @Kachop05
    @Kachop05 Год назад

    I remember the day i realized just how stupid the Bayonetta CPU is
    We let a friend (who knew nothing about the game) play, and, to prank him, we put him against a level 9 CPU. Our thought process was "yeah, she was broken in sm4sh, i reckon shell be good in ultimate too
    My friend SWEPT that Bayo
    Weve never told him of course

  • @Oswerb
    @Oswerb Год назад

    Genuinely surprised at CPU tierlists existing.
    Also surprised none of them have Little Mac at the bottom. This isn't a joke at at Mac being " lowtier", I know that he's good when a skilled player uses him. However the CPU just doesn't know how to move around the stage, recover to ledge, or combo with him.

  • @megamudkip5913
    @megamudkip5913 Год назад

    Alright, Mr. PKBeats, what about amiibo? Their battles can also be skipped, so surely the same kind of logic would also be followed by them in terms of determining randomness when skipping their battles, right? Obviously observing them would, but what about skipping them?

  • @haniyasu8236
    @haniyasu8236 Год назад +1

    Ok, ngl, this would be a superb test-case for showing off some statistics tools. For having a math degree, I'm admittedly a little rusty at stats, but the binomial distribution, p-values, confidence intervals, and t-tests all seem to be pertinent here. It could make a pretty sick educational video actually to teach these concepts using the cpu tourneys as a launching off point.