How Canva Saved Millions in AWS S3 Costs
HTML-код
- Опубликовано: 9 фев 2025
- In this video we will explore how Canva managed to save million of dollars from their AWS bills.
References:
Canva Article: www.canva.dev/...
Videos:
Video by Pressmaster from Pexels: www.pexels.com...
Video by Produtora Midtrack from Pexels: www.pexels.com...
Music:
♪ [non copyright music] Lofi Type Beat - Saturday Morning | aesthetic lofi music / Lofiru Link : [![](www.gstatic.co...[non copyright music] Lofi Type Beat ...]( • [non copyright music] ... )
Hello!! 👋 Wanted to share how Canva managed to save millions of dollars 💵💵 from their AWS S3 bills.
Hope you get to learn something new today!
That line after the zeros at 4:25 to make it easier to compare is such a beautiful idea. Appreciate the attention to detail
Lmao the captions at 1:11. "Just don‘t tell Bezos you want to leave."
LOOOL you found my hidden joke!!
And now their CEO can buy twice as much Lamborghinis per year. Nice one DevOps team ❤
Right? CEO is gonna get a fat bonus and the engineers two pizzas of their choice 😅
Animations be smoother each time. Great video!
Thank you for tuning in as always! 😉
@@kikisbytes what software you use to create this animations?
@@FitraRahimCanva
@@Colin_McPhail fr? dang i thought i was good at making canva presentations
@@FitraRahim I use motion canvas to make these animations
Those opptimal calculation though then again kudos on aws team that making the lifecycle feature and it works like a charm
i love how you gave credits to free pexel creators . surprised to see after this great video ,you have only this much of views and subscriber .all the best . loved this video
Thank you I really appreciate that!! Yeah, I give credits to all videos I use online since they worked hard for it :)
This video was great never knew data storage was that expensive for large companies!
High quality video and great animations! learned a lot from this video. Thanks.
That's awesome! I'm so happy you were able to learn something from this video.
Super cool to see how companies manage their cloud infrastructure to serve this much data. Those hyper scalers get expensive quickly!
Thanks for the vid, great job!
This is incredibly well made, keep making more videos!!
Thank you for watching!
I was expecting some amazing compression, but this works too
good vid. one day it’s gonna resurface on my homepage with 10M views 💪💪
I was even thinking about backing up my NAS to AWS Glacier for extra safety. The pricing is just so tempting
new to your channel. thank you for the dense value provided about to binge the rest of your content 🙏
absolute banger content, keep it up bro
Thank you so much!
wow this is info and value packed, insane how only 10k subs! going through rest of vids
Thank you!! Hope you enjoy some of my other system design videos :)
broo you should have millions of views
Awhh thank you, that means a lot!! 😊
bro saved millions in youtube views
@@astenium0have me dying 😂
@@astenium0lmao
@@astenium0😂😂😂😂😂😂😂
Hey Kiki. I have just discovered your channel and this is my first video. It is awesome, and I am sure I will be binging your catalog for quite some time.
Awesome and thank you for stopping by! The pinterest video is a good watch :)
This video is great! The only fault I can find is your mic quality, but a really well made and interesting video!
Thank you!! Yeah I should really upgrade my mic soon, I'm using a $30 one to record all my videos.
You should’ve included on site self storage, which is a fraction of the cost AWS has. When you get very large, it makes sense to have your own data center.
Datacenter engineer here: Not for a globally consumed application. It would take a minimum of 30 or so datacenters to store the same data in the same geographical regions. A single 1PB storage server at this tier is about $800k before support contracts, and you’re kidding yourself if you think they only need one. Let’s pretend there’s no maintenance or staff needed for each DC, as soon as it starts to pay for itself the hardware reaches end of support, and you have to purchase all new hardware and migrate, which also costs millions.
Now please explain to your boss that they need to deal alllll that instead of typing “terraform apply”.
@@TheOfficialOriginalChad thanks for the reply. tough argument but when your as large as Netflix or Spotify it make sense to go on premise, it a lot of work but money saving make companies take on hard task!
@@TheOfficialOriginalChad why do you need multiple servers for something that takes a couple seconds to load anyway? No user will tell the difference between their project taking 1 second to load or 1.5s because it has to go all the way from Australia to the US or something.
@@theairaccumulator7144network costs also add up over distance.
@@WhyInnovate i think you might be right or close to right. Netflix and Spotify would save money on STORAGE on-premise. Though storage is not their highest cost; data transfer is. I can’t estimate their data transfer numbers, but I can say with reasonable confidence it’s much cheaper through cloud providers.
I use to build out big DC's & Nwks. Your data is 100% on target. We called AWS/GCP/Azure the "Hotel California" Model. You can go there, but you can Never Leave. :-) Big DC's get huge discounts on hardware, but managing it is costly.
Crazy how this was done with a policy and not application code.Very simple cost optimisation tbh.
This is high quality stuff!
Yes that true, show the log of the buckets which object is accessed nearly over 1 year ago then should be placed to s3 glacier
Thanks for putting together the video. Been sharing it internally within Canva :)
That's awesome thank you for sharing!
And here I am trying to dynamically adjust storage tiers for files on my 3 MAU app. People are really out here just going with the flow huh.
Hey Kiki’s bytes, I love the format of how you explain things, my ADHD brain managed to sit through the entire thing and stay engaged, can I request that you teach programming in a similar format?
Thank you! I'm so glad that this format was helpful.
What kind of programming format are you thinking? Algo / data structure or general programming? I do want to make programming videos but also don't want to just share my screen and do tutorials since there are so many videos out there already. I usually find that using animations and visuals really help when I'm trying to learn something new.
I would consider doing some higher level topics. So let me know what you have in mind!
@@kikisbytes not to sound too greedy, algo and data structures yes please! Even going through the languages like python, c, ruby will be super useful, at least for me. Hahaha. Thank you for taking the time to make these videos that impact others positively.
Alternative title: how canva burned millions to AWS for years
Ouuuu that’s actually sooo goood!!!
This is how CFOs look at things constantly. You know, because the CFO would have deployed properly with all future considerations perfectly handled of course.
great content
So S3 standard for the win 😮
haha yeah for smaller object size
Damn.. AWS is GOATed
Great and insightful video! 👍 At 2:20, I am curious as to why you changed "207 PB (90% of 230 PB)" to "270 PB (90% of 300 PB)" during the transition. An oversight, perhaps? The math checks out either way.
bleh I originally had it as 270 but during editing I noticed it was wrong and changed it to 203. Guess I didn't cover the whole thing on Davinci Resolve :p. But man good eyes!!!
They are big enough to maintain their own cloud/datacenter, which would save them much, much more money. Stay away from vendor lock in kids, even when the cloud makes sense.
These insights are really valuable! You did great research. Great Video
Thank you! But have to give the credits to the Canva team for their amazing article.
Wow, Great Analysis! Kiki I think you are a High Scalability site Fan Just like me right?!? ;)
ahhaha of course! Are you the author?
@@kikisbytes Hehe, no of course!
But I still very vividly remember when back in 2009 read the story of this Mega Chad that created the Plenty of Fish site with millions of users!
One-man band!
You can read his story in All Time Favorites Architectures - it is Worth it! :)
Awesome
Awesome video! Judging by fluidity, was this video created in Canva?
hahaha only use Canvas for thumbnail!
The biggest benefit could probably be with migrating off the cloud at this point
This is an amazing video. I am sure this channel will blow up in popularity soon. Subscribed!
That means so much thank you!!!
What do you use to edit your videos? They're great
Amazing piece of content! What do you use to edit videos?
Ty!! I use Motion Canvas!
Amazing video, subbed !
Thank you so much!!
I'm guessing from the fact it wasn't addressed that data retrieval fees on Glacier IA were a negligible factor?
Think if would still cost quite a bit, but these data typically won't be requested too often when they are there.
I was thinking that was a video about how to stop using AWS, most more useful. For that money per month the can create a new company just serve the content with less security issues than S3...
Same here. I thought they moved to data centre. But that also comes with a ton of headaches. 😅
You can buy a petabyte storage solution for every month you're running your aws services, just saying
Nerrrrrd! 🤓🤓🤓
LOOL
Hi how do you make these videos? Like the format and would love to get started with this format to make some math education videos
You should check out Motion Canvas! It's amazing for making math videos, they also have latex component!
@@kikisbytes thank you so much. God bless you
@@abhardwaj02 absolutely! Please let me know if you ever end up making the video. I would love to tune in :)
Storage costs are much higher than I would have guessed.
I thought so as well originally and had to look up how much Canva was actually worth.
Interesting video. Btw, which aws service do you think would be the best to host an open source ai model? I mean could i do that on S3?
hmm maybe something like SageMaker?
@@kikisbytes Thanks
Great video gives me some idea of the scale.
Thank you for taking the time to watch!
can you tell more about your company or website which you said that cost $30k per month
Today, maybe just start with Intelligent Tiering for S3. Does this all for you.
If you're Canva and it's the before times and IT isnt available yet, sure do some analysis and curate the storage tier individually. But for everyone else in 2024 use Intelligent Tiering as your _default_ storage class.
Thank you for mentioning this! I should've of taken the time to briefly mention this in the video.
Great great great work on this video. Been using canva for work for years now thanks to my s.o but I never thought about if they use AWS 😧 seems everyone is these days
you are going to become famous guaranteed along as you stay consistent and relevant. You explained this topic so well that it makes me want to steall all of canvas data so I can work out how to solve this storage problem.
Hahaha thank you so much!
I was trying to take a rest from studying for my god damn radiology physics just to be tricked into more math in this video, no thank you I will pass, maybe return to it again later
hahaha thank you for stopping by anyways and good luck on with your study!
Did they consider tar'ing up a bunch of the small objects to pass the minimum size overhead?
dope
Didn't mention Storj/IPFS. Which is cheaper: Storj or AWS S3 Glacier?
The fun thing is. These policies exists in Azure, AWS and pretty much every cloud provider and are part of basic training. And yet people get layed off because of 'lack of experience'. Yeah sure m8.
Thank you for another interesting video!
Thank you Too
Thank you! I'm glad you liked it! 😊😊
So I'll summarise this vid, basically they left the server on
Why don't they just switch to R2? You only considered the storage pricing. The egress costs most likely make up a major portion of their storage and delivery budget. In the grand scheme of things and the scale at which Canva is operating, recurring revenue of >$2B/yr, saving $3.6Mn/yr is a drop in the bucket.
Instead of investing so much time to save this money, they could have saved possibly 10s of millions if they moved off of S3 to something like R2 or built their own custom CDN.
I don't know why people still use S3 when it is one of the most expensive storage services out there. I'm just talking about storage here, egress is another beast.
Cloudflare doesn't make new features into R2 currently (I'm assuming) because not a lot of people are using it. AWS has a lot of clients using their storage products. If cloudlfare gets 10-20% of those clients, it would do them wonders and they would start building features of S3, R2 currently doesn't support versioning, they are not investing a lot of money in it because it isn't a big enough revenue driver for them currently.
I’m wondering how much they paid those developers for this huge save on monthly costs
Amazing video, I learned a lot! Subbed!
Thank you! I'm glad you were able to learn something, stay tune for the next video!
Nice animation! Subscribed
Thank you!
Maybe they could have gone with a hybrid model or manage there own storage setup like base camp
hahaha yeah they probably just want to focus on their own things instead of having to worry about storage.
Clear and concise, loved it!
How did canva redirect their savings? What would AWS have done with those gains instead? Where was the increase in quality and on which side of this transaction?
Till AWS will reconfigure it 😂
Wow! The quality of your videos are awesome.
woah so underrated!! awesome and educational vid
Thank you for watching!
If it is expensive to move many files cant they just put many files together and move them as one file?
Loved the video. Thank you!
Thank you!
Moral of the story: don't use AWS, they're expensive as hell.
Expensive? Yes, but also can easily save money in running and maintaining an on prem data center
Expensive compared to what? Building your own data center? If you have the money to pay for humongous upfront costs and considerable running costs, sure. Other data centers generally don't offer much better pricing than aws
@@vinylSummer no competitor can offer everything AWS does, but many data centers can charge less for some services. You're right in the sense that it depends on your use case. I've had costly experiences with AWS, and found better prices elsewhere.
For example, I still left some servers in AWS, and this year they decided to charge more for IPv4 addresses to "stimulate adoption" of IPv6. We allocated IPv6 addresses, but most software is not ready to use them, so it's pointless, we can't stop using IPv4, that means our costs went up for no reason, and there's not much we can do about it. We could try to leave AWS, but would have to perform costly address changes for many thousands of clients, so we just accept this arbitrary cost.
@@vinylSummer I answered you, but it got removed. no competitor can offer everything AWS does, but many data centers can charge less for some services. You're right in the sense that it depends on your use case. I've had costly experiences with AWS, and found better prices elsewhere.
What are you going to use then Einstein
Clicked they made their own server or implemented some client side processing stuff. Disappointed. Good video though
great content!! subscribed 💯
Thank you for subbing!!! ❤️
Keep it up! I don’t think I can allow myself to not watch more of these 😂
Wow thank you!! That means a lot! Working on a new video right now, hopefully you’ll tune in later :)
Nice video but I just wonder with that talent and price tag why they wouldn't consider moving to on prem
I'm guessing resources and complexity issues
Do you think there’s a way to optimize that one time cost even further? That’s the one thing holding my company back
Oh wow that's really cool that your company is experiencing the same issue. IMO it would be best to find that break even point and slowly migrate objects over so that the transition fee is offset by the lower storage fee. Let me think more about this. Will let you know if I can think of anything else!
@@kikisbytes Thank you, we work with a lot of simulation data and have some petabytes in S3. We add TBs per day and we're using glacier for a bit of our storage so far but I think we can have better lifecycle policies in place. I actually came up with this idea too a week or 2 ago but the cost savings wasn't that attractive because of that initial cost.
I wasn't able to see how long our data was being frequently accessed for in the storage analysis or maybe I was looking at the wrong tool but I know it can be improved. I can double check on Monday though. Calculating the costs was a part of the issue for me. Definitely appreciate it if anything more comes to mind!
You have to wonder how much of that 100 PB of content has been abandoned by users. If I was Canva, I would create my own offline storage solution. I estimate you could create an on-prem 130 PB storage array for about $1.2-$1.3 million USD. Perhaps cheaper. I would institute a policy that basically anything not touched for ~24 months will be moved to offline archival, and could be restored with a 3-5 day SLA. You don't even need to keep all of that array powered up at all times to save on electrical costs. Why pay Amazon all of this S3 storage fee for something that may never be accessed again?
until you realize it its best you just abandon the whole on prem storage idea entirely and give in to s3 glacier deep archive ( which is a 12 hour retrieval vs that 3-5 day SLA ) which is only 1/4th of the cost of s3 glacier instant retrieval and 23 times less expensive than s3 standard, while having better reliability, uptime and security canva could ever dream of. lose-lose with on-prem offline storage. even with (possibly) never again accessed content being 10-20 petabytes (which is probably not even close), the cost of still having your customers data in s3 glacier deep archive pales in comparison to having your own on-prem data center.
The drives themselves would be ~1$ million and that's not even counting the servers. I would just automatically delete data that hasn't been accessed for more than a year instead of archiving it. A large part of Canva users are kids in school and they're rarely using the same account twice much less accessing old projects.
@@theairaccumulator7144 Deleting data is unacceptable, greetings from r/datahoarder
This idea is so bad I can't even bother to rebuke it given it will result in a massive wall of text
@@AQHackAQcan you do a TLDR on why this is a bad idea?
If you’re moving the object from standard to glacial, how do you maintain the object link?
i Imagine the public facing links would not be the internal s3 object link. Otherwise it precludes you from making any storage layer changes without breaking stuff at user level.
How do you make such smooth animations?
I use motion canvas to make my animations :)
What tool was used for animations?
Slick af
Thank you! I use motion canvas to create my animations :)
Great video as always! Keep it upp bro. I wanna see more videos from you. Thank you for making such masterpiece
Thank you and stay tune for the next video!
so in short we can say they applied lifecycle rule mgmt for their buckets to save costs. Am i missing something here?
@@rohanrustagi7857 the lifecycle rule isn’t the bit saving money, it’s the storage classes. Infrequent access (and similar) are charged cheaper rates. But yeah, that was their way to save costs, you got it! 👏
That’s exactly it, thank you this!
To add to this, it’s about them figuring out their data usage pattern and aws releasing a new s3 class that just happened to fit their exact use case with cheaper storage cost.
@@oSpam thank you 🙏
@@kikisbytes thank you 🙏
@@rohanrustagi7857 no probs. Glad you’re learning 😄
If you have objects that are 20kb, can't you just group them up into single objects? And then splice the part you care about upon retrieval?
hmm I think if you were to group into a single object then it might be hard to know exactly where to look. Like for example, where exactly is that 20kb piece located and in which file.
What are you thoughts?
@@kikisbytes Probably a SQL for CRUD, to maintain the object.
E.g.,
"Object" table has 5 columns, Obj1 Obj2 Obj3 Obj4 Obj5
Each one has an associated size so the offset for Obj4 is the sum of lengths of Obj1-Obj3.
A bit of a pain, but not the end of the world. At that point it's almost like handling object storage yourself though.
This was great! Very interesting thank you
Appreciate you taking the time to watch this video!
Great video! Definitely sharing this!
straight to point nice loved it ❤
Or use a alternative provider like Wasabi and you can Safe much more.
First time hearing about Wasabi, are their apis developer friendly?
@@kikisbytes We have switched from AWS S3 and Azure Blob storage to Wasabi and to IONOS. The APIs do not change as the storage is S3 compatible. Also fully compatible with AWS IAM.
We also switched all servers from AWS and Azure to IONOS. The cost saving per year, for server and S3 is about 62%. But the Server move was of course much more complex.
Have you noticed that their image quality is not as good as Adobe? I use Adobe express and once I export it, it can go to 2 MB or more depending on the assets in there. But in Canva it's less than that and once you upload it on social media, you can clearly see it by the ppi. Adobe has better ppi so the post on Instagram is better using Adobe. I am wondering if it is because they are saving cost?
all whilst the engineers get to keep their salary with no bonus
I love your content. It's amazing
Thank you really appreciate!
I just discovered this channel, the quality is insane for 10k subscribers
Wow I really appreciate that!!! Thank you for being part of the 10k subs.
❤❤❤
haha thank you!
i feel like cloud solutions are intentionally made to be difficult to uderstand cost-wise
❤
I've never used S3 due to the costs and wow is it ever expensive
Poor people also had to use the AWS web interface which is impossible to navigate
it can get unwieldy if you don't manage their data correctly at that scale
Should just host their own data