S3 🪣 - All the File Storage You’ll Ever Need!


Hi Reader 👋🏽

we are at the end of our database & storage section. Let's finish with one of the most well-known service in AWS ... 🥁 S3.

Even developers who don’t work with AWS know and use S3.

We’ll cover some of the main concepts

  1. Bucket & Objects 🪣
  2. Storage Classes 🏛️
  3. Event Notifications ⚡
  4. Versioning ⏪
  5. Pricing 💵

Let’s go 🚀


Infographic S3

Here you’ll find the infographic for S3 to get a head start.

Buckets & Objects 🪣

One of the main concepts of S3 is buckets & objects.

One bucket contains multiple objects. S3 is a global service (you don't need to choose a region in the management console). However, your bucket still lives in a region. Weird, I know.

Storage Classes

A fantastic feature of S3 is the ability to set storage classes to your bucket. Amazon offers 8 different storage classes depending on your needs.

The goal of changing a storage class is to save money. It is always a trade-off for other factors such as:

  • How long does S3 need to return objects
  • How safe is your data
  • How much effort do you have to retrieve your data

Here is an overview of all storage classes:

Event Notifications ⚡

Building event-driven systems are a huge part of AWS. There are whole applications that are built on the basis of S3.

One good example is the creation of thumbnails. Imagine having a YouTube channel. A YouTube channel needs thumbnail pictures.

You can configure S3 to receive images and let it call lambda to create thumbnails out of them.

S3 can trigger SQS, SNS, or Lambda. We would use Lambda for this use case.

S3 triggers the Lambda function with an event for each image upload. You can also execute notifications on change or deletion events. Lambda can go ahead, take the file, and create the correct sizes. This is a very common pattern.

Versioning ⏪

Often, you don’t want to override your data but you want to generate a new version of your S3 object. S3 allows you to version your objects. With that, you can keep older versions and you can roll back to a previous state.

This reduces your risk of accidentally overriding objects.

The main benefit here is that it is fully managed. That means you don’t need to do anything other than activate the actual versioning.

Once you've enabled versioning you can show your versions in the S3 console by switching the button Show versions.

I uploaded a file with the same name bucket-objects.png twice. You see that two different versions of that file exist now.

If you want to restore one you can either remove the old one. Or you can download and upload the file again.

Versioning doesn’t cost anything. You only pay for the additional storage you need.

Pricing 💸

This brings us to costs. S3 is a serverless service. That means its pricing is 100% usage-based. You don’t pay for any computing.

The pricing in S3 is mainly based on storage and on the number of API calls.

The main pillars of pricing are:

  1. Storage: Depending on the storage class. About $ 0.023 per GB
  2. Requests against your buckets: Depends also on the storage class. Infrequent ones are more expensive. About $ 0.005 / per 1,000 requests.
  3. Data Transfer: Transfer coming in and out of S3. From the internet to S3 is free. From S3 to the internet cost about $ 0.09 / GB. That is a huge cost driver.
  4. Advanced Features: Some advanced features like Intelligent Tiering are charged on top.

Final Words

That's it for today! We hope you enjoyed this issue about S3. Check out some of the related blog posts about S3.

Thank you and see you in 2 weeks 👋🏽
Sandro & Tobi ✌🏽

Related Posts

AWS for the Real World

Join our community of over 8,800 readers delving into AWS. We highlight real-world best practices through easy-to-understand visualizations and one-pagers. Expect a fresh newsletter edition every two weeks.

Read more from AWS for the Real World

⌛ Reading time: 13 minutes 🎓 Main Learning: How to Run Apps on Fargate via ECS 👾 GitHub Repository ✍️ Read the Full Post Online 🔗 Hey Reader 👋🏽 When building applications on AWS, we need to run our code somewhere: a computation service. There are a lot of well-known and mature computation services on AWS. You’ll often find Lambda as the primary choice, as it’s where you don’t need to manage any infrastructure. You only need to bring your code - it’s Serverless ⚡️. However, more options can be...

⌛ Reading time: 10 minutes 🎓 Main Learning: Running Postgres on Aurora DSQL with Drizzle 👾 GitHub Repository ✍️ Read the Full Post Online 🔗 Hey Reader 👋🏽 With re:Invent 2024, AWS finally came up with an answer to what many people (including us) asked for years: "What if there were something like DynamoDB but for SQL?" With Amazon Aurora DSQL, this is finally possible. It’s not just a “scales-to-zero” solution like Aurora Serverless V2. It is a true distributed, serverless, pay-per-use...

⌛ Reading time: 12 minutes 🎓 Main Learning: CloudWatch Launches re:invent 2024 ✍️ Read the Full Post Online 🔗 Hey Reader 👋🏽 re:invent happened already two weeks ago and there were some amazing launches 👀 CloudWatch got a lot of love at that re:invent. This is why we are showing you our top CloudWatch launches for this year. We've worked through all of them, tried to get them working with our example application of the CloudWatch Book, and are now busy updating the book ✍🏽. Let's dive into...