I built a Claude skill that replaced CloudWatch.


AWS FOR THE REAL WORLD
⏱️
Reading time: 6 minutes
🎯
Main Learning: Three levels of AWS observability β€” from raw console.log to a Claude Code agent that queries CloudWatch for you.
πŸ“
🎬

Hey Reader πŸ‘‹πŸ½
Quick check β€” when did you last close the CloudWatch console and feel like that was time well spent?
Yeah, me too. And I'm done with it.

In this week's video, I will show you how to never open the CloudWatch console again.

As a good alternative, check out our sponsor of this week!

Sponsored

Catch cloud cost problems live. Not in 30 days.

The end-of-month AWS bill versus Datadog's real-time cost view

For years, I found out about cost problems the way everyone does β€” when the AWS bill arrived. By then, the meeting where someone asks "can we cut cloud costs next sprint?" is already on the calendar.

Datadog Cloud Cost Management treats cost like any other signal. Same dashboard as your latency and error rates.

What I like about it:

  • πŸ“ˆ Real-time, telemetry-driven cost. Not the 30-days-late billing export.
  • πŸ€– AI & GPU cost visibility. Inference workloads are the new surprise line item.
  • πŸ“¦ Deep K8s and container allocation. Finally know which service is burning money.

And if you flip it on and discover you're spending way too much on containers β€” they have a free eBook on exactly that.

Sponsored by Datadog.

πŸ“š This Week's Deep Dive

Most AWS debugging time goes into the console.

Click into a log group. Pick a stream. Squint at plain text. Filter by string match. Tab over. Compare timestamps. Repeat.

Three years ago that was my whole life. Then I went through three levels of observability β€” and stopped opening the console entirely.

Level 1 β€” Log Hell

console.log("Processing order " + orderId + " for customer " + customerId) scattered across 15 log groups. No correlation IDs. No log levels. When a customer reports a missing order, you grep.

It works on a Tuesday afternoon. It breaks at 2am.

Level 2 β€” The Three Pillars

Structured JSON logs (AWS Powertools), EMF metrics, X-Ray traces. Every log carries the same correlationId from API Gateway β†’ Lambda β†’ SQS β†’ Lambda β†’ DynamoDB. Logs Insights actually works β€” one query, both log groups, all events for one customer.

This is what AWS docs recommend. Most teams stop here. It's the prerequisite for what comes next.

Level 3 β€” A Claude Code Agent

One markdown file in .claude/agents/ that knows your log groups, knows your query templates, and talks to CloudWatch via the AWS CLI.

I type: "find any orders that timed out yesterday."

It picks the right log groups. Writes the Logs Insights query. Polls for results. Groups errors by type. Tells me which customers were affected. No console. No query syntax.

The full agent file is in the free GitHub gist. Copy it, swap your log group prefix and AWS profile, you're running in 20 minutes.

Watch the video for the full walkthrough β€” the e-commerce order-processing demo, real seeded timeouts, and why Level 2 is the prerequisite that makes Level 3 work.

That's it for today

If you're unhappy with your observability -> Check out Datadog.

If you're using CloudWatch -> Use AI for it.

See you next week!

Sandro & Tobi

AWS for the Real World

We teach AWS for the real world - not for certifications. Join more than 10,500 developers learning how to build real-world applications on AWS.

Read more from AWS for the Real World
AWS DevOps Agent β€” your AI SRE is now on call

AWS FOR THE REAL WORLD ⏱️ Reading time: 12 minutes 🎯 Main Learning: AWS DevOps Agent investigates incidents autonomously across CloudWatch, CloudTrail, and your code. It surfaces evidence brilliantly β€” but can confidently point at the wrong root cause, so don't apply its fixes blindly. πŸ“ Blog Post Hey Reader πŸ‘‹πŸ½ I was in Portugal for the past week. 10 days of tennis, padel, sun and waves 🎾 Highly recommended place! Our daily lives as software developers really changed since we started using...

AWS FOR THE REAL WORLD ⏱️ Reading time: 10 minutes 🎯 Main Learning: S3 Files gives POSIX access at S3 prices: 13x cheaper than EFS for large files, but the 60-second write-back delay silently breaks coordination patterns! πŸ“ Blog Post πŸ’» GitHub Repository 🎬 Watch on YouTube Hey Reader πŸ‘‹πŸ½ Sandro is currently traveling through Portugal - work and fun combined! If you happen to be around, reach out. Would be great to meet up! 🀝 This week we're digging into S3 Files: a POSIX file system backed by...

AWS FOR THE REAL WORLD ⏱️ Reading time: 6 minutes 🎯 Main Learning: 5 common AWS account mistakes and how to fix each one in under 10 minutes 🎬 Watch on YouTube Hey Reader πŸ‘‹πŸ½ New week, new AWS deep dive 🐠 In this one, we'll show you the 5 most common mistakes we've seen in almost every AWS account we've looked at. Yes, there are more out there. But these are the ones you'll see everywhere. And they're pretty simple to fix! The good news? Most of these fixes take under 10 minutes. Rather watch...