Kafka Consumer Container Restarts in Kubernetes: A Production Case Study
A prod debugging story you’ll probably relate to

Search for a command to run...

Series
Bits and Pieces I learned on how to build and scale a distributed system. All the posts in the series belong to the real problems I faced during my 9-5 and the solutions we went through with.
A prod debugging story you’ll probably relate to

The $0 solution to a Distributed Cache Invalidation

Introduction At my current org, we have many applications running behind the API gateway that interacts with the outside world using REST APIs. To track server-side failures we built an alert system that nudges every time we get any 5XX error. As soo...

Introduction Any application that communicates with other resources over a network has to be resilient to transient failures. These failures are sometimes self-correcting. For example, a service that is processing thousands of concurrent requests can...

Important points to consider, Relational vs Non-relational Database, CAP theorem, and more!

Low Altitude Flight Over the Messaging Queue Landscape
