Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Developers

Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Developers

Data on Kubernetes Day EU 2022

Kubernetes comes with many valuable features like Volumes and StatefulSets, which make running stateful workloads simple. Interestingly, when combined with the right tools, these features can make Kubernetes very valuable for developers wanting to run massive production databases in development! This is precisely what was seen at "Extendi".

The developers at Extendi deal with a large amount of data in their production Kubernetes clusters. But when developing locally, they didn't have an easy way of replicating this data. This replication was needed because it allowed developers to test new features instantaneously without worrying if they would work as expected when pushed to production. But replicating a 100Gb+ production database for development wasn't turning out to be an easy task!

This is where leveraging Kubernetes + remote development environments came to the rescue. Running data on Kubernetes turned out to be way faster than any of the traditional approaches because of Kubernetes' ability to handle stateful workloads exceptionally well. And since Extendi already used Kubernetes in production - the setup process was fairly straightforward.

This talk by Lapo, Arsh, and I will cover practical steps on how leveraging Kubernetes-based development environments allowed dev teams at Extendi to run production data on Kubernetes during development using features like Volume Snapshots, which significantly impacts developer productivity.