Building Scalable Data Platforms on GCP

Most data platforms don’t fail because of bad code — they fail because of bad structure.
In this talk, we will look beyond pipelines to explore what truly makes a data platform
sustainable on Google Cloud Platform (GCP). Drawing on real-world projects, we’ll discuss
how ownership, Infrastructure as Code (IaC), environment isolation, and governance shape
the long-term reliability and scalability of data systems.
You’ll see how platform thinking — layering ingestion, processing, and serving — helps
teams reduce duplication, clarify responsibilities, and evolve their architecture as the
organization grows. The session blends practical design choices with lessons learned from
multiple GCP implementations, offering a realistic path from “just pipelines” to “platforms that
scale and last.”

Vorkenntnisse

This talk is ideal for Beginner Data Engineers, BI professionals, and Developers looking to optimize data workflows in a cloud-native environment using modern ELT techniques.

Lernziele

By the end of this session, attendees will:

  • Understand the key structural pitfalls that cause data platforms to degrade over time.
  • Learn how to design clear ownership, environment, and IaC boundaries on GCP.
  • See how layering ingestion, processing, and serving leads to more maintainable architectures.
  • Discover patterns for scalable governance and automation in modern data platforms.
  • Take away actionable lessons for evolving from data pipelines to platform architecture thinking.

Speaker

 

Olivier Bénard
Olivier Bénard is a Data Software Engineer, actively involved in the migration and maintenance of operations to Google Cloud Platform (GCP), combining architectural design with hands-on implementation to ensure a smooth and scalable transition. While his primary focus is on data platform migration and system architecture, he also integrates DevOps and software development practices to enhance scalability, automation, and long-term maintainability.