Lead SRE - BeReal

 Posted 3 hours ago
  
 France
  
5-10 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Define and drive SRE practices, including SLIs, SLOs, and incident management, to ensure platform reliability and scalability. Lead infrastructure automation and FinOps initiatives to optimize cloud costs and performance on GCP.

About BeReal

At BeReal, we are dedicated to authenticity in social media. By encouraging users to share unfiltered moments, we foster genuine connections and celebrate real life. We are now an international team of 100+ and have 40M+ monthly active users. Backed by Voodoo, our team is fully focused on scaling BeReal into an iconic social network used by hundreds of millions.

The Infrastructure team provides the backbone that powers the company’s growth, ensuring the scalability, efficiency, and reliability of our platform. We design and operate our infrastructure on GCP. Working hand in hand with developers, we enable teams to ship fast and efficiently while maintaining a strong focus on costs and performance. Our mission is to create a developer-friendly, cost-effective, and highly automated infrastructure that supports innovation at scale.

Role

  • Define and drive SRE practices across the organization, including SLIs, SLOs, error budgets, incident management, postmortem processes, and long-term reliability improvements across the platform

  • Design, implement, and optimize infrastructure for availability, scalability, reliability, and cost efficiency

  • Own and evolve our observability stack, improving monitoring, alerting, logging, and distributed tracing

  • Drive automation of infrastructure and operational workflows (e.g., Terraform, Terragrunt, Kubernetes)

  • Lead FinOps initiatives, developing tools and insights to optimize cloud costs

  • Partner closely with development squads to improve service reliability, performance, and operational excellence

  • Influence architectural decisions and establish best practices for building resilient distributed systems

  • Mentor and support Infrastructure engineers, helping raise the bar on reliability, operational excellence, and technical execution

  • Analyze performance bottlenecks and work on solutions such as scaling strategies, service optimizations, and system debugging

Profile

  • Strong knowledge of Kubernetes

  • Experience with high traffic, distributed systems architectures, and related tools (service discovery, config/secret management, etc.)

  • Strong knowledge of one Cloud provider (AWS or GCP preferred)

  • Proven experience defining and operating SRE practices (SLOs, incident management, observability, reliability engineering)

  • Strong operational mindset with experience managing production incidents and driving reliability improvements

  • Leadership and mentoring experience, with the ability to influence technical decisions across teams

  • Ownership-driven – If something isn’t working, you don’t wait for instructions; you improve it

  • Pragmatic and impact-oriented – You balance reliability, delivery speed, and business priorities

  • Performance vs cost-conscious – You make decisions that align with both technical excellence and financial sustainability

Our Stack

  • Operator: Kubernetes

  • CI/CD: Argocd, Github actions

  • Cloud provider: GCP

  • Monitoring: Datadog

  • Infra as code: Terraform / Terragrunt

  • Languages: golang / node

  • Datastores: Spanner / PostgreSQL / Redis

Benefits

  • Competitive salary based on experience

  • Swile Lunch voucher

  • Gymlib (100% covered by Voodoo)

  • Premium healthcare coverage with SideCare, 100% covered for you and your family

  • Wellness activities in our Paris office

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Software Development

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified