Senior Site Reliability Engineer - Remote EST

 Posted 2 hours ago
     
 $190K - $235K per year
  
5-10 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Design and operate production-grade Kubernetes infrastructure on AWS while developing AI agents for incident and root cause analysis. You will also build GitOps-based CI/CD pipelines and maintain observability using Datadog.
Join us as a Senior SRE where you’ll bridge the gap between cutting-edge AI innovation and rock-solid production stability. Working independently from the East Coast, you will collaborate with our global DevOps teams to automate 70% of your workload while owning the reliability of our AWS/Kubernetes environment. This is a role for a production-hardened engineer who wants a strong voice in technology decisions and the opportunity to build the future of AI-driven operations.

This is a fully remote role, however, you must be physically located in EST and be willing and able to work EST hours Monday-Friday and participate in on-call rotations. 

Base salary for this role ranges from $190,000 - $235,000 per year. 

  •  5+ years of experience as a SRE or DevOps Engineer (this is a hard requirement).

  • Deep Production Expertise: You must have extensive experience managing live, high-traffic SaaS environments; developer-only backgrounds without ops experience will not be a fit.

  • Cloud & Orchestration: Proven mastery of Kubernetes and AWS in production settings.

  • Coding/Scripting: Advanced proficiency in Python (preferred) or Go for automation; we need more than just Bash skills.

  • AI Knowledge: A strong understanding of or direct experience with AI/LLM technologies.

  • Observability: Hands-on experience with Datadog for monitoring and incident response.

  • Autonomy: Ability to work independently without direct daily oversight, managing production incidents and on-call responsibilities.

  • Time Zone: Located in the East Coast time zone to provide coverage overlap with our global teams.

  • Design, build, and operate production-grade Kubernetes infrastructure on AWS

  • Developing Ai Agents to handle incidents and root cause analisys 

  • Build and maintain GitOps-based CI/CD pipelines using GitHub Actions and ArgoCD

  • Develop internal DevOps tooling and developer self-service platforms

  • Own monitoring, observability, and operational excellence using Datadog

  • Collaborate with engineering teams to improve delivery speed and reliability 

HiBob is a village filled with amazing people and we’re especially proud of that. It’s a place where Bobbers can be themselves. We’re about fun, dreams, hopes and ambition, just as much as we are about precision, growth, and top performance. Becoming a Bobber means you’ll receive competitive compensation, benefits, and pre-IPO equity alongside all of this:

  • Stock options at a high-growth unicorn startup

  • 100% subsidized medical, dental, and vision coverage for employees

  • 401(k) with a 3% company match starting from Day 1

  • Hybrid working model for bobbers in the NY metro area

  • Work from home allowance to get your home office set up!

  • Temporary remote work-from-anywhere in the world for up to 2 months after 6 months of employment

  • Annual Headspace subscription and wellness benefits

  • Two social impact days per year for volunteering

  • Bob balance days - 4 additional days within a calendar year - Enjoy a company-wide long weekend at the beginning of each quarter

  • Employee referral program - $2,500 bonus for each successful referral with an additional ambassador bonus

  • Fun and frequent social events (in-person and virtual)

  • We love birthdays - take the day off and receive a special gift 

  • Dog-friendly office


If this sounds like something you’ve been looking for, we’d love to have you. Come on, join our village!


Location Eligibility: While this is a remote position, HiBob is currently authorized to hire in the following states: CA. CO, CT, DC, FL, GA, IL, IN, KS, MA, MD, MN, NC, NH, NJ, NV, NY, OH, OK, OR, PA, RI, SC, TN, TX, UT, VA, WA.

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Site Reliability Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified