SRE/DevOps

 Posted 3 hours ago
     
5-10 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Own the health, performance, and delivery of client infrastructure by automating manual work and optimizing CI/CD pipelines. Define and track reliability metrics like SLIs and SLOs while utilizing AI/AIOps for intelligent monitoring and incident response.

Mobile Wave Solutions is a professional services company specializing in software development as a service. We are committed to delivering scalable, high-quality software solutions that meet our clients' evolving needs. With a growing team of over 120 engineers and a mission to empower businesses globally, we provide expert teams to deliver robust solutions and drive innovation.

Role Overview

We're looking for a Site Reliability Engineer to own the health, performance, and delivery of the infrastructure of one of our client.

You'll sit at the intersection of engineering, operations, and quality measuring how their systems actually behave, automating away manual work, and making their path from commit to production fast and dependable.

This is a hands-on role for a seasoned engineer who treats operations as an engineering problem and sets the bar for others. You'll define what "reliable" means for the systems in concrete, measurable terms, establish the practices the team operates by, and use modern tooling, including AI/AIOps, to monitor, alert, and respond intelligently rather than reactively.

Key Responsibilities

  • Own the infrastructure. Build, maintain, and scale the systems our product runs on, with reliability and cost-efficiency as first-class concerns

  • Measure the non-functional. Define and track SLIs, SLOs, and error budgets for availability, latency, throughput, and scalability. Make system behavior visible and quantifiable to the whole team

  • Automate relentlessly. Identify and eliminate toil. Replace manual operational work with code, infrastructure-as-code, and self-healing systems

  • Build a seamless delivery process. Design, maintain, and improve CI/CD pipelines so engineers can ship safely and frequently with fast feedback

  • Collaborate across functions. Partner closely with software engineers and QA to embed reliability and quality early - through testing strategy, deployment practices, and shared ownership of production

  • Apply AI to operations. Use AI/AIOps to automate remediation, surface anomalies, reduce alert noise, and improve the signal quality of monitoring, reporting, and on-call

  • Lead incident response. Drive blameless postmortems and turn incidents into systemic improvements

  • Set the standard. Define reliability and operational best practices, and mentor engineers across the team to raise the bar

Qualifications

  • 5+ years of experience in an SRE, DevOps, or Platform Engineering role, with a track record of owning systems end to end

  • Strong grasp of reliability engineering fundamentals: SLIs/SLOs, error budgets, and reducing toil

  • Hands-on experience designing modernising and operating CI/CD pipelines

  • Solid infrastructure-as-code skills (Terraform / Pulumi / CloudFormation)

  • Experience with cloud platforms (GCP / AWS / Azure) and container orchestration (Kubernetes / Docker)

  • Proficiency in at least one programming/scripting language for automation (Python / Go / Bash)

  • Strong observability experience: metrics, logging, tracing, and alerting (NewRelic, Prometheus / Grafana / Datadog / OpenTelemetry)

  • Applied experience using AI/AIOps to automate, measure, report, and alert - anomaly detection, intelligent alerting, noise reduction, or automated remediation

  • A collaborative mindset and comfort working alongside engineers and QA toward shared reliability goals

  • Demonstrated technical leadership—mentoring engineers, driving cross-team initiatives, and influencing engineering practices

You would impress us if you have

  • Experience introducing AIOps tooling into an existing observability stack

  • Background in performance and load testing.

  • Familiarity with security and compliance practices (SOC 2 / ISO 27001 / GDPR)

Our Benefits

  • Remote Office – Option to work remotely or hybrid

  • Parking Space – Free parking available

  • Fun Office Space – Game zone and relaxation area

  • Health Insurance – Private health insurance, including dental care

  • Holidays – 5 extra days after your 1st and 5th year with us

  • Personal Development – Company-sponsored training and development

  • Employee Referral Programme – Competitive bonus for successful referrals

  • Social Events – Celebrating success together

  • Family Insurance – Add insurance coverage for a family member

  • Multisport Card – Fully covered sports pass

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in DevOps Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified