Dev-Ops Lead Engineer

 Posted 2 hours ago
     
5-10 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Own and manage the cloud infrastructure, security, and CI/CD pipelines for AI-driven projects using AWS and Terraform. Lead observability strategies and incident management to ensure resilient, scalable, and secure system performance.

About us:

Spectrum.Life is a whole-of-health digital partner that guides organisations and their people to thrive, delivering clinically backed digital health, mental health and wellbeing solutions.  
 
Our HealthTech delivers digital transformation for Insurers, Educators and Employers through Co-creation or seamlessly integrated out-of-the-box solutions, that decrease digital fragmentation and engage, empower, and transform their people’s lives. 
 
Established in 2018 by Stuart McGoldrick and Stephen Costello, Spectrum.Life provides services internationally to over 7.2m insurance members, 3,000 corporate clients, 60 universities and 650,000 university students. Spectrum.Life currently employs over 450 people
 
Our vision is to change and save as many lives as possible.

Role Brief:

We are seeking an experienced and forward-thinking DevOps Lead to own the infrastructure, security, and continuous delivery pipelines that form the backbone of our AI-driven projects. This is a critical, hands-on role where you will be responsible for building and maintaining a resilient, secure, and highly automated environment across our cloud platforms. 

 

You will be the designated expert for all things related to infrastructure and DevOps. Your deep understanding of cloud architecture, security principles, and CI/CD will ensure our engineering teams can build and release software quickly, safely, and efficiently. If you are passionate about automation and leveraging AI to create intelligent, self-healing systems, we want to hear from you

Schedule:

Responsibilities:

Cloud Infrastructure & Automation 
  • Architect, build, and manage scalable and secure infrastructure on AWS using Infrastructure-as-Code principles, primarily with Terraform.
  • Develop and maintain bespoke automation scripts to accelerate project setup, on-demand environment creation, and other operational tasks.
  • Champion and implement solutions like LocalStack to streamline local development and testing workflows for engineers.
  • Provide expert guidance on systems architecture, ensuring our infrastructure is designed for performance, scalability, and resilience.
  • Collaborate with engineering teams to manage and automate the infrastructure for our services, including APIs and databases, ensuring their performance and reliability.
CI/CD & Release Management 
  • Develop and improve CI/CD pipelines using GitHub Actions, from code commit to production deployment.
  • Integrate and manage automated testing, dependency updates, and security scans within the pipelines to ensure code quality and security.
  • Empower engineers with the tools and automation needed to reduce friction, manage technical debt, and focus on building great products.
  • Define and continuously improve our release processes, ensuring smooth and predictable deployments.
 
Security & Compliance 
  • Act as the subject matter expert for security, compliance, and data flows within our cloud infrastructure.
  • Implement and manage security best practices and automated tooling (SAST/DAST, dependency scanning) to protect our applications and data.
  • Oversee the security and compliance of AI-related data flows, ensuring that any data sent to third-party services is minimized, anonymized, and explicitly not used for external training purposes.
  • Ensure all infrastructure and processes adhere to legal and regulatory requirements, maintaining customer trust and data privacy.
Observability & Incident Management 
  • Implement and manage a robust observability strategy using tools like Sentry, Datadog, and native cloud services.
  • Configure critical alerting and monitoring (e.g., AWS CloudWatch Alarms) and integrate them with notification services to ensure rapid response.
  • Lead the incident management process for infrastructure-related issues, with a focus on root cause analysis and proactive prevention to minimize hotfixes.
  • Champion the use of AI in operations, exploring and implementing tools for anomaly detection, predictive analysis, and automated remediation.
  • Collaborate with our existing Core infrastructure engineer on business-as-usual projects to ensure strategic alignment across the company, while maintaining a primary focus on the AI project initiatives.
 

Requirements:

  • Proven experience in a DevOps/Infrastructure Engineering role with a focus on automation.
  • Proficiency in managing cloud infrastructure on AWS.
  • Experience supporting infrastructure for ML/AI projects (MLOps) is highly desirable.
  • Deep, hands-on experience with Infrastructure-as-Code using Terraform.
  • Hands-on experience with containerization technologies (Docker, Kubernetes) and networking (VPCs, Load Balancers).
  • Expert-level knowledge of building and managing complex CI/CD pipelines, with a strong preference for GitHub Actions.
  • Strong understanding of system architecture, security best practices, and compliance standards.
  • Comfortable with scripting languages (e.g., Python, Bash) to build automation and tooling.
  • Experience with the operational lifecycle of APIs and databases from an infrastructure perspective.
  • Hands-on experience with modern observability and error tracking tools such as Sentry, Datadog, Prometheus, or Grafana.
  • You have a deep technical curiosity and a passion for automation.
  • You act as a force multiplier, empowering the engineering team with the tools and processes they need to succeed.
  • You take complete ownership of your domain and are a reliable partner to the engineering teams you support.
  • You are a strategic thinker who can balance speed and safety, enabling developers to move fast without compromising on security or stability.
  • You are a strong communicator who can explain complex technical concepts to a variety of audiences.
  • You are proactive in identifying potential issues, reducing technical debt, and improving the overall development lifecycle.

Desirable:

What are the benefits of working at SPECTRUM.LIFE?

  • Full-time permanent contract
  • Work from home
  • Competitive salary (Dependent on experience) + employee benefits
  • Continuous professional development and training opportunities.
  • 25 days of annual leave
  • 24/7 EAP and a wide range of health and wellbeing supports
  • Extensive list of employee perks and benefits: https://app.box.com/s/6wwkvowbev6cn7tlvq9yz32amnpmnvcl

Your profile

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Software Development

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified