Site Reliability Engineer | Dayshift | Remote

 Posted an hour ago
     
2-5 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Design, build, and maintain scalable infrastructure and automation to ensure platform reliability, performance, and security. This includes managing CI/CD pipelines, monitoring system health, and supporting PCI-DSS compliance.

ZigZag is looking for a Site Reliability Engineer to join our team!

As a Site Reliability Engineer, you’ll design, build, and maintain the infrastructure and automation that power our platform. Working closely with software engineering teams and SRE peers, you'll embed reliability, performance, and compliance into the development lifecycle. Your focus will be on scalability, resilience, security, and operational efficiency across all environments.

Key Responsibilities

Infrastructure and Platform Engineering

  • Design, build, and maintain scalable and reliable infrastructure and platform services.

  • Develop and maintain infrastructure-as-code (e.g., CloudFormation, Terraform).

  • Develop custom automation workflows and internal tools to support infrastructure provisioning, monitoring, and incident response. (e.g., Python leveraging libraries = such as boto3 for AWS automations)

  • Liaise with vendors to assess and implement third-party solutions.

  • Maintain well-documented system configurations to support maintainability and compliance.

Reliability and Operations

  • Monitor system performance, availability, and capacity using observability tools (e.g., SumoLogic, AWS CloudWatch).

  • Create and maintain dashboards and monitoring solutions that offer deep insight into platform health and support rapid incident diagnosis.

  • Automate operational processes (e.g., deployments, failovers, scaling) to reduce toil and enhance system resilience.

  • Participate in incident response activities, including postmortems and root cause analysis, to drive continual improvement.

  • Continuously evolve and maintain SLOs and SLIs, ensuring a balance between development velocity and system reliability.

  • Work as part of a highly engaged team of SREs to ensure the stability, performance, cost-effectiveness, and observability of all environments.

Build, Deploy, and Development Enablement

  • Design and implement robust CI/CD pipelines and zero-downtime deployment strategies.

  • Build efficient and reliable build systems to empower development teams with self-service deployment capabilities.

  • Collaborate with engineering teams to embed reliability, scalability, performance, and security best practices into the SDLC.

Security and Compliance:

  • Maintain and monitor vulnerability scanning systems (e.g., Tenable Nessus, Lacework, Snyk) to work closely with Software Engineering teams to ensure  the platform remains secure and up to date.

  • Perform recurring security tasks such as reporting, maintaining security registers, and ensuring compliance with internal standards.

  • Support the organisation in maintaining PCI-DSS certification by ensuring infrastructure is securely configured and well-documented.

Skills & Experience

Essential

  • 2+ years of experience in a SRE role or similar (e.g. DevOps Engineer)

  • Experience managing an AWS environment and working in a SaaS business.

  • Strong knowledge and experience of infrastructure-as-code 

  • Experience with building and supporting robust CI/CD pipelines

  • Strong problem solving and analytical skills

  • Excellent communication and collaboration skills.

  • Ability to work in a fast-paced, agile environment

Desirable

  • Experience with BuildKite

  • Experience with distributed systems and microservice architecture

  • Exposure to compliance frameworks (PCI-DSS, ISO27001).

ZigZag is committed to building a diverse, inclusive, and equitable workplace. We believe that talent knows no borders, and we welcome individuals from all backgrounds to help us shape the future of work. Guided by transparency and agility, we foster an environment where everyone is valued and empowered to thrive.

By submitting this application, you acknowledge that you have read and agree with the company’s Privacy Policy.

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Site Reliability Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified