Senior Site Reliability Engineer - Remote

 Posted 3 hours ago
     
⭐ 5-10 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Design, develop, and operate application and infrastructure deployment and orchestration for the Akamai Cloud. Focus on enhancing reliability, scalability, and performance through automation and observability solutions.

Do you enjoy collaborating with teams to solve complex challenges?

Do you have a passion for cutting edge technologies?

Join our highly skilled Site Reliability Engineering team!

Our team designs, develops, and manages applications and infrastructure that support Akamai Cloud's products and services. Our SRE teams solve reliability, security, and usability at scale for our global fleet while maintaining Akamai's mission at the forefront of what we do: make life better for billions of people, billions of times a day.

Partner with the best

As a member of our ACDC SRE team, you will design, develop, and operate application and infrastructure deployment, configuration, and change orchestration for the Akamai Cloud.

As a Senior Site Reliability Engineer, you will be:

  • Designing, developing, testing, and operating essential services to enhance the reliability, scalability, and performance of infrastructure systems.
  • Designing and implementing observability solutions, including monitoring, logging, alerting, and telemetry, to identify and address issues before customer impact.
  • Enhancing reliability via automation, minimizing operational toil, and boosting resilience within engineering processes.
  • Developing extensive expertise in ACDC systems and acting as a reliable resource, guiding engineers and sharing effective practices team-wide.
  • Collaborating closely with software engineering, infrastructure, and platform teams to resolve complex production issues, determine root causes, and implement solutions.
  • Participating in an on-call rotation and delivering technical expertise during incidents, ensuring prompt restoration, clear communication, and post-incident enhancements.

Do what you love

To be successful in this role you will:

  • Demonstrate expertise in Ansible through playbook development, role creation, automation workflows, and enterprise-scale configuration management processes.
  • Manage Infrastructure as Code solutions utilizing tools like Terraform, SaltStack, Ansible, Chef, Puppet, or comparable technologies effectively and efficiently.
  • Design, develop, and deploy software and infrastructure at scale within a Linux environment with advanced-level expertise.
  • Demonstrate advanced experience in a site reliability or software engineering role, working with large-scale distributed systems.
  • Have great communication and interpersonal skills
  • Demonstrate accountability for reliability, develop automation and monitoring, and collaborate effectively with an engineering team unfamiliar with SRE practices.

About us

At Akamai, we make life better for billions of people, trillions of times a day.
Whether you're streaming live events, scrolling social media, watching your favorite series, or managing your savings, we're the engine behind the scenes. We provide the world's most distributed platform from Cloud to Edge to help the giants of the digital world work faster and stay more secure, making the internet a better experience for everyone.

Our focus is simple:
Cloud and Edge: Running apps closer to users for instant performance.
Security: Neutralizing threats before they ever reach your data.
Content Delivery: Scaling the world's biggest moments without a glitch.
AI: Enabling our customers to build, secure, and scale AI apps on the world's most distributed cloud platform.

At Akamai, we don't just support the internet; we power and protect it, because behind every great digital experience is a massive hidden challenge. And we're the ones who solve it. When millions of people hit play or pay, Akamai ensures it just works.

Benefits at Akamai: We support your health, well-being, finances, and life beyond work. See our benefits.

FlexBase adapts to your job's needs

Akamai's FlexBase program is yet another way we show our commitment to providing employees with an exceptional workplace experience. It's not about telling employees where to work; it's about supporting employees to do their best work.

We trust our incredible employees to work in ways that suit them best: at home, in an office, or a combination of both.

Connect with us on social and see what life at Akamai is like!

Similar Jobs

See all Remote Software Development jobs β†’

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Site Reliability Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified