Senior Site Reliability Engineer

 Published 9 days ago
    
 Portugal
Apply Now Please mention DailyRemote when applying

Disclaimer: Before you apply, please make sure the job is legit.

Attempting to apply for jobs might take you off this site to a different website not owned by us. Any consequence as a result for attempting to apply for jobs is strictly at your own risk and we assume no liability.

LastPass is looking for a Senior Site Reliability Engineer: 

As a Senior Site Reliability Engineer (SRE) you will be responsible for keeping all user-facing services and other LastPass production systems running smoothly. LastPass is a unique product that requires single-minded focus on building the best practices in security and automation. The team’s experience feeds back into other Engineering groups within the company, as well as to LastPass customers. 

If you are passionate about complex problem solving and motivated by scale, then this is the role for you!

Who will you work with?

You will be part of a dedicated engineering community in a dynamic and fast-paced environment to improve the observability and reliability practices of LastPass. You will also work alongside the Product Teams to help define their SLI/Os and support in instrumenting their services to track and monitor these metrics, share LastPass data (reporting metrics) on those SLOs/SLIs and improve their reliability.   

What are some of the exciting challenges you will be working on?

  • Define and build Service Level Objectives and Indicators for Cloud Platform and assist product teams in building SLOs and SLIs for our product
  • Build proactive monitoring (react on symptoms and not on outages) and observability best practices to run our LastPass product
  • Coding automation with Chef, Ansible, CDK, Terraform, and LastPass CI/CD
  • Define and build Service Level Objectives and Indicators for Cloud Platform and assist product teams in building SLOs and SLIs for our product
  • Document every action so your findings turn into repeatable actions and then into automation
  • Participate in the Incident Response team as an incident commander, in order to mitigate any incident impacting the normal operation of LastPass’ services

What does it take to work at LastPass?

  • Know your way around Linux and Unix Shell
  • Know what the use of configuration management systems like Chef and Ansible are
  • Have strong programming skills in one or more of the following: Python, Typscript, or Ruby and/or Go
  • Have deep technical knowledge of Observability tooling (DataDog, NewRelic, AppDynamics or other) and Reliability concepts
  • Strong technical skillset in AWS
  • Have experience with Nginx, Docker, Kubernetes, CDK, Terraform, or similar technologies

Ace Your Job Interview

Read our advice on how to answer the most common interview questions.