Senior Site Reliability Engineer

 Posted an hour ago
  
 Poland
  
 26000 - 34000 per month
  
5-10 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Own and evolve production infrastructure, leading the migration from Docker Swarm to Kubernetes. Maintain high availability across hundreds of servers and drive observability and IaC practices.

We’re a team of 500+ professionals who develop cutting-edge proxy and web data scraping solutions for thousands of the world’s best known businesses, including Fortune 500 companies. 


What’s in store for you:

You’ll be solving complex challenges and maintaining our own infrastructure with 60PB+ monthly data traffic. Here are its scale and maturity in numbers:


- 6PB+ Ceph storage

- 60PB+ monthly data traffic through our systems

- 300k+ service requests/sec processed

- 500k+ Kafka messages/sec streamed

\n


In this role, you’ll:
  • Own and evolve Webshare's production infrastructure - lead the migration from Docker Swarm to Kubernetes (or hybrid K8s + Ansible).
  • Maintain high availability across hundreds of servers and ~50 services.
  • Drive observability in cooperation with development team.
  • Establish and enforce IaC practices, CI/CD pipeline reliability, and change management processes.
  • Participate in the on-call rotation alongside backend developers.
  • Respond to and lead incident resolution, run post-mortems and drive systematic remediation.
  • Contribute platform tooling that improves developer experience and reduces infrastructure toil.
  • Keep backend engineers informed and capable — no silos, shared infrastructure ownership.


Your skills & experience:
 
  • Have built and operated highly available infrastructure at comparable scale — hundreds of servers, dozens of services, real production load.
  • Hands-on K8s in self-hosted / bare-metal environments.
  • Confident with Infrastructure as Code.
  • Have owned CI/CD pipelines end-to-end (GitLab CI or equivalent).
  • Have been on call in a production environment.
  • Proactive— surfaces problems before being asked, keeps the team informed without prompting.
  • Scripting and development skills.


Nice to have:
  • Led at least one major infrastructure migration - planned, executed, and stabilised it.
  • Python and/or Go familiarity - backend is Python, edge services are Go.
  • Exposure to proxy, networking-heavy infrastructure.
  • Previous experience in a small team where developers shared infrastructure responsibility.
  • Familiarity with edge clusters or split compute/edge architectures.


Salary & Benefits:
  • Gross salary: 26 000 PLN/month  - 34 000 PLN/gross. Keep in mind that we are open to discussing a different salary based on your skills and experience.
  • Growth & Learning: 40+ internal learning options, external conferences, mentorship, and year-round knowledge-sharing.
  • Health & Well-being: Private health insurance, gym allowance and a wellness app.
  • Celebration & Community: Team events, an overseas workation and plenty of ways to mark milestones together.


\n

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Site Reliability Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified