Please mention DailyRemote when applying
Agile Lab is a company founded in 2014 with the mission to create value for its customers in data-intensive environments through customisable solutions that establish performance-driven processes, sustainable architectures and automated platforms based on data governance best practices.
Having delivered over 100 successful Elite Data Engineering initiatives, we have used this experience to create Witboost: a modular, technology-agnostic platform that enables modern organisations to discover, value and produce their data in both traditional environments and fully compliant Data Mesh architectures.
With a highly skilled team of over 260 data engineers based in Europe, Agile Lab helps organisations with their data-driven transformation.
Take a look at our handbook to discover our core values and processes.
We are looking for a Site Reliability Engineer II (SRE II) to join our growing team. You will play a key role in maintaining the reliability, observability, and operational efficiency of enterprise-level distributed systems.
In this role, youβll coordinate a small technical team (3β4 people) in managing microservices in complex production environments. You will be involved in monitoring, incident management, release coordination, and performance tuning, with a strong focus on OpenShift platforms.
Youβll also work closely with multiple cross-functional teams to ensure high availability and performance of our cloud-native services.
This role includes on-call availability.
40K-50K
Ensure high reliability of microservices running in OpenShift environments
Lead and coordinate a technical team of 3β4 engineers for operational excellence
Manage incident resolution and ticketing workflows via ServiceNow
Collaborate with development teams to drive performance optimization and tuning
Design, configure and maintain monitoring dashboards (Grafana, Prometheus, etc.)
Coordinate with Service Control Room to maintain effective alerting and response
Oversee release processes of new features, hotfixes, and updates in production
Proven experience in Application Maintenance Services (AMS): minimum 2 years
In-depth knowledge of OpenShift and microservices in cloud-native environments
Ability to technically and operationally lead a team of 3β4 people
Experience in release management, monitoring, and incident resolution
Excellent communication and cross-functional coordination skills
Strong initiative, operational autonomy, and results-oriented mindset
Fluency in Italian
Monitoring & Observability: Grafana, Prometheus,
Cloud/DevOps: OpenShift, GitLab, Jenkins
Ticketing & ITSM: ServiceNow
Degree in Computer Engineering, Computer Science, or a related field
Data & Messaging: Kafka, MongoDB, Ignite
Monitoring & Observability: Kibana, Jaeger, Datadog, OpenTelemetry
ππ» We offer:
Full Remote or hybrid working in our offices: Milan, Turin, Padua, Bologna, Catania and Rende;
Real work life balance;
Training monthly budget (time and money);
Support of a buddy in the first week of work;
Benefits and corporate welfare programs: company prizes and welcome pack with all the equipment you need to work;
Agile Nomads Experience: opportunity to work for 2 weeks abroad;
Referral bonus, if you bring people as talented as you;
The opportunity to attend one conference per year;
A company rated 4.8 out of 5 for employee satisfaction on Glassdoor and certified as a Great Place to Work
Inclusive environment where you can be who you really are;
Stimulating environment oriented to growth, both professional and personal.
π How we work:
We don't like hierarchies: we work as a team;
We don't like bureaucracies, we prefer sense of responsibility;
We like data, certainly, so anything that is measurable;
We want to make a positive change in our industry;
Empathy, humility, collaboration, and willingness to challenge ourselves are the basis of our work.
Please note:
Only candidates based in European time zones (CEST or similar) will be considered for this position;
Stop the endless job search. Our AI finds and applies to the best jobs for you.
Discover remote opportunities in Support
Answer easy questions
200,000+ jobs across 15+ categories
Get your best job matches
Only hand-screened, legit jobs
Find a remote job faster
No ads, scams, or junk
“ I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!