The DevOps Automation Engineer Intermediate is responsible for supporting cloud and operational engineering functions by performing automation, system management, and incident resolution activities in production environments. The role helps ensure system reliability, scalability, and operational efficiency through effective execution of automation tasks, monitoring, troubleshooting, and collaboration with cross-functional teams.
This position works with development, operations, and network teams to support day-to-day operations, enhance CI/CD processes, and contribute to continuous improvement initiatives. The role plays a key part in improving system observability, reducing alert noise, and maintaining stable and high-performing systems.
|
SPECIFIC DUTIES AND RESPONSIBILITIES
|
- Manage and automate cloud infrastructure to ensure scalability, reliability, and performance
- Collaborate with ITSM platforms to improve incident management and integrate monitoring and alerting workflows
- Maintain and improve CI/CD pipelines to ensure efficient and reliable software delivery
- Automate system administration and operational tasks to improve efficiency and reduce manual effort
- Monitor systems and applications to proactively identify issues and reduce noise alerts
- Troubleshoot and resolve production issues across cloud, application, and network layers to ensure system stability
- Support Agile teams by delivering operational enablers and improving deployment and release processes
- Implement and enhance monitoring, logging, and observability solutions to improve system visibility
- Contribute to continuous improvement initiatives to enhance automation, operational efficiency, and system reliability
Core Competencies (Must-have Competencies)
- (AIOps and Intelligent Operations - Ability to apply monitoring and automation practices to improve system reliability and reduce alert noise
- ITSM Platform Utilization - Ability to use ServiceNow or similar platforms to support incident, change, and problem management processes
- Cloud and DevOps Engineering - Ability to manage cloud infrastructure and implement CI/CD workflows to support scalable and automated delivery
- System Administration and Automation - Ability to administer Linux systems and automate tasks using scripting to improve performance and reliability
- Networking Fundamentals - Ability to troubleshoot network and connectivity issues to support stable system operations
Complementary Competencies (Good-to-have Competencies)
- Cross-functional Collaboration - Ability to work effectively with development, operations, and network teams to achieve shared goals
- Problem-solving and Critical Thinking - Ability to analyze complex production issues and implement effective solutions
- Communication Skills - Ability to clearly communicate technical concepts and coordinate across distributed teams
- Ownership and Accountability - Ability to take responsibility for production systems and drive resolution of issues
- Agile and DevOps Mindset - Ability to operate effectively in Agile environments and support continuous delivery practices
- Automation Mindset - Ability to proactively identify and implement automation opportunities to improve operational efficiency
Educational Qualification/s
- Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent practical experience
Professional Qualification/s
- 3+ years of experience in IT operations, including experience in SRE or similar roles
- Experience with cloud platforms such as AWS, Azure, or Google Cloud
- Proficiency in infrastructure-as-code tools such as Terraform and containerization technologies such as Docker and Kubernetes
- Experience with CI/CD tools such as Jenkins or GitLab CI
- Strong Linux system administration experience with scripting skills in Bash or Python
- Familiarity with AIOps concepts such as anomaly detection, predictive alerting, and automated remediation
- Experience with ServiceNow or similar ITSM platforms
- Strong troubleshooting, analytical, and problem-solving skills in production environments
- Experience working with distributed or offshore teams and collaborating across time zones
Work Conditions
- Work Schedule: 9-hour shift (8 hours work, 1 hour break)
- Initial Schedule: Mid Shift (4:00PM to 1:00 PM during onboarding)
- Regular Schedule: Day Shift (8:00 AM to 5:00PM) to provide extended operational coverage outside Canadian work hours
- Overtime/On call: May be required based on operational needs