Sr. Production Support Engineer

 Posted 2 months ago
     
5-10 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

The role involves providing L2/L3 production support for AI-driven applications, data pipelines, and cloud-based infrastructure. Responsibilities include monitoring system performance, performing root cause analysis, and collaborating with engineering teams to ensure system reliability.

This is a remote position.

Sr. Production Support Engineer​

We are seeking a Senior Production Support Engineer to support and maintain AI-driven applications, data platforms, and client-facing solutions in a production environment. This role is responsible for ensuring system stability, performance, and reliability across AWS, Azure, Tableau, Power BI, and DealCloud CRM integrations.

The ideal candidate brings strong troubleshooting skills, experience with cloud and data ecosystems, and the ability to support complex, integrated systems in a fast-paced, AI-focused environment.

Key Responsibilities

  • Provide L2/L3 production support for applications, data pipelines, and AI-driven solutions
  • Monitor system performance and respond to incidents, alerts, and service disruptions
  • Perform root cause analysis (RCA) and implement fixes or coordinate with engineering teams
  • Support data pipelines (ETL/ELT) and ensure accuracy of data feeding into reporting tools (Tableau, Power BI)
  • Troubleshoot and resolve issues related to API integrations and microservices
  • Support CRM integrations (DealCloud) and related data workflows
  • Maintain and improve monitoring, logging, and alerting systems
  • Execute runbooks and standard operating procedures (SOPs) for issue resolution
  • Collaborate with development, QA, and data teams to ensure smooth deployment and production readiness
  • Participate in on-call rotations and provide after-hours support as needed
  • Identify opportunities for automation and process improvement within support operations


Requirements

Required Qualifications

  • 5+ years of experience in Production Support, Application Support, or Site Reliability Engineering (SRE)
  • Strong experience supporting systems in AWS and/or Azure environments
  • Experience troubleshooting data pipelines, ETL/ELT processes, and data-related issues
  • Strong SQL skills for data investigation and validation
  • Experience with monitoring and observability tools (e.g., Datadog, Splunk, New Relic, CloudWatch, Azure Monitor)
  • Experience with API troubleshooting and microservices-based architectures
  • Familiarity with incident management and ticketing systems (e.g., ServiceNow, Jira)
  • Basic scripting or programming experience (e.g., Python, Bash, or PowerShell)

Key Traits for Success

  • Strong analytical and troubleshooting mindset
  • Ability to remain calm and effective under pressure
  • Proactive approach to identifying and preventing issues
  • Strong collaboration skills across technical teams
  • Ownership mentality and commitment to system reliability


Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Support Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified