Infrastructure & Cloud Operations Engineer (Remote)

 Posted 16 hours ago
     
⭐ 10+ years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

The engineer is responsible for administering and modernizing enterprise cloud and hybrid infrastructure, with a heavy focus on Splunk observability and logging platforms. They will ensure system reliability and security through automation, monitoring, and the migration of legacy services to AWS.

GovCIO is currently hiring for a Senior Cloud Infrastructure Engineer to support and maintain enterprise cloud and hybrid infrastructure environments supporting critical federal operations. This role is responsible for administration, maintenance, monitoring, automation, troubleshooting, and modernization of enterprise infrastructure platforms spanning AWS cloud services, Linux systems, observability platforms, and enterprise logging solutions. The position supports operational continuity, system reliability, security monitoring, and infrastructure transformation initiatives in a shared-services team environment. This position will be fully remote within the United States.


Responsibilities

The Infrastructure & Cloud Operations Engineer is responsible for supporting and administering enterprise observability, logging, monitoring, and security analytics platforms, with a primary focus on Splunk and related technologies. This role supports the operation, maintenance, and modernization of enterprise cloud and hybrid infrastructure environments, including AWS, Linux systems, automation platforms, and data ingestion services. Working within a shared-services team, the engineer collaborates across multiple technical disciplines to ensure the reliability, performance, security, and availability of mission-critical systems while supporting operational initiatives, platform enhancements, and cloud transformation efforts.

 

  • Support and maintain enterprise cloud infrastructure environments, primarily within AWS.
  • Provide operational support for hybrid infrastructure spanning cloud-hosted and on-premises enterprise systems.
  • Administer, maintain, and troubleshoot enterprise observability, logging, and monitoring platforms, including Splunk Enterprise, Splunk Enterprise Security (ES), Splunk IT Service Intelligence (ITSI), and successor technologies.
  • Manage log ingestion, forwarding, indexing, retention, and troubleshooting across distributed systems and enterprise environments.
  • Support installation, configuration, and maintenance of Splunk Universal Forwarders and related data collection components.
  • Support enterprise security monitoring, analytics, alerting, and operational visibility capabilities through Splunk and related observability platforms.
  • Support evaluation, migration, and modernization efforts involving enterprise logging and observability platforms, including potential transitions to Elastic or similar technologies.
  • Perform Linux/Unix systems administration, including server provisioning, patching, upgrades, maintenance, and operational support.
  • Develop, maintain, and execute infrastructure automation and configuration management processes using Ansible and related automation tools.
  • Support enterprise data ingestion workflows, platform integrations, certificate management processes, and operational data pipelines.
  • Troubleshoot infrastructure, network, platform, and application performance issues across multiple environments.
  • Support cloud-hosted applications and enterprise infrastructure services to ensure reliability, availability, and operational continuity.
  • Administer and support monitoring, alerting, analytics, and security visibility capabilities across enterprise platforms.
  • Participate in cloud transformation and modernization initiatives, including migration of services from legacy on-premises environments to cloud-based architectures.
  • Support decommissioning of legacy systems and transition of workloads to modernized infrastructure platforms.
  • Develop and maintain operational documentation, standard operating procedures, implementation plans, and technical runbooks.
  • Collaborate with engineers, administrators, and stakeholders in a shared-services operating model where work assignments are distributed based on operational priorities and Jira-managed tasking.
  • Participate in rotational on-call support for production systems and incident response activities.
  • Ensure system reliability, performance, scalability, security, and operational continuity across supported environments.

Qualifications

Required Skills and Experience:

  • Bachelor's with 12+ years (or commensurate experience)
  • Experience supporting enterprise Splunk environments, including administration, troubleshooting, data ingestion, monitoring, and operational support.
  • Experience supporting enterprise observability, logging, monitoring, or SIEM platforms.
  • Experience supporting enterprise cloud environments, preferably AWS.
  • Experience administering Linux/Unix operating systems in enterprise environments.
  • Experience with infrastructure automation and configuration management tools such as Ansible.
  • Experience supporting data ingestion, log forwarding, indexing, and operational monitoring processes

Clearance Required: Ability to obtain and maintain a Suitability/Public Trust clearance.


Preferred Skills and Experience:

  • Experience supporting customers at the Department of Veterans Affairs
  • AWS certifications such as Solutions Architect, SysOps Administrator, or Cloud Practitioner.
  • Experience with Splunk Enterprise Security (ES), Splunk IT Service Intelligence (ITSI), or other advanced SIEM platforms.
  • Experience with Elastic Stack, OpenSearch, Dynatrace, or cloud-native observability platforms.
  • Experience supporting enterprise security operations, analytics, and monitoring functions.

Posted Salary Range

USD $125,000.00 - USD $130,000.00 /Yr.

Similar Jobs

See all Remote Software Development jobs β†’

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Cloud Operations Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified