Senior DevOps Engineer

 Posted a month ago
     
5-10 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Ensure the reliability and security of Hedera consensus node environments through automation and infrastructure-as-code. Reduce operational toil and improve release safety across globally distributed networks.

About Hashgraph:

Hashgraph is a fast-growing software company committed to supporting, developing and servicing Hedera, an open source, proof-of-stake platform. Hedera is EVM-compatible and has been specifically built to meet the needs of enterprise and web3 applications, which require speed, security, stability and sustainability. Hedera’s public network is governed by industry-leading organizations, spanning 11 sectors and 14 regions who oversee the development and direction of the decentralized platform.

The role:

We are hiring a Senior DevOps Engineer (Node Operations) to ensure the reliability, security, and operational excellence of Hedera consensus node environments. This role exists to reduce operational toil, strengthen infrastructure automation, and improve release and preproduction readiness across a globally distributed network. Without this role, we risk increased availability incidents, slower recovery times, and delays in delivering against the product roadmap.

The impact you'll have:

In this role, you will:

  • Operate and improve Hedera consensus node environments across testnet, previewnet, and preproduction
  • Design and implement automation-first workflows for release and preproduction environments
  • Build and maintain Infrastructure-as-Code (Terraform) on GCP
  • Improve change management, release safety, and operational predictability
  • Participate in on-call rotation, incident response, and RCA, driving corrective actions into automation
  • Partner with internal engineering teams and external stakeholders, including Hedera Governing Council members, to support operational requirements

What success looks like in 6-12 months:

  • Operational toil is significantly reduced through durable automation and standardization
  • Node environments are more reliable, with fewer incidents and faster recovery times
  • Release and preproduction workflows are predictable, repeatable, and automated
  • Infrastructure changes are consistent, testable, and auditable through IaC best practices

What you bring:

Core capabilities:

  • Strong systems reliability mindset with experience in incident response and RCA
  • Proven ability to automate operational workflows and reduce manual toil
  • Clear communicator with the ability to work across engineering, security, and external partners
  • Deep ownership mentality with a bias toward preventative engineering over reactive fixes
  • Strong Linux and networking troubleshooting in production environments

Functional expertise:

  • Infrastructure-as-Code with Terraform (module design, state management)
  • Configuration management with Ansible
  • CI/CD automation (Jenkins or equivalent pipeline tooling)
  • Experience operating distributed systems or production infrastructure at scale
  • Familiarity with Kubernetes fundamentals

Nice to haves:

  • Observability stacks (e.g., Grafana, Loki, Tempo, Mimir)
  • Programming/scripting (Go, Python, Bash)
  • GitHub/GitHub Actions experience

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in DevOps Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified