Staff Platform Engineer

 Posted 7 hours ago
     
10+ years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Own and evolve the AWS infrastructure, networking, and data services to support the Manifest product. Design platform guardrails and improve CI/CD workflows to enhance developer experience and production reliability.

Staff Platform Engineer

About the Role


We’re hiring a Staff DevOps Engineer to join Manifest, a new product being built in a high-autonomy, fast-moving environment.

This is a hands-on, staff-level role for someone who can own critical infrastructure, improve the developer experience, and partner closely with product engineers, DevOps leadership, and technical leads. We’re looking for someone who can operate production systems, but also design the guardrails, patterns, and platform capabilities that allow the team to move faster and more safely over time.

This role is a strong fit for someone who enjoys working close to the product team, understands the realities of building in a startup-like environment, and can bring structure, reliability, and technical depth to a fast-moving team.

What You’ll Do

  • Work on a team with two other platform engineers.
  • Own and evolve the infrastructure that supports Manifest, including AWS environments, networking, compute, data services, observability, CI/CD, and operational tooling.
  • Work with Pulumi and TypeScript to define, maintain, and improve infrastructure as code across the platform.
  • Support and improve our containerized application platform, including deployment pipelines, rollback mechanisms, and runtime configuration.
  • Help operate and harden our data infrastructure, including connection pooling, backups, disaster recovery, replication, and safe schema-change practices.
  • Partner with engineers to improve the reliability and safety of releases, including database migrations, deployment workflows, environment management, and production readiness checks.
  • Improve CI/CD workflows so that builds, tests, infrastructure changes, and deployments are fast, reliable, and easy for engineers to understand.
  • Lead observability and incident readiness work, including alerting, dashboards, SLOs, runbooks, incident response practices, and post-incident follow-up.
  • Help ensure the platform is secure, cost-conscious, and maintainable as the product scales.
  • Mentor engineers on infrastructure, operations, reliability, and production ownership.

What We’re Looking For

We’re looking for someone who has operated meaningful production systems and can bring staff-level judgment to infrastructure, reliability, and developer experience.

Strong candidates will have:

  • Deep production experience with AWS, especially services such as ECS/Fargate, RDS/Aurora PostgreSQL, VPC networking, load balancing, IAM, KMS, Secrets Manager, CloudFront, WAF, and related managed services.
  • Experience designing and operating systems that serve a global user base, seamless multi-region availability, and disaster recovery procedures.
  • Treats reliability, scalability, performance, and observability as a first-class design constraint, building these into designs from the start rather than bolting them on later.
  • Strong infrastructure-as-code experience. Pulumi with TypeScript is ideal, but deep experience with Terraform or another mature IaC approach is also valuable.
  • Strong operational knowledge of PostgreSQL, including performance investigation, connection pooling, backups, replication, locking, migrations, and safe schema-change patterns.
  • Experience designing and maintaining CI/CD systems, ideally with GitHub Actions, OIDC-based cloud authentication, container builds, environment promotion, required checks, and deployment gates.
  • Experience supporting containerized production workloads and improving deployment safety, rollback strategies, and runtime reliability.
  • Strong observability and incident response experience, including metrics, logs, traces, alerting, dashboards, runbooks, and post-incident learning.
  • The ability to work effectively in ambiguity, make pragmatic tradeoffs, and communicate clearly with both infrastructure specialists and product engineers.
  • A track record of raising the engineering bar through reusable patterns, documentation, automation, mentoring, and thoughtful technical leadership.


Our Environment

Manifest operates with a lean process and a high degree of ownership. Engineers are expected to work effectively in ambiguity, clarify requirements, collaborate directly across functions, and ship pragmatic, high-quality solutions.

The DevOps function is critical to that operating model. Resilient, well-planned infrastructure is critical, but we also do not want speed to come at the expense of reliability, security, or maintainability. This role exists to help Manifest find that balance as the product moves toward launch and scale.

You’ll work closely with product engineers, technical leads, DevOps leadership, and other stakeholders to ensure the platform is ready for real customers, real traffic, and real operational demands.

 

Why Join

 

This is an opportunity to help shape the foundation for a new product at an important stage.

You’ll be joining early enough to have real influence over how Manifest operates, deploys, scales, and responds to incidents. You’ll work on meaningful infrastructure problems, partner with a highly autonomous engineering team, and help define the standards that will carry the product into production and beyond.

If you’re excited by the combination of hands-on infrastructure work, production reliability, developer experience, and staff-level technical leadership, we’d love to talk.


 

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Platform Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified