Rezdy

Staff Platform Engineer

Posted a month ago

United States

⭐ 10+ years experience

Apply Now

Please mention DailyRemote when applying

AI Summary

Own and evolve the AWS infrastructure, networking, and data services to support the Manifest product. Design platform guardrails and improve CI/CD workflows to enhance developer experience and production reliability.

Staff Platform Engineer

About the Role

We’re hiring a Staff DevOps Engineer to join Manifest, a new product being built in a high-autonomy, fast-moving environment.

This is a hands-on, staff-level role for someone who can own critical infrastructure, improve the developer experience, and partner closely with product engineers, DevOps leadership, and technical leads. We’re looking for someone who can operate production systems, but also design the guardrails, patterns, and platform capabilities that allow the team to move faster and more safely over time.

This role is a strong fit for someone who enjoys working close to the product team, understands the realities of building in a startup-like environment, and can bring structure, reliability, and technical depth to a fast-moving team.

What You’ll Do

Work on a team with two other platform engineers.
Own and evolve the infrastructure that supports Manifest, including AWS environments, networking, compute, data services, observability, CI/CD, and operational tooling.
Work with Pulumi and TypeScript to define, maintain, and improve infrastructure as code across the platform.
Support and improve our containerized application platform, including deployment pipelines, rollback mechanisms, and runtime configuration.
Help operate and harden our data infrastructure, including connection pooling, backups, disaster recovery, replication, and safe schema-change practices.
Partner with engineers to improve the reliability and safety of releases, including database migrations, deployment workflows, environment management, and production readiness checks.
Improve CI/CD workflows so that builds, tests, infrastructure changes, and deployments are fast, reliable, and easy for engineers to understand.
Lead observability and incident readiness work, including alerting, dashboards, SLOs, runbooks, incident response practices, and post-incident follow-up.
Help ensure the platform is secure, cost-conscious, and maintainable as the product scales.
Mentor engineers on infrastructure, operations, reliability, and production ownership.

What We’re Looking For

We’re looking for someone who has operated meaningful production systems and can bring staff-level judgment to infrastructure, reliability, and developer experience.

Strong candidates will have:

Deep production experience with AWS, especially services such as ECS/Fargate, RDS/Aurora PostgreSQL, VPC networking, load balancing, IAM, KMS, Secrets Manager, CloudFront, WAF, and related managed services.
Experience designing and operating systems that serve a global user base, seamless multi-region availability, and disaster recovery procedures.
Treats reliability, scalability, performance, and observability as a first-class design constraint, building these into designs from the start rather than bolting them on later.
Strong infrastructure-as-code experience. Pulumi with TypeScript is ideal, but deep experience with Terraform or another mature IaC approach is also valuable.
Strong operational knowledge of PostgreSQL, including performance investigation, connection pooling, backups, replication, locking, migrations, and safe schema-change patterns.
Experience designing and maintaining CI/CD systems, ideally with GitHub Actions, OIDC-based cloud authentication, container builds, environment promotion, required checks, and deployment gates.
Experience supporting containerized production workloads and improving deployment safety, rollback strategies, and runtime reliability.
Strong observability and incident response experience, including metrics, logs, traces, alerting, dashboards, runbooks, and post-incident learning.
The ability to work effectively in ambiguity, make pragmatic tradeoffs, and communicate clearly with both infrastructure specialists and product engineers.
A track record of raising the engineering bar through reusable patterns, documentation, automation, mentoring, and thoughtful technical leadership.

Our Environment

Manifest operates with a lean process and a high degree of ownership. Engineers are expected to work effectively in ambiguity, clarify requirements, collaborate directly across functions, and ship pragmatic, high-quality solutions.

The DevOps function is critical to that operating model. Resilient, well-planned infrastructure is critical, but we also do not want speed to come at the expense of reliability, security, or maintainability. This role exists to help Manifest find that balance as the product moves toward launch and scale.

You’ll work closely with product engineers, technical leads, DevOps leadership, and other stakeholders to ensure the platform is ready for real customers, real traffic, and real operational demands.

Why Join

This is an opportunity to help shape the foundation for a new product at an important stage.

You’ll be joining early enough to have real influence over how Manifest operates, deploys, scales, and responds to incidents. You’ll work on meaningful infrastructure problems, partner with a highly autonomous engineering team, and help define the standards that will carry the product into production and beyond.

If you’re excited by the combination of hands-on infrastructure work, production reliability, developer experience, and staff-level technical leadership, we’d love to talk.

Automatically Apply to the Best Remote Jobs

Stop the endless job search. Our AI finds and applies to the best jobs for you.

Try it Now

Rezdy

Staff Platform Engineer

AI Summary

Staff Platform Engineer

About the Role

What You’ll Do

What We’re Looking For

We’re looking for someone who has operated meaningful production systems and can bring staff-level judgment to infrastructure, reliability, and developer experience.

Strong candidates will have:

Automatically Apply to the Best Remote Jobs

Ace Your Job Interview

How to Answer "How Do You Handle Criticism"?

How to Answer "Tell Me About Yourself?" in an Interview

How to Answer "What is your Experience with Customer Service?"

How to Answer "Describe Your Experience Working With Diverse Teams Or Different Cultures?"

How to Answer The Interview Question "What Sets You Apart From Other Candidates?"

How to Answer "Why Are You The Best Person For This Job"?

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Why Should We Hire You?"

How to Answer "What Areas Need Improvement?"

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Tell Me About a Time You Received Constructive Feedback"

How to Answer "What Is Your Greatest Accomplishment?"

Similar Jobs

Principal Solutions Architect - Data Engineering

Survey Programmer - ConfirmIT

Principal Software Engineer - Browser Guard

Senior AI Engineer

Quality Engineer Intern

Cloud Solution Architect – Dynamics 365 ERP Technical Architect

Rezdy

Staff Platform Engineer

AI Summary

Staff Platform EngineerAbout the Role

What You’ll Do

What We’re Looking ForWe’re looking for someone who has operated meaningful production systems and can bring staff-level judgment to infrastructure, reliability, and developer experience.

Strong candidates will have:

Automatically Apply to the Best Remote Jobs

Share This Job:

Similar Jobs

Principal Solutions Architect - Data Engineering

Survey Programmer - ConfirmIT

Principal Software Engineer - Browser Guard

Senior AI Engineer

Quality Engineer Intern

Cloud Solution Architect – Dynamics 365 ERP Technical Architect

Personalize your Remote Job Search in 3 Easy Steps!

Staff Platform Engineer

About the Role

What We’re Looking For

We’re looking for someone who has operated meaningful production systems and can bring staff-level judgment to infrastructure, reliability, and developer experience.