Technical Program Manager, Data Center Operations

 Posted an hour ago
     
 $200K - $270K per year
  
10+ years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Own the end-to-end site handover framework and develop SOPs to govern critical data center operations across a global fleet. Lead incident management, stability improvement programs, and provide leadership visibility through stability metrics and reporting.

About Fluidstack

We exist to make humanity more free. For most of human history, you farmed or you starved. Technology gave people more time for the things they wanted to do, instead of things they had to do. Powerful AI will be the biggest lever for human choice we've ever built - but only if models are aligned with what humanity actually wants. There are groups building AI who don't share these goals. Whoever deploys frontier compute infrastructure fastest will decide whether AI expands human freedom or shrinks it.

We're singularly focused on delivering 10 to 100s of GWs of compute faster than anyone else, rethinking every layer of the stack. We acquire power, design and build data centers, and operate them - with teams spanning hardware and software. Speed and scale are our key differentiators. Come be a part of building civilization-scale infrastructure for AI.


We hire people who care deeply about this problem space. If that is you, please apply!

How We Operate

  • Extreme ownership. Full autonomy. Own things end to end often taking on scope outside your core role without being asked to get things done.

  • Velocity. We drive everything forward as fast as possible.

  • First principles. Challenge every assumption. Zero analogy thinking, no egos, the best idea wins.

  • Love of the game. The frontier of AI is the most interesting problem of our time. We put in long hours at high intensity to push the frontier forward.

The Data Center Operations Team

Examples of key problems the team is working on

  • Operate at the scale of a nation, not a building. The fleet you run will draw more power than some countries, on the way to 100 GW.

  • Fly the plane while it's being built. Sites come online in pieces, and you keep the live ones running flawlessly while construction continues around them.

  • Write the playbook, don't inherit it. No prior operations org has run at this speed and scale, so the standards you set become the standard.

Role Scope

  • Own the end-to-end site handover framework: define the gates, acceptance criteria, and sign-off procedures that move a new facility from construction to live operations without dropped terms or late surprises.

  • Embed into design, construction, and due diligence teams early enough to shape maintainability requirements before they become field problems.

  • Drive the cross-functional handover rhythm across training, documentation, systems access, and knowledge transfer, surfacing blockers weeks before they hit the go-live schedule.

  • Build and maintain the SOPs that govern critical datacenter operations across the fleet, with metrics that track adoption, execution quality, and efficiency at each site.

  • Lead incident management and stability improvement programs, including post-incident reviews with root cause analysis, corrective action tracking, and preventive maintenance oversight that reduces unplanned outages across the global footprint.

  • Produce the dashboards and reporting that give leadership visibility into stability metrics and incident trends, and run the CAPA programs that turn that data into durable fixes.

What We're Looking For

The below is a starting point. We always make space for exceptional people, so if you don't fit this role exactly, tell us where you would.

  • You have run program management in mission-critical environments where a delayed handover or missed SOP had real operational consequences, not just schedule slippage.

  • You have designed operational frameworks from scratch: handover gates, SOP libraries, incident management programs built without a legacy system to copy from.

  • You quarterback across design, construction, supply chain, and site ops teams simultaneously, and other teams call you when a cross-functional workstream is stuck.

  • You write clearly enough to distill a complex operational issue into a decision and a next action for a site lead, an executive, or a counterparty who was not in the room.

  • You track incident trends and CAPA status in live dashboards and follow corrective actions through to closure, not just to initial assignment.

  • You have personally built or maintained SOPs and measured whether they were actually followed, not just whether they existed.

  • Bonus: ITIL, PMP, or PgMP certification. Hyperscale or large colo operator experience. Familiarity with ASHRAE, Uptime Institute, or TIA-942 standards. Exposure to datacenter construction and commissioning processes.

Salary & Benefits

  • Competitive total compensation package (salary + equity)

  • Retirement or pension plan, in line with local norms

  • Health, dental, and vision insurance

  • Generous PTO policy, in line with local norms

The base salary range for this position is $200,000 - $270,000 per year, depending on experience, skills, qualifications, and location. This range represents our good faith estimate of the compensation for this role at the time of posting. Total compensation may also include equity in the form of stock options.

We are committed to pay equity and transparency.

Fluidstack is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Fluidstack will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

You will receive a confirmation email once your application has successfully been accepted. If there is an error with your submission and you did not receive a confirmation email, please email careers@fluidstack.io with your resume/CV, the role you've applied for, and the date you submitted your application-- someone from our recruiting team will be in touch.

Similar Jobs

See all Remote Product jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Technical Program Manager

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified