Unstructured Technologies Inc.

Site Reliability Engineer

Posted 2 months ago

Worldwide

⭐ 5-10 years experience

Apply Now

Please mention DailyRemote when applying

AI Summary

Own production reliability and performance for a Kubernetes-based document processing platform. Focus on observability, capacity planning, and automating fleet operations to ensure system stability.

Unstructured is defining the standard for enterprise data transformation in the age of LLMs and generative AI. In just two years, we've raised over $65M from world-class investors, including Menlo Ventures, Bain Capital, Databricks, NVIDIA, Microsoft, and IBM.

Our open-source toolkit has been downloaded 61M+ times and is used by 90% of the Fortune 1000. We power production AI workflows across commercial and federal sectors — transforming PDFs, HTML, Word docs, images, emails, and more into AI-ready data pipelines that scale.

We're not just building tools, we're building the backbone of generative AI and the infrastructure that unlocks intelligence across industries.

About The Role

The infra team is small, technically deep, and owns the full stack from cloud provisioning and k8s operators to workflow orchestration and core services. We ship frequently and operate at a scale that makes reliability a first-class engineering problem.

As part of the Infra team, you'll work on the reliability and performance of our platform end-to-end. You'll work closely with infra and product engineers to instrument systems, establish SLOs, tune autoscaling, and drive incident process maturity. The work is technical and hands-on: you'll write code, dig into k8s internals, and hold yourself accountable to positive production outcomes.

What You'll Own and Drive

Own production reliability across our Knative, KEDA, and Kubernetes-based document processing platform. Proactively detect degradation, diagnose root causes, and ship fixes
Work on observability: end-to-end tracing, latency SLOs, capacity dashboards, and alerting that finds problems before customers do
Load testing and capacity planning: establish throughput benchmarks, detect performance regressions before they reach production
Support fleet operations: contribute to the safe, automated upgrade process for our growing fleet of production systems

What We’re Looking For

4+ years of SRE, platform engineering, or infrastructure engineering in a production Kubernetes environment
Deep operational knowledge of Kubernetes. You've written HPA configs, KEDA ScaledObjects, PodDisruptionBudgets, preStop hooks, and PriorityClasses; you understand pod lifecycle and scheduler behavior.
Demonstrated experience diagnosing and resolving real production performance issues: resource saturation, timeout failures, scheduling problems, graceful shutdown gaps
Enough Python or Go to read service code, trace a bug to root cause, and write a targeted fix

Why You'll Love It Here

You'll be surrounded by smart, kind, low-ego people who genuinely enjoy building together. We invest in our team with company offsites, best-in-tech swag, and the tools you need to do your best work, wherever you're based.

We support you holistically, not just at work. From medical, dental, and vision coverage effective the 1st of the month following your start date, life and disability insurance, unlimited PTO, and flexible parental leave, to a 401(k) with company match, equity, a $500 work from home stipend, $70/month internet reimbursement, and team/company offsites throughout the year - we want you focused on building, growing, and staying energized for the long haul.

If you're excited about what we're building, we'd love to meet you.

Automatically Apply to the Best Remote Jobs

Stop the endless job search. Our AI finds and applies to the best jobs for you.

Try it Now

Unstructured Technologies Inc.

Site Reliability Engineer

AI Summary

About The Role

What You'll Own and Drive

What We’re Looking For

Why You'll Love It Here

Automatically Apply to the Best Remote Jobs

Ace Your Job Interview

How to Answer "How Do You Handle Criticism"?

How to Answer "Tell Me About Yourself?" in an Interview

How to Answer "What is your Experience with Customer Service?"

How to Answer "Describe Your Experience Working With Diverse Teams Or Different Cultures?"

How to Answer The Interview Question "What Sets You Apart From Other Candidates?"

How to Answer "Why Are You The Best Person For This Job"?

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Why Should We Hire You?"

How to Answer "What Areas Need Improvement?"

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Tell Me About a Time You Received Constructive Feedback"

How to Answer "What Is Your Greatest Accomplishment?"

Similar Jobs

Staff Detection Engineer

SOFTWARE DEVELOPER II SR, FCH - IT - BUSINESS APPLCATNS

Ruby on Rails Engineer

Partner Operations & AI Automation Manager

Full Stack Software Engineer (Tech Lead)

Infrastructure Operations Engineer

Unstructured Technologies Inc.

Site Reliability Engineer

AI Summary

About The Role

What You'll Own and Drive

What We’re Looking For

Why You'll Love It Here

Automatically Apply to the Best Remote Jobs

Share This Job:

Similar Jobs

Staff Detection Engineer

SOFTWARE DEVELOPER II SR, FCH - IT - BUSINESS APPLCATNS

Ruby on Rails Engineer

Partner Operations & AI Automation Manager

Full Stack Software Engineer (Tech Lead)

Infrastructure Operations Engineer

Personalize your Remote Job Search in 3 Easy Steps!