AI Engineer - Data Intelligence

 Posted a day ago
     
 $150K - $180K per year
  
0-2 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Build and maintain the master data enrichment pipeline, focusing on classification and entity resolution workflows using deterministic logic and LLMs. Develop evaluation harnesses and regression suites to ensure high data quality and pipeline reliability.

Why Clarium?

The healthcare industry overspends on its supply chain by over $25B each year, the result of fragmented data, inefficient workflows, and wasted supplies. Clarium is fixing that. Our AI-powered platform, Astra OS, gives hospitals end-to-end visibility into their supply chain operations, automating workflows and surfacing actionable insights so supply chain teams can focus on what matters most: patient care. We're trusted by some of the world's leading health systems, including Yale New Haven Health, Stanford, Geisinger, Cleveland Clinic, and Kaiser Permanente.

Founded in 2020, Clarium has raised $43M in total funding. Our Series A was led by Northzone, with participation from General Catalyst, AlleyCorp, Kaiser Permanente Ventures, Texas Medical Center Ventures, and 1984 Ventures.

The Opportunity

AI-powered platforms, like Clarium’s, deliver the highest impact when they are supported by high-quality data. As we scale to more health systems and deepen our offering of intelligent, data-driven workflows, the master data enrichment pipeline (the system that classifies and contextualizes every product flowing through a hospital's supply chain) has become a critical growth lever. We're investing in the team and infrastructure to make that layer faster, smarter, and more reliable.

You'll join the Data Products team, a small, unusually senior group responsible for the data assets, data science, and analytics that drive measurable value for our clients. Day-to-day, you'll build and own components of our enrichment pipeline: classification workflows, entity resolution systems, evaluation harnesses, and the production tooling that keeps it all running. You'll work closely with engineers and data scientists who've shipped real ML systems at scale, and your work will feed directly into decisions made by supply chain teams at some of the country's leading health systems.

A rare early-career opportunity to learn fast and own real work from day one. As the first junior hire on the team, you won't be buried under layers of abstraction. You'll work directly alongside people who've done this before, on problems that actually matter. Short feedback loops, real stakes, and the kind of hands-on growth that's hard to find this early in a career. It's the opportunity many of us wish we'd had starting out.

In This Role You Will

  • Build and maintain components of Clarium's master data enrichment pipeline, the system that classifies and enriches every product flowing through our platform

  • Design and own classification and entity resolution workflows that combine deterministic logic and LLMs for production data processing

  • Build and operate evaluation harnesses, label sets, and regression suites (we use Braintrust) to measure and improve pipeline quality with confidence

  • Write production Python and SQL; the majority of your time will be spent in code, not in configuration tools

  • Analyze complex datasets using statistics and ML to surface actionable insights and inform pipeline improvements

  • Proactively audit data for quality issues; find the problems no one else has noticed yet, diagnose root causes, and ship fixes

What You'll Bring

  • Strong Python skills and a track record of writing production code, not just scripts or notebooks

  • Strong SQL, including complex joins, window functions, performance tuning, and data modeling

  • Comfort working in ambiguous environments; you can scope a problem, make a plan, and execute without hand-holding

  • A genuine, non-negotiable commitment to data quality; you treat silent bugs as real failures

  • Ability to go deep on an unfamiliar domain and develop meaningful expertise over time

Nice to Have

  • Experience with LLM integrations, prompt evaluation, or classification at scale

  • Familiarity with eval frameworks such as Braintrust, Promptfoo, or equivalent

  • Prior work in healthcare, supply chain, or another domain where data quality has direct operational consequences

Skills & Tools You'll Use

Need to Know: Python · SQL · PostgreSQL · CI/CD · Production observability

Nice to Know: Temporal · Braintrust · Snowflake · AWS · Sigma

What You Get at Clarium

Target Base Salary Range: $150K - $180K

The base salary Clarium offers may vary depending upon the ultimate scope and responsibilities of the position and on the candidate’s job-related knowledge, skills, and experience. The total package will include equity, in addition to a full range of medical and/or other benefits, depending on the position offered. Pay and benefits are subject to change at any time, consistent with the terms of any applicable compensation or benefit plans.

Incentive Stock Options proportionate to your salary

Fully remote, with a NYC co-working space available; distributed team across multiple time zones with opportunities for in-person time

Unlimited PTO

Top-tier health, vision, and dental benefits

401K

The opportunity to build on a strong foundational team with deep data and engineering roots at a stage where your work genuinely shapes the product

Equal Opportunity Statement

Clarium is committed to promoting an inclusive work environment free of discrimination and harassment. We value a diverse and balanced team where everyone can belong.

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in AI Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified