Please mention DailyRemote when applying
This role builds and owns the AI and data systems at the core of Tern's product. You'll set the standard on evals, pipeline reliability, and advisor reporting on a small team where the scope is real and the ownership is yours. If you've been waiting for AI problems worth owning, this is that job.
Tern is a venture-backed software company on a mission to reshape the $127B travel agency industry by giving power back to the entrepreneurs who built it.
Nearly 98% of travel agencies are small businesses. These businesses have been chronically underserved by technology. We're here to change that. Our platform helps travel advisors run more efficient, professional, and profitable operations, giving them the modern infrastructure they need to lead the next chapter of travel.
But the impact goes beyond business. Travel advisors help clients move more intentionally through the world. When a traveler works with an advisor, they're more likely to avoid overtouristed hotspots and more likely to spend their dollars in places where they can do real good. That's the kind of travel we want more of.
At Tern, we believe in small business. We believe in the power of travel. And we're building the future of both.
AI is becoming central to Tern's product, and the quality of those AI features lives or dies on the data behind them. This role sits right at that intersection. You'll be a senior builder on our data team, working closely with our Data / AI Lead, and your north star is making Tern's AI features trustworthy. That's an engineering problem, and you'll solve it by building: the systems those features run on, the eval harnesses that tell us whether they're actually good, and the monitoring that catches quality and drift in production.
That work rests on a data foundation you'll also own. Tern is the system of record for every advisor and agency on the platform, so our data is one of the most valuable things we have, and it has to do double duty; feeding the AI features that are becoming core to the product, and powering the reporting that advisors and agency owners rely on to run their businesses. You'll own that data end to end, from the pipelines that move it through to the reporting built on top of it, so the AI work always has solid ground to stand on.
Tern is building the platform travel advisors run their entire business on, and we believe the next great aggregator in travel gets built right now, on AI. The window to win this market is open, and we intend to take it. With capital and real momentum behind us, we're growing the engineering team this summer to put fuel behind what's already working.
This role owns a real part of how Tern works, the kind of work the rest of the team depends on. We hire engineers with high agency and the trust to execute independently, set direction through their work, and lift the standard of everyone around them. We hire for proven execution, so we want to see specific evidence of what you've shipped and the impact it had, in your resume and in the room.
Ship regularly. Every engineering pair ships a working, tested feature every week. Fast and good are not in tension. Testing is part of the build.
We own our code and how users experience it. We watch what we ship in production, stay close to support, monitoring, and user feedback, and own our fixes end to end.
We make the people around us faster and better, through code and design reviews that teach, through clear written work, and by unblocking our colleagues quickly.
Every team is an AI team. We use AI fluently in daily work, from design review to test generation to debugging. We run Claude Code with a deep library of custom skills and agents, unlimited token usage, and we're actively building agentic and MCP-based tooling on top of our own systems. This is how we move fast and build well.
Tern is a Ruby on Rails application with a Hotwire front end, backed by a Postgres database and hosted on Heroku, though we are migrating to Google Cloud Platform. Our data flows into BigQuery, where we model it with dbt and build reporting in ThoughtSpot and Hex. Claude Code is part of daily development. You don't need to have used every piece, but you should be fluent enough to be productive quickly and excited to work this way.
Build and ship the systems behind Tern's AI features: the data infrastructure, services, and agentic tooling they run on. The output of this role is working software in production, not decks or recommendations.
Build the evaluation systems that answer whether an AI feature is good enough to ship and good enough to keep- eval harnesses, datasets, and production monitoring that run as software, not one-off analyses. Where no quality bar exists yet, build the thing that sets it.
Own data quality as a first-class concern across ingestion, modeling, and reporting. Catch problems before they reach a model, a dashboard, or a user. Fix them end to end.
Build and maintain the ETL systems that move and shape data from our application and third-party sources. Keep them reliable as volume grows.
Work alongside product squads to build the reporting that gives advisors and agency owners real visibility into how their business is performing.
Make the people around you faster and better. Share context early and write clearly so others can build on your work.
Production AI/ML experience- the must-have: You've built and shipped AI and/or ML systems that real users depended on in production, and you owned what happened after launch. Watching quality, debugging bad outputs, and making the system better over time. This matters more to us than any specific tool or title.
Evals as engineering: You treat evaluation as something you build, not a report you write. You have a real point of view on what to measure, how to catch drift and regressions, and when a metric is lying to you ideally from building evals for a production system.
Data pipeline and service ownership: You've personally built and owned pipelines and services that move data from application sources into a warehouse. You know what breaks, when and why, and you own the fix.
High agency: You've taken ambiguous, under-specified problems and driven them to a working outcome. You don't need a fully-scoped ticket to start.
Experience with Ruby on Rails or working directly from an application database rather than just downstream data
Hands-on experience with LLM evaluation and observability tooling
Experience with MCP-based tooling or agentic data workflows
π± We're always leveling up. Whether you're deepening your craft, learning from a teammate, or embracing a new challenge, growth is core to our identity.
π§ We act with optimistic agency. We take initiative, seek clarity, and move forward, even when the path isn't obvious. Through every peak and valley, we lead with curiosity, laughter, kindness, and resolve.
πͺ We expect operational excellence. We ship value to our users every single week. We believe that compounding habits lead to sustainable productivity, consistency, and mutual trust.
Operating Principles:
π We deeply understand our users. At every level of the organization, we obsess about understanding those we serve and the industry we operate in.
β We embrace the power of and, and not now. We challenge trade-offs by asking better questions. We break hard problems into small pieces and tackle them with intention. We also know to make the hard call to say "not right now".
π£ We speak up and move forward. Everyone at Tern has a voice and a responsibility to use it. We invite healthy tension, share dissenting views early, and challenge each other with curiosity, not ego.
π We move fast and sweat the details. Velocity matters and relentless progress beats perfection every time. But speed isn't chaos: we stay aligned, own our outcomes, and care deeply about quality.
π€£ We take the work seriously, but not ourselves. We hire kind, driven people who elevate the room. If you've got a big ego or take yourself too seriously, you won't last.
Our interview is built to surface demonstrated execution, and we keep it deliberately light rather than a long gauntlet
We screen resumes for specific, verifiable things you shipped and the impact they had, not responsibilities or titles
We go deep on one or two things you personally built and shipped. We want the real story: the ambiguity, the dead ends, what broke, and how you owned the fix.
We give a practical exercise that reflects the actual work, with AI tools available and expected
And throughout, we look for evidence that working with you made other engineers better at their jobs
We take the work seriously, not ourselves. If you're here to grow, build something industry-changing, and raise the bar for the people around you, we'd love to meet you.
Be part of a mission-driven team transforming the travel planning space
Work with a supportive, curious, and creative team
Influence and shape Tern's data and AI foundation from early days
Competitive salary, equity, and benefits package
Tern is committed to building a team that represents people from many different backgrounds and life experiences, reflecting our worldview coinciding with the users and customers we serve across the world. We prefer that you apply, so think of our postings as the start of the conversation. Take the chance, you may be a wonderful add to our Tern team, even if you don't fully match every requirement on the job description.
Stop the endless job search. Our AI finds and applies to the best jobs for you.
Discover remote opportunities in Data Engineer
Answer easy questions
200,000+ jobs across 15+ categories
Get your best job matches
Only hand-screened, legit jobs
Find a remote job faster
No ads, scams, or junk
“ I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!