Principal AI Data Engineer

 Posted a month ago
     
5-10 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Design and build data product solutions that enable AI agents to reliably access and reason over enterprise and clinical data. Focus on transforming raw data into use-case-ready products using Snowflake, SQL, and modern AI tooling.

Work Schedule

Standard (Mon-Fri)

Environmental Conditions

Office

Job Description

Thermo Fisher CRG has an exciting opportunity within our CRG Digital group. We are looking for a Principal AI Data Engineer to join our team.

In this role, you will work directly with stakeholders to understand their needs, and design and build data product solutions that enable a range of use cases, including for AI agents, to reliably access, reason over, and act on enterprise and clinical data

The focus is on turning raw data into use case-ready data products using Snowflake, SQL and modern AI tooling. This is not a model development or research position. The ideal candidate enjoys working deeply with data, profiling, shaping and structuring it, so that downstream systems, including AI agents, can use it effectively

Essential Functions

• Experienced in working directly with non-technical stakeholders and leading technical projects.

• The successful candidate will need to be data proficient - likes to use data to make decisions as well as technically intuitive and enjoy learning new technology.

• User-focused - Keeps user needs in mind. Thoughtful about what’s intuitive for others, what’s not and why.

• Thorough knowledge of SQL and a range of technologies including one or more of the following: Snowflake, Databricks, MCP Servers, Microsoft Power Platform (e.g. Power Apps, Power BI), ETL, Python, and/or R.

• Understands how data structure, quality and context impact downstream AI/LLM use cases

• Direct knowledge of LLM, Agentic AI, Machine Learning and AI Engineering techniques an advantage - Familiarity with AI/LLM concepts is a plus, particularly in how data preparation and structure influence AI outputs (no model development required)

• Generates and tests hypotheses and analyzes and interprets the results.

• Navigates large, complex datasets for shaping, profiling, and curation, as well as identifies related data that is fundamental to successfully applying predictive techniques.

• Understanding of relational database and experience working with complex data systems.

• Designs, develops and programs methods, processes, and software programs to consolidate, cleanse, and analyze unstructured, diverse data sources to recognize patterns, identify opportunities, and generate actionable business insights and solutions.

• Understands how data structure, lineage, and quality impact downstream AI/LLM use cases, Collaborates with AI/ML teams but does not focus on model development.

• Identifies meaningful insights from large data and metadata sources in support of continuous improvement efforts and business process upgrades through exploratory data analysis.

• Ability to work on a multi-disciplinary project team

• Excellent problem solving and innovative skills

• Excellent written and verbal communications skills

• Ability to effectively organize multiple assignments with challenging timelines

• Ability to adapt and adjust to changing priorities

• Demonstrated positive attitude, enthusiasm toward work, and the ability to work well with others

Education and Experience:

• Bachelor's degree in computer science, statistics, biostatistics, mathematics or related field or equivalent and relevant formal academic / vocational qualification, and at least 5 years of experience that provides the knowledge, skills, and abilities to perform the job requirements.

Required Knowledge, Skills and Abilities:

  • Data proficient - likes to use data to make decisions
  • Technically intuitive – likes learning new technology tools and able to do so quickly
  • Thorough knowledge of SQL and a range of technologies including one or more of the following: Snowflake, Databricks, MCP Servers, Microsoft Power Platform (e.g. Power Apps, Power BI), ETL, Python, and/or R.
  • Understands how data structure, quality and context impact downstream AI/LLM use cases
  • User-focused - Keeps user needs in mind. Thoughtful about what’s intuitive for others, what’s not and why
  • Experience demonstrating a strong attention to detail
  • Ability to work on a multi-disciplinary project team
  • Excellent problem solving and innovative skills
  • Excellent written and verbal communications skills
  • Ability to effectively organize multiple assignments with challenging timelines
  • Ability to adapt and adjust to changing priorities
  • Demonstrated positive attitude, enthusiasm toward work, and the ability to work well with others
  • Understanding of relational data base and experience working with complex data systems

Preferred Knowledge, Skills and Abilities:

  • Clinical trial experience
  • Ability to train, mentor and supervise others

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Data Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified