Avenga

Senior Machine Learning Research Engineer – Benchmarking & Paper Replication

Posted 22 days ago

Poland

⭐ 5-10 years experience

Apply Now

Please mention DailyRemote when applying

AI Summary

The role involves analyzing academic ML papers to reproduce and validate cutting-edge methodologies through PoCs. You will build benchmark datasets and evaluation pipelines to measure model performance against baselines.

This is us

At Avenga, we believe that human creativity empowers technology that matters. Operating globally, our 6000+ specialists provide a full spectrum of services, including business and tech advisory, enterprise solutions, CX, UX and Ul design, managed services, product development, and software development.

This is the job

At the intersection of applied ML research and real-world AI product development, we are looking for a Senior Machine Learning Research Engineer to help explore, reproduce, benchmark and validate cutting-edge ML methodologies.

You will work in an R&D / PoC environment where the main focus is not only to build models, but to understand research papers, replicate promising approaches, evaluate them against strong baselines, and prepare reliable ground truth data for validation.

This role is a strong fit for someone who enjoys reading academic papers, debugging open-source research repositories, building benchmark datasets from scratch, and turning research ideas into working, measurable prototypes under practical compute constraints.

This is the team

You’ll join a senior AI / ML research and engineering environment working on early-stage applied AI initiatives. The team operates in a fast-moving R&D setup where scientific thinking, rigorous evaluation, and practical engineering are equally important.

In this role, you’ll collaborate with AI engineers, ML researchers, and technical stakeholders to test research hypotheses, validate model performance, and understand which approaches are technically feasible for further product development.

This is you

Strong machine learning fundamentals and hands-on experience with ML research or applied research projects
Experience reading scientific / academic ML papers and understanding the methodology behind them
Ability to dissect academic papers, debug open-source repositories, and replicate research results
Experience identifying, curating, and constructing benchmark datasets for testing specific ML methodologies
Ability to prepare ground truth data from large volumes of raw or semi-structured data
Strong Python skills and hands-on experience with modern ML / deep learning frameworks
Practical understanding of model evaluation, baselines, metrics, error analysis, and reproducibility
Comfortable working under compute constraints and adapting research methods to practical limitations
Able to work independently in an ambiguous R&D / PoC environment

Nice-to-have skills:

Experience building rigorous evaluation pipelines from scratch
Experience comparing original paper claims against baselines and alternative methods
Experience with data annotation, dataset quality control, or benchmark design
Experience with LLMs, agentic systems, or tool-based AI workflows
Experience converting research workflows into reusable components, tools, or deployable skills for LLM-based systems
Experience with MLOps, experiment tracking, model versioning, or reproducible ML pipelines
Publications, PhD / research background, or strong open-source research contributions are a plus

This is your role

Read and analyze scientific / academic ML papers with understanding
Identify promising methodologies and assess whether they are worth reproducing
Build PoCs based on academic papers and open-source research implementations
Debug, adapt, and reproduce research repositories under practical compute constraints
Identify, curate, and construct benchmark datasets required to test specific methodologies
Prepare ground truth data from large volumes of data to validate and test PoCs
Build evaluation pipelines to compare replicated approaches against baselines
Measure model performance critically and verify original paper claims
Document findings, limitations, experiment results, and recommendations for next steps
Collaborate with AI engineers and technical stakeholders to turn validated research into practical project direction

What awaits you at Avenga?

At Avenga, everyone matters. We provide equal opportunities in recruitment, career development, and leadership, regardless of race, ethnicity, gender identity, sexual orientation, disability, age, religion, or any other characteristic. We are committed to fostering a work environment where our diverse community of employees, candidates, and business partners actively shapes our growth. By bringing together people from different backgrounds and experiences, we build a workplace where everyone feels free to be themselves while honoring the boundaries of others.

Automatically Apply to the Best Remote Jobs

Stop the endless job search. Our AI finds and applies to the best jobs for you.

Try it Now

Avenga

Senior Machine Learning Research Engineer – Benchmarking & Paper Replication

AI Summary

Automatically Apply to the Best Remote Jobs

Ace Your Job Interview

How to Answer "How Do You Handle Criticism"?

How to Answer "Tell Me About Yourself?" in an Interview

How to Answer "What is your Experience with Customer Service?"

How to Answer "Describe Your Experience Working With Diverse Teams Or Different Cultures?"

How to Answer The Interview Question "What Sets You Apart From Other Candidates?"

How to Answer "Why Are You The Best Person For This Job"?

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Why Should We Hire You?"

How to Answer "What Areas Need Improvement?"

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Tell Me About a Time You Received Constructive Feedback"

How to Answer "What Is Your Greatest Accomplishment?"

Similar Jobs

Senior AI Full-Stack Staff Engineer (TypeScript/Node.js)

Full Stack Engineer - Magento / L2

Salesforce Administrator Developer

C# / VB.NET Developer - WFH

Full-stack Engineer (Python, TDD)

Desarrollador móvil iOS (MI)

Avenga

Senior Machine Learning Research Engineer – Benchmarking & Paper Replication

AI Summary

Automatically Apply to the Best Remote Jobs

Share This Job:

Similar Jobs

Senior AI Full-Stack Staff Engineer (TypeScript/Node.js)

Full Stack Engineer - Magento / L2

Salesforce Administrator Developer

C# / VB.NET Developer - WFH

Full-stack Engineer (Python, TDD)

Desarrollador móvil iOS (MI)

Personalize your Remote Job Search in 3 Easy Steps!