Senior Machine Learning Research Engineer – Benchmarking & Paper Replication

 Posted 4 hours ago
  
 Poland
  
5-10 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

The role involves analyzing academic ML papers to reproduce and validate cutting-edge methodologies through PoCs. You will build benchmark datasets and evaluation pipelines to measure model performance against baselines.

This is us

At Avenga, we believe that human creativity empowers technology that matters. Operating globally, our 6000+ specialists provide a full spectrum of services, including business and tech advisory, enterprise solutions, CX, UX and Ul design, managed services, product development, and software development.

This is the job

At the intersection of applied ML research and real-world AI product development, we are looking for a Senior Machine Learning Research Engineer to help explore, reproduce, benchmark and validate cutting-edge ML methodologies.

You will work in an R&D / PoC environment where the main focus is not only to build models, but to understand research papers, replicate promising approaches, evaluate them against strong baselines, and prepare reliable ground truth data for validation.

This role is a strong fit for someone who enjoys reading academic papers, debugging open-source research repositories, building benchmark datasets from scratch, and turning research ideas into working, measurable prototypes under practical compute constraints.

This is the team

You’ll join a senior AI / ML research and engineering environment working on early-stage applied AI initiatives. The team operates in a fast-moving R&D setup where scientific thinking, rigorous evaluation, and practical engineering are equally important.

In this role, you’ll collaborate with AI engineers, ML researchers, and technical stakeholders to test research hypotheses, validate model performance, and understand which approaches are technically feasible for further product development.

This is you

  • Strong machine learning fundamentals and hands-on experience with ML research or applied research projects

  • Experience reading scientific / academic ML papers and understanding the methodology behind them

  • Ability to dissect academic papers, debug open-source repositories, and replicate research results

  • Experience identifying, curating, and constructing benchmark datasets for testing specific ML methodologies

  • Ability to prepare ground truth data from large volumes of raw or semi-structured data

  • Strong Python skills and hands-on experience with modern ML / deep learning frameworks

  • Practical understanding of model evaluation, baselines, metrics, error analysis, and reproducibility

  • Comfortable working under compute constraints and adapting research methods to practical limitations

  • Able to work independently in an ambiguous R&D / PoC environment

Nice-to-have skills:

  • Experience building rigorous evaluation pipelines from scratch

  • Experience comparing original paper claims against baselines and alternative methods

  • Experience with data annotation, dataset quality control, or benchmark design

  • Experience with LLMs, agentic systems, or tool-based AI workflows

  • Experience converting research workflows into reusable components, tools, or deployable skills for LLM-based systems

  • Experience with MLOps, experiment tracking, model versioning, or reproducible ML pipelines

  • Publications, PhD / research background, or strong open-source research contributions are a plus

This is your role

  • Read and analyze scientific / academic ML papers with understanding

  • Identify promising methodologies and assess whether they are worth reproducing

  • Build PoCs based on academic papers and open-source research implementations

  • Debug, adapt, and reproduce research repositories under practical compute constraints

  • Identify, curate, and construct benchmark datasets required to test specific methodologies

  • Prepare ground truth data from large volumes of data to validate and test PoCs

  • Build evaluation pipelines to compare replicated approaches against baselines

  • Measure model performance critically and verify original paper claims

  • Document findings, limitations, experiment results, and recommendations for next steps

  • Collaborate with AI engineers and technical stakeholders to turn validated research into practical project direction

 

What awaits you at Avenga?

At Avenga, everyone matters. We provide equal opportunities in recruitment, career development, and leadership, regardless of race, ethnicity, gender identity, sexual orientation, disability, age, religion, or any other characteristic. We are committed to fostering a work environment where our diverse community of employees, candidates, and business partners actively shapes our growth. By bringing together people from different backgrounds and experiences, we build a workplace where everyone feels free to be themselves while honoring the boundaries of others.

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Software Development

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified