Preference Model is hiring for work from home roles

Preference Model

2 Remote Job Openings at Preference Model

Preference Model is hiring for remote RL Environments Engineer Summer Intern

RL Environments Engineer Summer Intern

Preference Model · Internship · 3 months ago
Preference Model
🌎 United States ⭐ 0-2 yrs exp 💼 Software Development
The intern will be responsible for designing and building Reinforcement Learning (RL) environments specifically tailored to test Large Language Model (LLM) reasoning across machine learning, systems, and research problems. This involves writing production-grade Python code and translating complex ML concepts from research papers into concrete, reproducible training tasks using tools like Docker.