Machine Learning Engineer, RL Environments - Internship
Preference Model
·
Internship
·
18 hours ago
Preference Model
Design and implement RL training environments to test LLM reasoning on machine learning and systems problems. Translate ML research papers into concrete training tasks and deliver work into production training runs.