Please mention DailyRemote when applying
White Circle is an AI Safety company building the safety, reliability, and optimization layer for AI systems. At the core of our platform are policies – simple natural-language rules that define what an AI model should and shouldn’t do. We automatically test, enforce, and continuously improve these policies at scale.
We’ve raised $11M from top funds, founders, and senior leaders at OpenAI, Anthropic, HuggingFace, Mistral, DeepMind, Datadog, Sentry, and others
We process over one hundred million API calls every month
We fine-tune and train our own LLMs so they run faster and cheaper than any open or proprietary model
We’re a small, highly focused team. If you want to work deeply on hard problems, see your work ship to production quickly, and influence how AI safety is actually built – you’re the one we need.
Review and evaluate AI conversations and model outputs
Assess responses for safety, quality, accuracy, policy compliance, and user intent
Identify harmful, unsafe, misleading, or low-quality behavior
Label and categorize model outputs according to internal evaluation frameworks
Moderate sensitive content and identify policy violations
Compare, rank, and score model responses
Investigate edge cases and ambiguous situations
Provide structured feedback to researchers and engineers
Help improve evaluation guidelines and annotation processes
Contribute to the datasets used to train and evaluate AI systems
Has exceptional attention to detail
Can make consistent decisions across large volumes of data
Enjoys analysing nuanced situations where there isn't always a clear answer
Can follow guidelines while exercising good judgment
Has strong written English skills
Communicates clearly and explains reasoning well
Is curious about AI and how these systems work
Have experience with content moderation, trust & safety, quality assurance, compliance, or policy enforcement
Have experience in data annotation, AI evaluation, RLHF, or model assessment
Have worked with AI tools extensively and understand their strengths and limitations
Enjoy finding edge cases and unusual model behavior
This role may involve reviewing content that is offensive, harmful, violent, sexual, or otherwise disturbing We provide tooling, and support, but candidates should be comfortable working with sensitive content when necessary
Salary of $30,000 to $50,000 + equity
Paid time off in line with your local regulations, no matter where you work from
All the hardware, tools, and services you need
Intro call with HR (25 min)
Take-home assignment
Final conversation with our CEO (35 min)
Please submit your application in English.
Stop the endless job search. Our AI finds and applies to the best jobs for you.
Discover remote opportunities in Others
Answer easy questions
200,000+ jobs across 15+ categories
Get your best job matches
Only hand-screened, legit jobs
Find a remote job faster
No ads, scams, or junk
“ I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!