ML Evaluation Designer - Expert
Mercor · Irlande
Job description
About the role
Mercor is seeking an experienced Machine Learning Engineer to design and refine evaluation tasks for large language models. This remote, contract position focuses on creating robust rubrics, metrics, and grading processes that directly improve model performance.
Key responsibilities
- Design ML/LLM evaluation tasks, rubrics, and metrics.
- Grade model and agent outputs, iterating to enhance evaluation quality.
- Apply training‑side judgment in SFT, RLHF, and reward modeling to shape evaluation design.
- Collaborate with AI research teams to refine evaluation signals and improve model outputs.
- Work independently and asynchronously, meeting deadlines for a 30+ hour weekly commitment.
Required profile
- 5+ years of experience as a Machine Learning Engineer with hands‑on training and evaluation work.
- Strong written communication skills.
Required skills
- Proficiency with PyTorch.
- Proficiency with JAX.
- Experience using Hugging Face libraries.
What we offer
- Contract rate ranging from $45 to $140 per hour.
- Fully remote work environment.
- Opportunity to influence cutting‑edge AI research.
Questions fréquentes
Why are you reporting this job?
Apply in 30 seconds
Enter your email to apply. An account will be created automatically.
By continuing, you accept our terms of use.
Already have an account? Login
Published 11 hours ago
Expires 1 month from now
5 views · 0 interested
Boost your chances
Upload your CV — we will match you with relevant openings.
Analyzing your CV...
Mercor
Irlande