πŸ‡ΊπŸ‡Έ USAJobs.work

America's Job Portal

← Back to USA Jobs

Freelance Agent Evaluation Engineer

Company

Mindrift

Location

gua musang, gua musang

Posted

June 15, 2026

Position Overview

Please submit your CV in English and indicate your level of English proficiency.
Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.
What this opportunity involves
We're building a dataset to evaluate AI coding agents β€” how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments:
Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history
Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair
Design tasks set in isolated environments - emulations of a developer's workstat...

Ready to Apply?

Join thousands of Americans building their careers

Apply Now