🇺🇸 USAJobs.work

America's Job Portal

← Back to USA Jobs

Freelance AI Evaluation Architect

Company

Reconocida empresa

Location

biobío, biobío

Posted

June 19, 2026

Position Overview

Empresa confidencial connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.

What this opportunity involves
  • We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks.
  • You’ll create challenging tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history.
  • Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair.
  • Design tasks set in isolated environments — emulations of a developer's workstation: a Linux machine with developmen...

Ready to Apply?

Join thousands of Americans building their careers

Apply Now