πŸ‡ΊπŸ‡Έ USAJobs.work

America's Job Portal

← Back to USA Jobs

Generative ai evaluator | $30/hr remote

Company

Crossing Hurdles

Location

Remote, Remote

Posted

June 19, 2026

Position Overview

Type: Hourly contract

Compensation: $20–$30/hour

Location: Remote

Commitment: 10–40 hours/week

Role Responsibilities Evaluate outputs from large language models and autonomous agent systems using defined rubrics and quality standards. Review multi-step agent workflows, including screenshots and reasoning traces, to assess accuracy and completeness. Apply benchmarking criteria consistently while identifying edge cases and recurring failure patterns. Provide structured, actionable feedback to support model refinement and product improvements. Participate in calibration sessions to ensure consistent evaluation alignment across reviewers. Adapt to evolving guidelines and ambiguous scenarios with sound judgment. Document findings clearly and communicate insights to relevant stakeholders.

Requirements Strong experience in LLM evaluation, AI output analysis, QA/testing, UX research, or similar analytical roles. Proficiency in rubric-based scoring, benchmarking frameworks, an...

Ready to Apply?

Join thousands of Americans building their careers

Apply Now