🇺🇸 USAJobs.work

America's Job Portal

← Back to USA Jobs

Pre-Training Text Data

Company

Microsoft Corporation

Location

Multiple Locations, United States

Posted

June 17, 2026

Position Overview

**Overview**

We are seeking engineers and researchers to join our Pretraining Text Data team, where we are building the next generation of foundation large language models. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you.

In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse text datasets critical to model development. You will lead efforts to:

+ Develop novel data collection strategies

+ Improve dataset quality and integrity

+ Understand data-driven model behaviors

+ Train models to understand the impact of data and data mixes

+ Align datasets with ethical and societal values

This is a cross-disciplinary, high-impact role ideal for engineers and researchers who want to push the boundaries of what AI can learn from data.

...

Ready to Apply?

Join thousands of Americans building their careers

Apply Now