Data Annotation Specialist for AI Model Alignment

agi, inc. United State
Relocation
Apply
AI Summary

We're seeking an experienced data annotator to help train and align our universal AI agents. You'll evaluate agent behavior, provide nuanced feedback, and work directly with ML researchers to refine guidelines and build datasets. This role requires strong judgment, attention to detail, and excellent communication skills.

Key Highlights
Evaluate agent trajectories and provide structured feedback
Collaborate with ML researchers to refine annotation guidelines and surface patterns in model failures
Build benchmark datasets to drive measurable performance improvements
Key Responsibilities
Evaluate agent trajectories: Review and annotate agent behavior across desktop, web, and mobile environments
Judge decision quality: Evaluate agent decision-making at each step
Provide structured feedback: Deliver qualitative feedback on agent behavior
Technical Skills Required
Data annotation Agent-based trajectory evaluation Reinforcement learning data Model alignment concepts Model quality evaluation
Benefits & Perks
Competitive company-sponsored medical, dental, and vision insurance
Top-tier relocation and immigration support
Ship by default - speed and polish can coexist

Job Description


Think Different. Build the Future. 🚀

Our Mission

Build everyday AGI. Trustworthy, consumer-grade agents that redefine human–AI collaboration for millions. Software shouldn’t wait for commands; it should partner with you, amplifying what you can do every single day.

Why AGI, Inc.

We’re a stealth team of elite founders and AI researchers, with backgrounds spanning Stanford, OpenAI, and DeepMind. We’re industry leaders in mobile and computer-use agents, bringing these capabilities to consumer scale.

Grounded in years of agent research, our AI is designed with trustworthiness and reliability as core pillars, not afterthoughts.

We are supported by tier-1 investors who funded the first generation of AI giants; now they’re backing us to build the next: everyday AGI. (Watch the demo)

If you see possibility where others see limits, read on.

About The Role

We're looking for experienced data annotators to help train and align our universal AI agents. You'll be evaluating agent behavior across computer and mobile interfaces, providing the nuanced feedback that shapes how our models learn and improve.

This is not a mechanical task. Your judgment defines what "helpful," "safe," and "aligned" mean in practice. You'll work directly with ML researchers to refine guidelines, surface failure patterns, and build the datasets that drive our next breakthrough.

You'll focus on quality, consistency, and insight, ensuring every annotation moves us closer to agents people can trust.

What You'll Do

Evaluate agent trajectories: Review and annotate agent behavior across desktop, web, and mobile environments — labeling actions, identifying failure modes, and assessing task completion quality.

Judge decision quality: Evaluate agent decision-making at each step: Was this the right action? Was it efficient? Did it align with user intent?

Provide structured feedback: Deliver qualitative feedback on agent behavior, including edge cases, reasoning errors, and UI misinterpretations.

Collaborate with researchers: Work directly with ML researchers to refine annotation guidelines, surface patterns in model failures, and inform training priorities.

Build benchmark datasets: Contribute to high-quality datasets that drive measurable performance improvements.

Minimum Qualifications

  • 1+ years of experience annotating agent-based trajectories, reinforcement learning data, or similar sequential decision-making tasks
  • Strong understanding of what constitutes model quality — you can distinguish between a correct action, a suboptimal action, and a hallucinated one
  • Familiarity with model alignment concepts: helpfulness, harmlessness, honesty, and how annotation choices influence model behavior
  • Track record of working alongside researchers — you're comfortable with ambiguity, can propose annotation schema improvements, and understand how your work feeds into training pipelines
  • Excellent attention to detail and ability to maintain consistency across high volumes of data
  • Clear written communication skills for documenting edge cases and providing actionable feedback

Why This Role Matters

Models learn from data. Data quality determines model quality. Your annotations are the ground truth.

You will directly shape how our agents behave — what they prioritize, how they reason, and whether users trust them. The patterns you identify and the feedback you provide will inform the next generation of training runs.

Our Culture

🏢 All in, in person — work moves faster face-to-face

🚀 Ship by default — speed and polish can coexist

🤝 One band, one sound — radical candor, zero politics

Perks

🏥 Competitive company-sponsored medical, dental, and vision insurance

✈️ Top-tier relocation and immigration support

How To Apply

Send us:

  • A link — or 60-second video — of something you built and why it mattered
  • Your resume or LinkedIn
  • Two sentences on the hardest challenge you've solved

Every exceptional candidate hears back within 48 hours.

If you see possibility where others see limits, we'd love to meet you.


Similar Jobs

Explore other opportunities that match your interests

Loads & Dynamics Engineer / Principal Loads & Dynamics Engineer

Programming
36m ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Northrop Grumman

United State

CPU Performance Management Firmware Developer

Programming
49m ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Qualcomm

United State

Senior Backend Software Engineer

Programming
1h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Flex

United State

Subscribe our newsletter

New Things Will Always Update Regularly