AI Engineer Evaluation Systems Developer

P-1 AI United State
Remote Relocation
Apply
AI Summary

We are seeking a talented AI Engineer to develop and maintain evaluation systems for our engineering AGI, Archie. The ideal candidate will design, implement, and operate evals that benchmark Archie against real-world engineering skill expectations. This role requires a brilliant, mission-driven individual with a thirst to learn and a passion for manifesting the future of physical engineering.

Key Highlights
Develop and maintain evaluation systems for AI engineer AGI
Design and implement evals that benchmark against real-world engineering skill expectations
Collaborate with AI researchers, software engineers, and domain experts
Technical Skills Required
Python Machine Learning Deep Learning Model-based Engineering
Benefits & Perks
$200-$250k annual salary
Significant equity component
Healthcare, dental, and vision insurance
401k with employer matching
Unlimited PTO
Remote work option
Relocation package

Job Description


About You

  • have done something remarkable, and have undeniable real-world proof-of-talent you can share with us
  • go from 0 → 1 on an idea before breakfast
  • always learning
  • believe in manifesting the future of physical engineering

About Us

We are building an engineering AGI. We founded P-1 AI with the conviction that the greatest impact of artificial intelligence will be on the built world. Our first product is Archie, an AI engineer capable of quantitative intuition over physical product domains and engineering tool use. Archie initially performs at the level of an entry-level design engineer but rapidly gets smarter and more capable. We aim to put an Archie on every engineering team at every industrial company on earth.

Our founding team includes the top minds in deep learning, model-based engineering, and industries that are our customers. We closed a $23 million seed round led by Radical Ventures that includes a number of other AI and industrial luminaries (from OpenAI, DeepMind, etc.).

In Summary

  • we are on a mission
  • multiple hats is the norm
  • no politics, low bureaucracy
  • fast, data-driven decision-making; velocity and agility are everything
  • believe in manifesting the future of physical engineering

About The Role

We are a small team tackling an ambitious problem. If we are successful, it will change the course of history. As such, we have a very high talent bar and are looking for people who have done something remarkable.

This role owns the testing and evaluation systems that define whether Archie is actually becoming a better engineer. You will design, implement, and operate the evals that benchmark Archie against real-world engineering skill expectations, ensure it is learning the right things, and prevent regressions as the system evolves.

You will work closely with AI researchers, software engineers, domain experts, and industrial partners to translate engineering judgment into scalable, automated evaluation frameworks. Your work will directly shape how we measure progress toward engineering AGI.

We don’t care if you’ve done it before. We just need you to be brilliant, mission-driven, and thirsty to learn.

This role can be either remote (based in the US or Canada and with existing work authorization) or based in our SF office. If you are remote, you should plan to spend one week out of six co-working with the rest of the company in our SF office. We will support relocation for candidates interested in moving to SF.

Compensation

$200 - $250k… for now. This role includes a significant equity component. We are an early-stage startup, so we favor equity over cash in our current compensation philosophy. You should too, or an early-stage startup might not be for you. That said, we expect cash compensation to progress quickly as the company matures.

Our benefits include healthcare, dental, and vision insurance, 401k with employer matching, and unlimited PTO.

Interview Process

  • Initial screening call (30 mins)
  • Biographical/behavioural interview (45 mins)
  • Technical interview (60 mins)
  • CEO interview (30 mins)

Similar Jobs

Explore other opportunities that match your interests

Senior Test Engineer

Testing
11m ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Jobs via Dice

United State

Principal Test Engineer / Senior Principal Test Engineer

Testing
14m ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

TalentAlly

United State

Manual QA Tester for Mobile Platforms

Testing
5h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Wiraa

United State

Subscribe our newsletter

New Things Will Always Update Regularly