P-1 AI Logo

P-1 AI

Software QA Engineer - AI

Posted 3 Days Ago
Remote
Hiring Remotely in United States
200K-250K Annually
Mid level
Remote
Hiring Remotely in United States
200K-250K Annually
Mid level
The Software QA Engineer - AI will design and implement evaluation systems for AI engineering performance, ensuring progress and preventing regressions while collaborating with cross-functional teams.
The summary above was generated by AI

About you:

  • have done something remarkable, and have undeniable real-world proof-of-talent you can share with us

  • go from 0 → 1 on an idea before breakfast

  • always learning

  • believe in manifesting the future of physical engineering

About us:

We are building an engineering AGI. We founded P-1 AI with the conviction that the greatest impact of artificial intelligence will be on the built world. Our first product is Archie, an AI engineer capable of quantitative intuition over physical product domains and engineering tool use. Archie initially performs at the level of an entry-level design engineer but rapidly gets smarter and more capable. We aim to put an Archie on every engineering team at every industrial company on earth.

Our founding team includes the top minds in deep learning, model-based engineering, and industries that are our customers. We closed a $23 million seed round led by Radical Ventures that includes a number of other AI and industrial luminaries (from OpenAI, DeepMind, etc.).

This role can be either remote (based in US or Canada and with existing work authorization) or based in our San Mateo Bay Area office. If you are remote, you should plan to spend one week out of six co-working with the rest of the company in our San Mateo office. We will support relocation for candidates interested in moving to the Bay Area.

In summary:

  • we are on a mission

  • multiple hats is the norm

  • no politics, low bureaucracy

  • fast, data-driven decision-making; velocity and agility are everything

  • believe in manifesting the future of physical engineering

About the role:

We are a small team tackling an ambitious problem. If we are successful, it will change the course of history. As such, we have a very high talent bar and are looking for people who have done something remarkable.

This role owns the testing and evaluation systems that define whether Archie is actually becoming a better engineer. You will design, implement, and operate the evals that benchmark Archie against real-world engineering skill expectations, ensure it is learning the right things, and prevent regressions as the system evolves.

You will work closely with AI researchers, software engineers, domain experts, and industrial partners to translate engineering judgment into scalable, automated evaluation frameworks. Your work will directly shape how we measure progress toward engineering AGI.

We don’t care if you’ve done it before. We just need you to be brilliant, mission-driven, and thirsty to learn.

This role can be either remote (based in the US or Canada and with existing work authorization) or based in our SF office. If you are remote, you should plan to spend one week out of six co-working with the rest of the company in our SF office. We will support relocation for candidates interested in moving to SF.

Compensation:

$200k - $250k… for now. This role includes a significant equity component. We are an early-stage startup, so we favor equity over cash in our current compensation philosophy. You should too, or an early-stage startup might not be for you. That said, we expect cash compensation to progress quickly as the company matures.

Our benefits include healthcare, dental, and vision insurance, 401k with employer matching, and unlimited PTO.

Interview process:

  • Initial screening call (30 mins)

  • Biographical/behavioural interview (45 mins)

  • Technical interview (60 mins)

  • CEO interview (30 mins)

Top Skills

AI
Automation
Evaluation Frameworks
Software Engineering

Similar Jobs

26 Minutes Ago
Remote or Hybrid
Addison, IL, USA
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves building user experiences on ServiceNow, driving customer satisfaction, leading design workshops, and collaborating with teams to enhance platform value while guiding customer leadership.
Top Skills: AngularjsBootstrap 3CSSFigmaHTMLJavascript FrameworksJSONMiroNow AssistRest ApisSaas TechnologiesSassSketchWeb Services
26 Minutes Ago
Remote or Hybrid
San Diego, CA, USA
173K-303K Annually
Senior level
173K-303K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Strategic Operations, Director leads operational excellence in the CCx team, managing budgets, organizational engagement, and cross-functional alignment, driving strategic initiatives for the organization.
Top Skills: AIFinancial Management SoftwareOperational Dashboards
26 Minutes Ago
Remote or Hybrid
Kirkland, WA, USA
128K-217K Annually
Senior level
128K-217K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Senior Process Engineer, you will lead process optimization, drive cross-functional initiatives, and enable data-driven decisions to improve operational efficiency in cloud operations.
Top Skills: Ai Workflow AutomationCloud InfrastructureData AnalyticsData Center OperationsItilLucid ChartsMiroNo-Code/Low-Code PlatformsServicenow

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account