xAI Logo

xAI

RL Environments Specialist

Reposted 4 Days Ago
Be an Early Applicant
Easy Apply
Remote
Hiring Remotely in USA
100-200 Hourly
Mid level
Easy Apply
Remote
Hiring Remotely in USA
100-200 Hourly
Mid level
Create full reinforcement learning environments, including UI and backend, and manage task creation and validation processes for training AI agents.
The summary above was generated by AI
About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the Role

We need talented engineers that will create full RL environments (UI, backend, programmatically generate tasks and validation) for training computer use agents. This means that we need you to take ownership of the entire task creation process for a given environment.

In this role, you will
  • Build sandbox UIs that our agents and RL actors will interact with.
  • Create tasks for built environments and programmatically validate task completion.
  • Enjoys working remotely
Qualifications
  • Strong professional experience with React.js (hooks, modern state management, TypeScript preferred) — required
  • Strong professional experience building backend services in Python (FastAPI, Flask, or Django) — required
  • Hands-on experience with containerization (Docker required; Docker Compose/Kubernetes a plus)
  • Strong front-end design skills and exceptionally high taste in UI/UX, polish, and visual detail
  • Proven ability to design a relational database schema in Python and populate it with large-scale, realistic mock data
  • Experience creating and exposing clean, well-documented API endpoints (REST or GraphQL)
  • Exceedingly high standards for code quality, readability, testing, and front-end craftsmanship
  • Extensive day-to-day experience using coding agents / AI assistants as a power user (Cursor, Claude, Copilot, Grok, Aider, etc.)
  • Good understanding of the Reinforcement Learning paradigm (RLHF, PPO, DPO, reward modeling, etc.)
Preferred Qualifications
  • Posses strong logical reasoning skills, is detail-oriented, and thrives in a fast-paced work environment.
  • Eager to teach to and learn from teammates.
  • Enthusiasm to collaboratively build the best truth-seeking AI out there!
Interview Process
  1. Technical hands-on live coding round
  2. Hiring Manager / Final interview round
Compensation and Benefits
  • The pay for this role may range from USD $35/hour - $100/hour.
  • Your actual pay will be determined on a case-by-case basis and may vary based on the following considerations: location, job-related knowledge and skills, education, and experience.
  • Top performers may be considered for MTS positions within xAI.

xAI is an equal opportunity employer.

California Consumer Privacy Act (CCPA) Notice

Top Skills

Containerization
Python

Similar Jobs

An Hour Ago
Remote or Hybrid
United States
181K-336K Annually
Expert/Leader
181K-336K Annually
Expert/Leader
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
The VP of Demand Programs will lead strategies to improve sales performance and productivity across SailPoint's global teams, aligning initiatives with corporate goals and fostering a high-performing team culture.
Top Skills: Analytics PlatformsExcelPowerPoint
An Hour Ago
Remote or Hybrid
United States
105K-195K Annually
Senior level
105K-195K Annually
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
The Vulnerability Management Team Lead will oversee vulnerability assessments, remediation processes, and collaboration across IT teams to enhance security practices in production environments.
Top Skills: AWSJIRAKubernetesPowershellPythonSIEMSoar
An Hour Ago
Remote or Hybrid
United States
129K-240K Annually
Senior level
129K-240K Annually
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Seeking a Staff Backend Software Engineer to develop Java-based microservices for a cloud-based SaaS identity analytics product. Responsibilities include design, implementation, and testing of features and participation in on-call support.
Top Skills: AuroraAWSDynamoDBHibernateJavaKafkaMySQLPostgresRedisSQL

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account