Summa Linguae Technologies Logo

Summa Linguae Technologies

Technical Project Manager - AI Data Processing

Posted 4 Hours Ago
Be an Early Applicant
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
The Technical Project Manager will coordinate data deliveries, automate validation processes, and work closely with engineering and operations teams, ensuring quality compliance of datasets.
The summary above was generated by AI

Technical Project Manager - AI Data Processing 

Work model: Remote (covering PST hours) - Based in PST timezone
Employment type: Full-time (Contract or Employment)

About DATAmundi 

DATAmundi builds advanced software solutions that power our localization and data services. We support AI companies and research teams by delivering high-quality datasets, validation workflows, and scalable data processing. Our R&D initiatives explore how modern AI systems — including LLMs, speech models, and multimodal systems — can be evaluated, improved, and safely deployed through structured data and validation methodologies. 

We are expanding our R&D activities and seeking researchers to collaborate on applied research and technical outreach within the AI ecosystem. 

About the role 

We’re hiring a Technical Project Manager with a hands-on, data engineering skillset to support Data Processing for AI. You’ll translate client requirements into executable validation logic for data processing workflowssupport data post processing, and help ensure delivery data is consistent, measurable, and within client metrics and guidelines. 

This role sits at the intersection of technical delivery, data QA, and automationYou’ll work closely with development teams and support the Operations Team while also writing/maintaining lightweight scripts and queries that turn requirements into quality assurance checks. 

 

What you’ll do 

  • Own the end-to-end coordination of data deliveries, from intake to validation and handoff. 
  • Work with client data delivered via S3 buckets or direct uploads; ensure correct structure, completeness, and readiness for downstream use. 
  • Translate client guidelineinto automated validation using SQL, regex, and supporting scripts. 
  • Create and  to compute quality and consistency metrics such as: 
  • WER (Word Error Rate) maintain Python utilities
  • IAA (Inter-Annotator Agreement) 
  • Additional dataset-level metrics as required 
  • Use Windows Command Prompt for bulk file operations (creating/moving/downloading folders and files) to support processing and delivery workflows. 
  • Partner with internal development teams by writing Jira tickets for platform improvements and bug fixes (requirements, steps to reproduce, acceptance criteria). 
  • Quickly ramp on internal platforms and configuration logic (e.g., worktypes / templates), advising on setup patterns and tradeoffs. 
  • Investigate issues by querying datasets through database tools and producing clear summaries of findings and next steps. 

 

What we’re looking for 

  • years experience in technical delivery / project coordination in a data environment (data, analytics, ML, QA automation, or platform operations. 
  • Practical comfort with: 
  • Python (scripting for metrics and data validation workflows) 
  • SQL (queries used in automated checks) 
  • Regex (pattern-based validation) 
  • Command line / Windows CMD (bulk file operations) 
  • Strong written communication and the ability to convert fuzzy requirements into precise, testable checks. 
  • Experience working with engineering teams and using tools like Jira to drive execution. 
  • Ability to evaluate options and recommend an approach based on pros/cons, timelines, and maintainability. 

 

Nice to have 

  • Experience with speech/audio or text datasets (given WER and annotation agreement use cases). 
  • Familiarity with cloud data workflows (especially AWS S3 concepts like buckets, prefixes, access patterns). 
  • Experience with data labeling/annotation workflows and quality frameworks. 

Top Skills

Aws S3
JIRA
Python
Regex
SQL
Windows Command Prompt

Similar Jobs

26 Minutes Ago
Remote
USA
134K-214K Annually
Senior level
134K-214K Annually
Senior level
Cloud • Fintech • Food • Information Technology • Software • Hospitality
Lead the design, development, and maintenance of core applications, focusing on system integrations and managing multiple third-party applications while mentoring junior developers.
Top Skills: AngularAWSAzureDockerDynamoDBGCPGithub ActionsGitlab CiGoJavaJenkinsKubernetesMongoDBMulesoftMySQLNode.jsPostgresPythonReactVueWorkatoZapier
38 Minutes Ago
Easy Apply
Remote
United States
Easy Apply
Entry level
Entry level
Cloud • Security • Software • Cybersecurity • Automation
The Strategic Account Executive will support strategic customers, manage account leadership, conduct sales activities, and ensure product adoption, all while collaborating with various teams.
Top Skills: Application Lifecycle ManagementGitGitlabSoftware Development Tools
43 Minutes Ago
In-Office or Remote
New York, NY, USA
140K-160K Annually
Senior level
140K-160K Annually
Senior level
Big Data • Cloud • Software • Generative AI • Big Data Analytics
The role involves developing sales strategies, managing relationships with executive buyers, and generating business opportunities in a consultative selling environment.
Top Skills: Challenger MethodologiesMeddpiccSaaSSales Strategies

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account