AllCloud Logo

AllCloud

LLM Architect

Reposted 12 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
The LLM Architect will design and develop custom language models, optimizing transformer architectures, implementing training methodologies, and collaborating with engineers to create advanced AI models.
The summary above was generated by AI
Description

LLM Architect

Location: US / Canada (Eastern Time) - Home-based

Job Type: Full-time, Permanent 

About AllCloud

AllCloud is a global professional services company providing organizations with cloud enablement and transformation tools. As an AWS Premier Consulting Partner and audited MSP, a Salesforce Platinum Partner, and a Snowflake Premier Partner, AllCloud helps clients connect their front and back offices by building a new operating model to harness the benefits of cloud technology and data and analytics.

Job Summary

We are looking for an innovative LLM Architect to lead the design and development of custom language models at AllCloud. This role will be responsible for architecting, training, and optimizing large language models based on modified transformer architectures. The ideal candidate will have deep expertise in NLP, transformer model design, and efficient training methodologies. You'll work alongside GPU Engineers and ML Engineers to create state-of-the-art language models that meet our customers' specific requirements, pushing the boundaries of what's possible with generative AI.

Responsibilities

  • Design custom transformer-based language model architectures tailored to specific use cases
  • Develop and implement modifications to transformer architectures to enhance performance, efficiency, or capabilities
  • Create and execute model pre-training, fine-tuning, and evaluation strategies
  • Implement techniques like quantization, pruning, and knowledge distillation to optimize model size and performance
  • Design and implement training data pipelines, including data selection, cleaning, and augmentation
  • Establish rigorous evaluation frameworks to assess model performance, fairness, and safety
  • Research and implement state-of-the-art techniques in LLM development
  • Create detailed documentation on model architectures, training methodologies, and performance characteristics
  • Collaborate with GPU Engineers to implement efficient training strategies across distributed systems
  • Work with customers to understand their unique requirements and translate them into model design decisions

Requirements

Summary of Key Requirements

  • 4+ years of experience in deep learning research or development with a focus on NLP and transformer models
  • Strong understanding of transformer architecture and its variants (GPT, BERT, T5, etc.)
  • Experience designing and training large language models from scratch
  • Expertise in PyTorch or TensorFlow for implementing custom model architectures
  • Knowledge of distributed training approaches for large models (DeepSpeed, Megatron, etc.)
  • Experience with model compression techniques (quantization, pruning, knowledge distillation)
  • Strong background in mathematics, particularly linear algebra, differential equations, probability, and statistics
  • Familiarity with current research in LLM development, including attention mechanisms, mixture of experts, and efficient training methods
  • Master's or PhD in Computer Science, Machine Learning, or related field
  • Publication record in NLP, LLMs, or transformer architecture (strongly preferred)

Certifications

  • AWS Machine Learning Specialty (Strongly Preferred)
  • NVIDIA-Certified Associate - Generative AI Multimodal (Preferred)

Why work for us? 

Our team inspires progress in each other and in our customers through our relentless pursuit of excellence; you will work with leaders who promote learning and personal development.


AllCloud is an Equal Opportunity Employer and considers applicants for employment without regard to race, color, religion, sex, orientation, national origin, age, disability, genetics or any other basis forbidden under federal, provincial, or local law.


Top Skills

Deepspeed
Megatron
Nlp
PyTorch
TensorFlow
Transformer Models
HQ

AllCloud Denver, Colorado, USA Office

1624 Market St, Suite 226, Denver, Colorado, United States, 80202

Similar Jobs

8 Days Ago
Remote or Hybrid
3 Locations
Senior level
Senior level
Software
The role focuses on performance analysis, modeling, and validation for deep learning systems, enhancing both hardware and software responsiveness.
Top Skills: CC++CudaDeep LearningLarge Language ModelsLlvmMlirRisc-V
4 Hours Ago
Remote or Hybrid
San Diego, CA, USA
147K-258K Annually
Senior level
147K-258K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
This role involves building high-quality, scalable code, mentoring team members, and designing software solutions that integrate AI into workflows. Responsibilities include project management and enhancing existing products.
Top Skills: Ai Productivity ToolsAngularJavaJavaScriptReactVue
4 Hours Ago
Remote or Hybrid
San Diego, CA, USA
127K-215K Annually
Senior level
127K-215K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves building scalable code, collaborating with product owners, implementing AI solutions, enhancing products, and mentoring colleagues.
Top Skills: AngularJavaJavaScriptReactVue

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account