Required Technical Expertise
Core Mastery Areas
- Expert level with deep architectural knowledge of NVIDIA data center platforms, including HGX and DGX platforms.
- GPU-accelerated compute architecture for AI and HPC workloads.
- High-performance networking architectures, especially with Spectrum-X.
- Large-scale AI factory and HPC platform design.
Storage Expertise
- Hands-on architectural experience with high-performance parallel or scale-out storage systems.
- Deep understanding of storage performance characteristics relevant to AI and HPC workloads, including bandwidth, IOPS, latency, and metadata scaling.
- Proven experience integrating storage platforms such as VAST Data, Netapp, WEKA, DDN, or Lustre into GPU-accelerated environments.
Working Proficiency
- NVIDIA Base Command Manager (BCM) for cluster lifecycle management and operations.
- Slurm for HPC workload scheduling and resource management.
- Run:AI for GPU orchestration and multi-tenant AI workload optimization.
- Kubernetes administration including deploying and managing GPU-accelerated AI and HPC workloads.
- Linux systems administration in large-scale, performance-sensitive environments.
- Containerized AI workflows and their interaction with schedulers and storage systems.
Additional Experience
- Experience optimizing existing HPC or AI platforms for performance, utilization, and cost efficiency.
- Prior experience with multi-site, air-gapped, or regulated environments is beneficial but not required.
- Experience with liquid cooling, power/cooling design, and data center integration strongly preferred.
Leadership & Influence
- Senior individual contributor role with influence through technical authority rather than people management.
- Ability to mentor engineers and architects through design reviews, architectural guidance, and technical leadership.
- Comfortable operating autonomously in complex, high-impact technical environments.
Documentation & Repeatability Expectations
- Develop and maintain high quality architectural documentation, including design blueprints, configuration guides, deployment validation reports, and operational runbooks.
- Ensure all technical artifacts meet WWT’s One Voice standards for clarity, completeness, and technical accuracy, enabling consistent delivery across teams.
- Create reusable templates, reference architectures, and standardized design patterns that accelerate future projects and improve delivery quality.
- Drive a culture of documentation discipline, ensuring that every deployment is reproducible, supportable, and aligned with architectural intent.
Educational/Experience Requirements
- Bachelor’s degree in a technical field or equivalent hands-on experience architecting large scale HPC or AI systems.on experience architecting large scale HPC or AI systems.
- Advanced degree (MS/PhD) in relevant fields is a plus but not required.
- Experience: 10+ years in HPC, Data Center Architecture, and/or Systems Engineering.
- Bare Metal Focus: A fundamental preference for, and understanding of, on-premises hardware constraints (power, cooling, cabling).
- Proven experience as a Senior, or Lead Architect or equivalent experience in AI projects.
Want to learn more about Solutions Consulting & Engineering? Check us out on our platform: https://www.wwt.com/community/scande/about
Certain states and localities require employers to post a reasonable estimate of salary range. A reasonable estimate of the current base pay range for this position is $215,000.00 to $245,000.00 annually. Actual salary will be based on a variety of factors, including shift, location, experience, skill set, performance, licensure and certification, and business needs. The range for this position in other geographic locations may differ. Certain positions may also be eligible for variable incentive compensation, such as bonuses or commissions, that is not included in the base pay.
The well-being of WWT employees is essential. So, when it comes to our benefits package, WWT has one of the best. We offer the following benefits to all full-time employees:
- Health and Wellbeing: Health, Dental, and Vision Care, Onsite Health Centers, Employee Assistance Program, Wellness program
- Financial Benefits: Competitive pay, Profit Sharing, 401k Plan with Company Matching, Life and Disability Insurance, Tuition Reimbursement
- Paid Time Off: PTO and Sick Leave (starting at 20 days per year) & Holidays (10 per year), Parental Leave, Military Leave, Bereavement
- Additional Perks: Nursing Mothers Benefits, Voluntary Legal, Pet Insurance, Employee Discount Program
We strive to create an environment where all employees are empowered to succeed based on their skills, performance, and dedication. Our goal is to cultivate a culture of belonging that encourages innovation, collaboration, and respect for all team members, ensuring that WWT remains a great place to work for All!
If you have any questions or concerns about this posting, please email [email protected].
#LI-AF1
#LI-Remote
Preferred QualificationsWhy WWT
At World Wide Technology, we work together to make a new world happen. Our important work benefits our clients and partners as much as it does for our people and communities across the globe. WWT is dedicated to achieving its mission of creating a profitable growth company that is also a Great Place to Work for All. We achieve this through our world-class culture, generous benefits, and by delivering cutting-edge technology solutions for our clients.
Founded in 1990, WWT is a global technology solutions provider leading the AI and Digital Revolution. WWT combines the power of strategy, execution, and partnership to accelerate digital transformational outcomes for organizations around the globe. Through its Advanced Technology Center, a collaborative ecosystem of the world's most advanced hardware and software solutions, WWT helps clients and partners conceptualize, test and validate innovative technology solutions for the best business outcomes and then deploys them at scale through its global warehousing, distribution and integration capabilities.
With over 14,000 employees across WWT and Softchoice and more than 60 locations around the world, WWT's culture, built on a set of core values and established leadership philosophies, has been recognized 14 years in a row by Fortune and Great Place to Work® for its unique blend of determination, innovation and creating a great place to work for all.
Want to work with highly motivated individuals on high-performance teams? Join WWT today!
What is the Solutions Consulting & Engineering Team and why join?
Solutions Consulting & Engineering is an organization that is customer-focused and solutions-led. We deliver end-to-end and emerging solutions to drive customer satisfaction and increase profitability and growth. Our world-class management consulting, delivery excellence, and engineering brilliance enable our success. We embody the OneWWT mindset by bringing the right talent at the right time from anywhere within WWT to solve our customer’s problems. Our goal is to bring together business acumen with full-stack technical know-how to develop innovative solutions for our clients’ most complex challenges.
Overview
The Principal Architect leads HPC AI focused Professional Services delivery engagements and cross functional technical teams on customer programs or projects. They are responsible for technical communications with WWT Engineers, Architects, and the customer for AI-driven projects. The Principal Architect may participate in several Customer projects concurrently, integrating AI solutions with enterprise IT systems.
Role Summary
The Principal Architect will be at the epicenter of the AI revolution, working with the most advanced hardware on the planet. Whether you're helping a research facility unlock new scientific breakthroughs or an enterprise to build its first private AI cloud, your fingerprints will be on the infrastructure that defines the next decade of technology.
The right person for the job is a senior individual contributor responsible for designing, implementing, and optimizing large-scale High-Performance Computing and AI platforms centered on the NVIDIA data center ecosystem. This role operates in a hybrid capacity, combining hands-on technical architecture with selective customer-facing advisory responsibilities.
The architect serves as a technical authority across GPU-accelerated compute, high-performance networking, and modern parallel storage platforms, influencing architectural standards and delivery outcomes while ensuring successful, on-time, and on-budget customer deployments without escalations.
This is a remote work from home position, with an average travel expectation of approximately 10%, and a willingness for additional travel during peak project phases or critical customer engagements.
Key Responsibilities
Architecture and Design
- Lead the end-to-end architecture of GPU-accelerated HPC and AI platforms, including greenfield AI factory designs and optimization of existing HPC environments.
- Architect integrated solutions spanning Compute, Networking, and Storage using NVIDIA HGX and DGX platforms, Grace CPU architectures, Spectrum-X networking, and high-performance parallel storage systems.
- Design storage architectures optimized for AI training, inference, and HPC workloads, balancing performance, scalability, resiliency, and cost.
- Define reference architectures, design patterns, and best practices for repeatable and supportable customer deployments.
Platform Implementation and Optimization
- Provide hands-on technical leadership during implementation phases, including cluster bring-up, performance tuning, and workload optimization.
- Architect and integrate workload orchestration and scheduling platforms using NVIDIA Base Command Manager, Slurm, Kubernetes and Run:AI.
- Optimize end-to-end data pipelines, including GPU utilization, storage throughput, metadata performance, and job scheduling efficiency.
- Troubleshoot performance bottlenecks across Compute, Networking, and Storage.
Storage Architecture & Data Performance
- Design and validate high-performance storage solutions using modern parallel and scale-out storage platforms.
- Demonstrate hands-on experience with at least one of the following storage technologies
- VAST Data
- WEKA
- DDN
- Lustre
- Netapp
- Architect storage solutions that support demanding AI and HPC workloads, including high-throughput training pipelines, checkpointing, and large-scale shared datasets.
- Collaborate with compute and networking design to ensure balanced, bottleneck-free architectures.
Technical Authority and Advisory
- Act as a senior technical authority for HPC and AI architecture across internal teams and customer engagements.
- Participate selectively in customer-facing discussions to validate architecture and delivery plans, with a primary focus on design integrity and execution rather than pre-sales.
- Influence platform standards, architectural direction, and technical decision-making through expertise and demonstrated execution.
Delivery Excellence
- Identify technical risks early across Compute, Networking, Storage, and orchestration layers, and drive mitigation strategies.
- Partner with the PMO counterpart to resolve Risks and Issues upon identification and to ensure production-ready, supportable platforms.
- Ensure staff, contractors, and partners adhere to WWT best practices and templates for AI solution delivery.
- Review deployment documents, technical assessments, and other outputs to ensure consistency and accuracy, aligning with AI and "One Voice" standards.
Top Skills
Similar Jobs
What you need to know about the Colorado Tech Scene
Key Facts About Colorado Tech
- Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
- Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
- Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
- Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute


