Candidly Logo

Candidly

Senior Infrastructure Engineer

Posted 11 Hours Ago
Easy Apply
Remote
Hiring Remotely in USA
Senior level
Easy Apply
Remote
Hiring Remotely in USA
Senior level
Lead the design and evolution of cloud infrastructure, enforce best practices, automate SDLC processes, and improve systemic issues in a collaborative environment.
The summary above was generated by AI

Company Description

Candidly was founded in 2016 to flip the script on what it means to plan, borrow, repay, and save for college. Today, we’re the category leader with the market’s most comprehensive AI-driven student debt and savings optimization platform. We partner with hundreds of top employers, financial institutions, and retirement record keepers, positioning Candidly to serve more than 35 million Americans. 

We’re already achieving incredible results — to date, we’ve helped our users get on track to eliminate more than $1.8B in student debt and pay off their loans 175,000 years quicker — and we’re seeking movers, shakers, innovators, and problem solvers to help take our mission even further. 

Candidly is a high-growth, Series B startup, funded by leading investors including Altos Ventures, Aflac, Salesforce Ventures, UBS, Equal Opportunity Ventures, Impact Engine, Rethink Impact, Unum, and Cercano Management. Our fully remote, international team of 70 (and counting) includes alumni from Google, UBS, Twitter, Plaid, Prudential, LendingTree, Morgan Stanley, Deutsche Bank, and more.

Job Description

We're looking for a Senior Infrastructure Engineer to lead the design, implementation, and ongoing evolution of our cloud infrastructure. This is a role for someone who brings both technical depth and genuine curiosity — someone who sees an opportunity to improve something and goes after it, who takes problems end-to-end, and who makes the engineers around them better just by being present.

You'll partner closely with engineering leadership and cross-functional teams to ensure our infrastructure strategy stays ahead of our product ambitions. You'll also be a key voice in shaping how we build — our standards, our reliability posture, and our operational culture.

What You'll Do

  • Lead the design and implementation of scalable, secure, and resilient cloud infrastructure across AWS and Azure, supporting both Candidly's AI and SaaS products
  • Drive the architectural vision and strategy, ensuring alignment with long-term business goals and surfacing risks before they become problems
  • Own and enforce best practices for infrastructure as code (IaC), CI/CD, and automated deployments
  • Take the lead on automating and accelerating SDLC processes — identifying bottlenecks in how we build and ship, and designing solutions that make the whole pipeline faster and smoother, whether that involves AI-assisted tooling or traditional automation
  • Serve as a subject matter expert on cloud architecture, containerization, and observability
  • Lead incident response and post-mortems with a focus on systemic improvement, not just immediate fixes
  • Proactively identify and close gaps before they compound

Who You Are

  • You take ownership naturally — you see what needs to happen and you make it happen, and when you commit to something, it gets done
  • You think and communicate beyond your immediate scope. You invest in the people around you, take informed positions on technical direction, and make the team stronger — not just the codebase
  • You bring genuine energy and drive. You're energized by ambiguity, you raise the bar for yourself and others, and you don't settle for "good enough" when better is possible

Required Skills and Experience

  • 5–7+ years of hands-on experience in cloud infrastructure, DevOps, or Site Reliability Engineering (SRE)
  • Expert-level AWS knowledge: EC2, ELB, ASG, RDS, S3, SQS, Lambda, IAM, VPC, CloudFormation, CDK, and Route 53
  • Solid understanding of Linux administration and networking concepts (VPNs, VPC peering, NAT, DNS, firewalls)
  • Deep experience with Infrastructure as Code (IaC) for container orchestration at scale using CloudFormation, AWS CDK, Docker, and Kubernetes
  • Excellent written and verbal English communication — you can translate tradeoffs for engineers and stakeholders alike
  • Comfortable working remotely and independently
  • Moderate-level Azure experience, with comfort operating across multi-cloud and multi-region environments
  • Expertise in CI/CD pipelines (AWS CodePipeline, GitLab CI, or similar)
  • Strong proficiency in scripting and automation using Python, Bash, and Ansible
  • Deep understanding of monitoring and logging with systems and strategies (AWS Cloudwatch, Datadog, Azure Monitor)
  • Hands-on experience with high-availability architectures and auto-scaling strategies
  • Solid grasp of AWS security best practices: IAM, encryption, Secrets Manager, and security auditing
  • Experience with databases (MySQL, Postgres, Redshift)
  • Familiarity with serverless architectures (AWS Lambda, Fargate)
  • Knowledge of database replication strategies

Preferred Skills and Experience

  • Experience in a high-growth, regulated industry (Fintech), specifically in architecting and scaling infrastructure to maintain reliability and compliance under rapid user and transaction volume growth
  • Proven technical leadership experience, including improving infrastructure processes
  • AWS Professional-level certifications (Solutions Architect Professional, DevOps Engineer Professional)
  • Knowledge of event-driven architectures (SNS, SQS, EventBridge)
  • Experience in cost optimization strategies for AWS environments
  • Experience in compliance frameworks (SOC, NIST, ISO, CCPA, GDPR)
  • Experience integrating AWS services with third-party tools for observability and security
  • Experience with single-tenant and multi-tenant architectures, as well as client on-premises deployment systems
  • Familiarity with scalable, reproducible ML pipelines, tools, and frameworks (Kubeflow, MLflow, Amazon SageMaker)
  • Data engineering experience and large-scale data processing and storage
  • Knowledge of Jira and Confluence, including best practices for ticket management and KPI tracking

Background and EEOC

Candidly offers for employment are conditioned upon satisfactory completion of our employment screening process (including, but not limited to, a review of past employment and education records, background investigation, and/or credit check & fingerprints).

Candidly strives to foster an environment where every employee can succeed. As an Equal Opportunity Employer we do not discriminate on the basis of race, religion, color, sex, sexual orientation, gender identity, gender expression, national origin, age, non-disqualifying physical or mental disability, veteran status, or any other basis covered by applicable law. All employment is decided on the basis of qualifications, merit, and business need.

Top Skills

Ansible
AWS
Aws Cdk
Aws Cloudwatch
Aws Codepipeline
Azure
Azure Monitor
Bash
CloudFormation
Datadog
Docker
Gitlab Ci
Kubernetes
Python

Similar Jobs

4 Days Ago
Remote
USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Artificial Intelligence • Information Technology • Marketing Tech • Software • SEO
The Senior Infrastructure Engineer will design, build, and operate crucial cloud infrastructure and developer tools, focusing on observability, reliability, and performance in a high-velocity software environment.
Top Skills: GCPPythonTerraform
5 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will lead security design and implementation for cloud infrastructures, mentor teams, and automate security solutions.
Top Skills: AnsibleAWSAzureCloud Security ToolsCloudFormationGCPGoTerraform
5 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
The Security Software Engineer will design and implement security controls for MongoDB Atlas, collaborating across engineering teams and ensuring adherence to high security standards.
Top Skills: ApparmorC/C++CgroupsEbpfGoGrafanaJavaKubernetesPythonRustSeccompSelinuxSplunkTerraformVictoria Metrics

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account