The Senior Site Reliability Engineer ensures fast, stable SaaS products through automation, collaboration, monitoring, and implementing AI tools to enhance performance and reliability.
Summary:
We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise.
The Senior Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers. SRE's at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.
You either have an SaaS infrastructure background with a programmatic, automated mindset or are someone that comes with a software engineering background with SaaS infrastructure experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can operate independently to deliver solutions.
Responsibilities:
• Champion and implement a culture of SRE to maintain a high-quality platform infrastructure in DFIN SaaS products • Leverage AI tools to enhance system reliability, including intelligent observability, incident prediction and automated remediation across cloud infrastructure • Evaluate and implement emerging AI powered operations and observability solutions to proactively improve system performance, reliability and scalability • Champion and implement application and infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs • Optimize application performance at scale • Automate everything including system operational runbooks • Define and support continuous integration and deployment pipelines (CI/CD) aligned to branching and quality assurance strategies • Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into work processes • Perform with broad independence and deliver on project milestones and tasks on schedule while communicating progress regularly • Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations • Learn continuously and apply lessons learned • Evangelize best practices, eliminate bottlenecks, and improve process • Participate in on-call duties 365/24/7 and lead the triage and RCA of production incidents
Qualifications:
• 5+ years experience designing, building, securing, monitoring and maintaining cloud infrastructure in Azure or AWS • Experience applying AI capabilities within CloudOps operations • Relevant certifications or training in AI, Cloud AI services or AIOps platforms are a plus • 5+ years experience writing software in any modern software language such as C# .NET, Java • 5+ years experience creating automated deployments with tools such as Harness, Azure DevOps, Ansible or Jenkins to manage Infrastructure as Code and software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment • 5+ years experience implementing production performance, availability, and scalability monitoring and alerting using a tool such as New Relic, Dynatrace, DataDog or AppDynamics • 5+ years experience writing scripts in PowerShell or Python/Bash to automate system operations as runbooks for Windows or Linux environments. • 5+ years experience supporting public client facing revenue generating systems • Strong DevOps focus and experience building and deploying Infrastructure as Code with Terraform or similar technology • Experiencing monitoring and preventing issues with databases and database queries (SQL, Cosmos) using tools like Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, or Redgate SQL Monitor • Experience planning, coordinating, developing and executing all stages of post deployment verification test scripts • Experience securing Windows or Linux systems in 24x7 production environment • Experience with containerization and managing Kubernetes clusters (AKS or EKS) • Experience with common cloud networking, firewall and load balancing configuration • BS in Computer Science or equivalent work experience
We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise.
The Senior Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers. SRE's at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.
You either have an SaaS infrastructure background with a programmatic, automated mindset or are someone that comes with a software engineering background with SaaS infrastructure experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can operate independently to deliver solutions.
Responsibilities:
• Champion and implement a culture of SRE to maintain a high-quality platform infrastructure in DFIN SaaS products • Leverage AI tools to enhance system reliability, including intelligent observability, incident prediction and automated remediation across cloud infrastructure • Evaluate and implement emerging AI powered operations and observability solutions to proactively improve system performance, reliability and scalability • Champion and implement application and infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs • Optimize application performance at scale • Automate everything including system operational runbooks • Define and support continuous integration and deployment pipelines (CI/CD) aligned to branching and quality assurance strategies • Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into work processes • Perform with broad independence and deliver on project milestones and tasks on schedule while communicating progress regularly • Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations • Learn continuously and apply lessons learned • Evangelize best practices, eliminate bottlenecks, and improve process • Participate in on-call duties 365/24/7 and lead the triage and RCA of production incidents
Qualifications:
• 5+ years experience designing, building, securing, monitoring and maintaining cloud infrastructure in Azure or AWS • Experience applying AI capabilities within CloudOps operations • Relevant certifications or training in AI, Cloud AI services or AIOps platforms are a plus • 5+ years experience writing software in any modern software language such as C# .NET, Java • 5+ years experience creating automated deployments with tools such as Harness, Azure DevOps, Ansible or Jenkins to manage Infrastructure as Code and software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment • 5+ years experience implementing production performance, availability, and scalability monitoring and alerting using a tool such as New Relic, Dynatrace, DataDog or AppDynamics • 5+ years experience writing scripts in PowerShell or Python/Bash to automate system operations as runbooks for Windows or Linux environments. • 5+ years experience supporting public client facing revenue generating systems • Strong DevOps focus and experience building and deploying Infrastructure as Code with Terraform or similar technology • Experiencing monitoring and preventing issues with databases and database queries (SQL, Cosmos) using tools like Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, or Redgate SQL Monitor • Experience planning, coordinating, developing and executing all stages of post deployment verification test scripts • Experience securing Windows or Linux systems in 24x7 production environment • Experience with containerization and managing Kubernetes clusters (AKS or EKS) • Experience with common cloud networking, firewall and load balancing configuration • BS in Computer Science or equivalent work experience
Top Skills
Ai Tools
Ansible
Appdynamics
AWS
Azure
Azure Devops
Bash
C# .Net
Cosmos
Datadog
Dynatrace
Harness
Java
Jenkins
Kubernetes
New Relic
Powershell
Python
SaaS
SQL
Terraform
Similar Jobs at DFIN
Fintech • Software
The Venue Account Executive will expand the customer base by targeting accounts, manage the sales process, and build relationships with executives while collaborating with internal departments.
Top Skills:
Salesforce
Fintech • Software
Provide technical and client-facing support for ArcReporting, configuring templates, mapping and processing data, troubleshooting issues, training clients, preparing documentation, and assisting with UAT and version upgrades to ensure accurate automated financial reports.
Top Skills:
ArcreportingExcelMS OfficeMicrosoft Word
Fintech • Software
Lead multiple products within the Arc Suite, define product vision, mentor product managers, ensure product success, and collaborate with stakeholders.
Top Skills:
AgileAha!Azure Devops
What you need to know about the Colorado Tech Scene
With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.
Key Facts About Colorado Tech
- Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
- Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
- Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
- Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

