Site Reliability Engineer IV - ePay
The Site Reliability Engineer IV is responsible for ensuring ePay application is highly available, resilient, secure and scalable. Our ideal candidate is well-versed in modern cloud-based architecture, experienced in designing systems for reliability as well as implementing monitoring, alerting, and ops automation. Candidate will have proven experience in change management, emergency response and experience working with development teams to help create automated pipelines and solutions required for continuous delivery in an Agile Dev/Ops environment.Principal duties and responsibilities:
- Automate anything and everything! (Infrastructure build out, testing, deploying, monitoring, etc)
- Design and assist in the authoring of software tools that reliably manage application delivery
- Implementation of proactive monitoring, alerting, trend analysis and self-healing systems. Perform quality reviews, manage operational issues
- Partner with development team in defining and implementing improvements in service architecture
- Ensure services are designed with 24/7 availability and operational readiness and rigor
- Improve predictability and reliability of software releases, workflows and operating software.
- Collaborate with Product and Support teams to plan and deploy frequent product releases
- Reduce application deployment windows by leading company towards automated pipelines and solutions required for continuous delivery in an Agile environment.
- Reduce mean time to recovery (MTTR) by helping troubleshoot, monitor, alert, and automating recovery.
- Implement SRE tools, processes, and best practices
- Interface with Dev/Product/OPS teams to identify root cause analysis and re-instrument triggers to prevent future network degradation and outages
- Define non-functional requirements as part of the product lifecycle to influence the new designs, standards, and methods for scalable, highly available distributed systems
- Explore and innovate new cloud and HA technologies, features, and tools
Required Experience:
- Bachelor's degree in Computer Science and or equivalent practical experience.
- Solid knowledge of Linux and experience in production support activities
- Fundamental understanding of TCP/IP, load balancing, routing, firewalling, clustering basics, DNS, HTTP/s
- Fluency with at least one current generation scripting language used by DevOps professionals (Bash, Python)
- Experience with Continuous Integration and Continuous Delivery concepts, best practices including Infrastructure as code, utilizing tools like Terraform, Cloudformation, Ansible, Chef, Puppet or an equivalent
- Proven experience with DevOPS log/monitoring/metric collection toolsets ELK, Thanos/Prometheus etc
- Deep understanding of the software delivery process with the ability to implement and enforce that process across the organization
- Hands-on experience with AWS
- Development Experience is a Must
- Experience in Taking Application Code and Third-Party Products and Building Full End-to-End Pipelines to Build, Test and Deploy Complex Systems
- Ability to Containerize an Application and Build a Process Around Creating Containers and Pushing them to an Artifact Repository
- Understand General Networking Concepts, Connectivity, Systems Architecture, Disaster Recovery
Estimated Salary range for this position: $120,000-140,000The base salary range represents the anticipated low and high end of the GHX's salary range for this position. Actual salaries will vary and will be based on various factors, such as candidate's qualifications, skills, competencies and proficiency for the role. The base salary is one component of GHX's total compensation package for employees. Other rewards and benefits include: health, vision, and dental insurance, accident and life insurance, 401k matching, paid-time off, and education reimbursement, to name a few. To view more details of our benefits, visit us here: https://www.ghx.com/about/careers/DisclaimerGHX provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, national origin, age, disability, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state and local laws. GHX complies with applicable state and local laws governing non-discrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.GHX expressly prohibits any form of unlawful employee harassment based on race, color, religion, gender, sexual orientation, national origin, age, disability, or veteran status. Improper interference with the ability of GHX's employees to perform their expected job duties is absolutely not tolerated.