Senior Site Reliability Engineer
Department Summary
DISH is a Fortune 200 company that continues to redefine the communications industry. Our legacy is innovation and a willingness to challenge the status quo, including reinventing ourselves. We disrupted the pay-TV industry in the mid-90s with the launch of the DISH satellite TV service, taking on some of the largest U.S. corporations in the process, and grew to be the fourth-largest pay-TV provider. We are doing it again with the first live, internet-delivered TV service - Sling TV - that bucks traditional pay-TV norms and gives consumers a truly new way to access and watch television.
Now we have our sights set on upending the wireless industry and unseating the entrenched incumbent carriers.
We are driven by curiosity, pride, adventure, and a desire to win - it's in our DNA. We're looking for people with boundless energy, intelligence, and an overwhelming need to achieve to join our team as we embark on the next chapter of our story. Opportunity is here. We are DISH.
Job Duties and Responsibilities
Objectives of this Role
- Understanding and documenting the performance and scalability non-functional requirements including SLI/SLOs. Validating requirements with Business Stakeholders
- Manage SLI/SLOs of customer facing interfaces as well as backend services and provide improvement plans for non-compliance
- Develop custom dashboards in New Relic to represent a holistic view of system operational health
- Improve reliability, quality, and time-to-market of our suite of software solutions
- Support release engineering by providing automation support as well as push changes to production when manual intervention needed
- Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
- Provide primary operational support and engineering for multiple large distributed software applications
Daily and Monthly Responsibilities
- Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
- Partner with engineering teams to improve services through rigorous testing and release procedures
- Participate in system design consulting, platform management, and capacity planning
- Modelling areas of risk to estimate latency characteristics and capacity requirements. Typically, this will either be refining the workload and modelling how it applies to a set of components, or working with component suppliers to estimate capacity requirements.
- Create sustainable systems and services through automation and uplifts
- Balance feature development speed and reliability with well-defined service level objectives
Skills, Experience and Requirements
Required Skills and Qualifications
- Bachelor's degree in computer science or other highly technical, scientific discipline
- Ability to program (structured and OO) with one or more high level languages, such as Go, Java, C/C++, Ruby, React Native and JavaScript
- Experience AWS cloud services like EC2, S3, Cloud Front, EKS as well as dynamic resource management frameworks (Kubernetes)
- Experience in any one of the application performance management tool (preferably New Relic), EFK stack and log analysis
- A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
- Ability to drive a collaborative approach across business functions, and external partners
Preferred Qualifications
- Previous success in technical engineering
- Coding experience beyond simple scripts
#LI-JW3
#LI-EC1
#LI-KT2#LI-JW3
#DICE_KTO
Salary Range
Compensation: $161,500.00/Year - $218,500.00/Year
Benefits
From versatile health perks to new career opportunities, check out our benefits on our careers website . Successful completion of a pre-employment screen, to include a drug test and criminal background check.