Sr Site Reliability Engineer (Hybrid)
JOB SCOPE
As a Senior Site Reliability Engineer in Infrastructure as a Service for on-premises cloud, you will be responsible for ensuring the reliability, availability, and scalability of our, IaaS platforms, cloud infrastructure, automation, and tooling. You will work closely with our engineering, development, operations, and security teams to design, implement, and maintain a highly available and secure IaaS platform. You will be responsible for designing, building, and maintaining the infrastructure, monitoring and alerting systems, and ensuring high availability of our services. Additionally, you will play a key role in automating processes and developing tools to improve operational efficiency.
DUTIES AND RESPONSIBILITIES
- Design, build, and maintain highly available, scalable, and secure infrastructure for our on-premises private cloud IaaS offering.
- Develop and implement monitoring and alerting systems to ensure the reliability and availability of our services.
- Automate processes and develop tools to improve operational efficiency and reduce manual intervention.
- Collaborate with development and operations teams to troubleshoot and resolve infrastructure issues.
- Conduct regular performance testing and capacity planning to ensure optimal performance of the infrastructure.
- Ensure that our PaaS and IaaS platforms meets the needs of our customers, including internal and external stakeholders.
- Develop and implement processes for incident management, problem management, and change management in alignment with Charter Incident and Change Management Polices
- Continuously monitor and analyze the performance of PaaS and IaaS services to identify and resolve issues proactively.
- Develop disaster recovery and business continuity plans and perform regular testing to ensure their effectiveness.
- Participate in capacity planning and performance optimization efforts.
- Mentor junior engineers and provide technical leadership to the team.
- Participate in an On Call rotation to ensure 24x7 support of Cloud Services.
- Perform other duties as requested.
BASIC/ MINIMUM QUALIFICATIONS
- Bachelor's degree in Computer Science, Engineering or related field, and/or equivalent work experience.
- Minimum of Six (6) years of experience in site reliability engineering, systems engineering, or software engineering.
- Minimum five (5) years of experience designing and operating Infrastructure as a Service in on-premises cloud.
- Minimum of five (5) years of experience with infrastructure as code tools such as Morpheus, Terraform, Ansible, or Chef.
- Minimum of five (5) years of experience with IaaS platforms and related technology for virtualization, compute, storage, and network.
- Minimum of five (5) years of experience with monitoring, logging, and alerting tools such as VROPS, Nagios, Prometheus, Grafana, Log Insight, and Splunk
- Minimum of five (5) years of experience scripting or development with languages such as Python, Ruby, or Bash
ADDITIONAL JOB REQUIREMENTS
- Knowledge of PaaS, IaaS, Kubernetes, Docker, Rancher, GIT, Repository Management, Server Compute, SAN Storage, Virtualization, IP Networks, Data Center Operations, Linux and Windows Systems Administration.
- Ability to handle multiple projects and tasks.
- Ability to mentor junior engineers and lead technical teams and programs.
- Strong decision making and problem-solving skills while working under pressure.
- Strong communication and collaboration skills.
- Ability to use personal computer and software applications.
- Knowledge of all FCC compliance reports and other rules and regulations.
- Knowledge of Cable Television or related technologies.
PREFERRED QUALIFICATIONS
- Experience working in a DevOps or Site Reliability Engineering role.
- Experience with Infrastructure as a Service technologies.
- Experience with Infrastructure as Code, scripting and development.
- Experience with virtualization platforms such as VMware, Nutanix or OpenStack.
- Experience with Unix/Linux or Windows systems administration.
- Experience with Compute in a Cloud Environment using Rack Mount and Blade Servers.
- Experience with Storage in a Cloud Environment using SAN, HCI, or Software Defined block, file, and object.
- Certifications in Virtualization, Kubernetes, Docker, Containers, Compute, Storage, Networking, Public Cloud and Operating System technologies
ISY313 336032 336032BR
Here, employees don't just have jobs, they build careers. That's why we believe in offering a comprehensive package that rewards employees for their contributions to our success, supports all aspects of their well-being, and delivers real value at every stage of life.
The pay for this position has a salary range of
$88,200.00 to $156,600.00. The actual salary offer will carefully consider a wide range of factors, including your skills, qualifications, experience and location. Also, certain positions are eligible for additional forms of compensation such as bonuses.