NOC Engineer
Zoomies help the world connect — and deliver happiness while doing it. We set out to build the best video conferencing product for the enterprise, and today help people communicate better with products like Zoom Phone, Zoom Rooms, Zoom Video Webinars, Zoom Apps, and OnZoom.
We’re problem-solvers and self-starters, working at a fast pace to design solutions with our customers and users in mind. Here, you’ll work across teams to dig deep into impactful projects that are changing the way people communicate, and enjoy opportunities to advance your career in a diverse, inclusive environment.
NOC Engineer
Job Description:
Develops, documents, and monitors Global NOC processes, procedures library as well as standards configurations and versioning controls
Responsible for the monitoring of the ZOOM global production environment, overnight on-call support as necessary in the event of a major incident
Assist in develops and publishes metrics and dashboards demonstrating the effectiveness of the regional and Global NOC functions
Contributing to the maintenance of the production environment of ZOOM global real-time online conference system, a high-availability target of 99.99%, continuously find and fix problems, and ensure the stable operation of the business.
Accountable and responsible for handling incidents and events on the ZOOM global Production infrastructure, ensuring ZOOM’s services delivery and performance
Monitoring and providing timely response and escalation to all incidents, outages, and performance alerts on ZOOM global Production environment, following our Global NOC Incident Management playbook
Recognize, identify and prioritize incidents following the business requirements, organizational policies, and operational impact
Create and track operational issues tickets and improve operational efficiency
Takes ownership of Incidents/Events ticket(s) creation, following up with the respective Business or Technical Owners for troubleshooting and resolution
This role is part of the 24x7 follow the sun Global NOC teams between US and India
Required:
5+ years or more years experience in a NOC or monitoring organization, with at least 3 years experience specifically in an infrastructure support role
Bachelor's degree in IT relevant fields, or equivalent; or an additional 4 years of relevant IT experience
Demonstrate Strong ability to diagnose server or network, compute and storage, application service alerts, events, or issues
In-depth practical knowledge of infrastructure management, access, and monitoring systematic flows
Practical Linux knowledge in a physical, virtual, or public cloud environment
Practical understanding of network infrastructure and architecture
Exceptional verbal and written communication skills necessary to effectively collaborate with peers and to present and explain highly technical information to stakeholders who may have limited technical knowledge
Must be available for occasional after-hours tasks
Ability to understand common information architecture frameworks
Strong communication skills with the ability to communicate clearly and calmly with team members and technical personnel in high-stress situations
Good knowledge of ZABBIX, Python, API, Ansible, Terraform, and AWS cloud
Any IT industry certifications, particularly in network, systems, and cloud computing
Desired:
Experiences with Disaster Recovery plans and related technologies
Excellent time management and team leadership skills, and ability to handle multiple concurrent tasks
Ability to recognize anomalies from alerts-based plots
Network engineering and system administration experience
An analytical and problem solver
Knowledge of the OSI stack, routing, switching, security, compute, and storage fundamentals
Networking, compute, database administration certifications