Senior Cloud Operations Engineer

Sorry, this job was removed at 7:17 p.m. (MST) on Monday, March 27, 2017
Find out who's hiring in Greater Denver Area.
See all Developer + Engineer jobs in Greater Denver Area
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

Leveraged by millions of user every day, EMS Software manages some of the highest profile spaces in the world (including the NASDAQ bell). We are consistently delivering new features to our suite of products. We want to tackle bigger challenges and accomplish some truly amazing things. Our team is always improving our codebase and operations footprint and we have amassed a sizeable backlog of interesting challenges and product initiatives. Our team needs to grow to enable even greater success in our industry, and that is where you come in.

As our third Operations Engineer, you will help continue to define our transformation and on-premise enterprise software solution to that of a SaaS solution, hosting a number of our current Fortune 100 clients. We’re looking for a seasoned operations leader with expertise spanning high availability systems in a production environment, a DevOps mentality of continuously improving the system, and a firm grasp on cloud architectures. You must have extensive experience supporting applications developing in at least 3 of the following: .NET, Java, JavaScript, Python, Node, GO or Ruby. You should also be passionate about solving problems and developing creative solutions leveraging automation.

Your First Three Months

In your first month, as your familiarity with the product grows, your responsibilities and influence will grow as well. You will collaborate with other members of the team in established patterns and continue to hone your skills as you push the design, architecture and implementation of our production environments to their next phase for general adoption.

Within two months, you and your team will have a well-tested, low-latency and highly available hosted environment. Additionally, you will have helped to create the process of onboarding customers into this new environment and continue to support them throughout their transition.

Within three months, you will have played an instrumental role in growing your team, helping to hire both your direct manager (our next Director of Cloud Operations) as well as another Operations Engineer. Further, you will have helped to drive changes to the operational and development roadmap as we inch closer to onboarding 40% of our customer base into hosted production environments by the end of 2017.

Other Responsibilities

  • Design, provision, configure and maintain the platform operations to handle the scale of running several application stacks in the cloud that will be consumed worldwide
  • Automate the deployment and maintenance of cloud platform technologies
  • Oversee production operations, log management, data warehouse, and database operations, including management of Splunk services
  • Ensure all monitoring systems (IT, development, service management, Apdex) are in place
  • Enforce consistency of monitoring, reporting, and alarming systems
  • Drive process improvements for service management, including: outage/incident management, rollbacks and reporting
  • Research emerging virtualization techniques and advise management
  • Perform capacity management, load and scalability planning
  • Ensure compliance with deployment and operations documentation
  • Assist management in development and optimization of operational cost models
  • Design cloud infrastructure for high reliability and availability
  • Build strategic and tactical plans for continued improvement of cloud architecture and operations
  • Manage, maintain, and enforce service level agreements
  • Provide operational reports (e.g., Service Level, Usage, Cost) to upper management
  • Optimize cost structure for cloud operations
  • Assist in the establishment of 24x7 performance monitoring and response protocols
  • Provide on-call support outside of normal work hours/days

About You

  • You’re driven, humble, and autonomous
  • You’re a quick study, a strong communicator, and you’re able to adapt to a fast-paced environment
  • You have a working knowledge of Agile Development practices (e.g., SCRUM, TDD)
  • You’re an expert in Linux (Ubuntu / Debian) and Windows (IIS, SQL Server)
  • You have at least 3 years of hands-on production experience with Amazon Web Services (AWS), Google Cloud or Microsoft Azure. This includes:
    • Configuration of VPCs, with VPN to corporate network
    • Experience setting up, maintaining and monitoring global production environments, QA and staging environments, with a strong understanding of the differing needs of such environments
    • At least 3 years of experience in a professional production environment
    • At least 2 years of experience managing networking infrastructure and monitoring at the application level
  • Performance optimization experience, including: troubleshooting and resolving network and server latency issues; performing hardware evaluation/selection tasks; performance vs cost vs time analysis
  • At least 2 years of experience with automation or scripting tools (e.g., Python, Shell, Powershell)
  • At least 1 year of experience with Ansible, Jenkins, Capistrano, GIT/TFS
  • You’re detail-oriented, with excellent documentation skills, and you’re someone who can successfully manage multiple priorities
  • Troubleshooting skills that range from diagnosing hardware/software issues to large scale failures within a complex infrastructure

Other Things We Hope You Have

  • Bachelors in Computer Science or equivalent work experience
  • Experience with Tomcat, Rails, Mongo, MySQL, Splunk, Nagios, Haproxy/Stunnel and Maven
  • Hands-on experience with performance, load and security penetration testing
  • Hands-on experience with building out a continuous integration and delivery pipeline
  • Experience working with Docker, Kauai, Kubernetes

The Team

Ultimately, you will be part of a 4-person team of operational engineers (but you’ll be the third hire). While assisting us hire our next Director of Cloud Operations, you will report directly to the VP of Development, Assad Jarrahian.

The bigger team consists of 12 Developers, 10 Quality Engineers, 4 Product Owners, and 3 UX Designers. We have an open and collaborative environment where everyone works together to deliver what is needed, from product features to operations needs (e.g., health checks).

We value open and direct communication, taking calculated risks that will push us forward, and investing in our people.

Our Stack

  • We have current Production and Continuous Integration footprints in Google Cloud, AWS, and Azure
  • Our front end applications leverage React and React Native, Redux, Node, C#, and Knockout
  • Our APIs comprises of Golang, .NET and .NET core
  • Our backend comprises of MS SQL Server

EMS Software is proud to be an Equal Opportunity Employer. Applicants are considered for all positions without regard to race, color, religion, sex, national origin, age, disability, sexual orientation, ancestry, marital or veteran status. We are committed to hiring a diverse and talented workforce, supporting integrity in the workplace and an entrepreneurial atmosphere.

Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Location

Two blocks from the Arapahoe Village Center RTD Station, walking distance to numerous bars, restaurants and coffee shops.

Similar Jobs

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about EMS SoftwareFind similar jobs