Veritone: AI that makes you even better
Veritone Logo

Veritone

Site Reliability Engineer I

Reposted 21 Hours Ago
Be an Early Applicant
In-Office
London, England
Senior level
In-Office
London, England
Senior level
Deploy and maintain SaaS applications, automate management processes, troubleshoot stability issues, and collaborate with teams to enhance features and reliability.
The summary above was generated by AI
POSITION SUMMARY

The ideal candidate will have 5+ years of experience in Linux systems and software management, expertise with Terraform, Ansible, and cloud platforms like AWS, Azure, and GCP. Experience with large-scale distributed systems, monitoring/alerting systems (Prometheus, Grafana), CI/CD pipelines, container orchestration (Docker, Kubernetes), and programming languages (Go, Java, Python) is essential. A background in implementing security controls, automating deployments, and troubleshooting complex systems is also required.

‎ 

WHAT YOU'LL DO

  • Deploy and maintain a resilient, secure, and efficient SaaS application platform to meet established SLAs.

  • Automate, monitoring, management and incident response to achieve an auto-remediation system.

  • Monitor site stability and performance and troubleshoot site issues.

  • Participate in on-call rotation to ensure stability and uptime for our platforms.

  • Scale infrastructure to meet rapidly increasing demand.

  • Collaborate with cross-functional teams working with Engineering, Product, Services, and other departments.

  • Collaborate with developers to bring new features and services into production.

  • Independently design and develop tools to aid in operations and automation as well as work jointly with other team members to deliver innovative solutions to complex business and technical challenges.

  • Provide deployment and operations support for multi-tiered distributed software applications.

  • Estimate engineering effort, plan implementation, and rollout system changes that meet requirements for functionality, performance, scalability, reliability, and adherence to development goals and principles.

  • Collaborate in a fast paced environment with multiple teams (software development, release management, build and release, etc...).

  • Collaborate in a fast paced environment with multiple teams in a dynamic entrepreneurial organization

  • Defining how the behavior of large scale systems can be achieved

  • Measuring and achieving reliability through engineering and operations work

  • Monitoring and alert development, documentation and management with the goal of creating an auto-remediation system

  • Adapting security controls to product not typically native to GA releases

  • Developing automation methods to extend standard deployment pipelines for bespoke implementations

  • Patching, policy enforcement, and audit of production systems

  • Driving the Disaster Recovery process

‎ 

WHAT YOU'LL NEED

  • Expertise with Infrastructure-as-Code such as Terraform.

  • 5+ years of professional Linux systems and software management experience 

  • Knowledgeable with code languages including: Go, Node.js, Java

  • Experience with managing  infrastructure within Azure, GCP and AWS 

  • Expertise with monitoring and alerting systems including Prometheus, Grafana

  • Strong script skills for systems and data driven solutions

  • JIRA experience for project/task management

  • Extensive experience in troubleshooting large-scale distributed systems.

  • Comprehensive background in monitoring and alerting systems in auto-remediation systems including Prometheus, Grafana

  • Proven examples of standardizing security controls across large-scale systems

  • Comfort working within project/task management platforms.

Systems and Tools
  • Cloud platforms including: AWS, Azure, and GCP. 

  • Infrastructure coding languages: Terraform, Cloudformation, Ansible, Puppet, Python

  • CI/CD: experience working with and supporting build and deploy pipelines and tools: Jenkins, ArgoCD, GitHub Actions, Rundeck

  • Datastore Management and Query skills: Postgres, MySQL, MongoDB, MSSQL, ElasticSearch, Solr

  • Container orchestration platforms: Docker, Kubernetes, EKS, AKS

  • Familiarity with coding languages including: Go, Node.js, Java, Python

  • Monitoring/Alerting Tools: Prometheus, Grafana, VividCortex, Runscope, Cloudwatch, Monitor, VictorOps

  • OS and Container Hardening: STIG, CIS, SELinux, IPTables, FIPS 140-2, FIPS 140-3

  • JSON data structures and database schemas

  • API Query language: REST, GQL

Bonus Points If
  • Bachelor’s degree in Computer Science or related field

  • Have worked in regulated or public sector environments through development and assessment of cloud based solutions

  • Worked with, developed, or supported continuous integration/continuous deployment systems

  • Have concrete examples ready to present for creating auto-remediation systems

Veritone is a leading provider of artificial intelligence (AI) technology and solutions. The company's proprietary operating system, aiWARE, orchestrates an expanding ecosystem of machine learning models to transform audio, video and other data sources into actionable intelligence. We love to continuously grow while staying ahead of trends and creating structure in an unstructured world. 

If you’ve made it this far and align with our goals, we look forward to reviewing your qualifications!

DISCLOSURE

Our company provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability or genetics.

Candidates must possess the right to work in the UK and be able to provide the necessary documentation to verify this as required by UK immigration laws.

‎ 

Top Skills

Ansible
Argocd
AWS
Azure
Docker
Elasticsearch
GCP
Github Actions
Go
Gql
Grafana
Java
Jenkins
Kubernetes
Linux
MongoDB
Mssql
MySQL
Postgres
Prometheus
Python
Rest
Rundeck
Solr
Terraform

Veritone Denver, Colorado, USA Office

Denver, CO, United States

Similar Jobs at Veritone

17 Hours Ago
In-Office
London, England, GBR
Senior level
Senior level
Artificial Intelligence • Computer Vision • HR Tech • Machine Learning • Software
The Senior Support Services Specialist will provide Tier 2 support, resolve issues, develop SOPs, and guide the support team.
Top Skills: CSSHTMLSalesforceSQLXML
8 Days Ago
In-Office
London, England, GBR
Junior
Junior
Artificial Intelligence • Computer Vision • HR Tech • Machine Learning • Software
The Integrations Developer will create and maintain integrations between tools and third-party job boards, handling all aspects from development to testing.
Top Skills: CSSHTMLJSONMS OfficePerlRest ApisXML
9 Days Ago
In-Office
London, England, GBR
Senior level
Senior level
Artificial Intelligence • Computer Vision • HR Tech • Machine Learning • Software
The Senior Machine Learning Engineer will develop deep learning models for image and video analysis, focusing on object detection and localization. Responsibilities include coding, optimizing algorithms, and collaborating with a team in an agile environment.
Top Skills: C++GitJIRAOnnxOpencvPandasPythonPyTorchTensorFlowTensorrt

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account