Data Engineer
What we do:
Recognized as one the Top 100 Tech Companies by Builtin.com and over 4.4-star review on Glassdoor, SambaSafety® is the pioneer of driver risk management software in North America. Trusted by over 2 million subscribed drivers; thousands of businesses look to Sambasafety to provide the most powerful, advanced, intuitive, and impactful risk solution platform on the market. SambaSafety is growing at an incredible rate with high employee engagement. It’s an exciting time to be at Samba. Now is the right time to join our high performing culture. We hope to see you here!
What You’ll Do:
- Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs
- Implement data solutions using scalable cloud-based data services and pipeline
- Support legacy infrastructure either in place or by transitioning to current architecture
- Ability to analyze disconnected data sources to determine logical relationships based on business terms, cardinality, and data quality
- Assemble large, complex data sets that meet functional / non-functional business requirements
- Collaborate with cross-functional, often remote teams including BI / analytics, software engineers, data architects, data scientists, product management, and IT
- Build ETL and/or ELT solutions to and from a variety of data sources including SQL, NoSQL, AWS ‘big data’ technologies, and others (e.g., Salesforce, Snowflake)
- Keep our data separated and secure across national boundaries through multiple data centers an AWS regions
- Work with analytics and data scientist team members to assist them in building and optimizing our product into an innovative industry leader
- Provide data for embedded analytics solutions such as Looker, Power BI
- Ability to write clear and complete documentation regarding database design and processes
What you'll need:
-Relational SQL and NoSQL databases: Microsoft SQL Server, Postgres, Cassandra, MongoDB, etc.
-AWS cloud services: EMR, Glue, S3, Aurora, RDS, SQS, Lambda, Fargate, EC2, Redshift,
-Athena, Kinesis, Step Functions, DynamoDB, CloudFormation, CloudWatch, etc.
-Big data tools: Hadoop, Spark, Kafka, etc.
-Message and Stream-processing systems: RabbitMQ, Kafka Streams, Storm, Spark-Streaming, etc.
-Graph databases: Neo4J, Neptune, GQL, etc.
-Data pipeline and workflow management tools: NiFi, Azkaban, Luigi, Airflow, etc
-Object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.
- B.Sc. in Computer Science (or equivalent)
- 5 years’ experience in a data engineering position