Data Engineer
As the newest member of our data pipeline team you have a gift for building tools that consume and transform ginormous data sets into readily available information for your data science group. You understand the fundamentals of statistical methods that underlie machine learning and relish the opportunity to put them into practice. Standardizing, normalizing, testing and optimizing data is your passion. You are talented at contributing to the direction for new patterns as well as identifying improvements to our existing work that will make Gloo’s products even more valuable for clients! We have worked hard to build a healthy and transparent team that truly enjoys working together. Want to play?
What you'll be doing:
- Assembling large, complex data sets that meet functional and non-functional business requirements
- Identifying, designing, and implementing internal process improvements – everything from automating manual processes to optimizing data delivery to re-designing infrastructure for greater scalability
- Effectively collaborating with data team members to develop the Insights Platform and its data pipeline
- Designing, constructing, installing, testing, and maintaining highly scalable data pipelines
- Developing data set processes for data modeling, mining, and production
- Integrating new data management technologies and software engineering tools into existing structures
- Brainstorming with your team to continually support scalable architecture
What you'll bring with you:
- BS in Computer Science, Engineering or a related discipline
- About 4 years of experience with data models, data analysis and data engineering where terabyte-volume datasets are commonplace
- Expert level experience working with MapReduce
- Advanced knowledge of AWS services including EC2, EMR, Redshift, Elasticsearch and NoSQL databases
- Comfort with Git, Kafka, Jenkins, Hive, Oozie, Cascading
- Demonstrated experience building and optimizing ‘big data’ pipelines, architectures, and data sets
- Ability to build processes supporting data transformation, data structures, metadata, dependency, and workload management
- Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores
- Creative, articulate, and competent communication style – you deliver your message clearly, concisely, and confidently every time
- You thrive in an environment where “what if” is commonplace
- Desire to contribute to a team doing great things
Our team members enjoy:
- Compensation and bonus commensurate with experience
- Plenty of time off to keep you balanced
- Medical with HSA contribution
- A dynamic, talented team, dedicated to changing the world and building an incredible business
- Beautiful office space in downtown Boulder on Pearl Street, steps from coffee shops and blocks from hiking trails
- Company Happy Hour Fridays
- Fresh fruit, snacks, coffee and sodas