Data Scientist I
Description
In partnership with the Data Science team, the Data Scientist I supports various data science projects through development and execution of case studies, requirements definition, software design, model development, model validation, results visualization and presentation to stakeholders. This role will work cross-functionally to support data science initiatives leading to key insights and identification of opportunities for both internal and external customers.
Key Responsibilities
- Work with internal and external stakeholders to identify data science opportunities and define objectives
- Mine and analyze data from company and third-party databases to drive optimization and improvement of product development and business strategies
- Develop custom models and algorithms utilizing internal and external data sets to deliver insights toward the design of new and improved processes and solutions
- Assess the effectiveness, accuracy, and limitations of machine learning methods and techniques to achieve desired outcomes
- Adhere to sound software engineering practices
Key Duties
- Collect use case examples and requirements
- Develop robust reusable software infrastructure
- Implement integration and unit testing
- Develop processes and tools to build, monitor, and analyze model performance and data accuracy
- Share results with stakeholders and broader organizational community in an easily accessible and understandable way
Key Competencies
- Demonstrated ability to translate high-level sketch of problem requirements into a robust extensible solution
- Strong problem-solving skills with an emphasis on reusable software development
- Experience using statistical and data-oriented programming languages (R, Python, SLQ, etc.) to manipulate data and draw insights from large data sets
- Experience using version control systems such as Git
- Experience querying databases and using SLQ, Python, R, etcetera
- Experience working with and creating data structures and abstract data types
- Knowledge of a variety of machine learning techniques (e.g., clustering, decision tree learning, artificial neural networks) and their real-world advantages and drawbacks
- Excellent written and verbal communication skills for coordinating across teams
- A drive to learn and master new technologies and techniques
- Able and willing to travel up to 10% of the time.
Required Education, Certifications, and Experience
- BS in computer science or related technical field
- Coding experience with several languages: Python, R, C/C++, Java, etcetera
- 1-2 years of applicable experience
Preferred Qualifications
- Knowledge and experience in statistical and data mining techniques, including linear regression, neural networks, Random Forest, boosting, decision trees, text mining, social network analysis
- Experience using web services such as Redshift, S3, Spark
- Experience analyzing data from 3rd party providers
- Experience visualizing/presenting data for stakeholders using D3, Shiny, ggplot, or other web app framework.
Key Differentiators
- Knowledge of advanced statistical techniques and concepts, including properties of distributions, statistical tests, and proper usage
- Demonstrated ability to work both independently and collaboratively
Equal Opportunity Employer/Protected Veterans/Individuals with Disabilities
The contractor will not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay or the pay of another employee or applicant. However, employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information, unless the disclosure is (a) in response to a formal complaint or charge, (b) in furtherance of an investigation, proceeding, hearing, or action, including an investigation conducted by the employer, or (c) consistent with the contractor’s legal duty to furnish information. 41 CFR 60-1.35(c)