Data Scientist

CV Project LLC
CV Project LLC

Job Overview

Position is 100 Remote Analytics platform client that delivers intelligence is looking to hire a full time Data Scientist. Great full time benefits and 100 remote. Data Scientists are responsible for cleaning, transforming, enriching, and analyzing vast amounts of raw data from various systems using Apache Spark and other analytics packages to develop valuable features and to provide ready-to-use data to stakeholders for robust downstream analysis. They analyze data for correlations to identify trends and predictive power, and build, maintain, and deploy predictive models. Data Scientists work with analysts to understand business needs and requirements, and data engineers to implement scalable pipelines for ETL, model training, and scoring. They service both ad-hoc requests as well as core pipeline development. The ideal candidate has a passion for discovering insight hidden in large data sets and working with stakeholders to improve business outcomes. They keep up with the latest technology including the latest versions of spark, new analytical packages, etc. They must have a proven ability to drive business results with their data-based insights and be comfortable working with a wide range of stakeholders and functional teams. Responsibilities Collaborate with product management and engineering departments to understand company needs and devise possible solutions Mine and analyze data from company databases to drive optimization and improvement of product development, marketing techniques and business strategies Communicate results and ideas to key decision makers Research and develop statistical learning models for data analysis Keep up-to-date with latest technology trends Implement new statistical or other mathematical methodologies as needed for specific models or analysis Optimize joint development efforts through appropriate database use and project design Assess the effectiveness and accuracy of new data sources and data gathering techniques Develop custom data transformations, models, algorithms to apply to data sets Use predictive modeling to increase and optimize customer experiences, revenue generation, ad targeting and other business outcomes Develop processes and tools to monitor and analyze model performance and data accuracy Create high-performance data processing pipelines in Apache Spark for data transformation, aggregation, and model training Produce unit tests for Spark transformations and helper methods Write documentation with all code Basic Qualifications Excellent communication and interpersonal skills Knowledge of agile methodologies and tools (e.g. Scrum, JIRA). Basic system administration skills in both a Windows and Linux environment Bachelor’s degree in Computer Science, Statistics, Applied Math or related field 3+ years practical experience with Apache Spark, ETL, machine learning, data processing, and data analytics Strong Python and Bash shell scripting experience Experience training, deploying, monitoring, and updating machine learning models Knowledge of a variety of machine learning techniques, including clustering, decision trees, random forest, boosting, text minint, and neural networks, and their real-world advantages and drawbacksGLMRegression, Random Forest, Boosting, Trees, text mining, social network analysis, etc. Experience working with and creating data architectures The ability to teach and train others in the methodologies and practices used in data science Familiarity with Git and code versioning practices A drive to learn and master new technologies and techniques Preferred Qualifications Strong experience with Scala 5+ years practical experience with Apache Spark (Scala and Python), ETL, machine learning, data processing, and data analytics Strong experience with Apache Spark 2.x, including query tuning and performance optimization Masterrsquos or Doctoral Degree in Computer Science, Statistics, Applied Math or related field AWS Cloud experience, including Glue and AthenaData Scientist 1

View More
Job Detail
Shortlist Never pay anyone for job application test or interview.