Immediate Opening – Data Engineer (Kafka, Spark, and Hive) – Irving, TX – Fulltime

Photon Infotech
Photon Infotech

Job Overview

Greetings everyone, We hope you are staying safe. We are looking for a Data Engineer to join our new Data Analytics Initiatives. The role will be involved on building a unified data platforms for our clients. The candidate will work through data streaming, cleansing and processing for large Fortune 500 client. Who are we? For the past 20 years, we have powered many Digital Experiences for the Fortune 500. Since 1999, we have grown from a few people to more than 4000 team members across the globe that are engaged in various Digital Modernization. For a brief 1 minute video about us, you can check httpsyoutu.beuJWBWQZEA6o What will you do? Prepare data to be ready for analytics including transforming from source raw data, cleaning missing data and outliers as well as imputing as necessary Create a streaming data platform for a realtime consumptions Work with the business to understand key goals and translate that into analytics requirements and implementations Work with the Data Scientist to understand the model and implementing to be production ready What are we looking for? Bachelor degree in Computer Science, Mathematics, or related fields At least continuous 4 years of experience with Spark, especially pySpark implementation. Understanding of Kafka-Yarn-Spark-HDFS ecosystem for ingestion. A working knowledge with Streaming Data infrastructure In depth knowledge of preparing large scale data analytics for consumptions. Key knowledge on HIVE and query optimization in HIVE Banking experience related to risk management and analysis on Fraud is a plus Career progression demonstrating early working experience with Hadoop and moving on to include Kafka and Spark later

