WHAT WILL FILL YOUR DAYS:Moving between understanding the open & unanswered questions about Siri; to defining new metrics and filters; to specifying new logging necessary with the high-level goal of using data to improve Siri.Designing, creating, and maintaining data pipelines that populate a petabyte scale data warehouse.Working with data infrastructure teams providing input to improve our platform.Working with data producing teams to specify requirements and to transparently provide rapid feedback.Partnering with your teammates across Siri data to answer questions, to provide support, and to innovate in taking our data warehouse to the next level.
Education & Experience
Surprise us! Many will have an MS or BS in CS, Engineering, Math, Statistics, or a related field OR equivalent practical experience in data engineering.4+ years of industry experience working with distributed data technologies (e.g. Hadoop, MapReduce, Spark, Flink, Kafka, etc.) for building efficient & large-scale data pipelines.Software Engineering proficiency in at least one high-level programming language (Java, Scala, Python or equivalent).Experience required in building batch data processing pipelines curating data for data science consumers.Experience strongly preferred building stream-processing applications using Apache Flink, Spark-Streaming, Apache Storm, Kafka Streams or others.View More