Data Scientist - Drug Manufacturing

Panda Intelligence • Full-time • Massachusetts, United States, US • 2w ago

Panda Intelligence is pleased to present an exceptional Data Scientist opening for a prestigious Cell & Gene Therapy Biotech firm, playing a pivotal role in carrying out the company's revered AI strategy in manufacturing operations. With a leadership team rooted in Harvard Medical School, MIT, Amgen and GSK, the firm has won numerous awards for its employee experience and has one of the highest retention rates in the industry. This is an ideal opportunity for an ambitious Data Scientist with 2-5 years of experience in manufacturing data science, preferably in Cell & Gene Therapy or Biologics, passionate about AI's post-discovery potential.

Responsibilities:

Leading the analysis of diverse datasets—including manufacturing batch records, process characterization data, quality control and assurance data, and supply chain information—to develop innovative solutions to complex business challenges.
Partnering with cross-functional teams of data scientists, engineers, strategists, and other stakeholders to design, implement, and optimize data science initiatives that address high-priority business needs. Conduct A/B testing to assess the effectiveness and value of these solutions.
Craft engaging data visualizations and presentations to communicate insights and recommendations to both technical teams and business leaders, ensuring clarity and actionable outcomes.
Stay actively involved in the broader data science community to keep up-to-date with emerging tools, methodologies, and advancements in data technologies, bringing fresh perspectives to the team.

Required Qualifications:

PhD in Mathematics, Statistics, Biostatistics, Epidemiology, Computer Science, Clinical/Biomedical Informatics or a related computational/quantitative discipline OR a Masters degree (scientific field of study) and 3+ years of data scientist experience in cell & gene therapy manufacturing.
Deep domain knowledge in biopharmaceutical manufacturing data and supply chain data, ideally in the cell & gene therapy
Fluency in at least one programming language (ideally Python) and familiarity with other related tools e.g. AWS, dbt, Azure
Extensive experience with statistical/analytical methodologies and ML algorithms (these can include regression, clustering, feature selection, DL, classification etc.)
Experience in LLM prompt engineering and developing LLM-based solutions and architectures, including Retrieval-Augmented Generation (RAG).
Proven ability to create advanced data visualizations and interactive dashboards using tools like Dash, Streamlit, R Shiny, and frameworks such as Flask and Angular, applied in business contexts.
Background in operations research and manufacturing processes specific to the biologics industry.

Apply here if interested or contact me at e.noble@panda-int.com with an updated resume.