Data and Platform Engineer

about 2 months ago
Full time role
Hybrid · Remote · Emeryville, CA, US... more
Prolific Machines is ushering in a new era of biotechnology with light. We harness light to produce everyday essentials more efficiently, from food and lifesaving drugs to novel biosolutions. Our first-of-its-kind photomolecular biology platform delivers unprecedented control, precisely guiding cellular behavior when and where it matters most. Unlike existing tools used to control biology, our technology unlocks dynamic control over virtually any cellular function in any cell type. We're enabling partners to unlock robust efficiency, quality, reproducibility, and sustainability advantages across pharmaceuticals, nutritional and therapeutic proteins, cellular agriculture, and more. Prolific is supported by leading investors, including Bill Gates’s Breakthrough Energy Ventures, Mayfield and SOSV.

Help us create a brighter tomorrow with biology. We're looking for mission-driven, talented, and kind people to join our team.

About this Job

We are seeking a passionate builder with experience in data pipeline development, platform integration, and data management to enhance Prolific Machines’ data infrastructure and build the foundations of our AI Platform. You will be responsible for developing and maintaining efficient data pipelines and processes to transform, migrate, analyze, and integrate data and metadata across various scientific systems. Your work will play a crucial role in ensuring the integrity, accuracy, and availability of our scientific and engineering data, directly impacting our core business operations and AI platform development.


Responsibilities

  • Design and develop scalable data pipelines for transforming and migrating data from diverse, multi-modal sources to target systems, ensuring data quality, accuracy, and consistency throughout the process.
  • Own and manage integrations with the Ganymede platform and other relevant systems (ELN: Benchling, IoT: Particle), ensuring seamless data flow and system interoperability.
  • Ensure structured, high-quality data accessibility to support model development, bioprocess capabilities and analysis, industrial-scale demonstrations, and customer-facing solutions.
  • Work closely with Biology, BPD, Engineering, and AI teams to understand data needs and provide solutions that enhance research, development, and commercialization efforts.
  • Provide infrastructure and development support for data analysis, model development, and hardware/instrument interaction.
  • Document data transformation and migration processes, including data mappings, transformations, and dependencies, and maintain comprehensive documentation for future reference.

About You

  • Excellent programmer: Primarily Python, and strong proficiency in SQL.
  • Excellent understanding of relational and non-relational databases, data modeling principles, and query optimization techniques.
  • Exposure to and good understanding of Biology / Bioprocess data and analysis.
  • Experience with scientific data management and documentation tools, including electronic lab notebook (ELN) systems (Benchling) and/or laboratory information management systems (LIMS).
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud, and their data services.
  • 2+ years in industry and a B.S. or M.S. degree in Computer Science, Engineering, or a related field.
  • Enjoy getting hands-on with data infrastructure and platform development in all stages of development.
  • Have strong technical communication skills, are excited about working with interdisciplinary teams, and are eager to work in a high-growth, fast-paced startup environment.
  • Experience with infrastructure as code (IaC) principles and technologies is a plus.
  • Knowledge of data governance, data security, and data privacy practices is a plus.
  • Experience with / understanding of AI is a plus.

Diversity: We value diverse people, perspectives, and knowledge. To this end, we aim to foster a sense of belonging that enables everyone to bring their whole, authentic self to work everyday. We are committed to growing a diverse and inclusive team that encompasses all types of people. We believe this can only be achieved by acknowledging the ancestral, historic, and current systemic and communal inequities.

Location: The San Francisco Bay Area (Emeryville, CA)

Benefits: Outstanding Health, Vision & Dental Insurance (including full spouse coverage), at least one all-expense-paid team holiday per year, flexibility in schedule and unlimited vacation days. Free lunch every day!