Senior Data Engineer

about 3 years ago
Full time role
Emeryville, CA, US... more
Emeryville, CA, US... more

Job Description

About us At Shiru we believe that food should be delicious and nourishing without negatively affecting our planet. Acknowledging our growing global population as well as the imminent effects of climate change, Shiru’s mission is to create better protein ingredients that will catapult us into a sustainable food future. 
With our mission in mind, Shiru makes high quality, functional food proteins through better leveraging our precious environmental resources. To do this, we employ technologies originally  created to solve problems in adjacent industries, including computational biology, machine learning, and industrial fermentation and bioprocessing.
We apply computational intelligence to find the most functional natural food proteins in the world, harnessing the inherent ability of microflora to produce them. We then partner with food and beverage companies to incorporate these unique protein ingredients into everyday products. Shiru is now expanding our team of dedicated professionals across multiple disciplines to make enhanced protein ingredients for a better world.
About the roleShiru seeks an experienced programmer and engineer to develop the architecture and pipelines for ingesting scientific data. At Shiru, our business, R&D, and data science teams rely on robust access to lab generated, multi-omics, and protein structure/function data to drive our core processes. You will create the data warehouse and ETL pipelines to ingest, store, and serve the data to all stakeholders. This role is highly cross functional and will require strong collaboration with wet lab scientists, bioinformaticists, engineers, and business analysts. As a key member of an early stage engineering team, you will wear many hats and have a great opportunity for growth. 
About YouYou are a detail oriented engineer with deep expertise in data warehouses but also broad experience in programming, DevOps, and cluster computing. You have great communication skills and are able to gather requirements from different functional teams. Ideally you have worked in the biotech industry and have experience with LIMS or processing data from laboratory equipment. You constantly strive for quality and rigorous engineering processes.

Responsibilities

  • Build and maintain ETL pipelines to ingest data from a wide variety of public and proprietary sources.
  • Create data pipelines to capture, process, and store experimental design and data from the lab.
  • Design schemas that allow for efficient storage and retrieval of data.
  • Create tools that enable the company to turn data into actionable knowledge. 
  • Collaborate with laboratory and data scientists to enable analytics and reporting of scientific data.
  • Collaborate with software and machine learning engineers to enable quick and easy consumption of data.

Attributes

  • You write clean, modular, and maintainable code.
  • You are a continual learner and drive innovation by understanding new frameworks and technologies.
  • You are a self-starter, comfortable taking initiative without direct supervision.
  • Excellent communication and stakeholder management skills with the ability to relay technical information to non-technical audiences.
  • You expect your work to be meaningful and strive to be part of a business dedicated to having a positive impact on the planet.

Requirements

  • BS/MS/PhD in Computer Science or equivalent experience/training
  • 3+ years of experience building production data pipelines
  • Expertise in Python and working with large datasets in Pandas/Jupyter
  • Expertise with Docker and containerized workflows
  • Expertise in SQL 
  • Experience in AWS ecosystem with particular focus on Glue, Athena, Batch, ECS, and EKS
  • Experience with workflow managers such as Airflow, Luigi, or Snakemake
  • Experience working with distributed datasets (Spark, Dask)
  • Experience with modern testing and CI/CD frameworks
  • Proficiency with Unix, Git, and other command-line tools
  • Familiarity with genomics and/or proteomics a plus
At Shiru, we're looking for people with passion, grit, and integrity. You're encouraged to apply even if your experience doesn't precisely match the job description. We’re expecting your skills and passion to stand out—and set you apart—especially if your career has taken some extraordinary twists and turns. Please join us in this singular opportunity to create the future of food!  
Shiru is an equal opportunity employer who values diversity. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Shiru offers competitive compensation and employee benefits along with an attractive equity package commensurate with candidate qualifications.

Similar jobs