This job is no longer active

Data Engineer (Remote)

over 2 years ago
Full time role
Remote · San Francisco, CA, US... more
Remote · San Francisco, CA, US... more

Company

Pachama is a mission-driven company looking to restore nature to help address climate change. Pachama brings the latest technology in ...

View Company Profile

Job Description

Our Verification team is looking for a senior data engineer to be responsible for laying the critical infrastructure for our mission to map and monitor the planet's forests. Verification covers everything from data ingestion and graphics to model training and inference. Each is necessary to produce high-accuracy project evaluations and monitoring.
You'll be working alongside a team of sustainability experts, forest scientists, machine learning experts, designers, and customers to understand stakeholder needs and deliver high-quality, scalable products. The systems you build will underpin the core technology of the company, creating the heartbeat by which our business operates. You will be responsible for the strategy and implementation of data ingestion, indexing, and storage for our researchers and engineers. You will design and implement the workflows responsible for ingesting large amounts of raster and vector geospatial data for efficient access and processing.
We're looking for engineers who find joy in the craft of building and want to make an impact. Engineers who push forward initiatives by asking great questions, cutting through ambiguity, and organizing to win. Engineers who are relentlessly detail-oriented, methodical in their approach to understanding trade-offs, place the highest emphasis on building, and building quickly.

You will:

  • Build new data sources for geospatial information such as satellite imagery, LiDAR, radar, and field plots,
  • Design and construct robust pipelines for the ingest of geospatial data,
  • Create a geospatial data warehouse and platform for scientists and machine learning engineers,
  • Implement cutting edge pre-processing algorithms for sensor data to produce quality features for ML,
  • Develop tools and infrastructure to increase iteration time and improve developer experience,
  • Design infrastructure to accelerate research and experimentation,
  • Develop plans for cross-team initiatives related to infrastructure and deliver on them.

We are looking for strengths with:

  • Building scalable and fault-tolerant distributed systems that process large amounts of data,
  • Geospatial data sources and storage,
  • Working with ML models in production systems,
  • Algorithms and data structures, domain-driven design, and event-based microservices.
  • Handling batch and event data for geospatial data and machine learning systems.

We expect you to:

  • Learn fast and be humble.
  • Own solutions end-to-end.
  • Take part in strategic thinking and take apart problems.
  • Ship high-quality software.
  • Communicate well and document better.
  • Have fun.

Similar jobs





Pachama is a mission-driven company looking to restore nature to help address climate change. Pachama brings the latest technology in ...

View Company Profile

Similar jobs