Logo
Coastal Carbon

Data Engineer

Coastal Carbon, San Francisco, California, United States, 94199


Who are we?

Coastal Carbon is a seed-funded startup on a mission to create positive impact through earth observation and AI. Founded at the University of Waterloo by a team of PhDs and engineers, we're backed by some of the best AI and climate tech investors like HF0, Inovia Capital and Propeller Ventures, angels like James Tamplin (cofounder Firebase) and Sid Gorham (cofounder OpenTable, Granular), and partners like Amazon AWS and the United Nations.

What do we do?

We're building multimodal foundation models for the natural world. We believe there's more to the world than the internet + more to intelligence than memorizing the internet. Our models are trained on satellite remote sensing and real world ground truth data, and are used by our customers in nature conservation, carbon dioxide removal, and government to protect and positively impact our increasingly changing world. Our ultimate goal is to build AGI of the natural world.

About the role

We are seeking a Data Engineer to join our team and help us build out a digital twin of the natural world. The successful candidate will be responsible for supporting the design, building, monitoring, and maintenance of the underlying database and related tooling.

The role will involve:Developing and maintaining AWS infrastructure to support a multi-Petabyte databaseSupporting upstream data pipeline design and implementationHeavy focus on scalability and optimization for performanceCreating downstream applications to support machine learning and visualizationRequirements

Bachelor's degree in engineering, computer science or a related field, or equivalent5+ years of relevant experienceFluency with SQL programmingProficiency in PythonAptitude in parallel processingDemonstrated experience with managing, ingesting, and transforming geospatial dataKnowledge of Earth observations and methods including satellite remote sensing and weather reanalysis dataFamiliarity with object store databases like Redshift/SnowflakeExperience building data pipelines and tooling to support downstream applicationsTeam player, willing to undertake various tasks to support our collective goalsNice to have

Proficiency in PyTorch or Tensorflow (with interest in learning PyTorch)Knowledge of AWS networking and security protocolsExperience with containerization and orchestration technologies such as Docker, AirFlow, and/or Kubernetes.Location wise, strong preference for in-person in Waterloo, however hybrid work is possible for exceptional candidates.