Logo
TRUNK LTD

Senior Data Engineer

TRUNK LTD, San Francisco, California, United States, 94199


At Trunk, we're on a mission to empower growing software organizations to deliver high-quality software quickly. We understand the challenges of merge conflicts, poor code quality or consistency, flaky tests, and other distractions that can drain productivity and morale. Our unique approach enables engineering teams to stay focused on designing, implementing, and delivering software, leading to the creation of magical, high-quality projects and happier teams.Our journey began in 2021, with our founders leveraging their experience from some of the world's largest and fastest-growing tech companies - Uber, Google, YouTube, and Microsoft. In 2022, we achieved a significant milestone by securing a $25M Series A funding led by Garry Tan at Initialized Capital (currently President of YC) and Peter Levine at a16z. This growth and recognition are a testament to our potential and the value we bring to the software development landscape.We know the frustration of trying to deliver code while constantly being interrupted by slow CI, flaky tests, and fragile processes. At Trunk, we’re building the tools to bring the joy back to software development. We’re looking for entrepreneurial people who are passionate about solving these problems.As a founding member of our Data Engineering team, you’ll leverage your technical expertise to build data pipelines for processing and storing the data generated by our customer's CI/CD and automated tests. You’ll also experiment with integrating AI models to drive analytics and insights for our customers. We're tackling challenging problems and need engineers who can operate well in ambiguity and develop great solutions.As an engineering team, we thrive on our ability to move quickly and adapt as we learn. Quickly delivering value to customers and getting their feedback is critical to our success. Engineers will be able to work closely with customers to understand the nuances of their use cases. We value empathy, hard work, and collaboration.Our data stack is constantly evolving, but built on the foundations of Python, PostgreSQL, Spark, TimescaleDB, AWS, Kubernetes, and AWS Glue.

What you'll do ‍

Build fault-tolerant and scalable data pipelinesDesign efficient data storage, collaborating with product engineers to create fast and reliable data-driven featuresDebug, profile, and optimize distributed data-intensive applications to improve their latency, accuracy, resource consumption, and throughputDesign and build observability of data quality and accuracyIntegrateML models like Llama to analyze data and create featuresWe're looking for

5+ years of experience as a software engineer with a strong understanding of key concepts in distributed systems3+ years of experience in building and deploying data applications, with a track record of regularly shipping new featuresFluency in at least two of these languages: Java/Scala/Kolin, Python, Go, Rust, or C++Good understanding and practical experience with partitioning, replication, map-reduce, indexing, and CAP theoremExperience with distributed storage systems (S3, HDFS, Hive, ClickHouse, Elastic, etc), distributed processing engines (Spark, etc), and message queues (Kafka, SQS, etc)Passion for building large-scale ML applications and improving software engineers' productivityUnderstanding of key concepts in natural language processing, machine learning, or statistical analysis

(Nice to have) Some experience with machine learning stack (pandas, PyTorch, numpy, sci-kit, transformers, etc)

What we offer

Unlimited PTOCompetitive salary and equityWork-life balanceFlexibility to be fully or partly remoteUp to $200/month stipend for coworking space for remote folksFew meetings, so you can ship fast and focus on buildingOne Medical membership on us!Top-notch medical, dental, vision, short-term disability, long-term disability, and life insuranceAll insurance is 100% company-paid ($0 premiums) for employees and highly subsidized for dependentsFSA, HSA with company contributions, and pre-tax commuter benefits401(k) planPaid parental leave ( up to 12 weeks)Our tech stack

Frontend: Typescript, React, Redux, Next.jsBackend: Typescript, Node, AWS, CDK, k8s, gRPCObservability: Prometheus, Grafana, Kiali, JaegerCI/CD: GitHub ActionsCLI/Daemon/LSP: C++20, BazelVSCode Extension: TypescriptGeneral: GitHub, Slack, Linear, SliteThe salary and equity range for this role are: $170K - $210K and .15% - .35%.Please note that the compensation range provided is a general guideline only and is subject to change based on location, qualifications, and experience.