Logo
Essential AI

Data Infrastructure Engineer

Essential AI, San Francisco, California, United States, 94199


Does building end-to-end data infrastructure excite you? We are looking for Infrastructure Engineers with experience designing, building, and optimizing scalable data infrastructure platforms to power our AI models.Essential AI’s mission is to deepen the partnership between humans and computers, unlocking collaborative capabilities that far exceed what could be achieved today. We believe that building delightful end-user experiences requires innovating across the stack - from the UX all the way down to models that achieve the best user value per FLOP.We believe that a small, focused team of motivated individuals can create outsized breakthroughs. We are building a world-class multi-disciplinary team who are excited to solve hard real-world AI problems. We are well-capitalized and supported by March Capital and Thrive Capital, with participation from AMD, Franklin Venture Partners, Google, KB Investment, and NVIDIA.The RoleThe Data Infrastructure Engineer will design, implement, and optimize a scalable infrastructure to prepare the data that powers our AI training. This infrastructure must be reliable and capable of efficiently processing petabytes of data. You will collaborate closely with the data research team and data crawling team when designing this system.What you will be working onBuilding petabyte-scale, high-throughput data processing systems for preparing and curating datasets for AI training.Orchestrating workloads across large clusters; architecting and maintaining distributed computing environments.Working directly with our data research team on implementing new methods of data preparation.Troubleshooting and resolving infrastructure-related issues in a timely manner.What we are looking forMinimum of 6+ years of experience in data-intensive applications and software development.Proficient with Kubernetes & containerization and with building cloud services using providers like AWS, GCP, etc.Ability to write, debug, and optimize distributed systems and understanding of data orchestration and automation tools (or strong willingness to learn).Proficient in high-performance programming languages like Go, Rust, or C++.You have previous experience in creating and maintaining infrastructure for processing datasets for ML model training and/or serving.We encourage you to apply for this position even if you don’t meet all of the above requirements but want to work on these techniques.We are based in-person in SF and fully onsite 5 days a week. We offer relocation assistance to new employees.The base pay range target for the role seniority described in this job description is up to $225,000 in San Francisco, CA. Final offer amounts depend on multiple factors such as candidate experience and expertise, geographic location, total compensation, and market data. In addition to cash pay, full-time regular positions are eligible for equity, 401(k), health benefits, and other benefits; some of these benefits may be available for part-time or temporary positions.

#J-18808-Ljbffr