Logo
Karkidi

Senior Software Engineer, AI Infrastructure

Karkidi, Berkeley, California, United States, 94709


THE ROLEThe AI Infrastructure team makes Covariant’s robot data accessible and easy to leverage to develop, debug, and deploy AI-based software. Our vision is to automate and refine every step of the AI lifecycle, from collecting, indexing, and annotating data to training and deploying new models and monitoring performance across our robot fleet. To this end we are designing a data platform that processes terabytes of robot telemetry data every day making it searchable and usable by the rest of the company. We build the core libraries, services, and tools that form the foundation of AI software development at Covariant and we are hiring senior engineers to help us achieve our vision.AREAS OF FOCUSBuilding services and APIs to search and annotate our rapidly growing robot datasetDesigning libraries to help us train, deploy, monitor, and understand our modelsFull stack development of tools that leverage our libraries and services to visualize and explore Covariant’s robot dataYOU WILLWork closely with the research and solutions teams to spec, develop, and ship features for our robot data platformLead and manage full-stack projects with cross-functional stakeholdersBuild tools to search and visualize robot telemetry data and facilitate fast performance iterationImplement scalable data pipelines to ingest and process robot telemetry dataDevelop and deploy distributed systems that span customer warehouses to the public and private cloudAdvocate for and facilitate quality software design principles including system observability and debuggabilityYOU HAVE4+ years of programming experience in modern programming languages such as Python4+ years of experience working on full stack, backend web development, or cloud infrastructureDesigned, built, and deployed modern web APIsDesigned and deployed solutions using public cloud providers like AWSExperience with containerization technologies like Docker and container orchestration platforms like Kubernetes and Amazon ECSStrong communication skills; able to efficiently communicate technical details to a varied audienceExperience with building model training infrastructure, libraries, and toolsThe ability to work independently on open-ended cross-functional projectsNICE TO HAVESExperience architecting data infrastructure for machine learning systemsExperience with Django and/or PostgresSAMPLE WEEK IN THE LIFEDevelop a scalable data pipeline leveraging services like Amazon SQS or KinesisDesign a new database model and corresponding API endpoints and viewsDeploy a service to Kubernetes and monitor its performanceTriage and debug a performance issue in PostgresAdd a feature to a computational graph libraryMeet with the research team to gather requirements and understand how we should support a new research project, such as training and deploying a new modelPrepare a technical deep-dive presentation on a project you recently completedIndependently run a meeting for your latest project to keep stakeholders on other teams up-to-date$145,000 - $225,000 a yearBase pay is one element of our total rewards package which may also include comprehensive benefits and equity etc., depending on eligibility. The annual base salary range for this position is from $145,000 to $225,000. The actual base pay offered will be determined on factors such as years of relevant experience, skills, education, etc. Decisions will be determined on a case-by-case basis.

#J-18808-Ljbffr