San Francisco Compute Co.
Software Engineer, Distributed Systems
San Francisco Compute Co., San Francisco, California, United States, 94199
About
We’re the San Francisco Compute Company. We’re building the first real-time compute trading platform. We think that over the next decade, thousands of startups and labs are going to be training and serving large models. They need compute to do this, and we’re building a platform on which that compute can be traded. If we’re successful, it will be possible to scale to tens of thousands of accelerators for hours at a time without having to build your own infrastructure. This will greatly increase the number of organizations that can afford to train large models, which will make the most important technology of our lifetime accessible to more people.The Role
As a distributed systems software engineer, you’ll be working on our in-house resource orchestration system. This system coordinates state and access to hundreds (soon thousands) of GPU compute nodes in multi-tenant clusters spanning across multiple data centers. Some responsibilities of the role include:Design of distributed system architectures that enable high availability fault tolerant state managementDeployment automation and performance optimization of virtual machines running on bare metal that utilize GPU passthroughDesign and deployment of multi-tier high performance network attached storage systemsAbout You
You have built fault tolerant distributed systems before that can manage hardware resources at scaleYou enjoy creating self-correcting systems that contribute to hardware health and reliabilityYou have experience with Linux virtualization (Cloud Hypervisor, QEMU, libvirt, virtiofs, sr-iov, PCIe passthrough)You appreciate and value good documentationSome Nice to Haves
Experience with Rust (our VM orchestrator is written in Rust)Experience with etcdExperience with high performance storage systems (WEKA, VAST, Ceph, etc.)Benefits
Unlimited office book budget: You can buy as many books for the office as you want. You’re encouraged to spend time during the workday reading!Generous equity grant: Team members are offered a competitive salary along with equity in the companyRetirement matching: We match 401(k) plans up to 4%Medical, dental & vision: We offer competitive medical, dental, vision insurance for employees and dependents and cover 100% of premiumsTime off: We offer unlimited paid time off as well as 10+ observed holidaysParental leave: We offer biological, adoptive, and foster parents paid time off to spend quality time with familyDaily lunch: We cover lunch daily for employeesVisa Sponsorships: Yes, we sponsor visas and work permitsThe San Francisco Compute Company is committed to maintaining a workplace free from discrimination and harassment. We make employment decisions based on business needs, job requirements, and individual qualifications, without regard to race, color, religion, belief, national origin, social or ethical origin, age, physical, mental, or sensory disability, sexual orientation, gender identity or expression, marital status, civil union or domestic partnership status, past or present military service, HIV status, family medical history or genetic information, family or parental status including pregnancy, or any other status protected by law.We welcome the opportunity to consider qualified applicants with prior arrest or conviction records. Our commitment to diversity includes hiring talented individuals regardless of their criminal history, in accordance with local, state, and federal laws, including San Francisco’s Fair Chance Ordinance and California’s ban-the-box laws.If you require reasonable accommodation for any reason, please reach out to us at
team@sfcompute.com .
#J-18808-Ljbffr
We’re the San Francisco Compute Company. We’re building the first real-time compute trading platform. We think that over the next decade, thousands of startups and labs are going to be training and serving large models. They need compute to do this, and we’re building a platform on which that compute can be traded. If we’re successful, it will be possible to scale to tens of thousands of accelerators for hours at a time without having to build your own infrastructure. This will greatly increase the number of organizations that can afford to train large models, which will make the most important technology of our lifetime accessible to more people.The Role
As a distributed systems software engineer, you’ll be working on our in-house resource orchestration system. This system coordinates state and access to hundreds (soon thousands) of GPU compute nodes in multi-tenant clusters spanning across multiple data centers. Some responsibilities of the role include:Design of distributed system architectures that enable high availability fault tolerant state managementDeployment automation and performance optimization of virtual machines running on bare metal that utilize GPU passthroughDesign and deployment of multi-tier high performance network attached storage systemsAbout You
You have built fault tolerant distributed systems before that can manage hardware resources at scaleYou enjoy creating self-correcting systems that contribute to hardware health and reliabilityYou have experience with Linux virtualization (Cloud Hypervisor, QEMU, libvirt, virtiofs, sr-iov, PCIe passthrough)You appreciate and value good documentationSome Nice to Haves
Experience with Rust (our VM orchestrator is written in Rust)Experience with etcdExperience with high performance storage systems (WEKA, VAST, Ceph, etc.)Benefits
Unlimited office book budget: You can buy as many books for the office as you want. You’re encouraged to spend time during the workday reading!Generous equity grant: Team members are offered a competitive salary along with equity in the companyRetirement matching: We match 401(k) plans up to 4%Medical, dental & vision: We offer competitive medical, dental, vision insurance for employees and dependents and cover 100% of premiumsTime off: We offer unlimited paid time off as well as 10+ observed holidaysParental leave: We offer biological, adoptive, and foster parents paid time off to spend quality time with familyDaily lunch: We cover lunch daily for employeesVisa Sponsorships: Yes, we sponsor visas and work permitsThe San Francisco Compute Company is committed to maintaining a workplace free from discrimination and harassment. We make employment decisions based on business needs, job requirements, and individual qualifications, without regard to race, color, religion, belief, national origin, social or ethical origin, age, physical, mental, or sensory disability, sexual orientation, gender identity or expression, marital status, civil union or domestic partnership status, past or present military service, HIV status, family medical history or genetic information, family or parental status including pregnancy, or any other status protected by law.We welcome the opportunity to consider qualified applicants with prior arrest or conviction records. Our commitment to diversity includes hiring talented individuals regardless of their criminal history, in accordance with local, state, and federal laws, including San Francisco’s Fair Chance Ordinance and California’s ban-the-box laws.If you require reasonable accommodation for any reason, please reach out to us at
team@sfcompute.com .
#J-18808-Ljbffr