Nexusflow

Backend Engineer

Nexusflow, Palo Alto, California, United States, 94306

About Nexusflow.aiModern enterprise copilots & agents call for last-mile quality, enterprise-grade robustness and scalable operation costs, beyond simplified programming interfaces for generative AI. Nexusflow tackles this challenge, enabling enterprises to own their workflow copilots & agents stacked on top of powerful yet cost-effective, compact LLMs. We train large language models and build last-mile quality dev tooling for copilots & agents on your enterprise workflows. Our team has built the open-source LLM, NexusRaven-V2, rivaling GPT-4 in function calling with a 100X smaller model size. Our team members are also behind the scenes of Starling, the #1 ranked compact 7B chat model based on human evaluation in Chatbot Arena.PositionNexusflow is currently adding Backend Engineers to our team. Our Backend Engineers package up our technology in models and last-mile quality tooling. They will be the driving force to build our products and solutions, in extensive collaboration with our ML Engineers and Front-end Engineers.ResponsibilitiesAPI system development for copilot & agent quality toolingAPI system development for copilot serving and integration with a focus on enterprise-grade requirements in the following areas:Integration with on-prem & cloud compute vendorsIntegration with software tools required in customer-oriented solutionsDistributed system and optionally GPU performance optimizationWear many hats and collaborate with the whole team for product development, deployment, and customer successQualificationRequiredExperience in ML model or ML data pipeline deployment (on-prem or on cloud)Experience in building backend for application or platform API systemsPreferredWorking experience in a fast-paced team environmentExperience in using or contributing to modern compute frameworks for LLMs (e.g. Deepspeed, Huggingface TGI, FSDP)Experience in projects involving LLMs

#J-18808-Ljbffr