OpenReq
Machine Learning Researcher
OpenReq, Cupertino, California, United States, 95014
Overview
With Etched ASICs, we have fundamentally different constraints than existing AI chips. We don'thave the same batch-size-latency tradeoff as GPUs. We can parallelize workloads and digestlarge contexts much more efficiently than GPUs.
Sohu enables entirely new research directions and products. When our chips come out, theseuse cases need to already be mature and visceral. Whether real-time video, agents, speculativedecoding, or new tree search algorithms, we must create the market for our hardware.In this role, you will lead efforts on research directions that trade compute for improvedunderstanding and speed, particularly with agents and new search techniques.
Representative projects• Combine multi-agent with RAG to improve the quality of QA• Evaluate new system on standard benchmark• Design new and verifiable benchmark for agent reasoning• Design LLM content understanding based recommendation systems
You may be a good fit if you have:• Past projects or publications with substantial impact in ML and/or CV (quality > quantity)• Strong hands-on engineering skills, particularly with python, pytorch, CUDA, DDP/FSDP• Deep understanding of open and closed source model architectures and open sourcelibraries for transformer training and inference• Familiarity with LlamaIndex, LangChain, MetaGPT, CoT, ToT, and beam search• Ability to think outside the box and make tradeoffs considering feasibility, quality, andtime-to-ship of a project
Strong candidates may also have experience with:• Knowledge graphs• Name entity recognition• Tool calling and coder LLMs• SWE bench & SWE agent
We encourage you to apply even if you do not believe you meet every single qualification.
With Etched ASICs, we have fundamentally different constraints than existing AI chips. We don'thave the same batch-size-latency tradeoff as GPUs. We can parallelize workloads and digestlarge contexts much more efficiently than GPUs.
Sohu enables entirely new research directions and products. When our chips come out, theseuse cases need to already be mature and visceral. Whether real-time video, agents, speculativedecoding, or new tree search algorithms, we must create the market for our hardware.In this role, you will lead efforts on research directions that trade compute for improvedunderstanding and speed, particularly with agents and new search techniques.
Representative projects• Combine multi-agent with RAG to improve the quality of QA• Evaluate new system on standard benchmark• Design new and verifiable benchmark for agent reasoning• Design LLM content understanding based recommendation systems
You may be a good fit if you have:• Past projects or publications with substantial impact in ML and/or CV (quality > quantity)• Strong hands-on engineering skills, particularly with python, pytorch, CUDA, DDP/FSDP• Deep understanding of open and closed source model architectures and open sourcelibraries for transformer training and inference• Familiarity with LlamaIndex, LangChain, MetaGPT, CoT, ToT, and beam search• Ability to think outside the box and make tradeoffs considering feasibility, quality, andtime-to-ship of a project
Strong candidates may also have experience with:• Knowledge graphs• Name entity recognition• Tool calling and coder LLMs• SWE bench & SWE agent
We encourage you to apply even if you do not believe you meet every single qualification.