OpenReq

Machine Learning Researcher

OpenReq, Cupertino, California, United States, 95014

Overview

With Etched ASICs, we have fundamentally different constraints than existing AI chips. We don'thave the same batch-size-latency tradeoff as GPUs. We can parallelize workloads and digestlarge contexts much more efficiently than GPUs.

Sohu enables entirely new research directions and products. When our chips come out, theseuse cases need to already be mature and visceral. Whether real-time video, agents, speculativedecoding, or new tree search algorithms, we must create the market for our hardware.In this role, you will lead efforts on research directions that trade compute for improvedunderstanding and speed, particularly with agents and new search techniques.

Representative projects• Combine multi-agent with RAG to improve the quality of QA• Evaluate new system on standard benchmark• Design new and verifiable benchmark for agent reasoning• Design LLM content understanding based recommendation systems

You may be a good fit if you have:• Past projects or publications with substantial impact in ML and/or CV (quality > quantity)• Strong hands-on engineering skills, particularly with python, pytorch, CUDA, DDP/FSDP• Deep understanding of open and closed source model architectures and open sourcelibraries for transformer training and inference• Familiarity with LlamaIndex, LangChain, MetaGPT, CoT, ToT, and beam search• Ability to think outside the box and make tradeoffs considering feasibility, quality, andtime-to-ship of a project

Strong candidates may also have experience with:• Knowledge graphs• Name entity recognition• Tool calling and coder LLMs• SWE bench & SWE agent

We encourage you to apply even if you do not believe you meet every single qualification.