Baidu
Machine Learning System Hardware Architect
Baidu, Sunnyvale, CA
Do you want to be part of the AI revolution? Do you want to think out of the box, thriving on challenges in the AI industry and the desire to solve them? Do you want to work with a world-class team to explore the fast-growing AI hardware opportunities and impact on the AI industry?We’re looking forward to you joining us to collaborate, contribute, and revolutionize AI silicon and system.DescriptionWe are looking for a world-class Machine Learning System Architect (HW) to join our SoC team at Baidu’s Sunnyvale office. The successful candidate will be a motivated self-starter who will thrive in this highly technical environment. Your job responsibilities as a Machine Learning System Architect will help the team to architect and create high-performance machine learning silicon and connect thousands of Kunlun Accelerators together for distributed AI training tasks.Create differentiated architectural innovations for Baidu’s Kunlun AI SoC roadmap. Architect, simulate, and design amazing machine learning solutions for our AI machine learning products.Develop system-level ML architectures that push the boundaries of performance, power, and latency; collaborate closely with many other teammates to ensure we design and optimize hardware and software for maximum performance.Monitor industrial and academic trends in artificial intelligence and determine where they should intersect our roadmaps. Drive partnerships for access to the most advanced AI technologiesEvaluate the power, performance, and cost of prospective architecture and subsystems. Build scalable tools for modeling and performance evaluation.Engage with system and application software engineers to ensure optimization of the entire hardware/software stack.Engage with SoC design, verification, and validation engineers to realize the architecture.QualificationsKnowledge of Machine Learning market, technological and business trends, software ecosystem, and emerging applications.Proven track record 5+ years architecting hardware solutions for Machine Learning, acceleration and optimization.Experience with deep learning frameworks including TensorFlow, PyTorch, PaddlePaddle, etc.Strong track record of outreach to ML researchers and application developers.Experience with CPUs, GPUs, memory systems, and accelerators.Experience with performance simulation and modeling in C++Experience with SoC interconnects and NoCsExperience with area, frequency, and power optimizationsFamiliarity with video, DSP, Ethernet, and PCIeMS or PhD in Electrical or Computer Engineering.Excellent communication skills in both English and Chinese.Culture Fit:Mission alignment: If you want to be part of a team to accomplish this great mission, we will provide you the best possible platform to do that.Self-directed: We work best with people that are driven, motivated, and aspire to greatness.Hungry to learn: We are eager to see you learn new skills and grow.Team orientation: We work in small, fast-moving teams. We watch out for each other and go after big goals together as a team.