Logo
Karkidi

Machine Learning Engineer Intern, Computer Vision Algorithm (Video Understanding

Karkidi, Cupertino, California, United States, 95014


The computer vision algorithm intern will work in a dynamic team as part of the Video Computer Vision org, which develops on-device computer vision and machine perception technologies across Apple’s products. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers' hands.Minimum QualificationsDuring the internship, you must be enrolled in a M.S. or PhD program in Electrical Engineering/Computer Science or a related field (mathematics, physics, or computer engineering), with a focus on computer vision and/or machine learning.Rich experiences in video machine learning covering one of the topics: Video Understanding / Video Foundation Model / Multi-modal LLM.Proven prototyping skills and proficient in coding (C, C++, Python).Excellent written and verbal communication skills, be comfortable presenting research to large audiences, and have the ability to work hands-on in multi-functional teams.Preferred QualificationsPublication record in relevant venues (e.g., NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, SIGGRAPH).Industry experiences with multi-modal foundation models and frameworks.Knowledge and understanding of generative AI, multi-modal large language models, and video captioning.Solid understanding of the state-of-the-art in Video Understanding and familiarity with the challenges of developing algorithms that run efficiently on resource-constrained platforms.Team-oriented, result-oriented, and self-motivated.

#J-18808-Ljbffr