Logo
TikTok

Machine Learning Engineer - Data Curation - AIGC, TikTok Monetization GenAI

TikTok, San Jose, California, United States, 95199


Responsibilities

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. We are Generative AI team under Monetization Technology. Our team focuses on developing cutting-edge Generative AI techs across all modalities, including text, image, videos, landing pages, etc.

We are looking for infrastructure engineers who are excited to grow their business understanding, build highly scalable and reliable software/infrastructure, partner across functions with global teams, and make big impacts. If you are someone who welcomes challenges, we are eager to have you on the team!

Responsibilities:Collaborate with foundational model researchers, including specialists in Ads LLM, Text-to-Image, and Text-to-Video, to develop and maintain efficient, low-latency data pipelines.Design and implement robust, scalable systems for data curation and management, supporting the foundational training of models across various formats in distributed environments.Implement data insights and model evaluation pipelines to enhance user engagement and drive revenue growth.Develop caching mechanisms to improve data retrieval speeds and enhance model responsiveness.Stay abreast of the latest academic research and open-source advancements, integrating cutting-edge technologies to continuously improve our data operations and machine learning model performance.

Qualifications

Minimum Qualifications:B.S./M.S./Ph.D. in Computer Science, Computer Engineering, or a related field.Programming and Technical Proficiency: Expertise in Python and a strong foundation in deep learning frameworks, such as PyTorch, as well as large model training libraries like FSDP/DeepSpeed and asyncio. A minimum of 3 years' experience with Linux, Docker, and Kubernetes is required.Data Engineering and AI/ML Knowledge: Demonstrated capability in data curation, management, and optimization within Generative AI ecosystems, encompassing both streaming and batch data processing.

Preferred Qualifications:Advanced Technical Expertise: Experience in CUDA Optimization and a deep understanding of the application of Generative AI models across multiple domains.Cloud Computing and Distributed Systems: Significant experience in managing large-scale data systems, with a strong preference for those who have worked with Vector Database solutions.Interpersonal and Problem-Solving Skills: A demonstrated passion for technology, coupled with outstanding problem-solving capabilities.

Job Information:The base salary range for this position in the selected city is $145000 - $250000 annually. Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location.

#J-18808-Ljbffr