Logo
Genmo Replay

Research Scientist (diffusion)

Genmo Replay, San Francisco, California, United States, 94199


We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation.Role overview:

We are seeking an exceptional Research Scientist to join our team, focusing on developing cutting-edge diffusion models for text-to-video generation. In this role, you will be at the forefront of innovation, creating novel architectures and algorithms that transform written descriptions into stunning, coherent video content.Key responsibilities:

Lead research initiatives in advanced diffusion models for text-to-video generation, focusing on improving visual quality, temporal consistency, and semantic fidelityDevelop and implement state-of-the-art algorithms for translating textual descriptions into dynamic video contentDesign and conduct rigorous experiments to validate new ideas and evaluate model performanceCollaborate with cross-functional teams to integrate research breakthroughs into our production pipelineStay at the cutting edge of the field by regularly reviewing academic literature and attending top-tier conferencesContribute to the research community through high-quality publications and open-source contributionsMentor junior researchers and foster a culture of innovation within the research teamWork closely with product teams to align research directions with user needs and market opportunitiesQualifications:

Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, or a closely related fieldMust have:Strong publication record in top-tier conferences (e.g., CVPR, ICCV, NeurIPS, ICML) with a focus on generative models, particularly diffusion modelsExtensive experience implementing and optimizing large-scale generative models for image or video tasksDeep understanding of state-of-the-art techniques in text-to-image and text-to-video generationProficiency in Python and deep learning frameworks such as PyTorch or TensorFlowExcellent communication skills with the ability to explain complex technical concepts to diverse audiencesProven ability to work collaboratively in a team environmentIdeal candidate will have:Postdoctoral or industrial research experience in generative AI for videoHands-on experience with text-to-video generation projectsExpertise in other generative model architectures (e.g., GANs, VAEs) and their applications to videoExperience working with large-scale datasets and distributed computing environmentsTrack record of successful collaboration with product teams on technology transfersFamiliarity with video codecs, compression techniques, and perceptual quality metricsContributions to open-source projects in the field of generative AIAdditional information

The role is based in the Bay Area (San Francisco). Candidates are expected to be located near the Bay Area or open to relocation.Genmo is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law. Genmo, Inc. is an E-Verify company and you may review the

Notice of E-Verify Participation

and the

Right to Work posters in English and Spanish .

#J-18808-Ljbffr