Sr. Research Engineer/Scientist (all levels), World Models
Posted 3 hours ago
Job Description
About the Team
The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, and delivering intelligent solutions to Tiktok, enabling users to make and share creative content in a much easier way. The team has research groups dedicated to generative models for content creation, image generation, video synthesis, intelligent image/video editing, and world models.
The team is looking for Research Engineer / Scientists who can take initiatives in building next-generation World Models. The candidate will work on developing methods and infrastructure to train large-scale generative models from massive simulated and real-world multimodal datasets. This role places a particular emphasis on ensuring long-horizon temporal consistency, realistic physics, complex dynamics from the model and enabling users and agents to interact with the model in real-time.
Responsibilities
- Develop large-scale, diverse, and interactive multi-modal data generation pipeline.
- Develop training pipeline for long-context interactive video generation models.
- Advance video generation models to capture long-horizon temporal consistency, realistic physical dynamics, object interactions, and causal relationships from large-scale multi-modal data.
Minimum Qualifications:
- M.S or Ph.D. in Computer Vision, Computer Graphics, Machine Learning, or equivalent experience.
- 3 years research experiences in broad GenAI, multimodal foundation models, or Embodied AI areas.
- Demonstrated ability to communicate complex technical concepts and collaborate effectively within cross-functional research teams
Preferred Qualifications:
- Proven experiences in at least one of the following areas: video generation and synthesis; efficient and real-time diffusion models; 3D/physics-based simulation; or reinforcement learning for agentic environment interaction.
- Proven track record of first-author publications in prestigious venues including CVPR, ICLR, NeurIPS, SIGGRAPH, and ICML
The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, and delivering intelligent solutions to Tiktok, enabling users to make and share creative content in a much easier way. The team has research groups dedicated to generative models for content creation, image generation, video synthesis, intelligent image/video editing, and world models.
The team is looking for Research Engineer / Scientists who can take initiatives in building next-generation World Models. The candidate will work on developing methods and infrastructure to train large-scale generative models from massive simulated and real-world multimodal datasets. This role places a particular emphasis on ensuring long-horizon temporal consistency, realistic physics, complex dynamics from the model and enabling users and agents to interact with the model in real-time.
Responsibilities
- Develop large-scale, diverse, and interactive multi-modal data generation pipeline.
- Develop training pipeline for long-context interactive video generation models.
- Advance video generation models to capture long-horizon temporal consistency, realistic physical dynamics, object interactions, and causal relationships from large-scale multi-modal data.
Minimum Qualifications:
- M.S or Ph.D. in Computer Vision, Computer Graphics, Machine Learning, or equivalent experience.
- 3 years research experiences in broad GenAI, multimodal foundation models, or Embodied AI areas.
- Demonstrated ability to communicate complex technical concepts and collaborate effectively within cross-functional research teams
Preferred Qualifications:
- Proven experiences in at least one of the following areas: video generation and synthesis; efficient and real-time diffusion models; 3D/physics-based simulation; or reinforcement learning for agentic environment interaction.
- Proven track record of first-author publications in prestigious venues including CVPR, ICLR, NeurIPS, SIGGRAPH, and ICML
TikTok
81 jobs posted
About the job
Posted on
Apr 4, 2026
Apply before
May 4, 2026
Job typeFull-time
CategoryResearch Engineer
Location
San Jose, CA
Similar Jobs
Today
Research Engineer/Scientist (all levels), World Models
TikTok
San Jose, CAResearch Engineer/Scientist (all levels), World Models
TikTok
San Jose, CAToday1d
Research Engineer, World Models
Waabi
Remote$155K - $269KToronto, ON, CanadaSan Francisco, CAPittsburgh, PARemote US & CanadaResearch Engineer, World Models
Waabi
Remote$155K - $269KToronto, ON, CanadaSan Francisco, CAPittsburgh, PARemote US & Canada1d29d
Sr. Research Engineer
Yahoo
TaiwanSr. Research Engineer
Yahoo
Taiwan29d15d
Research Scientist / Research Engineer — Early Career Cohort
OpenAI
$295KSan Francisco, CAResearch Scientist / Research Engineer — Early Career Cohort
OpenAI
$295KSan Francisco, CA15d23d
Research Engineer/Scientist - Generative UI, Consumer Devices
OpenAI
$380K - $445KSan Francisco, CAResearch Engineer/Scientist - Generative UI, Consumer Devices
OpenAI
$380K - $445KSan Francisco, CA23d23d
Research Engineer/Scientist - Human Alignment, Consumer Devices
OpenAI
$380K - $445KSan Francisco, CAResearch Engineer/Scientist - Human Alignment, Consumer Devices
OpenAI
$380K - $445KSan Francisco, CA23d4d
Research Engineer
Graphcore
Bristol, UK; Cambridge, United KingdomCambridge, UKResearch Engineer
Graphcore
Bristol, UK; Cambridge, United KingdomCambridge, UK4d4d
Research Engineer
Graphcore
Bristol, UK; Cambridge, United KingdomCambridge, UKResearch Engineer
Graphcore
Bristol, UK; Cambridge, United KingdomCambridge, UK4d29d
Research Engineer, Multimodal
Character AI
Redwood City, CAResearch Engineer, Multimodal
Character AI
Redwood City, CA29d29d
Research Engineer, Agents
Anthropic
Remote$500K - $850KSan Francisco, CASeattle, WANew York City, NYResearch Engineer, Agents
Anthropic
Remote$500K - $850KSan Francisco, CASeattle, WANew York City, NY29d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.