Multimodal Algo Researcher (AI Innovation Center) Intern - 2026 Soaring Star Talent Program
Posted 51 days ago
Job Description
This job posting has expired and no longer accepting applications.
About The Team:
The mission of the TikTok Eng-AI Innovation Center is to explore cutting-edge AGI technologies, including but not limited to LLM, multi-modal LLM (video/image/audio/text/code), etc., to let machine better understand user creations on Tiktok platform. Regarding to video/image/audio/text, enhanced content understanding can bring better user experience of searching, recommendation, and can more accurately identify and defend internet abuse and fraud on our platform. Regarding to code, our developed LLM aims to automatically re-organize/optimize Tiktok codebase, and make the code/coding become more accessible for Tiktok engineers.
Project Introduction:
Multimodal foundation large models (VLM) represent a research hotspot in the industry and a critical technology for TikTok's business scenario applications. In 2024, TikTok's Innovation Center developed VFM V1, a multimodal large model tailored for TikTok's business scenarios. It matches the performance of the best open-source model Qwen VL on public test sets, while significantly outperforming all other foundation models on TikTok's business test sets. In the future, we aim to continuously develop foundation models with efficient perception and reasoning capabilities, capable of handling multilingual and massive video content understanding algorithms to deliver a better content consumption experience for users.
Project Challenges:
Enhance the multimodal perception encoder: The current encoder uses a fixed frame rate. We need to explore more efficient adaptive frame rates while considering the integration of modalities such as audio and user behavior.
How to fuse multimodal perception and thinking capabilities to promote stronger comprehensive perception and cognitive abilities of the model.
Minimum Qualifications:
- A PhD degree with top AI conference papers in ML/CV/NLP are required;
- Excellent coding ability, data structures, and fundamental algorithm skills, proficient in C/C++ or Python, winners of competitions such as ACM/ICPC, NOI/IOI, Top Coder, Kaggle, etc. are preferred
- In-depth research experience in Machine Learning, with a particular emphasis on Large Language Models (LLMs) and Generative AI
- Excellent ability to analyze and solve problems, and be passionate about solving challenging problems
Preferred Qualifications:
- Passion for technology, good communication skills and team spirit
If you have any questions, please reach out to us at apac-earlycareers@tiktok.com
The mission of the TikTok Eng-AI Innovation Center is to explore cutting-edge AGI technologies, including but not limited to LLM, multi-modal LLM (video/image/audio/text/code), etc., to let machine better understand user creations on Tiktok platform. Regarding to video/image/audio/text, enhanced content understanding can bring better user experience of searching, recommendation, and can more accurately identify and defend internet abuse and fraud on our platform. Regarding to code, our developed LLM aims to automatically re-organize/optimize Tiktok codebase, and make the code/coding become more accessible for Tiktok engineers.
Project Introduction:
Multimodal foundation large models (VLM) represent a research hotspot in the industry and a critical technology for TikTok's business scenario applications. In 2024, TikTok's Innovation Center developed VFM V1, a multimodal large model tailored for TikTok's business scenarios. It matches the performance of the best open-source model Qwen VL on public test sets, while significantly outperforming all other foundation models on TikTok's business test sets. In the future, we aim to continuously develop foundation models with efficient perception and reasoning capabilities, capable of handling multilingual and massive video content understanding algorithms to deliver a better content consumption experience for users.
Project Challenges:
Enhance the multimodal perception encoder: The current encoder uses a fixed frame rate. We need to explore more efficient adaptive frame rates while considering the integration of modalities such as audio and user behavior.
How to fuse multimodal perception and thinking capabilities to promote stronger comprehensive perception and cognitive abilities of the model.
Minimum Qualifications:
- A PhD degree with top AI conference papers in ML/CV/NLP are required;
- Excellent coding ability, data structures, and fundamental algorithm skills, proficient in C/C++ or Python, winners of competitions such as ACM/ICPC, NOI/IOI, Top Coder, Kaggle, etc. are preferred
- In-depth research experience in Machine Learning, with a particular emphasis on Large Language Models (LLMs) and Generative AI
- Excellent ability to analyze and solve problems, and be passionate about solving challenging problems
Preferred Qualifications:
- Passion for technology, good communication skills and team spirit
If you have any questions, please reach out to us at apac-earlycareers@tiktok.com
This job posting has expired and no longer accepting applications. Please check out our latest AI jobs.
TikTok
46 jobs posted
About the job
Similar Jobs
TikTok
24 days agoMultimodal Algo Researcher (AI Innovation Center) Intern - 2026 Soaring Star Talent Program
SingaporeView detailsTikTok
3 days agoLLM Algorithm Research Scientist Code AI Intern (AI Innovation Center) - 2026 Start (PhD)
SingaporeView detailsTikTok
5 hours agoLLM Algorithm Research Scientist Code AI Intern (AI Innovation Center) - 2026 Start (PhD)
SingaporeView detailsSalesforce
22 days agoSummer 2026 Intern - Employee Success Reward Partner Strategy & AI Innovation (Dublin)
IrelandView detailsSalesforce
24 days agoSummer 2026 Intern - Talent Intelligence Data Analyst
Georgia - Atlanta$28 - $32/hrView details
NewsBreak
16 days agoVenture Builder, NewsBreak AI Innovation (Intern)
Mountain View, California$50 - $60/hrView detailsAMD
25 days agoSummer 2026 PhD HPC & AI GPU Performance Intern
Austin, TexasView detailsAmazon
18 days ago2026 Applied Scientist Intern, Amazon University Talent Acquisition
FranceView detailsCelonis
24 days agoSummernaut Program - AI & Management Consulting (Value Engineering) Summer Intern
United StatesView detailsCelonis
18 days agoSummernaut Program - AI & Management Consulting (Value Engineering) Summer Intern
Madrid, SpainView details
Looking for something different?
Browse all AI jobsNever miss a new AI job
Get the latest AI jobs delivered to your inbox every week. Free, no spam.
