TikTok
Company
Lead Researcher, Large Language Models/LLM, TikTok
San Jose
Job Description
Our Foundations and Intelligence Service R&D team is fast growing and responsible for building state-of-the-art foundation models, such as LLM, VLM and Omni Models. Our mission is to build a bridge for collaboration between foundation models and downstream business scenarios, and use foundation model powered world knowledge to enhance better user experiences across TikTok, including content moderation, search and recommendations, client AI, etc.
We are looking for researchers in LLM, VLM and Omni Model domain who are experienced in single/multi-modality LLM pre-training and applications, including evaluations, data processing and recipes for pre-training and post-training, reinforcement learning based alignment, efficient training and inference. There are no doubt a lot of unsolved problems in the LLM domain which could have a huge impact on industry and academia. In TikTok, we have real applications, resources and patience for technology incubation.
- Lead the incubation of next-generation, high-capacity LLM solutions for TikTok business, identify and define both short and medium term objectives;
- Design methods, tools, data recipes and experiments to push forward state-of-art in large language models;
- Explore new model architecture and inference-efficient model design for LLM applications to scale impact on business
- Work closely with cross-functional teams to plan and implement projects harnessing LLMs for diverse purposes and vertical domains
- Extend the insights and impact from industry to academia
Minimum Qualifications
- Ph.D in Computer Science, Data Science, Artificial Intelligence, or a related field
- Strong understanding of cutting-edge LLM research (e.g., long context, multi modality, alignment research, agent ecosystem, etc.) and possess practical expertise in effectively implementing these advanced systems as a plus
- Proficiency in programming languages such as Python, Rust, or C++ and a track record of working with deep learning frameworks (e.g., pytorch, deepspeed, megatron, vllm, etc.).
- Strong understanding of distributed computing framework & performance tuning and verification for training/finetuning/inference; Being familiar with PEFT, RL, MoE, CoT or Langchain is a plus.
Preferred Qualifications
- Excellent problem-solving skills and a creative mindset to address complex AI challenges. Demonstrated ability to drive research projects from idea to implementation, producing tangible outcomes.
- Published research papers or contributions to the LLM community would be a significant plus.
- Experience with inference tuning and Inference acceleration. Have a deep understanding of GPU and/or other AI accelerators, experience with large scale AI networks, pytorch 2.0 and similar technologies.
- Experience with evaluation of AI systems, LLM application & agent development is desirable.
We are looking for researchers in LLM, VLM and Omni Model domain who are experienced in single/multi-modality LLM pre-training and applications, including evaluations, data processing and recipes for pre-training and post-training, reinforcement learning based alignment, efficient training and inference. There are no doubt a lot of unsolved problems in the LLM domain which could have a huge impact on industry and academia. In TikTok, we have real applications, resources and patience for technology incubation.
- Lead the incubation of next-generation, high-capacity LLM solutions for TikTok business, identify and define both short and medium term objectives;
- Design methods, tools, data recipes and experiments to push forward state-of-art in large language models;
- Explore new model architecture and inference-efficient model design for LLM applications to scale impact on business
- Work closely with cross-functional teams to plan and implement projects harnessing LLMs for diverse purposes and vertical domains
- Extend the insights and impact from industry to academia
Minimum Qualifications
- Ph.D in Computer Science, Data Science, Artificial Intelligence, or a related field
- Strong understanding of cutting-edge LLM research (e.g., long context, multi modality, alignment research, agent ecosystem, etc.) and possess practical expertise in effectively implementing these advanced systems as a plus
- Proficiency in programming languages such as Python, Rust, or C++ and a track record of working with deep learning frameworks (e.g., pytorch, deepspeed, megatron, vllm, etc.).
- Strong understanding of distributed computing framework & performance tuning and verification for training/finetuning/inference; Being familiar with PEFT, RL, MoE, CoT or Langchain is a plus.
Preferred Qualifications
- Excellent problem-solving skills and a creative mindset to address complex AI challenges. Demonstrated ability to drive research projects from idea to implementation, producing tangible outcomes.
- Published research papers or contributions to the LLM community would be a significant plus.
- Experience with inference tuning and Inference acceleration. Have a deep understanding of GPU and/or other AI accelerators, experience with large scale AI networks, pytorch 2.0 and similar technologies.
- Experience with evaluation of AI systems, LLM application & agent development is desirable.
TikTok
223 jobs posted
About the job
Similar Jobs
Discover more opportunities that match your interests
- 21 days ago
LLM Optimization Lead
MongoDB
Austin; New York City; SeattleView details - 24 days ago
Data Science Lead - TikTok Ads- San Jose
TikTok
San JoseView details - 16 days ago
Data Science Lead - TikTok Ads- San Jose
TikTok
San JoseView details - 24 days ago
Large Model Algorithm Researcher(Multimodal & Code AI)
TikTok
SingaporeView details - 20 days ago
Ara Gamma – Data Engineer (LLM Data & Prompt Engineering) - English language
Welocalize
RemoteView details - 30 days ago
Lead Data Scientist
HubSpot
RemoteView details - 30 days ago
Lead Data Engineer
Mastercard
Dublin, IrelandView details - 28 days ago
Causal AI Researcher
Dell Technologies
Eldorado Do Sul, BrazilView details - 28 days ago
Causal AI Researcher
Dell Technologies
Eldorado Do Sul, BrazilView details - 27 days ago
Lead Data Engineer
Mastercard
Atlanta, GeorgiaView details
Looking for something different?
Browse all AI jobs