Data Engineer, AI Platform
Posted 102 days ago
Job Description
This job posting has expired and no longer accepting applications.
Team Introduction
The Intelligent Creation - AI Platform team is a team focusing on building advanced end-to-end AI production pipelines, including deep learning model training, optimization, deployment and applications. We provide AI capabilities to empower content creation and consumption on TikTok and serve billions of users.
Responsibilities
- Own the management of massive GenAI datasets (PB) scale, including storage for video/image data, data processing, and data validation.
- Build and evolve GenAI data-pipeline infrastructure with a focus on extreme processing speed and end-to-end throughput.
- Partner closely with ML researchers/engineers to accelerate training-data acquisition, improve data quality, support evaluation of model outputs, and enable a closed-loop data lifecycle.
- Lower the barrier to data acquisition and maximize the utility and value of data across use cases.
Minimum Qualifications
- 3+ years of experience building data services; strong proficiency in Python.
- Experience with high-concurrency and asynchronous programming is a plus.
- Hands-on with Hive, MySQL, MongoDB, and Elasticsearch; solid understanding of internals; capable of data abstraction and data modeling.
- Practical experience with large-scale data processing frameworks such as Hadoop, Spark, Flink, and Ray.
- Excellent communication and collaboration skills; detail-oriented; strong problem-solving and analytical abilities.
Preferred Qualifications
- 1–2 years of experience with the Ray framework, including proficient orchestration of GPU/CPU resources; deep understanding of Ray’s architecture and usage patterns; strong background in high-concurrency and async processing to boost overall throughput.
- Experience sourcing and curating training datasets for GenAI.
The Intelligent Creation - AI Platform team is a team focusing on building advanced end-to-end AI production pipelines, including deep learning model training, optimization, deployment and applications. We provide AI capabilities to empower content creation and consumption on TikTok and serve billions of users.
Responsibilities
- Own the management of massive GenAI datasets (PB) scale, including storage for video/image data, data processing, and data validation.
- Build and evolve GenAI data-pipeline infrastructure with a focus on extreme processing speed and end-to-end throughput.
- Partner closely with ML researchers/engineers to accelerate training-data acquisition, improve data quality, support evaluation of model outputs, and enable a closed-loop data lifecycle.
- Lower the barrier to data acquisition and maximize the utility and value of data across use cases.
Minimum Qualifications
- 3+ years of experience building data services; strong proficiency in Python.
- Experience with high-concurrency and asynchronous programming is a plus.
- Hands-on with Hive, MySQL, MongoDB, and Elasticsearch; solid understanding of internals; capable of data abstraction and data modeling.
- Practical experience with large-scale data processing frameworks such as Hadoop, Spark, Flink, and Ray.
- Excellent communication and collaboration skills; detail-oriented; strong problem-solving and analytical abilities.
Preferred Qualifications
- 1–2 years of experience with the Ray framework, including proficient orchestration of GPU/CPU resources; deep understanding of Ray’s architecture and usage patterns; strong background in high-concurrency and async processing to boost overall throughput.
- Experience sourcing and curating training datasets for GenAI.
This job posting has expired and no longer accepting applications. Please check out our latest AI jobs.
TikTok
52 jobs posted
About the job
Posted on
Nov 20, 2025
Apply before
Dec 20, 2025
Job typeFull-time
CategoryData Engineer
Location
San Jose, CA
Skills
Similar Jobs

Rocket Money
27 days agoSenior Data Engineer, Data Platform
San Francisco, CANew York City, NYDetroit, MIPhoenix, AZMiami, FLDenver, CO$160K - $200K/yrView detailsHitachi
21 days agoAI and Data Engineer
Santa Clara, California$29/hrView detailsDRW
9 days agoData Engineer, Unified Platform
LondonView detailsVisa
26 days agoData Engineer
Bengaluru, IndiaView detailsWix
26 days agoData Engineer
Tel Aviv-Yafo, Tel Aviv District, IsraelView detailsOddball
23 days agoData Engineer
RemoteView details
BJAK
18 days agoData Engineer
MalaysiaView details
BJAK
18 days agoData Engineer
IndonesiaView details
BJAK
18 days agoData Engineer
ChinaView detailsYahoo
19 days agoData Engineer - AI Semantic Analytics
United States$76K - $159K/yrView details
Looking for something different?
Browse all AI jobsNever miss a new AI job
Get the latest AI jobs delivered to your inbox every week. Free, no spam.
