CloudWalk
Company
Machine Learning Engineer (Distributed Training)
Remote
Job Description
About CloudWalk:
CloudWalk is building the intelligent infrastructure for the future of financial services. Powered by AI, blockchain, and thoughtful design, our systems serve millions of entrepreneurs across Brazil and the US every day.
Our AI team trains large-scale language models that power real products - from payment intelligence and credit scoring to on-device assistants for merchants.
About the Role:
We’re looking for a Research Engineer to design, scale, and evolve CloudWalk’s distributed training stack for large language models. You’ll work at the intersection of research and infrastructure - running experiments across DeepSpeed, FSDP, Hugging Face Accelerate, and emerging frameworks like Unsloth, TorchTitan, and Axolotl.
You’ll own the full training lifecycle: from cluster orchestration and data streaming to throughput optimization and checkpointing at scale. If you enjoy pushing the limits of GPUs, distributed systems, and next-generation training frameworks, this role is for you.
Responsibilities:
Requirements:
Bonus:
Our process is simple: a deep conversation on distributed systems and LLM training, and a cultural interview.
Competitive salary, equity, and the opportunity to shape the next generation of large-scale AI infrastructure at CloudWalk.
CloudWalk
14 jobs posted
About the job
Similar Jobs
Discover more opportunities that match your interests
- 21 days ago
Machine Learning Engineer
Twilio
RemoteView details - 29 days ago
Machine Learning Engineer, Safety
xAI
Palo Alto, CAView details - 28 days ago
Machine Learning Platform Engineer -
Synthesia
Amsterdam; Europe; Munich; UK; ZurichView details - 28 days ago
Machine Learning Engineer (GenAI)
GPTZero
New York HQView details
26 days agoMachine Learning Engineer (LLM)
BJAK
United KingdomView details
26 days agoMachine Learning Engineer (LLM)
BJAK
Hong KongView details
25 days agoMachine Learning Engineer (LLM)
BJAK
ThailandView details
25 days agoMachine Learning Engineer (LLM)
BJAK
VietnamView details
25 days agoMachine Learning Engineer (LLM)
BJAK
SingaporeView details
25 days agoMachine Learning Engineer (LLM)
BJAK
IndonesiaView details
View all ML Engineer jobs
Looking for something different?
Browse all AI jobs