Machine Learning Engineer (Distributed Training)
Posted 119 days ago
Job Description
This job posting has expired and no longer accepting applications.
About CloudWalk:
CloudWalk is building the intelligent infrastructure for the future of financial services. Powered by AI, blockchain, and thoughtful design, our systems serve millions of entrepreneurs across Brazil and the US every day.
Our AI team trains large-scale language models that power real products - from payment intelligence and credit scoring to on-device assistants for merchants.
About the Role:
We’re looking for a Research Engineer to design, scale, and evolve CloudWalk’s distributed training stack for large language models. You’ll work at the intersection of research and infrastructure - running experiments across DeepSpeed, FSDP, Hugging Face Accelerate, and emerging frameworks like Unsloth, TorchTitan, and Axolotl.
You’ll own the full training lifecycle: from cluster orchestration and data streaming to throughput optimization and checkpointing at scale. If you enjoy pushing the limits of GPUs, distributed systems, and next-generation training frameworks, this role is for you.
Responsibilities:
Requirements:
Bonus:
Our process is simple: a deep conversation on distributed systems and LLM training, and a cultural interview.
Competitive salary, equity, and the opportunity to shape the next generation of large-scale AI infrastructure at CloudWalk.
This job posting has expired and no longer accepting applications. Please check out our latest AI jobs.
CloudWalk
1 job posted
About the job
Similar Jobs
Faculty
15 days agoMachine Learning Engineer
LondonView detailsFaculty
14 days agoMachine Learning Engineer
LondonView detailsReddit
16 hours agoMachine Learning Engineer
RemoteUnited States$186K - $303K/yrView detailsFaculty
13 days agoLead Machine Learning Engineer
LondonView details
Censys
11 days agoSenior Machine Learning Engineer
RemoteUnited StatesCanada$170K - $204K/yrView detailsFaculty
7 days agoLead Machine Learning Engineer
United KingdomView detailsFaculty
7 days agoLead Machine Learning Engineer
LondonView detailsAppier
6 days agoMachine Learning Engineer (Intern)
Taipei, TaiwanView detailsVisa
25 days agoMachine Learning Engineer
Foster City, CA$137K - $193K/yrView detailsOtter
20 days agoMachine Learning Engineer
Mountain View, CA$155K - $207K/yrView details
Looking for something different?
Browse all AI jobsNever miss a new AI job
Get the latest AI jobs delivered to your inbox every week. Free, no spam.
