Boson AI
Company
High Performance Computing Engineer
Santa Clara HQ
Job Description
This job posting has expired and no longer accepting applications.
Boson AI is a startup building large language tools for everyone to use. Our founders (Alex Smola, Mu Li), and a team of Deep Learning, Optimization, NLP, AutoML and Statistics scientists and engineers are working on high quality generative AI models for language, audio, and entertainment.
About The Role
We are looking for a Senior High Performance Computing Engineer to help us operate the GPUs, network and filesystem in our datacenter deployment in Toronto. The ideal candidate needs to have strong problem solving skills and an ability to learn new tools. Experience with Slurm, MAAS, Ceph, Infiniband, NVIDIA deepops, Ethernet networking and related tools are a big plus. You should be comfortable performing some amount of hardware configuration.
You will have the opportunity to work with NVIDIA H100 and A100 GPUs, over 20PB of storage, Terabit networking and hundreds of computers. You will be responsible for deploying and operating a broad range of infrastructure technologies and hardware systems.
Boson AI is a startup building large language tools for everyone to use. Our founders (Alex Smola, Mu Li), and a team of Deep Learning, Optimization, NLP, AutoML and Statistics scientists and engineers are working on high quality generative AI models for language, audio, and entertainment.
About The Role
We are looking for a Senior High Performance Computing Engineer to help us operate the GPUs, network and filesystem in our datacenter deployment in Toronto. The ideal candidate needs to have strong problem solving skills and an ability to learn new tools. Experience with Slurm, MAAS, Ceph, Infiniband, NVIDIA deepops, Ethernet networking and related tools are a big plus. You should be comfortable performing some amount of hardware configuration.
You will have the opportunity to work with NVIDIA H100 and A100 GPUs, over 20PB of storage, Terabit networking and hundreds of computers. You will be responsible for deploying and operating a broad range of infrastructure technologies and hardware systems.
A day in the life:
You might be a great fit if you have:
The ability to solve problems and to learn new techniques is key.
Please mention that you found this job on MoAIJobs, this helps us grow. Thank you!

Boson AI
18 jobs posted
About the job
Similar Jobs
Discover more opportunities that match your interests
- 10 days ago
Reliability Engineer | High-Performance AI
Luma AI
Palo Alto, Palo Alto, CAView details - 11 days ago
AI Performance Engineer - GPU
AMD
Amsterdam, NetherlandsView details - 6 days ago
AI Performance Engineer - GPU
AMD
Stockholm, SwedenView details - 23 days ago
Research Engineer, Model Performance & Quality
Anthropic
San Francisco, CA | New York City, NY | Seattle, WAView details - 13 days ago
Staff Machine Learning Engineer, ML Performance & Optimization
Waymo
Mountain View, CA, USA; San Francisco, CA, USA; Bellevue, WA, USAView details - 8 days ago
Machine Learning Performance Engineer, Annapurna Labs
Amazon
IL, Tel AvivView details - 4 days ago
Staff Machine Learning Performance Engineer, Inference Optimisation
Wayve
LondonView details - 28 days ago
Data Engineer
Feedzai
RemoteView details - 27 days ago
Data Engineer
SpaceX
Hawthorne, CAView details
27 days agoData Engineer
Motive
RemoteView details
View all ML Engineer jobs
Looking for something different?
Browse all AI jobs