Generative AI Inference Engineer
Posted 26 days ago
Job Description
Generative AI Inference Engineer
<Remote>
About the role:
We are seeking passionate Machine Learning Engineers to join our Inference team, focusing on the creative applications of generative AI models. The ideal candidate will have substantial experience developing and running inference for multi-modal models. A deep understanding of diffusion model architectures and familiarity with workflow tools like ComfyUI are a big plus. You will be expected to leverage and push the boundaries of state-of-the-art inference optimization techniques for multi-modal generative models. This role offers the opportunity to work alongside top researchers and engineers, utilizing cutting-edge high-performance computing resources to make a significant impact in the rapidly evolving field of generative AI.
Responsibilities:
- Lead efforts to drive the design, development of customer-facing multi modal ML inference systems.
- Work with the Platform and Inference teams on building inference systems for the next generation of models, where you will work on areas such as optimization, model tuning and deployment.
- Partner with leading cloud providers to deliver hosted Stability AI inference solutions.
- Be a strategic thought partner for leaders across the organization on driving business impact through machine learning
- Be part of the team to bring new Stability models and pipelines into existence
- Prototype and productionize inference platform improvements and new features
Qualifications:
- 7+ years working on productionizing machine learning systems, including inference pipeline development
- Expert level knowledge on writing and running python services at scale
- 5+ years working on python scientific stack, pyTorch and at least one high-performance inference framework (e.g. Triton and TensorRT)
- Deep understanding of Diffusion Architecture
- Experience profiling and optimizing deep neural networks on Nvidia GPUs, using profiling tools such as NVIDIA Nsight
- Experience with python-based image manipulation/encoding/decoding frameworks, such as OpenCV
- Experience deploying to cloud orchestration systems such as Kubernetes and cloud providers such as AWS, GCP, and Azure
- Experience with Docker
- Ability to rapidly prototype solutions and iterate on them with tight product deadlines
- Strong communication, collaboration, and documentation skills
- Experience with the open-source ML ecosystem (HuggingFace, W&B, etc.)
Equal Employment Opportunity:
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.
Stability AI
1 job posted
About the job
Similar Jobs
28d
Senior Generative AI Engineer
Datadog
Paris, FranceSenior Generative AI Engineer
Datadog
Paris, France28d6d
Helix AI Engineer, Generative AI
Figure
San Jose, CAHelix AI Engineer, Generative AI
Figure
San Jose, CA6d29d
Sr. SW Engineer- Generative AI
Visa
$111K - $172KAustin, TXSr. SW Engineer- Generative AI
Visa
$111K - $172KAustin, TX29d22d
Sr. SW Engineer- Generative AI
Visa
$111K - $172KAustin, TXSr. SW Engineer- Generative AI
Visa
$111K - $172KAustin, TX22d15d
Software Engineer - Generative AI
C3 AI
$120K - $165KRedwood City, CaliforniaSoftware Engineer - Generative AI
C3 AI
$120K - $165KRedwood City, California15d22d
Generative AI Architect-63324
Hitachi
Hyderabad, Telangana, IndiaGenerative AI Architect-63324
Hitachi
Hyderabad, Telangana, India22d9d
Technical Director, Generative AI
Monks
$115K - $130KNew YorkTechnical Director, Generative AI
Monks
$115K - $130KNew York9d19d
Principal GenAI Inference Optimization Engineer
AMD
San Jose, CAPrincipal GenAI Inference Optimization Engineer
AMD
San Jose, CA19d8d
Product Manager, Generative AI
C3 AI
$155K - $193KRedwood City, CaliforniaProduct Manager, Generative AI
C3 AI
$155K - $193KRedwood City, California8d7d
Lead Gen AI / ML Engineer
AMD
Austin, TexasLead Gen AI / ML Engineer
AMD
Austin, Texas7d
Looking for something different?
Browse all AI jobsFree AI job alerts
Get the latest AI jobs delivered to your inbox every week. Free, no spam.