About the role:
We are looking for Machine Learning Engineers to work on our audio team who are passionate about generative models and creative applications of AI. In particular, we are looking for people who have experience of developing model serving pipelines to operate at scale and have knowledge of state of the art techniques for optimisation and feature development. We want highly creative ML engineers who are motivated to push the boundaries of generative audio models. You will have access to state-of-the-art high performance computing resources and you will be able to work alongside top researchers and engineers to truly make an impact in the fast growing world of generative AI.
Responsibilities:
- Lead efforts to drive the design, development and production of customer-facing ML music, speech and audio generation systems, with specific reference to inference and API environments
- Work with the Audio, Platform and Inference teams on building pipelines for the next generation of models, where you may assist with areas such as optimization, model tuning and deployment, HPC clusters, and tooling
- Be a strategic thought partner for leaders across the organization on driving business impact through machine learning
- Work on the commercial side - productionizing generative models, and building the infrastructure to serve them at scale
- Produce events and metrics in our data warehouse so that we can analyze critical business metrics like cost, performance, reliability, etc.
- Be part of the team that brings new Stability audio models and pipelines into existence for API customers
- Prototype and productionize inference platform improvements and new features
Qualifications:
- 5+ years working on machine learning projects, including inference and pipeline development
- Solid knowledge of Python scientific stack, PyTorch and at least one high-performance inference framework (e.g. TensorRT)
- Experience profiling and optimizing deep neural networks, including knowledge of GPU profiling tools such as NVIDIA Nsight
- Experience with Python audio processing libraries such as librosa, torchaudio, or similar
- Experience with cloud orchestration systems such as Kubernetes and cloud providers such as AWS, GCP, and Azure
- Ability to rapidly prototype solutions and iterate on them with tight product deadlines
- Experience with training and/or deploying ML models with Amazon AWS (Sagemaker a plus) or Google Cloud
- Strong communication, collaboration, and documentation skills
- Experience with Linux and command line tools
- Evidence of interest in music / audio projects is valued
Equal Employment Opportunity:
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.
Stability AI is a community and mission driven, open-source artificial intelligence company that cares deeply about real-world implications and applications. Our most considerable advances grow from our diversity in working across multiple teams and disciplines. We are unafraid to go against established norms and explore creativity. We are motivated to generate breakthrough ideas and convert them into tangible solutions. Our vibrant communities consist of experts, leaders and partners across the globe who are developing cutting-edge open AI models for Image, Language, Audio, Video, 3D and Biology.