overlay
Deep Learning Performance Engineer
Engineering – Engineering
hybrid: San Francisco, CA
Salary range $170,112 - $237,000
added Sat Oct 14, 2023
link-outApply to Anyscale
Anyscale is based in San Francisco, CA. Employees are required to come in office 3x a week.
About the role:
We are particularly looking for motivated game-changing Senior+ engineers who can drive towards optimizing performance! We are open to both hands on individual contributors or highly technical engineers with prior management experience.
Success in this opportunity will come from passionate individuals who want to enable developers in the upcoming Generative AI/LLM revolution. We’re hiring exceptional Software Engineers and Research Engineers (or hybrids of the two) to help us build out Anyscale’s LLM offering, building on our work on high-performance LLM inference.
We're looking for THE BEST engineers, who are excited to build advanced LLM applications as well as the platform and infrastructure to enable them.

As part of this role, you will:

  • Build extensions to existing open source LLMs such as adding support for function templates
  • Push the boundaries of existing LLM applications (e.g. building cutting edge question answering applications)
  • Develop features to enable production deployment of LLMs (e.g. what does CI for LLMs look like? How do you do evals of LLMs)
  • Work on systematically improving the quality of LLM Application
  • Jointly define your own projects as the ecosystem evolves
  • Work closely with the first 50 users of the things you build
  • Help us build a world class company

We'd love to hear from you if you have:

  • 3+ years of experience working as an an applied scientist, research engineer or software engineer focused on LLMs
  • You enjoy coding for 50% or more of your time
  • Solid fundamentals in algorithms, data structures, system design
  • Domain expertise in LLMs and generative AI

Bonus points!

  • Experience working with systems engineering aspects of LLMs (e.g. distributed training, autoscaling inference etc)
  • Experience with approaches to LLM model improvement and fine tuning (such as LoRA and RLHF)
  • Published research in the Gen AI space
  • Experience using Ray

Compensation

  • At Anyscale, we take a market-based approach to compensation. We are data-driven, transparent, and consistent. The target salary for this role is $170,112 ~ $237,000. As the market data changes over time, the target salary for this role may be adjusted.
This role is also eligible to participate in Anyscale's Equity and Benefits offerings, including the following:
  • Stock Options
  • Healthcare plans, with premiums covered by Anyscale at 99%
  • 401k Retirement Plan
  • Wellness stipend
  • Education stipend
  • Paid Parental Leave
  • Flexible Time Off
  • Commute reimbursement
  • 100% of in office meals covered
Anyscale Inc. is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law. Anyscale Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish
At Anyscale, we're on a mission to democratize distributed computing and make it accessible to software developers of all skill levels. We’re commercializing Ray, a popular open-source project that's creating an ecosystem of libraries for scalable machine learning. Companies like OpenAI, Uber, Spotify, Instacart, Cruise, and many more, have Ray in their tech stacks to accelerate the progress of AI applications out into the real world.

With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can scale an ML application from their laptop to the cluster without needing to be a distributed systems expert.

We're a San Francisco based company, proud to be backed by $250+ million from top-tier investors like Andreessen Horowitz, NEA, and Addition.