overlay
Researcher, Speech
ASR
remote
added Tue Oct 24, 2023
link-outApply to AssemblyAI

About the role:

AssemblyAI is growing quickly, and we’re searching for a Speech Researcher specializing in Speech Technology to join our Speech team! With significant investment and strong leadership to fuel our growth, it’s the perfect time to join AssemblyAI.

What You’ll Do:

  • Execute ambitious research projects that have the potential for a high impact on AssemblyAI’s products and services, as well as on external research and technology communities
  • Collaborate with senior researchers and the technology leadership team to effectively execute those research projects and communicate the progresses of the projects
  • Engage with research communities through activities such as publishing experimental results or participating in conferences
  • Stay up-to-date on the latest research in the speech domain and share this knowledge across the company
  • Replicate and examine state-of-the-art speech models based on conference/journal publications
  • Collaborate with the engineering team to deploy new models into production

What You’ll Need:

  • PhD in Computer Science or a related field; or Master's degree in Computer Science or a related field plus equivalent practical experience
  • 3+ years of experience in speech research based on deep learning or machine learning in either an academic or industry setting. Specific research topics include, but are not limited to:
    • ASR (e.g., model architecture, ASR robustness).
    • Language modeling
    • Speaker modeling (e.g., speaker verification, speaker identification, speaker diarization)
    • Self-supervised learning for speech or speech-text
    • Multilingual speech modeling (multilingual ASR, speech translation, language identification)
    • Applications of ASR (e.g., spoken dialogue modeling, speech summarization)
    • Paralinguistics (e.g., speech emotion recognition)
  • A track record of widely recognized research publications at leading conferences
  • Excellent verbal and written communication skills in technical matters

Nice to Have:

  • 2+ years experience in applied research in the speech or related industry
  • Demonstrated ability to commercialize research results

Pay Transparency:

AssemblyAI strives to recruit and retain exceptional talent from diverse backgrounds while ensuring pay equity for our team. Our salary ranges are based on paying competitively for our size, stage and industry, and are one part of many compensation, benefits and other reward opportunities we provide.

There are many factors that go into salary determinations, including relevant experience, skill level and qualifications assessed during the interview process, and maintaining internal equity with peers on the team. The range shared below is a general expectation for the function as posted, but we are also open to considering candidates who may be more or less experienced than outlined in the job description. In this case, we will communicate any updates in the expected salary range.

Lastly, the provided range is the expected salary for candidates in the U.S. Outside of those regions, there may be a change in the range, which again, will be communicated to candidates.

Salary range: $140K - $170K+

Working at AssemblyAI

We are a small but mighty group of problem solvers, innovators, and experienced AI researchers with over 20 years of expertise in Machine Learning, Speech Recognition, and NLP. As a fully remote team, we’re looking for people to join our team who are ambitious, curious, and self-motivated. We put a lot of trust and autonomy into everyone on our team and want to find people who will add to our culture, not just fit in.

We’re committed to creating a space where our employees can bring their full selves to work and have equal opportunity to succeed. So regardless of race, gender identity or expression, sexual orientation, religion, origin, ability, age, veteran status, if joining this mission speaks to you, we encourage you to apply!

Keep Exploring AssemblyAI:

Check us out on YouTube!

Learn more about AI models for speech recognition

Core Transcription | Audio Intelligence | LeMUR | Try the Playground

Our $30M Series B fundraise

AssemblyAI is a remote-first AI company building powerful deep learning models for developers, startups, and enterprises to transcribe and understand their audio data.

Our Automated Speech Recognition (ASR) models already outperform companies like Google, AWS, and Microsoft - which is why hundreds of companies and thousands of developers are using our APIs to transcribe and understand millions of videos, podcasts, phone calls, and zoom meetings every day. Our APIs power innovative products like conversational intelligence platforms, zoom meeting summarizers, content moderation, and automatic closed captioning.

AssemblyAI’s Speech-to-Text APIs are already trusted by Fortune 500s, startups, and thousands of developers around the world, with well-known customers including Spotify, Algolia, Dow Jones, Happy Scribe, BBC, The Wall Street Journal, and NBCUniversal. As part of a huge and emerging market, AssemblyAI is well on its way to becoming the leader in speech recognition and NLP.

We're growing at breakneck speed, and recently announced our Series B round. We've raised $63M in total funding, and are backed by leading investors including Insight Partners, Accel, Y Combinator, Patrick and John Collision (Founders of Stripe), Nat Friedman (Former CEO of GitHub), and Daniel Gross (Entrepreneur & Investor in companies including GitHub, Uber & SpaceX)!

Our ambition is to build an iconic AI company, making advanced deep learning technology accessible to everyday developers through a simple API, good docs, and a great developer experience.

Join our world-class, remote team and help us build an iconic deep learning company!