AssemblyAI has announced a series of new features and integrations designed to bolster the capabilities of speech AI applications. These enhancements include leveraging Large Language Models (LLMs) and partnerships with top platforms such as LangChain, LlamaIndex, Twilio, and AWS, according to AssemblyAI.
Utilizing Large Language Models with Voice Data
AssemblyAI is introducing new guides to help developers get more from their voice data using LLMs. These guides detail how to ask questions, summarize, extract, and generate content from audio data. The guides are part of AssemblyAI’s commitment to providing comprehensive resources for developers looking to enhance their applications with advanced AI capabilities.
Expanding Integrations for Enhanced Functionality
A key aspect of AssemblyAI's latest update is the introduction of integrations with leading platforms. Developers can now build LLM applications that handle audio data using LangChain, create searchable audio archives with LlamaIndex, and improve call transcription with Twilio. Detailed information on these integrations is available on AssemblyAI’s integrations page.
These integrations are designed to make it easier for developers to incorporate advanced speech AI capabilities into their applications, thereby enhancing the user experience and expanding the potential use cases for AssemblyAI’s technology.
New Tutorials and Resources
AssemblyAI has also released several new tutorials and resources to help developers make the most of its technology. These include:
- Creating Multi-Lingual Subtitles with AssemblyAI and DeepL: This guide teaches developers how to build a web app in Go that uses AssemblyAI to transcribe video files and generate subtitles.
- Building an AI-powered Video Conferencing App with Next.js and Stream: Learn how to create a video conferencing app that supports live transcriptions and an LLM-powered meeting assistant.
- Implementing Hotword Detection with Streaming Speech-to-Text and Go: This tutorial demonstrates how to create a hotword detection system using AssemblyAI’s Streaming Speech-to-Text API.
Trending YouTube Tutorials
In addition to written guides, AssemblyAI has also shared trending YouTube tutorials to help developers explore the full potential of its technology. Highlights include:
- Creating Speaker-Based Subtitles for Videos with AI (Python Tutorial): A tutorial on AI speaker diarization to create subtitles that dynamically change color based on the speaker.
- Building an AI Voice Translator (Python + Gradio Tutorial): Learn to build a voice translator that can translate voice into over 30 languages.
- Creating an AI Chat Bot in Java: This guide covers how to build an AI chat bot in Java that accepts real-time audio input as prompts using AssemblyAI and Claude.