🎙️

Audio & Voice

Text-to-speech, voice cloning, transcription, and music generation

10 tools

Deploy real-time AI agents that talk, type, and take action with voice synthesis, multimodal capabilities, and enterprise integration for customer support and business automation.

voice-agentsmultimodalcustomer-support

View tool →Visit

DeepgramFreemium

Audio & Voice

Deepgram helps developers build conversational AI using enterprise voice AI APIs for speech-to-text, text-to-speech, and real-time voice agent processing.

voice-aispeech-recognitiontext-to-speech

View tool →Visit

Cartesia SonicPaid

Audio & Voice

Cartesia Sonic is a real-time text-to-speech API with ultra-low latency (90ms) generating natural, expressive voices with laughter and emotion controls across 40+ languages for AI voice agents.

text-to-speechvoice-generationvoice-agents

View tool →Visit

Noiz AIFree

Audio & Voice

Noiz AI is a text-to-speech platform that generates emotionally expressive voice output using emoji-based tone control. It enables users to create natural, nuanced voices for storytelling and messaging.

text-to-speechvoice-generationstorytelling

View tool →Visit

Beatoven.aiFreemium

Audio & Voice

Beatoven.ai generates royalty-free background music and soundscapes for videos and podcasts using AI composition technology.

music-generationroyalty-freeai-composer

View tool →Visit

Play.htFreemium

Audio & Voice

Play.ht generates natural-sounding AI voiceovers from text, offering multiple voice options for content creators, marketers, and developers.

text-to-speechvoiceoverai-voice

View tool →Visit

KrispFreemium

Audio & Voice

Krisp uses AI to remove background noise and distractions from audio during calls and recordings, improving voice quality in virtual meetings.

noise-cancellationaudio-qualitymeetings

View tool →Visit

SynthwaveFreemium

Audio & Voice

Synthwave is an AI music creation platform that generates original compositions and soundtracks in various genres for creative projects.

music-generationai-musicsoundtrack

View tool →Visit

VocapiaFreemium

Audio & Voice

Vocapia uses AI to transcribe and index audio and video content for search and accessibility, helping organizations make media searchable.

transcriptionaudio-indexingaccessibility

View tool →Visit

UdioFreemium

Audio & Voice

Udio is an AI music generation platform that enables creators to compose, customize, and generate original music tracks using artificial intelligence.

music-generationaudio-creationai-music

View tool →Visit