Please enable JavaScript.
Coggle requires JavaScript to display documents.
topics, Audio - Coggle Diagram
topics
NLP
RAG + audio
Voice customization
Metadata GEN
Prompt to SFX/speech
Podcast
Full synthetic audio content
AI voice assistant
Music
Removing old recordings
Remixing
Fingerprinting
Syntesis
Per Instrument (stems) broadcast
Vocal/Instrument Timbre transfer
SFX
Hard FX
Folley
Ambiences
AutoSync
Speech
Speech Separation
Data cleaning
Data gathering
Noise removal
Speech Enhancement
Audio object broadcast
Diarization (who speaking when)
Speech Analysis
Speech to text
Metadata annotation
Data gathering
Translation
Codification/Compression
Captioning
Speech Syntesis
Voice cloning (<10s)
Sintetic media
Dubbing
Translation
Correction
Voice to voice
TTS
Audio
Understanding current state (Outsinde & inside)
Branches
Separation
Data Cleaning & gathering
General Speech dataset construction
Multi-class speech models
Virtual Humans
Automatic Overdub
speech correction
speech as a service
deepfake
Sintetic media
Entertainment
Learning (scripts, man + machine interations, etc)
Sports
Real time translation
Automatic captioning + translation
Denoising
Metadata generation
Data driven compression
Audio object coding solution
Introduction to broadcast standarts (MPEG-H, dolby atmos, etc)
At home AI interaction (globo talents)
Speech/ambiance enhancement
Multiple audio feeds
Audio target ad
Multiple languages automatic OTA
Remixing
Analysis
Syntesis
Voice cloning
Change speed/pitch
Voice assistants