- Видео 152
- Просмотров 707 710
Thorsten-Voice
Германия
Добавлен 12 ноя 2013
Guude! (hi, nice to see you) 👋,
i'm Thorsten 😊.
You like open source, privacy aware and local running voice technology? Me too 😎. You'll find cooking recipe like tutorials on TTS, STT, Voice Assistants, AI, ML and way more cool stuff here. So, hop on and join my amazing community 🥰.
#opensource #voice #cloning #technology #news #tutorial #local #privacy #tech #tts #stt #voiceassistant #raspberrypi #smarthome #homeassistant
* My project website: www.Thorsten-Voice.de
* Me on GitHub: github.com/thorstenMueller
i'm Thorsten 😊.
You like open source, privacy aware and local running voice technology? Me too 😎. You'll find cooking recipe like tutorials on TTS, STT, Voice Assistants, AI, ML and way more cool stuff here. So, hop on and join my amazing community 🥰.
#opensource #voice #cloning #technology #news #tutorial #local #privacy #tech #tts #stt #voiceassistant #raspberrypi #smarthome #homeassistant
* My project website: www.Thorsten-Voice.de
* Me on GitHub: github.com/thorstenMueller
Your AI Voice Sounds WRONG! Here's Why 🤖 → 🗣️
Learn how to dramatically improve your AI text-to-speech output through proper text cleaning and normalization techniques. In this tutorial, I'll show you:
✓ Common text issues that ruin TTS quality
✓ Step-by-step text cleaning process
✓ How to handle numbers, abbreviations, and special characters
✓ Universal techniques that work with any TTS engine
Whether you're using commercial or open-source TTS solutions, these text preprocessing steps will help you achieve more natural-sounding speech output. I will use @NVIDIA NeMo for text cleaning / normalization as one possibility.
#TextToSpeech #TTS #AIVoice #Tutorial #VoiceAI
00:00 Intro & samples / goals
01:32 What to achived by tutorial end
03:13 Wha...
✓ Common text issues that ruin TTS quality
✓ Step-by-step text cleaning process
✓ How to handle numbers, abbreviations, and special characters
✓ Universal techniques that work with any TTS engine
Whether you're using commercial or open-source TTS solutions, these text preprocessing steps will help you achieve more natural-sounding speech output. I will use @NVIDIA NeMo for text cleaning / normalization as one possibility.
#TextToSpeech #TTS #AIVoice #Tutorial #VoiceAI
00:00 Intro & samples / goals
01:32 What to achived by tutorial end
03:13 Wha...
Просмотров: 732
Видео
🎙️ Home Assistant Voice Preview Edition (VPE) #03 | Local Setup with Whisper & Piper 🗣️
Просмотров 1,3 тыс.21 день назад
Welcome to our new series about Home Assistant Voice! In this episode, we'll setup the device for local voice processing using whisper (STT) and piper (TTS). 📋 What we'll cover: - Install / configure OpenAI whisper for speech recognition - Install /configure Piper TTS for speech synthesis - Add Wyoming protocol - Configure voice assistant - Turning on/off entities with local voice control ⚡ Dev...
🎙️ Home Assistant Voice Preview Edition (VPE) #02 | First Setup & Connection 🔌
Просмотров 50421 день назад
After unboxing Home Assistant Voice in the previous episode, let's get this device up and running! In this video, we'll go through the initial setup process and connect the device to your Home Assistant installation. 📋 What we'll cover: - Powering on the device for the first time - Connecting to Home Assistant - Exploring created entities - Demo entities overview ⚡ Using Home Assistant version:...
🎙️ Home Assistant Voice Preview Edition (VPE) #01 | Unboxing & Tech Specs 📦
Просмотров 38821 день назад
Welcome to our new series about Home Assistant Voice! In this first episode, we'll unbox this exciting new device and take a detailed look at its technical specifications. 📋 What we'll cover: - Complete unboxing experience - Box contents overview - Hardware specifications - Quick look at documentation - Preview of upcoming episodes ⚡ Device Details: - Home Assistant Voice Preview Edition - Rele...
F5 Text to Speech Tutorial | Hit "Refresh" on Your AI Voice!
Просмотров 6 тыс.2 месяца назад
🔥🔥🔥 Impressive voice cloning with F5 TTS! Clone your voice with a few seconds audio data for your personal AI voice. Step-by-step tutorial For comparison reason - here's my computer spec: * CPU: 4x Intel(R) Core(TM) i5-3550 CPU @ 3.30GHz * RAM: 16GB * GPU: NVIDIA GeForce GTX 1050 Ti Based on some comments you might want to watch it on 1.5x speed 😁. Thanks to @kardiokode-g8v for pointing out lic...
3 steps to run HuggingFace 🤗 "Parler TTS" AI Voice on your local machine
Просмотров 7 тыс.3 месяца назад
How to run "Parler TTS" from @HuggingFace on your local machine in 3 simple steps (using python code)! Including audio samples. #python #parler #tts #huggingface 00:00 Intro 02:22 Parler TTS Github repo 03:10 Dataset basis for Parler TTS 05:40 Huggingface space to try it out 06:20 Set up Python venv for Parler TTS & Install 09:47 Using python script to synthesize audio 14:45 Synthesizing audio ...
Best AI Voice Generator | 2024.08
Просмотров 21 тыс.5 месяцев назад
Free #TTS with #Mars5 #Parler #MetaVoice #Toucan and #ChatTTS. First look and comparison video on voice cloning and more. Thanks to you great #opensource text to speech projects and @HuggingFace for providing cool spaces to play around with 🤗. And thank you "VB" for pointing to these cool projects on LinkedIn 👏: www.linkedin.com/posts/vaibhavs10_text-to-speech-ecosystem-has-been-booming-activit...
Automate Voice Dataset Creation Using Whisper AI
Просмотров 2,1 тыс.6 месяцев назад
Easy tutorial on creating a structured voice dataset on raw audio data using Python and Whisper by OpenAI for speech recognition. #ai #whisper #tts #voice #data #python 00:00 Intro 01:10 Set up python virtual environment 03:00 Working with "the magic" script :) 07:00 Run voice dataset generation with Whisper AI STT 07:58 Checking results 09:45 Outro * github.com/thorstenMueller/Audio-to-Voice-D...
TTS Voice Dataset | LJSpeech | Voice Cloning
Просмотров 2,8 тыс.6 месяцев назад
Close look to ljspeech voice dataset and it's structure for tts voice cloning. The ljspeech voice dataset is widely supported by tts voice cloning software. Videos is describing the structure and how you can create it for your personal voice clone. 00:00 Intro 02:23 LJSpeech info and download 04:15 LJSpeech in research (Google Scholar) 05:17 Close look to the voice dataset file structure 06:25 ...
Unlock AI Superpowers with NVIDIA CUDA: Boost Performance in Python!
Просмотров 1,6 тыс.7 месяцев назад
Boost your AI performance by using NVIDIA CUDA on Windows. Step by step tutorial on how to use CUDA with Python / pytorch and performance comparison with Coqui TTS. #performance #nvidia #python #ai #machinelearning #tts Please subscribe to my channel 😊. ruclips.net/user/ThorstenMueller Thanks dear @MightyReiti for your inspiration and support on my new recording setup ❤️. 00:00 Intro 01:55 What...
Home Assistant ❤️ Voice - Tutorial 05 - Wyoming protocol
Просмотров 5 тыс.10 месяцев назад
Home Assistant ❤️ Voice - Tutorial 05 - Wyoming protocol
Home Assistant ❤️ Voice - Tutorial 04 - Piper TTS
Просмотров 8 тыс.10 месяцев назад
Home Assistant ❤️ Voice - Tutorial 04 - Piper TTS
Home Assistant ❤️ Voice - Tutorial 03 - Conversation / NLP
Просмотров 1,6 тыс.10 месяцев назад
Home Assistant ❤️ Voice - Tutorial 03 - Conversation / NLP
Home Assistant ❤️ Voice - Tutorial 02 - Text Assist
Просмотров 2 тыс.10 месяцев назад
Home Assistant ❤️ Voice - Tutorial 02 - Text Assist
Home Assistant ❤️ Voice - Tutorial 01 - Basic setup & demo entities
Просмотров 4,5 тыс.10 месяцев назад
Home Assistant ❤️ Voice - Tutorial 01 - Basic setup & demo entities
Running a local Piper TTS server with Python on Linux
Просмотров 7 тыс.11 месяцев назад
Running a local Piper TTS server with Python on Linux
🔥 Voice interview Michael Hansen | HA | Raspberry | Piper | Rhasspy
Просмотров 2,2 тыс.11 месяцев назад
🔥 Voice interview Michael Hansen | HA | Raspberry | Piper | Rhasspy
Local voice cloning with 6 seconds audio | Coqui XTTS on Windows
Просмотров 47 тыс.Год назад
Local voice cloning with 6 seconds audio | Coqui XTTS on Windows
🇩🇪 Künstliche Sprachausgabe uff Hessisch | Kostenlos und OHNE CLOUD !
Просмотров 1,1 тыс.Год назад
🇩🇪 Künstliche Sprachausgabe uff Hessisch | Kostenlos und OHNE CLOUD !
TEXT TO SPEECH | Piper TTS on Windows 🚀 AI voice 10x faster Realtime!
Просмотров 30 тыс.Год назад
TEXT TO SPEECH | Piper TTS on Windows 🚀 AI voice 10x faster Realtime!
XTTS FAQ | Interview with Josh Meyer from Coqui AI
Просмотров 2,3 тыс.Год назад
XTTS FAQ | Interview with Josh Meyer from Coqui AI
Python virtual environment / venv | Windows, Linux & Mac OS X
Просмотров 3,3 тыс.Год назад
Python virtual environment / venv | Windows, Linux & Mac OS X
Free voice recording for BEST voice cloning | Piper-Recording-Studio | Windows
Просмотров 10 тыс.Год назад
Free voice recording for BEST voice cloning | Piper-Recording-Studio | Windows
Is Mycroft Mark 2 the better Alexa?! | Private | Voice Assistant
Просмотров 3,8 тыс.Год назад
Is Mycroft Mark 2 the better Alexa?! | Private | Voice Assistant
Create your AI digital voice clone locally with Piper TTS | Tutorial
Просмотров 54 тыс.Год назад
Create your AI digital voice clone locally with Piper TTS | Tutorial
Increase Text to Speech pronunciation quality with eSpeak | Tutorial
Просмотров 13 тыс.Год назад
Increase Text to Speech pronunciation quality with eSpeak | Tutorial
Talk locally (no ChatGPT) with your documents 😄 | PrivateGPT + Whisper + Coqui TTS
Просмотров 6 тыс.Год назад
Talk locally (no ChatGPT) with your documents 😄 | PrivateGPT Whisper Coqui TTS
Raspberry Pi | Local TTS | High Quality | Faster Realtime with Piper TTS
Просмотров 33 тыс.Год назад
Raspberry Pi | Local TTS | High Quality | Faster Realtime with Piper TTS
Thorsten-Voice TTS in Windows nutzen | DDC / VITS
Просмотров 6 тыс.Год назад
Thorsten-Voice TTS in Windows nutzen | DDC / VITS
Thorsten-Voice TTS in Linux nutzen | DDC / VITS / Piper
Просмотров 3,5 тыс.Год назад
Thorsten-Voice TTS in Linux nutzen | DDC / VITS / Piper