How to Clone Most Languages Using Tortoise TTS - AI Voice Cloning

My Top 5 Open Source Text to Speech Softwares Starting off in 2024

Accidently Training Tortoise TTS on Crappy Audio Data

iOS 18 is AMAZING! - Try these 10 things first!

Bringing Mr Beast's Credit Card To The Hood!

I Become the Man United Manager... in FC 25

How I Do Voice Cloning in Other Languages with Tortoise TTS - Dataset and Tokenizer

Jarods Journey

Просмотров 3,4 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 20 сен 2024
Links referenced in the video:
Github - github.com/Jar...
Karpathy's tokenizer video - • Let's build the GPT To...
Timestamps:
0:40 - Explaining the process
1:23 - ytdlp script
3:30 - Transcription script with whisperx
7:20 - Merge folders after transcription
8:30 - Resampling to 22k hz
13:25 - Uploaded scripts :)!
14:03 - Making a tokenizer in another language
16:30 - What is the tokenizer for?
21:05 - Quick explanation on tortoise cleaners
Hardware for my PC:
Graphics Card - amzn.to/3pcREux
CPU - amzn.to/43O66Ir
Cooler - amzn.to/3p98TwX
RAM - amzn.to/3NBAsIq
SSD Storage - amzn.to/42NgMFR
Power Supply (PSU) - amzn.to/430bIhy
PC Case - amzn.to/447499T
Mother Board - amzn.to/3CziMXI
Alternative prebuilds to my PC:
Corsair Vengeance i7400 - amzn.to/3p64r22
MSI MPG Velox - amzn.to/42MnJHl
Cheapest and PC recommended:
Cyberpower 3060 - amzn.to/3XjtZoP
Come join The Learning Journey!
Discord - / discord
Github - github.com/Jar...
TikTok - / jarodsjourney
If you found anything helpful, please consider supporting me and the content I am trying to produce!
www.buymeacoff...
Наука

Комментарии • 17

@thienmytvho3096 6 месяцев назад ⁺¹
Will this gonna be on github later?Also I appreciate your effort on making these kind of video. Keep up the good work.
@radioketnoi Месяц назад
I don't understand where I went wrong. I'm training Vietnamese language. I used about 1 hour of my voice for training, created tokenzier with your python file for Vietnamese language "vi". Then I tested it with a sentence that was already in the audio sample. It produced a sound that was my voice. However, the sound produced was meaningless, not Vietnamese at all. Please tell me where I went wrong??
@bomar920 6 месяцев назад ⁺¹
Thanks 🙏 you deserve more subscribers.
@9-volt247 6 месяцев назад
I deserve more subscribers, too, not only Jarods.
@diogenes848 6 месяцев назад ⁺¹
Is there some place we can just download some working voices? I don't needs a "specific" voice just something as polished as possible. I'm wondering if I can use this to do "higher" quality TTS to listen to documents or ebooks. The processing time seems like it is going to just make that impossible regardless but I'd like to have some reasonable voices in the can just to play with... I've tried making a couple voices... they work... they're not great. Just want to download a sample voice that is polished if possible.
@shovonjamali7854 6 месяцев назад
Wow! Outstanding! Can you please tell me while taking the playlists for training, were those from single speakers or several ones?
@soorenapars 6 месяцев назад
Nice explanation
@SAnsAN091190 6 месяцев назад
Jarod, what do you think, if there is a haginface dataset that contains audio tracks and decryption text for them, is it possible to use such a dataset with this project, without having to extract audio from it?
P.S. A very useful video, especially about how english_cleaners breaks non-English languages) I'm going to screw on the Slavic tokenizer))
P.P.S. I'm looking forward to the second part of the preparation!)))
@SAnsAN091190 6 месяцев назад
I'm also thinking about checking the decryption of audio tracks from such datasets. Since I saw for myself that in some cases the transcription and the sound in the audio do not match (sometimes people mess around and record just unrelated sounds). Well, exclude tracks that mostly do not match what is indicated in the transcript.
@ssix9448 5 месяцев назад
Hi sir!
You're doing a great job with TTS. Are you planning to release the Hindi TTS model?
@Jarods_Journey 5 месяцев назад
Wont be releasing the model, but the code to train it will all be available
@codemaster911 6 месяцев назад
Thank you! what is your recommendation for the dataset length for high quality result?
@michikoangelineoey980 6 месяцев назад
do we need to make new token if the language only using latin character?
@sukhpalsukh3511 6 месяцев назад
Suppose I have Hindi language audios with transcription I manually created or use any script,
@exelyugure 2 месяца назад ⁺¹
You changed the code to the point that many stuffs are broken. For now, its unusable
@novatft5597 6 месяцев назад
Can you share your code using in this video
@sukhpalsukh3511 6 месяцев назад
Appreciate your work, but it's complicated to understand could you please explain with just simple examples

Следующие

Автовоспроизведение

How to Clone Most Languages Using Tortoise TTS - AI Voice Cloning

How to Clone Most Languages Using Tortoise TTS - AI Voice Cloning

My Top 5 Open Source Text to Speech Softwares Starting off in 2024

My Top 5 Open Source Text to Speech Softwares Starting off in 2024

Accidently Training Tortoise TTS on Crappy Audio Data

Accidently Training Tortoise TTS on Crappy Audio Data

iOS 18 is AMAZING! - Try these 10 things first!

iOS 18 is AMAZING! - Try these 10 things first!

Bringing Mr Beast's Credit Card To The Hood!

Bringing Mr Beast's Credit Card To The Hood!

I Become the Man United Manager... in FC 25

I Become the Man United Manager... in FC 25

OUR SON WAS DIAGNOSED WITH AUTISM🤍

OUR SON WAS DIAGNOSED WITH AUTISM🤍

Voice Cloning For Any Language | Fine-Tuning Tortoise-TTS | Part 1

Voice Cloning For Any Language | Fine-Tuning Tortoise-TTS | Part 1

C6 W20D2: Blog: Comments, File Upload, & Database Seeding

C6 W20D2: Blog: Comments, File Upload, & Database Seeding

Training Any Language in AI Voice Cloning - Tortoise TTS

Training Any Language in AI Voice Cloning - Tortoise TTS

Voice Cloning In Multiple Languages - Open Source

Voice Cloning In Multiple Languages - Open Source

How to Make the PERFECT Dataset for RVC AI Voice Training

How to Make the PERFECT Dataset for RVC AI Voice Training

Realtime AI Voice Changer Using RVC (Retrieval-based Voice Conversion w./ w-okada)

Realtime AI Voice Changer Using RVC (Retrieval-based Voice Conversion w./ w-okada)

RAG from the Ground Up with Python and Ollama

RAG from the Ground Up with Python and Ollama

A Tip on Training Better Voice Models in Tortoise TTS

A Tip on Training Better Voice Models in Tortoise TTS

Why Israel is in deep trouble: John Mearsheimer with Tom Switzer

Why Israel is in deep trouble: John Mearsheimer with Tom Switzer

Lithium cell can get burn

Lithium cell can get burn

ДЖУНГЛИ В КОМПЕ..

ДЖУНГЛИ В КОМПЕ..

ВОТ КАК APPLE ТЕБЯ ОБДИРАЮТ! 😱 #APPLE #iphone16 #компания #маркетинг

ВОТ КАК APPLE ТЕБЯ ОБДИРАЮТ! 😱 #APPLE #iphone16 #компания #маркетинг

Почему НЕ СТОИТ покупать iPhone 16 💩

Почему НЕ СТОИТ покупать iPhone 16 💩

20 НОВЫХ КРУТЫХ ФИШЕК GALAXY S24 ULTRA - ONE UI 6.1.1

20 НОВЫХ КРУТЫХ ФИШЕК GALAXY S24 ULTRA - ONE UI 6.1.1

#НДА ep.1 / АПГРЕЙД ПК ЗА 10К - БЮДЖЕТНО ПРОКАЧИВАЕМ КОМП ПОДПИСЧИКА ДЛЯ ИГР ЗА 10.000р

#НДА ep.1 / АПГРЕЙД ПК ЗА 10К - БЮДЖЕТНО ПРОКАЧИВАЕМ КОМП ПОДПИСЧИКА ДЛЯ ИГР ЗА 10.000р

Неожиданный заказ на $200: превращаем лого в 3D-шедевр в After Effects!

Неожиданный заказ на $200: превращаем лого в 3D-шедевр в After Effects!

Решил поменять Защитное стекло ошибки

Решил поменять Защитное стекло ошибки