Train your custom Speech Recognition Model with Hugging Face models
HTML-код
- Опубликовано: 9 окт 2024
- This tutorial will show you how to train a custom voice recognition model using Hugging face models. With the increasing popularity of voice-enabled devices and services, having accurate and reliable voice recognition is crucial. By training your own custom voice recognition model, you can improve the accuracy of your voice-enabled applications and services, and tailor them to your specific needs.
We will walk you through the process of preparation of training data and how to fine-tune on top of a custom voice recognition model, and tuning. We will also describe how to inference this model in real-time.
Github: github.com/dre...
#speechrecognition #huggingface
claps hands for thumbnail bait
Great content sir . Thank you so much . Learnt a lot from this one . Keep growing 🎉
Thank you :)
Nice tutorial Deepak. Keep making such useful content. ❤
Thanks Avinash :)
the model you created is it based on the whole datasets?
Hello, I ran the codes and trained the model on the entire dataset, but when I run the inference code, the predictions are empty. Any idea why? Could it have anything to do with the fact that I don't have pytorch_model.bin in my model folder, but model.safetensors instead?
nope
hello sir,
does this model detect filler words ?
sir, I am note able to import huggingsound, Please help
its showing the following error :- ERROR: Ignored the following versions that require a different python version: 0.0.1 Requires-Python >=3.7,=3.7,=3.7,=3.7,=3.7,=3.7,=3.7,
@@GouravKumar-qi5gt based on the error, it seems you are not using compatible python version, could you upgrade and check
guru@@deepakjohnreji , please play input and output sounds
your voice_cloning python module have several issues
This experiment was done as a research project to test out previous approaches. Thia package has a lot of flaws, but these approaches are a great alternative to the paid services available.
Bro how to create dataset ?
Do you mean, like huggingface dataset?
@@deepakjohnreji i wont to create voice to 3d sign language animation web app in to my research. Its already have google speech recognition in my language(sinhala) its not a research i think. Then i want to create speech identification model in my language. Can you give me a better solution for it? 😊
@@suchidoodles have you tested whether Google Speech recognition solution is working well? for your data.
I am getting too many errors when I run the model_training code. I have tried running it on Google Colab, but I still cannot get any results. Can you please help me?
Is it working now?