00:00 Intro: Turning audio into text for free 00:09 No Downloads: No local installation needed 00:17 Whisper AI & Colab: Using Whisper AI with Google Colab 00:50 Google Colab Setup: How to use Google Colab 01:58 Runtime Options: Choosing CPU vs GPU 03:37 Install Packages: Setting up Whisper and FFmpeg 04:25 Upload Files: Adding audio/video files to Colab 05:25 Choose Model: Picking the right Whisper model 06:16 Run Transcription: Executing transcription 07:07 File Outputs: Different file types explained 07:44 Avoid File Loss: Save files before Colab resets Thank you for watching! Let me know if you have any questions down in the comment section! 😀
Hi, is there a way to get transcript from a youtube video that didn't have transcription in built. For example, they didn't activate CC caption. Thanks
Did you get an error message? Sometimes uploading your audio files in the folder takes a while. You will need to run the script until after your audio uploading completed.
That sounds rare. Make sure you log in to the same gmail account and try refresh? (It doesn't work when you use your tablet or phones. So try use the browser on a PC/Mac.) Good luck!
Great question! Whisper AI can't transcribe based on an RUclips link directly. I'd first check if RUclips's own transcription is satisfiable. You can click on the "description" of any youtube video, and scroll all the way down and click on "Show Transcript". Then you will find the transcription on the right-hand side. Hope this helps!
The large model takes longer time to process. I’d recommend testing the large model on audios with heavy accents or strong noise. Or in any case, you are not satisfied with the result with the medium model, then switch up to large. Hope this helps!
@@ElleWang There's other project called seamless M4T which has speech to speech translation but its not installing on colab so do you have any idea about it?
Thank you for asking about Seamless M4T! Yes, it's a powerful tool, but I should mention that speech-to-speech task require significantly more GPU resources than speech-to-text. That's why it might struggle on Colab. I'd recommend running it locally on your own GPU if possible.
Thank you for your comment! Yes, you can use Whisper AI to translate other languages into English as well. It depends on what source language you are using. You can check the URL in the description to find the command for it. I may also plan a tutorial video on translation with whisper. :-)
Thank you Elle. I found a Chinese drama on RUclips (mandarine speaking) and I would like to get the Chinese text to convert it in Pinyin to learn Chinese mandarin. Do you think it's possible with this tool? The RUclips video has English captions and Chinese characters (transcription is only in English) but I need PinYin to learn how to prounounce each words. If I get the text of the video in Chinese, I know how to convert it into Pinyin with free websites. Watching dramas is a great way to study a language.
Yes, you can use Whisper AI to transcribe videos in Chinese and then translate those into English also use Whisper (you can check out another video on my channel that focuses on the "translation" function of Whisper AI. :-)
@@ElleWang Thank you very much, I did it and it worked fine! Very impressive tool. I'm also building a language app, so this kind of AI tool can help me a lot.
@@ElleWang it's cool to make captions for videos in different languages. Subtitles took a longtime to make when I was doing that on my Commodore Amiga in the 80's ^^
So glad you find the video helpful! Yes, indeed, it used to take forever to manual transcribe and translate. Your language app idea sounds fascinating! Good luck with everything!
@@ElleWang Thank you. My app will be free, no ads. I hope to give free 300 lessons with the help of AI. I try to make a conversational live cartoon, for example, in a restaurant, you speak and answer to the waitress/ter then depends of your answers or questions, the next dialogue is different. I try to reproduce a real situation. If you know AI to do this (create animated cartoon and voice recognition, text to speech), pls let me know. :)
Thank you so much for the tool! However I uploaded a large wav file -one hour lecture- and the text was incomplete, do you know what could have happened?
Thank you for your comment! Re: your question - It might be a Google Colab session timeout. Sometimes the wifi condition can influence it. I've processed a 2-hour wav file successfully in the past. Good luck!
Sorry I was using regular CPU for this, when i took T4 GPU it took me 47 secs, but do I have to install whisper everytime i have to transcribe a file? it showed me an error /bin/bash: line 1: whisper: command not found and only when i rerun the install command did it go away
00:00 Intro: Turning audio into text for free
00:09 No Downloads: No local installation needed
00:17 Whisper AI & Colab: Using Whisper AI with Google Colab
00:50 Google Colab Setup: How to use Google Colab
01:58 Runtime Options: Choosing CPU vs GPU
03:37 Install Packages: Setting up Whisper and FFmpeg
04:25 Upload Files: Adding audio/video files to Colab
05:25 Choose Model: Picking the right Whisper model
06:16 Run Transcription: Executing transcription
07:07 File Outputs: Different file types explained
07:44 Avoid File Loss: Save files before Colab resets
Thank you for watching! Let me know if you have any questions down in the comment section! 😀
Interesting video, well done and explained. Thanks, it was helpful to me.
So glad to know it was helpful to you! Feel free to share any questions you have. 😊
Can you also make a Google Collab on free talking avatar ? Thank you.
Hi, is there a way to get transcript from a youtube video that didn't have transcription in built. For example, they didn't activate CC caption. Thanks
Hi there, you are able to record the audio in your local computer, than you can use the method in the video to get the transcription.
Thank you so much! I really appreciate it!
Thank you for your kind comment! So glad it was helpful to you!
idk what is wrong with mine i cant do it i got 500 audios to transcript
Did you get an error message? Sometimes uploading your audio files in the folder takes a while. You will need to run the script until after your audio uploading completed.
Hi ElleWang, I have installed now Colab, but I do not find it afterwards in the selection. Where is the problem?
That sounds rare. Make sure you log in to the same gmail account and try refresh? (It doesn't work when you use your tablet or phones. So try use the browser on a PC/Mac.) Good luck!
What about RUclips? I have the link or the video downloaded onto RUclips but can’t get video to my computer
Great question! Whisper AI can't transcribe based on an RUclips link directly. I'd first check if RUclips's own transcription is satisfiable. You can click on the "description" of any youtube video, and scroll all the way down and click on "Show Transcript". Then you will find the transcription on the right-hand side. Hope this helps!
Hi! Can you make a Google Collab on TTS for free ala eleven labs
What about large model?
The large model takes longer time to process. I’d recommend testing the large model on audios with heavy accents or strong noise. Or in any case, you are not satisfied with the result with the medium model, then switch up to large. Hope this helps!
@@ElleWang There's other project called seamless M4T which has speech to speech translation but its not installing on colab so do you have any idea about it?
Thank you for asking about Seamless M4T! Yes, it's a powerful tool, but I should mention that speech-to-speech task require significantly more GPU resources than speech-to-text. That's why it might struggle on Colab. I'd recommend running it locally on your own GPU if possible.
Hey, thank you so much for this video, So heplful
Just wondering
Do you know a method to convert foreign language speech video to english text please?
Thank you for your comment! Yes, you can use Whisper AI to translate other languages into English as well. It depends on what source language you are using. You can check the URL in the description to find the command for it. I may also plan a tutorial video on translation with whisper. :-)
@@ElleWang Thank you for your response. Found the command in the URL, Will give it try & see how it does. Thank you :)
Thank you Elle. I found a Chinese drama on RUclips (mandarine speaking) and I would like to get the Chinese text to convert it in Pinyin to learn Chinese mandarin.
Do you think it's possible with this tool? The RUclips video has English captions and Chinese characters (transcription is only in English) but I need PinYin to learn how to prounounce each words. If I get the text of the video in Chinese, I know how to convert it into Pinyin with free websites. Watching dramas is a great way to study a language.
Yes, you can use Whisper AI to transcribe videos in Chinese and then translate those into English also use Whisper (you can check out another video on my channel that focuses on the "translation" function of Whisper AI. :-)
@@ElleWang Thank you very much, I did it and it worked fine! Very impressive tool. I'm also building a language app, so this kind of AI tool can help me a lot.
@@ElleWang it's cool to make captions for videos in different languages. Subtitles took a longtime to make when I was doing that on my Commodore Amiga in the 80's ^^
So glad you find the video helpful! Yes, indeed, it used to take forever to manual transcribe and translate. Your language app idea sounds fascinating! Good luck with everything!
@@ElleWang Thank you. My app will be free, no ads. I hope to give free 300 lessons with the help of AI. I try to make a conversational live cartoon, for example, in a restaurant, you speak and answer to the waitress/ter then depends of your answers or questions, the next dialogue is different. I try to reproduce a real situation. If you know AI to do this (create animated cartoon and voice recognition, text to speech), pls let me know. :)
Thank you so much for the tool! However I uploaded a large wav file -one hour lecture- and the text was incomplete, do you know what could have happened?
Thank you for your comment! Re: your question - It might be a Google Colab session timeout. Sometimes the wifi condition can influence it. I've processed a 2-hour wav file successfully in the past. Good luck!
I can’t find in google drive in my i pad
Hi there, you will need to use a desktop/laptop to use Google Colab. :-)
@ElleWang Thanks for your quick response 👍
it took me 4 mins to transcribe a 30 sec audio wav file, is that expected? also does this work with aac file, also thank you so much for sharing this
Sorry I was using regular CPU for this, when i took T4 GPU it took me 47 secs, but do I have to install whisper everytime i have to transcribe a file? it showed me an error /bin/bash: line 1: whisper: command not found and only when i rerun the install command did it go away
Yes, you need to run the same script including the installing lines every time using Google Colab. And yes using “T4 GPU” whenever you can! :-)
@@ElleWang Thank you
i am running faster-whisper on my 2016 entry level potato laptop