How to Install & Use Whisper AI Voice to Text

Kevin Stratvert

Просмотров 481 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 сен 2024

Комментарии • 752

@KevinStratvert Год назад ⁺⁵⁷
Run Whisper AI in the cloud using Google Colab (requires no install and is also free): ruclips.net/video/8SQV-B83tPU/видео.html
@amlaaaa479 Год назад ⁺⁵
Didn't work for me. I just get error reports
@daedalusjones4228 Год назад
Works great for me using Co-Lab. Or on my hard drive. Both work great.
But here's something:
I have multiple gmail accounts. And I have a number of tools, add-ons, extensions to Google Drive/Docs/Sheets, including Co-Lab, Apps Scripts, etc.
And I initially set them all up on one google account. But when I go to set up those same tools in my other google drive accounts, I get an error message, and can't do it.
It seems that I can't have stuff in Co-Lab, for example, in more than one google account.
@francescooliva5951 Год назад
there is a way with the installation on windows to use whisper OFFLINE?
@KevinStratvert Год назад ⁺⁷
@@francescooliva5951 once you install, you can use offline.
@francescooliva5951 Год назад ⁺¹
@@KevinStratvert so the only time i go online Is to download for the First Time the pre-trained model?(tiny/medium/large according to my choice)? I have a AMD Radeon 530 GPU... But whisper seems to not read It. In fact i use 99% of my CPU in task manager. What Is the medium time to transcribe a medium kind of file?
@BijuuMike 8 месяцев назад ⁺⁴⁷
for the ones having issues with "file doesn't exist" you have to make sure that you add the file type at the end even if its not named that. For example if you file is named "file" and its an mp3 then you must type in whisper file.mp3. Hope this helps because this was not specified
@lauram14 7 месяцев назад
I need help FP16 is not supported on CPU; using FP32 instead.. what does this mean?
@federicobartolozzi680 7 месяцев назад
@@lauram14 nothing, just more ram used and low speed
@iphoneapple1892 7 месяцев назад ⁺¹
thank you i was stuck for two week now its work
@hiteshbiswas776 6 месяцев назад
still facing the issue for m4a file... is it possible we need to give only certain file types
@lambdaboy9999 6 месяцев назад
wait why are you here?
@RussMcClay Год назад ⁺⁴³
Gosh, Kevin, this is the first video I've seen of your and I am mightily impressed! I've been in IT for over 30 years and I can tell you that your presentation is one of the leanest and meanest I've ever seen. What a great contribution this is to the community. Thank you very much!
@dandixon7 Год назад ⁺³¹
This is probably my favorite video on RUclips ever. It is amazing. It takes a process that I found complicated and turns it into easy to follow steps. It actually takes what could be stress inducing and makes it relaxing with some unintentional ASMR presentation. Very well done.
@zenmasterwannabe Год назад ⁺⁵
Thank you for doing a complete walkthrough, unlike so many other RUclipsrs who act like they're being thorough but later find out they're skipping small but essential steps as if we already know!
@rolfbackstrom6029 Год назад ⁺⁷⁴
Thanks again, Kevin, for a very useful video. Nice to see Python at work. It reminds me of old-time programming - at least a little. I am 71 and wrote my first program using punch cards... :)
@noreenstxs9605 Год назад ⁺⁵
I dropped my whole stack of punch cards once :)
@JohnDoe-rx3vn Год назад ⁺⁷
I was just telling my buddy about that. I think AI is going to be as big a jump as punch cards/numbered lines to named variables was
@ALifetimeofFitness Год назад ⁺⁵
@@noreenstxs9605 I did that back around 1970!
@hubertmallard7254 Год назад ⁺⁵
I'm 72... Cards.. IBM 1130 Fortran Apple 2 Pascal 😅
@PoeLemic Год назад ⁺³
@@hubertmallard7254 Yeah, same here. Programmed in Fortran, Cobol, Pascal, etc. What about the TRS-80? Remember those?
@CharCharSJ Год назад ⁺¹⁹
Amazing walkthrough. Thank you. You've made something that would have been overwhelming for me and taken me hours (if I could do it at all) seem so easy and I was done in under half an hour!!
@WilliamMMiller Год назад ⁺¹¹
Hi Kevin. Been watching you for awhile and just want to say thanks for all the explanations. Concise and interesting. You've helped me a lot and, again,
I thank you. Keep it rolling!
@SpeedRacer24X Год назад ⁺³¹
Another incredibly useful video and so very easy to follow as well! It works perfectly for my large assembly recordings. Thanks so much Kevin. You're such a great teacher, I just love your stuff!
@riomarketing4668 8 месяцев назад ⁺¹
It worked after some serious debugging but couldn't have done it without this video. Thank you a ton!!
@Noneofyourbusiness2000 7 месяцев назад ⁺²
When checking your python version from the command line make sure you use a capital V.
python -V
This is because python -v is the option for verbose and you will have a wall of strange text popup instead of the version number
@MichaelLaFrance1 Год назад ⁺⁴
This exercise gave me some solid experience troubleshooting errors. I had to pull teeth to get Homebrew (using a Mac) to install properly, and then had an SSL certificate error, but Google & Stack Overflow came to the rescue, and Whisper is working like a charm. Thanks for the great video!
By the way, if anyone gets an SSL certificate error using Python3 (which apparently is common), just enter the following in terminal, exactly as written (but check your version*):
/Applications/Python\ 3.11/Install\ Certificates.command
* Just adjust the version number to match your release, in the example above, I updated it to 3.11
@mehmetbakideniz 4 месяца назад ⁺¹
people like you further motivate me to share my knowledge with the internet. Thank you so much! you have saved me a ton of time.
@DirkBeukers98 3 месяца назад
I have this problem but when I try your solution the terminal says: "no such file or directory: /Applications/Python" Do you know how to fix that?
@sashabagdasarow497 Год назад ⁺¹
This is why I love internet! To execute a neural network you just have to follow simple guidelines!
There are issues and stuff to figure out yourself, but this is such a great jumpstart!
@the.squidd 3 месяца назад
You are a legend. What an amazingly helpful and easy to listen to tutorial on this.
@ETSemajase Год назад ⁺¹
Thank you Kevin for what you do. I followed the instructions. I added the following in case some newbies wanted this.
I installed Python version 3.11.5 in Windows 11 and it works fine. In Windows Explorer, I created a folder under the C: Drive called Whisper. I then copied my mp3 audio file (from data drive) to C:\Whisper, typed in cmd in the address field to bring up the Command Prompt, and then typed
whisper filename.mp3 --model medium [and then Enter].
A 36-minute conversation (50mb) took a little over 39 minutes to run. I then cut all the files from C:\Whisper and pasted them into a folder on my data drive. Then I copied the text version into a version of Word that I don’t pay a monthly fee for and saved it. 😊
Hope this helps someone.
@struppifrohlich2008 Год назад
I tried Python 3.11.5 too, but every time i go in my C:\Whisper folder and type in CMD where I type in Whisper test.wav it says:
FileNotFoundError: [WinError 2] The system cannot find the specified file
Do you know a solution?
@Alex-jb4ke 11 месяцев назад ⁺¹
YOUR VIDEO IS AMAZING!!! It helped me so much with learning languages, I used this whisper program, converting speech to text, and then I used chat GPT as a super translator, IT IS ABSOLUTELLY AMAZING. Thanks to this video I did in 1 day the amount of work for 4 days. The quality of Whisper is absolutelly amazing. Kevin Stratvert is the BEST, Thank you
@ydhirsch 5 месяцев назад
This was indeed a helpful video, even if I wish you skipped package managers for ffmpeg installation. I got Whisper installed and working, testing transcription on a recording of a 70 minute meeting. With a fairly muscular PC, I tried with small, medium and large models. Surprisingly I got more accurate results with small, in addition to quicker results. Great tool, wonderful intro.
@jimdarley Год назад ⁺³
Outstanding tutorial as always Kevin. Thank you. I used this to transcribe my recording of a 45-minute webinar so I could read along and highlight as I listened to the replay. It took just 11 minutes on my high-end gaming computer with a Geforce RTX-3060 Ti graphics card. Very useful tool!. ‼
@generalgeert Год назад ⁺¹
SOunds great, which model did you use? the default small model or a higher one?
@jimdarley Год назад ⁺²
@@generalgeert I used whisper -model medium
@huy3148 9 месяцев назад
Which CPU did you have for that transcribe? Thank you
@iagovinicius3228 3 месяца назад
I have an RTX 7800xt, but when transcribing it is the CPU that does the work... how do I use the GPU?
@haechi9381 19 дней назад
Thank you for the smooth tutorial! I'm doing subs at my free time and I'm struggling at the timings of the audio sooo it will be a big help ^^
@robh5695 Год назад ⁺²
I was recently thinking how great it would be to have Whisper local, instead of online only. And, voila, here's Kevin! Readin' minds, and don't even know it; well, you do now. Thanks!
@Flatwound Год назад
I had previous success with your Stable Diffusion video for a local install. It was the only one I found that was clear and perfectly detailed! This video also was excellent, I just followed your step by step instructions and everything is working great!
@temperanceplaysgenshin 6 месяцев назад
This is one of the best step-by-step instructions I've ever seen. Thank you!
@ashraffouad Год назад ⁺¹
Wow, many thanks Kevin. I had my own videos that I was planning to do Voiceover and found it very difficult to listen to and translate the video, this way I was able to generate Arabic text and it is pretty good and even the translate feature to English is excellent. This video solved a lot for me, and I have tested it, and very promising. Many thanks again.
@marcoomana8130 6 месяцев назад
My brother you have saved me literally over a thousand hours of work. This made a life-changing improvement on my productivity
@ochuutv7195 Год назад ⁺³
Kelvin you such a sweet heart... just when I needed a transcribing software...
vaahlaaaaah!!! Here you are with the solution..
Kelvin are you reading my mind?Answer me
Nice one bruh... you make everything seems easy.. And working SMART Muah!!! Kelvin Kelvin!!!!! Thank you ❤
@calypso168 28 дней назад
Thank you so much! Very helpful besides the small problem with dll file that I encounter, I fixed it and working perfectly now
@rebeccasimpson7477 4 месяца назад
Wow! Really impressed how quick and easy this was. Would love a follow up video on how to incorporate something like pyannote to this so that we can also have speaker diarization!
@dagmar4580 Год назад ⁺¹
Thank you for all the time you put into making this step-by-step guide, all worked, yay! It did, however, take over 2,5h to transcribe a 40 min interview in .wav. Is that how it's supposed to be? Anyone else noticing similar sluggishness? 🤔
@killakaiju 3 месяца назад
dude thank you this actually worked compared to other tutorials!
@gringo7864 Год назад
As an educator, I really like you style of explaining. Tnx
@Trunkerad 9 месяцев назад
Incredibly helpful. Thank you.
Whenever I want to use some (free and very useful) open-source tool I'm always baffled how difficult unintuitive it is to get it running by yourself
@sabofx 9 месяцев назад
Crystal clear tutorial. Worked the first time trying. Thanx buddy! 😁
@KevinStratvert 9 месяцев назад ⁺¹
Great to hear!
@iondu655 Год назад ⁺⁵
I got the
'whisper' is not recognized as an internal or external command,
operable program or batch file.
Response when I wrote the command. Is there any solution to this issue?
@omidsahebzamani8428 6 месяцев назад ⁺²
Hey did you manage to fix that? I'm having the same issue :(
@iondu655 6 месяцев назад
@@omidsahebzamani8428 my advice is to use Whisper AI on Google Workspace (Google Colab). I still can not make it work on my computer.
@turkozfataliique1854 2 дня назад
adding this path to environment path worked
'C:\Users\\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.8_qbz5n2kfra8p0\LocalCache\local-packages\Python38\Scripts'
@vickiebarnes4326 3 месяца назад
Awesome! Thank you so much. You helped me actually get this to work (after watching several other videos!).
@DoD33 11 месяцев назад
Holy... I've been following you for quite some time now and I have to say, you lost me on this one. I'm sure there's another way I can accomplish this, not to say you are wrong, or giving bad advice or whatever, in fact just the opposite, you explained it perfectly and of course I have no doubt it's doable. In fact I'm writing this to get you more comments on the video. Great job, it's just one I'll pass on.
@davidcohen1861 7 месяцев назад
Like other users I had to update pip to get everything to run correctly but cmd gave me the prompt I needed to update so really easy in Windows 11. --- Also it is worth mentioning that I did the cloud version and the download version and the cloud version will limit how much you can use it. If you have 30 minute or more audio files you will only be able to do 1 or two using the cloud version but the download version seems to be unlimited so far. I am converting 1 hour + audio files in medium and it has not limited me at all.
@Jo-xu2vi Год назад
It's a great help to sort and summarize important info from a vidz 😊. Thank you mr. Kevs!
@EvilLittleCar Год назад
Thank you! This is exactly what we needed to transcribe our tiny DnD podcast!
@Elena-00 9 месяцев назад ⁺¹
It currently works with Python version 3.11.7. Thanks Kevin
@benjagomol 8 месяцев назад
what about 3.12 or 3.13?
@markushastreiter9640 Год назад
Thank you so much. Great instructions with exactly the right level of detail. Got whisper running on first try.
@robertparenton7470 Год назад
Thank You! I was able to transcribe my mp3 file. Excellent technology for next week's online course.
@berfont1994 11 месяцев назад
Appreciate your teaching Kevin, love and respect from Singapore :)
@aminaelyoussoufi3235 5 месяцев назад
incroyable!! merci beaucoup j'ai tout compris c'était méga clair. bravo continue comme ça.
@yulo8987 6 месяцев назад
God bless you! Thank you for explaining the process in a simple and easy to follow way.
@joshuamethven 8 месяцев назад ⁺²
PLEASE NOTE: I needed to update my pip to get this to run!! (23.3.2) - Windows 11
WINDOWS CMD ADMIN - run this command:
python -m pip install --upgrade pip
ALSO - You may need to add Python to your windows "Path" in the application called "Edit the system environment variables "
(I did all this after the installation process in the video)
@shawnnot 7 месяцев назад
Thanks, this worked.
@iu-di3lj Год назад
So useful and clearly presented, never stop making videos
@renerens Год назад
It is really amazing how good it is at transcribing songs! Using that for my home build arranger/karaoke keyboard :)
@digicinematic 4 месяца назад
Nice video. I was able to install Whisper on Ubuntu 22.04 LTS and transcribe without a hitch.😄 Well done.
@PhilipSalt Год назад
Thank you Kevin for sharing your walkthrough, been looking at paid at platform for transcription. So easy when you know how
@iggymach 5 месяцев назад
I would have never managed without this video. Thanks man
Also, 2.7 Gb for Torch. Wow!!
@rebine_lee Год назад
哇真的好用，讲的很细致！！在中国永远找不到如此细致的教程
@anonymousperson45152 Год назад
Bro I dunno what to say but this is the thing that I have been looking for. Thank you a lot.
@fortheloveofcake93 Год назад
Running flawlessly for me. What a fantastic guide. I had to download latest version of pip to work but no hitches installing anything for me.
@MarkAntinozzi 5 месяцев назад
I was able to transcribe and translate audio with Whisper!! Thank you so much!!
>M
@InnocentiusLacrimosa 8 месяцев назад
This is great. I'll check if this can be basis for localized version of audio transcribe and then AI powered summary and analysis of meeting audio track.
UPDATE: The transcription worked fine locally with the small model. Now attempting the same with the large model (working on multiple languages, mostly not English so large model should be better). It is looking pretty sweet so far and not too slow either (with 4070 Ti). I did a manual analysis by copy paste in chatGPT to get a summary of a 45 min interview and it was OK. Now I need to build a proper pipeline to use the API to do the analysis or preferably do it with something like llama model locally also.
@telvinstein8354 3 месяца назад
Great explanation thank you very much!!!
@expcavaliers Год назад
amazing tutorial. Thank you for this super high quality well thought out tutorial. went super smooth.
@markmikhaeel Год назад
Kevin, you are my tech genius! That came in the right time.
Thanks heaps for your amazing video:)
@Barrys_Workshop Год назад ⁺¹
Very useful thanks. As always very clear succinct videos
@cezarang Месяц назад
Working perfectly... thank you
@SequeriaAnil Год назад ⁺²
Hey Kevin, its always a pleasure to listen to how you simplify Tech for your viewers. Thank you.
Would it be possible to use let's say a RUclips video link instead of an mp3 or wav file and transcribe to text.
@MrKalavak Год назад
you can record your screen if you press win alt r while playing the video
then after you record the video convert the video on your computer into a mp3 or wav file
@IDMYM8 Год назад
@@MrKalavak or he can use ytdl to download audio-only of that video, which will do all the process with a single prompt.
@vicentesoto1628 9 месяцев назад
I was not able to continue since you provided Chocolatey but no further explanation if using MAC.
Anyway awesome work.
@whitewingsmedia0 Год назад
Well, I got it to work so I'm good. Your instructions are excellent!
@marknelson6514 11 месяцев назад
Fantastic video. I'm going to grab the transcript and start installing on another i7 laptop and see what happens. Thank you sir!
@Dadastudying Год назад
Amazing! Thanks fpr such a helpful video, dude!
@Noumenon11 Год назад
Awesome tutorial. Thanks Kevin. Whisper AI is an amazing tool.
@johannamarci8429 Год назад ⁺¹
I am lost at 6:00 minutes step. I really don't understand where to find it and how we do it. I'm really disappointed.
@joseponce9567 8 месяцев назад
great explanation and all straightforward
@Dragonka523 Год назад
Thank you very much, great walkthrough and thanks for the uninstall informations too
@jaredhinton5662 Год назад
Very cool my dude, thank you for helping with this. I would have never gotten this on my own
@pritish47 Год назад ⁺³
Hi kevin, i tried this. But i am getting this error while running the whisper: "whisper' is not recognized as an internal or external command,
operable program or batch file."
@sara_zharax5324 4 месяца назад
extremely good tutorial! Thank you!
@the_kvadronikus Год назад
thank you for that guide, simple and to the point, but full of info, like.
@the_kvadronikus Год назад
and yes, ive install and use whisper, it works, somewhere lose correct endings of words or choose wrong letter, but it have insanely quality of transcribation even for Russian lang on normal base.
@akramshaik2913 Год назад
Well, it is a wonderful video and useful too, but it's taking longer time to load the transcript. Thanks to you Kevin!!
@davidmesaros9733 Месяц назад
Really useful video ! Thx !
@abararahmed9402 10 месяцев назад
just 2 words for you.. you are incredibly awesome.
@philweaver457 8 месяцев назад
Thanks, Kevin. Super helpful!
@Luckotheirish213 Год назад
THANK YOU - this tutorial is fantastic.
@Abhishek-cy1il 3 месяца назад
That's we called quality content
@sashabagdasarow497 Год назад
Brooo! The CMD trick is so good!
@DAS-kg1vz 5 месяцев назад ⁺¹
Thanks so much! I got some problems when I install whisper model. It shows that " Defaulting to user installation because normal site-packages is not writeable" and a bunch of "Requirement already satisfied: openai-whisper in c:\users\1092123\appdata
oaming\python\python39\site-packages (20231117)" How can I resolve it?
@perruris Год назад
Your content is a jewel, ty!
@ps.cbaraona 6 месяцев назад
Thanks! Worked perfectly ;)
@AnthonyVGibby 10 месяцев назад
Thanks for the instructional video. I getting to this right away.
@endemion06463 10 месяцев назад
You're better of using Subtitle Edit, much easier to use.
@essamhubail596 Год назад
great illustration and I have successfully installed it on my computer. Thank you @kevin
@pavlohak4525 Год назад
It's working !!! Thank you for help ))
@member3673 8 месяцев назад ⁺¹
Could you please tell me where to change for a different language, audio file in a different language and it is to be written in the same language. Otherwise, how can Whisper recognize the language? Thank you
@lordbyron7918 5 месяцев назад
amazing content! thank you!
@pauljones7798 Год назад
Hi Kevin, thank you for the training video.
@SvaJonny 11 месяцев назад
Is it possible to get which speaker is talking by timestamp? Like :
00:00 -> 00:10 : [Person1] Hello my name is …
00:10 -> 00:15: [Person2] Nice to meet you …
00:15 -> 00:20: [Person1] What is your …
Or at least just mentioning when the speaker is changing?
a way to detect different voices?
@Snowdudee Год назад
Amazing video dude, thanks!
@alisahinkabak4876 7 месяцев назад ⁺¹
I have a problem! i tried translating a japanese video to english. The command i gave was: 'whisper yuuriytprank1.mp4 --language Japanese --task translate' but then it said this:' 'whisper' is not recognized as an internal or external command,
operable program or batch file.' Where did i make a mistake?
@andresplayz8998 11 месяцев назад
Good tutorial! Easy to follow
@someshkhatawe3404 10 месяцев назад
omggggg!!!!!!!! this was so smooth .... Thanks!!
@christopherely4364 Год назад
Worked for me, thank you.
@philipjamesajagabos2519 Месяц назад
I just have to subscribe to your channel. Good and detail video
@Learn_English_Deutsch Год назад
hi , kevin , thank for your video, really helpful,
@calebvaughan587 Год назад
Super helpful, thank you so much
@elmaog 2 месяца назад
That was auwsome, thanks a lot
@ismaellorenzi4544 9 месяцев назад
Men, i dont know how i found this video but thanks a lot.

Следующие

Автовоспроизведение

Best FREE Speech to Text AI - Whisper AI