Fastest speech to text transcription, 100% offline - Whisper.cpp | Zero latency

Поделиться
HTML-код
  • Опубликовано: 26 дек 2024
  • Today we will see how to download and use whisper offline.
    Whisper from openai: github.com/ope...
    Whisper.cpp: github.com/gge...
    Models: github.com/gge...
    - - - - - - - - - - - - - - - - - - - - -
    Follow us on social networks:
    Instagram: / codewithbro_
    ---
    Support us on patreon: / codewithbro
    #whisper #openai #whispercpp #speechtotext #programming #softwaredeveloper #softwareengineer #transcription #developer #iosdeveloper #mobiledevelopment #coding #coder #javascript #developer #computerscience #computersciencestudent #100daysofcode #html #css #programmer #vue #npmpackage #npm #package #CodeNewbies #Code_with_bro #code_withbro #youtubechannel #youtube #youtuber #youtubers #subscribe #youtubevideos #sub #youtubevideo #like #instagram #follow #video #vlog #subscribetomychannel #gaming #music #explorepage #love #smallyoutuber #vlogger #youtubegaming #instagood #gamer #youtubecommunity #likes #explore #youtubelife #youtubecreator #ps #bhfyp #fotiecodes

Комментарии • 74

  • @codewithbro95
    @codewithbro95  7 месяцев назад +4

    If you have any questions please feel free to drop them below!
    Please don't forget to like and subscribe for more interesting content like this🔥

    • @maxxflyer
      @maxxflyer 4 месяца назад +1

      hey bro, does it offer italian language?

    • @codewithbro95
      @codewithbro95  4 месяца назад

      @@maxxflyer I belive it does, you can check the repo

  • @hjoseph777
    @hjoseph777 3 месяца назад +8

    I am using a combination of "faster-whisper" and "whisper. cpp" offline. I will use "faster-whisper" for fast machines or servers with GPU and whisper in my project.cpp will be on a regular laptop on CPU. Thanks for sharing; your demo was crystal clear; keep it up. New subscriber

  • @endresbielefeldt2050
    @endresbielefeldt2050 7 месяцев назад +4

    thank you for the amazing content!

  • @meow2646
    @meow2646 2 дня назад

    Is there a project that uses whisper to replace the keyboard? Ie can I use it to replace the built in speech to text app in windows? or said another way can it output text to whatever text field my cursor is in?

  • @theMonkeyMonkey
    @theMonkeyMonkey 7 месяцев назад +4

    Your english is excellent. may i make a suggestion - python is not pronounced pie-ton but pie-thon - with the 'th' being the same as the 'th' in 'this'

  • @edmondgoddy
    @edmondgoddy 5 месяцев назад +1

    1K Subs. Congrats bro

    • @codewithbro95
      @codewithbro95  5 месяцев назад +2

      @@edmondgoddy thanks man, really appreciate the support 🙌🏾🙌🏾

  • @Dixon105
    @Dixon105 Месяц назад +1

    i have a problem i dont know why dont detect fine my voice with de base en model, i even scream and no all is detected. i using integrated microphone but is a good microphone. i will test with another microphone soon, but something tellme that not will work neither.

    • @codewithbro95
      @codewithbro95  Месяц назад +1

      @@Dixon105 should be a problem with your microphone, try your settings and make sure your mic is well setup, that should work. Also I noticed some issues as well with airports, when I have them on it picks up sound not in the best way with them on.

  • @mentalview8703
    @mentalview8703 6 месяцев назад +1

    Great video bro. Keep it up 👍

    • @codewithbro95
      @codewithbro95  6 месяцев назад +1

      Thanks, really appreciate 🙌🏾

  • @teclascelestiais9328
    @teclascelestiais9328 2 месяца назад +1

    incredible! Do you know if it only transcribes wave files? Can I also get mp3?

    • @codewithbro95
      @codewithbro95  2 месяца назад +1

      not sure but i believe you can convert to wav and transcribe from there!

  • @QHawk7
    @QHawk7 3 месяца назад +2

    It picks up sounds? weird... Doesn't it phone home?

  • @Robert-fl6ei
    @Robert-fl6ei 16 дней назад +1

    I need to install c++ to make it work on windows 10 ?

  • @Jeka476
    @Jeka476 4 месяца назад +2

    Why is there black screen in middle of the video?

    • @codewithbro95
      @codewithbro95  4 месяца назад +1

      Hey man, apologies for this, that should have been spotted before publishing.
      Sorry!

  • @dazdazfzf
    @dazdazfzf 2 месяца назад +1

    thanks for you content for West Indies in the carribbeans. Guadeloupe :-) I am curious to know on what kind of machine you are working on ? Is there a big GPU ? I saw metal, normal apple laptop ?

    • @codewithbro95
      @codewithbro95  2 месяца назад +3

      apple silicon m1, with 8 core gpu I think

  • @JackieUUU
    @JackieUUU 7 месяцев назад +3

    amazing! what gpu are you running? or it’s on cpu?

    • @codewithbro95
      @codewithbro95  7 месяцев назад +4

      Running on macOS M1 chip with 8 core GPU, I believe whisper.cpp makes use of metal on mac

  • @DenzilSheldon
    @DenzilSheldon 5 месяцев назад +1

    Wow amazing!
    Question: how much faster is it estimated working faster then Python?
    Thanks a lot!

    • @codewithbro95
      @codewithbro95  5 месяцев назад +1

      No specific data on that but after trying both I’d say it’s just about 5x faster in transcription

  • @aryanbamane1281
    @aryanbamane1281 3 месяца назад +1

    How do I implement this on website??
    Please help.

  • @gomgom330
    @gomgom330 3 месяца назад +1

    What different with speech recognition library?? As i know speech recognition support engine like whisper,watsonx,and google speech, but for offline it use vosk by default

    • @codewithbro95
      @codewithbro95  3 месяца назад +1

      This is more accurate in terms of recognition

    • @Dixon105
      @Dixon105 Месяц назад +1

      @@codewithbro95 i will be force to use vosk becouse this dont work for me

  • @Plash14
    @Plash14 3 месяца назад +1

    Hey umm, can faster whisper detect sounds like that too or is it only Whisper.cpp?

    • @codewithbro95
      @codewithbro95  3 месяца назад +1

      @@Plash14 not sure what you mean

    • @Plash14
      @Plash14 2 месяца назад +1

      @codewithbro95 basically it can detect your keyboard typing sounds etc right? Was wondering if it can be done on faster_whisper as well

    • @codewithbro95
      @codewithbro95  2 месяца назад +1

      @@Plash14 I see, not so sure about that(haven’t tried it) however, if it’s based off of whisper then I believe it should be able to do that

    • @Plash14
      @Plash14 2 месяца назад +1

      @@codewithbro95 I see... thanks for the reply!

  • @RoarStaze
    @RoarStaze 5 месяцев назад +1

    How do you get the make command to work on windows?, i got the make command but i just get error saying cc not found and someone said gcc=cc but i dont know how to do anything from there

    • @codewithbro95
      @codewithbro95  5 месяцев назад +1

      @@RoarStaze not tried it yet on windows but from the error you got, I believe you have to install gcc on your windows machine

    • @RoarStaze
      @RoarStaze 5 месяцев назад

      @@codewithbro95 i do have gcc someone said i need to make it gcc=cc but ive no idea how to do that

  • @mbegangsylvain1076
    @mbegangsylvain1076 6 месяцев назад +1

    love it !!!

    • @codewithbro95
      @codewithbro95  6 месяцев назад +1

      Glad you love it... Please, don't forget to like and subscribe for more interesting content like this one🔥😎

  • @Robert-fl6ei
    @Robert-fl6ei 18 дней назад +1

    Guys,
    this is just for english language?

    • @codewithbro95
      @codewithbro95  18 дней назад +1

      It does support other languages

    • @Robert-fl6ei
      @Robert-fl6ei 17 дней назад

      @@codewithbro95 ok. Thank you very much.
      I'm trying to install this at my place, but I can't manage it myself. I have never programmed, I don't know what c++ is, for example.
      It is very interesting, but unfortunately I haven't managed to do it yet.
      Do you perhaps run your own programming community, where I could find support :) ?

  • @gnosisdg8497
    @gnosisdg8497 7 месяцев назад +4

    can you put this offline whisper with a local llm model lets say phi3 to get reply based on whisper? i mean lets see how fast it can actually put out what the llm model will reply, this way you can make an offline ai assistant with no latency in responses and local 100 %

    • @codewithbro95
      @codewithbro95  7 месяцев назад +5

      i am actually working on something like this, check out my recent videos on Jarvis. I am building Jarvis so you don't have to

    • @gnosisdg8497
      @gnosisdg8497 7 месяцев назад +2

      @@codewithbro95 cool nice job keep it up, can you also add a way to use phi3 llm with phidata as well for Local RAG and also options for reading csv , pdf ,word documents as well ? this will give you a lot of views also, we are talking for an actual use of an ai assistant with this abilities !!!

    • @codewithbro95
      @codewithbro95  6 месяцев назад +1

      ​@@gnosisdg8497 definitely something i am looking to work on, stay tuned!!!

  • @hjoseph777
    @hjoseph777 3 месяца назад +1

    Your screen went black at 6:10

    • @codewithbro95
      @codewithbro95  3 месяца назад +1

      yeah, editing mistake, my appologies

  • @siddharthchadha3930
    @siddharthchadha3930 6 месяцев назад +1

    Thanks your video goes blank in the middle for a little bit

    • @codewithbro95
      @codewithbro95  6 месяцев назад +1

      @@siddharthchadha3930 really? Didn’t notice that. Apologies nonetheless

    • @HimanshuChanda
      @HimanshuChanda 5 месяцев назад

      @@codewithbro95@ 06:13 onwards

  • @contactmebaba
    @contactmebaba 4 месяца назад +1

    The guide to install and make it working was not clearly captured in this video. In between it was only voice and no screen record visible to us. I appreciate your effort, but you need to cover the content for wide audience from beginner to Advance in step by step procedure. The command ''make" still doesn't work. The problem with all these AI youtubers are not providing solution to an issue and keep moving to other AI tools with new content. Try to follow-up and provide solutions to your audience in order to get more followers.

    • @codewithbro95
      @codewithbro95  4 месяца назад +2

      @@contactmebaba I recon the screen went black at a point, sincere apologies for that. That was an editing error. Will try my best to do a better job at double checking before publishing.

  • @snatvb
    @snatvb 6 месяцев назад +1

    I wait same speed TTS(text to speech), it would be great to have

    • @codewithbro95
      @codewithbro95  6 месяцев назад +1

      Not sure i understand what you mean!

    • @snatvb
      @snatvb 6 месяцев назад +1

      @@codewithbro95 we have option recognize speech to text in realtime, but text to speech is really slow now

    • @codewithbro95
      @codewithbro95  6 месяцев назад +1

      @@snatvb definitely agree with you, inferencing with TTS is very bad at the moment, though I recently stumbled on a really promising project called ChatTTS apparently it’s being built specifically for this purpose, I haven’t tried it though, maybe I will and make a video on it.

    • @snatvb
      @snatvb 6 месяцев назад

      @@codewithbro95 yep, I've seen recently. I tried "bark" from suno and it work pretty slow (I have rtx 3070) and sometimes it voices llm imagination text instad of I gave :D

  • @TiTanos168
    @TiTanos168 2 месяца назад +2

    Thanks for the info. But the screen is completely blacked out.

  • @ToMooNoT
    @ToMooNoT 6 месяцев назад +1

    Hi, noob here.. Trying to figure out how to get the `make` working from VSCode terminal, on windows so far I installed MSYS2 added C:\msys64\usr\bin and C:\msys64\mingw64\bin to PATH env variables but... still says command not recognized..

    • @RoarStaze
      @RoarStaze 5 месяцев назад +1

      same did u find a fix?

    • @codewithbro95
      @codewithbro95  5 месяцев назад +1

      @@ToMooNoT does it work outside of vscode ? That’s the normal terminal

    • @ToMooNoT
      @ToMooNoT 5 месяцев назад

      @@codewithbro95 I had to install Visual Studio and build the C code from there or something, but it didn't build the microphone one, and I don't know how to add it to the build step, so kinda gave up, also was trying to get my AMD GPU to work with ZLUDA which is a library that should make CUDA code work on AyyMD, but no luck there either even with AI helping with troubleshooting..