RealtimeSTT: A low-latency speech-to-text library with advanced voice activity detection

Поделиться
HTML-код
  • Опубликовано: 26 окт 2024

Комментарии • 20

  • @kritikusi-666
    @kritikusi-666 День назад

    I just tried this. Holly hell. This is some fast rendering. Thank you for sharing your repo. I tested with stereo mix with a YT video. It did not miss a beat.

  • @Miyauti
    @Miyauti Месяц назад

    This looks really promising, i will try to test it on my programs! Thanks for the work my dude!

  • @LostPostProductions
    @LostPostProductions 7 месяцев назад

    Thank you for showing us your library in action as well as letting us know how we can support it!

  • @ccapp3389
    @ccapp3389 7 месяцев назад

    Nice one! I look forward to trying this out

  • @aboudezoa
    @aboudezoa 11 месяцев назад

    This is awesome ! Thanks

  • @modicool
    @modicool 5 месяцев назад

    That's incredibly accurate. Nice work! Can you active-transcribe AND wake-word for commands? It'd be great if you could have it always listening and then do something on wake word.

    • @Linguflex
      @Linguflex  5 месяцев назад

      No, currently not. The idea is good, I can see some use-cases for this. I'll think about that.

  • @allfather_ogre
    @allfather_ogre 4 месяца назад

    Great work..do you have any ideas to reduce latency in text to speech..im working on it..

  • @deathlyecks
    @deathlyecks Месяц назад

    I'd like to use it but there isn't code snippets with just specific functions. I only want the live speech transcription and real time TTS in a simple script,
    I don't need all of the code for wake words etc etc etc etc, And there's like 2000 encompassing lines of code so I cant even figure out what parts are that specific feature.

    • @Linguflex
      @Linguflex  Месяц назад

      Please look into github.com/KoljaB/LocalAIVoiceChat/ project and how both Realtime libraries are used in the ai_voicetalk_local.py

    • @Linguflex
      @Linguflex  Месяц назад

      Also please look into the tests folder of both Realtime libs. There are simple code examples without usage of wakewords in the RealtimeSTT tests folder.

    • @deathlyecks
      @deathlyecks Месяц назад

      @@Linguflex Okay, thanks for replying!

  • @sebastiangonzales46
    @sebastiangonzales46 Месяц назад

    I'll try to use this and customize it for our Undergrad Thesis, is it okay?

    • @Linguflex
      @Linguflex  Месяц назад

      Yes sure. It's MIT license so you can use it for whatever you like.

  • @datfeeshboi9526
    @datfeeshboi9526 4 месяца назад

    im gonna try to make a Vrchat STT app that puts the words above my head using their osc system :D

  • @kimjongoof5000
    @kimjongoof5000 7 месяцев назад

    Is it possible to make the voice dictation instantaneous at the cost of accuracy? I want to try controlling the servos on an animatronic mouth with voice dictation. It doesn't have to be accurate, it just needs to be accurate enough to be convincing and as fast as possible

    • @Linguflex
      @Linguflex  6 месяцев назад

      You probably want to use whisper.cpp with a quantized tiny model and grammar sampling, look up Georgi Gerganov's chess example.

    • @Linguflex
      @Linguflex  6 месяцев назад

      You could also train a wake word model to do this. They are crazy fast and reliable but specialized on few keywords. Check Openwakeword or PvPorcupine.

  • @Moicestlunehiver
    @Moicestlunehiver 5 месяцев назад

    I don't understand how to use it...

    • @Linguflex
      @Linguflex  5 месяцев назад

      What do you want to do?
      The "tests" folder contains some examples how you can use it:
      github.com/KoljaB/RealtimeSTT/tree/master/tests
      Maybe also the "tests" of RealtimeTTS can help, they also use RealtimeSTT a lot:
      github.com/KoljaB/RealtimeTTS/tree/master/tests