Видео 32
Просмотров 23 021

0:56

Ollama & Home Assistant: Ultimate Privacy with Linguflex for Smart Homes

1:20

Linguflex remote controlled with phone browser

1:05

Milkshake Scene with realtime speaker diarization algorithm

1:10

Realtime speaker diarization algorithm

4:25

Current progress: developing a realtime speaker diarization algorithm

2:33

Working on a general memory for Linguflex

AI memory solution with minimal latency that automatically extracts, stores and retrieves user information during conversations. In systems like MemGPT users must explicitly say "save this info". With this approach the AI chooses when to save information autonomously.
So when a user says "my name is Steve" this system automatically stores this detail without additional prompts. With MemGPT, the user would need to say "My name is Steve, please save that information."
#RAG #Memory

Видео

0:56

Linguflex with custom wakewords

Просмотров 109Месяц назад

Custom wakewords (like "samantha" and "linguflex") now possible in Linguflex. github.com/KoljaB/Linguflex/

Ollama & Home Assistant: Ultimate Privacy with Linguflex for Smart Homes

1:20

Ollama & Home Assistant: Ultimate Privacy with Linguflex for Smart Homes

Просмотров 580Месяц назад

Say hello to total privacy with #Ollama support for #HomeAssistant in Linguflex. Smart home automation running fully locally. #OpenSource #SmartHome github.com/KoljaB/Linguflex github.com/KoljaB/Linguflex/blob/main/docs/home.md github.com/KoljaB/Linguflex/tree/main/lingu/modules/home

Linguflex remote controlled with phone browser

1:05

Linguflex remote controlled with phone browser

Просмотров 3232 месяца назад

Linguflex remote controlled with phone browser

Milkshake Scene with realtime speaker diarization algorithm

1:10

Milkshake Scene with realtime speaker diarization algorithm

Просмотров 3274 месяца назад

3 Speakers, short sentences and music at the end. Code available here: github.com/KoljaB/WhoSpeaks File used: realtime_diarize.py

4:25

Realtime speaker diarization algorithm

Просмотров 1,7 тыс.4 месяца назад

Realtime diarization of the coin toss scene from "No Country for Old Men", which is a challenge for every speaker diarization engine. Both speakers have a similar voice characteristics, they speak quiet and are hard to understand. This tests a new voice characteristics grouping algorithm with automatical speaker number detection that learns and gets better with time. Code is online here: github...

Current progress: developing a realtime speaker diarization algorithm

2:33

Current progress: developing a realtime speaker diarization algorithm

Просмотров 1594 месяца назад

Completely unpolished tests. Basic idea seems to work, yet very early work state still, needs more tests for verification and lots of refinement.

First realtime speaker diarization algorithm test

3:30

First realtime speaker diarization algorithm test

Просмотров 1534 месяца назад

Completely unpolished first test. Basic idea seems to work, yet very early work state still, needs more tests for verification and lots of refinement.

Linguflex 2.0 with Samuel L. Jackson voice

1:08

Linguflex 2.0 with Samuel L. Jackson voice

Просмотров 1354 месяца назад

Not Elevenlabs, just XTTS and RVC postprocessing with finetuned models. All local generated in realtime.

0:50

Linguflex 2.0 with Snoop Dogg voice

Просмотров 1274 месяца назад

Not Elevenlabs, just XTTS and RVC postprocessing with finetuned models. All local generated in realtime.

Linguflex 2.0 with David Attenborough voice

0:44

Linguflex 2.0 with David Attenborough voice

Просмотров 9974 месяца назад

Not Elevenlabs, just XTTS and RVC postprocessing with finetuned models. All local generated in realtime.

MoneyPrinterTurbo AI English Installation Manual

8:37

MoneyPrinterTurbo AI English Installation Manual

Просмотров 5144 месяца назад

Original video: ruclips.net/video/vWBf5p fr4/видео.html Automatically translated with: github.com/KoljaB/TurnVoice CLI command: turnvoice vWBf5p fr4 -l en -v female

6:52

Setup Guide: Linguflex 2.0 on Windows

Просмотров 7014 месяца назад

Step-by-step video for installing Linguflex 2.0 AI assistant on Windows, featuring local operation and ultra-low latency. For detailed documentation, visit the GitHub repository. github.com/KoljaB/Linguflex

Speech interruption in linguflex 2.0, a free open-source personal ai assistant

0:41

Speech interruption in linguflex 2.0, a free open-source personal ai assistant

Просмотров 3055 месяцев назад

Source code: github.com/KoljaB/Linguflex /tree/lingu-2.0-preview Features: - lightning fast assistant - custom personalities - allows usage of local llms - high quality local realtime tts - easily extendable with your own functions - allows huge number of functions with keyword filtering

Replacing six speakers at once with azure voices with a single cli command

3:54

Replacing six speakers at once with azure voices with a single cli command

Просмотров 1518 месяцев назад

Replacing six speakers at once with azure voices with a single cli command

#coqui ➕ https://github.com/KoljaB/TurnVoice = 🔥

1:01

#coqui ➕ https://github.com/KoljaB/TurnVoice = 🔥

Просмотров 2078 месяцев назад

#coqui ➕ github.com/KoljaB/TurnVoice = 🔥

0:41

Voice changed with TurnVoice

Просмотров 1758 месяцев назад

Voice changed with TurnVoice

0:34

Exchange YT video voices.

Просмотров 578 месяцев назад

Exchange YT video voices.

RealtimeTTS v0.3.3 now supports OpenAI TTS

0:16

RealtimeTTS v0.3.3 now supports OpenAI TTS

Просмотров 4458 месяцев назад

RealtimeTTS v0.3.3 now supports OpenAI TTS

RealtimeTTS v0.3.0 with simplified chinese support

0:40

RealtimeTTS v0.3.0 with simplified chinese support

Просмотров 1348 месяцев назад

RealtimeTTS v0.3.0 with simplified chinese support

0:49

RealtimeTTS Speed and Quality Demo

Просмотров 6159 месяцев назад

RealtimeTTS Speed and Quality Demo

Fast local AI talk with a custom voice based on Zephyr model, RealtimeSTT and RealtimeTTS libraries.

3:53

Fast local AI talk with a custom voice based on Zephyr model, RealtimeSTT and RealtimeTTS libraries.

Просмотров 1,6 тыс.9 месяцев назад

Fast local AI talk with a custom voice based on Zephyr model, RealtimeSTT and RealtimeTTS libraries.

Realtime translation in 6 languages with RealtimeSTT library in under 80 lines of code.

0:48

Realtime translation in 6 languages with RealtimeSTT library in under 80 lines of code.

Просмотров 2 тыс.10 месяцев назад

Realtime translation in 6 languages with RealtimeSTT library in under 80 lines of code.

1:29

TTS and STT in a loop

Просмотров 14811 месяцев назад

TTS and STT in a loop

RealtimeSTT: A low-latency speech-to-text library with advanced voice activity detection

1:54

RealtimeSTT: A low-latency speech-to-text library with advanced voice activity detection

Просмотров 2 тыс.11 месяцев назад

RealtimeSTT: A low-latency speech-to-text library with advanced voice activity detection

0:15

RealtimeSTT: A low-latency speech-to-text library with advanced voice activity detection

Просмотров 28411 месяцев назад

RealtimeSTT: A low-latency speech-to-text library with advanced voice activity detection

Voicebased interface to language model based on two new python libraries I developed

1:34

Voicebased interface to language model based on two new python libraries I developed

Просмотров 21711 месяцев назад

Voicebased interface to language model based on two new python libraries I developed

Low latency AI voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming.

0:38

Low latency AI voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming.

Просмотров 6 тыс.Год назад

Low latency AI voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming.

Wake word activation, smart home control, music playout and the sophia girlfriend personality

2:51

Wake word activation, smart home control, music playout and the sophia girlfriend personality

Просмотров 389Год назад

Wake word activation, smart home control, music playout and the sophia girlfriend personality

1:47

Complex requests with linguflex

Просмотров 1,5 тыс.Год назад

Complex requests with linguflex

@sujaldarode1649 8 дней назад
u got github ?
@Linguflex 8 дней назад
Yes, github.com/KoljaB. No code for this one up yet tho, bcs it's too early work state
@JosuéHenriqueBeckerSchwartzhau 18 дней назад
Can I run it in a raspberry pi 3 B?
@lokeshart3340 19 дней назад
Does it need gpu cause I have a i3 only
@Tigas4ever Месяц назад
Can you help me? :( TomlDecodeError: Reserved escape sequence used (line 100 column 1 char 3696) Traceback: File "C:\Users\tiago\miniconda3\envs\MoneyPrinterTurbo\lib\site-packages\streamlit untime\scriptrunner\script_runner.py", line 584, in _run_script exec(code, module.__dict__) File "C:\Users\tiago\Desktop\Try\MoneyPrinterTurbo\webui\Main.py", line 34, in <module> from app.services import task as tm, llm, voice File "C:\Users\tiago\Desktop\Try\MoneyPrinterTurbo\app\services\task.py", line 8, in <module> from app.config import config File "C:\Users\tiago\Desktop\Try\MoneyPrinterTurbo\app\config\__init__.py", line 6, in <module> from app.config import config File "C:\Users\tiago\Desktop\Try\MoneyPrinterTurbo\app\config\config.py", line 42, in <module> _cfg = load_config() File "C:\Users\tiago\Desktop\Try\MoneyPrinterTurbo\app\config\config.py", line 30, in load_config _config_ = toml.loads(_cfg_content) File "C:\Users\tiago\miniconda3\envs\MoneyPrinterTurbo\lib\site-packages\toml\decoder.py", line 514, in loads raise TomlDecodeError(str(err), original, pos)
@Linguflex Месяц назад
Sorry, can't help. I have nothing to do with MoneyPrinterTurbo. This is a translation example as showcase for my TurnVoice GitHub project.
@AbishekAjaiSatnur-v4x 22 дня назад
You gotta put the pexels API key in quotes like this - """ pexels_api_keys = [ "nE113EOVlRVbpWvRE0yZFuy6KmM9WAqvelyadayadayada",] """.
@nairdrive4825 Месяц назад
Action latency is yet to be improved , amazing project❤
@allfather_ogre Месяц назад
Great work..do you have any ideas to reduce latency in text to speech..im working on it..
@MrMoralHazard Месяц назад
Looking promising!
@ToddCWalker 2 месяца назад
Does Linguflex allow you to interrupt the model?
@Linguflex 2 месяца назад
Yes, it does. Voicebased or via escape key. See: ruclips.net/video/uQ8jJtalc9M/видео.html
@bigdaddy5303 2 месяца назад
Hey mate love your work. Unfortunately i cant get this software to run. Finally got your chat package working though (forget the name), adapted it for ollama (using langchain) as i just cant get the cuda version of llama cpp working (so takes about 20 secs for a response)...langchain just seems a little more straight forward and ollama is blazing fast. I was also experiencing a lot of those pipe issues, changing python version fixed it for me. Seems a difficult bit of software to maintain (like all AI packages) given the number of dependencies which are very sensitive to exact versions of other packages. But keep it up, amazing work.
@Linguflex 2 месяца назад
Codebase for this: github.com/KoljaB/EmoTTS It uses OpenAI. Ollama is best choice I think for local models. You're right about maintaining AI projects ends up in dependency nightmares. Of course you can set fixed versions, but if not all dependent libs do this too it will mess up the environment nevertheless sooner or later.
@bigdaddy5303 2 месяца назад
@@Linguflex will be amazing when local llms are capable of this. Ive never bothered with openAI for these things, though I am a paid subscriber, aren't the costs in addition to that to use their API? whilst I only have 12GB of vram on my laptop, stuck with the comparatively dumb llms. Though even some of the coqui voice models at time express emotion, the quality of the text they are reading obviously plays a huge role. The thing. I hate about all ai libraries being stuck to specific python versions and package versions...is I must have installed a venv for just about every bloody python version and every bloody py torch version. My love/hate relationship with python.
@Linguflex 2 месяца назад
@@bigdaddy5303 Very true words. It was an insane hazzle to combine all needed libs for linguflex into one single environment. We already have ollama support for linguflex in some implementations on our discord (so I have and another dev but we didnt release it yet). Should be coming soon...
@Archie_Moran 2 месяца назад
would this project be implementable into a business
@Linguflex 2 месяца назад
depends (tts license etc), but sure
@breakerboy365 2 месяца назад
I'm attempting to run a few example clients provided in the git. Maybe there's something obvious I'm missing, but an error I commonly run into is: ModuleNotFoundError: No module named 'RealtimeSTT'
@Linguflex 2 месяца назад
If you did perform "pip install realtimestt" before then probably your pip/python env is somehow corrupted. You could try it in a virtual env.
@breakerboy365 2 месяца назад
@@Linguflex I knew it was something obvious 🤦 Don't know how I missed that step lol Thank you for your help
@datfeeshboi9526 2 месяца назад
im gonna try to make a Vrchat STT app that puts the words above my head using their osc system :D
@NAE333 2 месяца назад
Awesome. How is interruptions implemented here? Thank
@Linguflex 2 месяца назад
The interruptions are triggered by the volume level detected on the microphone. It's a straightforward implementation that unfortunately also gets interrupted if the text-to-speech (TTS) output is too loud. To eliminate this issue, implementing echo cancellation is necessary, but that process is quite complex and not trivial
@NAE333 2 месяца назад
Very nice. Greatjob❤ Out of curiosity, how would you handle back to back conversation with interruptionhandling without using space?
@Linguflex 2 месяца назад
Thank you. We talked about how to do solid interruption in my discord channel recently: discord.gg/f556hqRjpv Highly encourage you to join, it's a great place to ask questions, share progress and get support from tech enthusiasts. Would love to see you there!
@DengAbuoiDeng 2 месяца назад
I really love your content and aspire to be like you. Although I am new to programming, I am learning Python and JavaScript simultaneously, which has been quite challenging. I live in the Kakuma refugee camp, where few people are interested in contributing to technology, focusing more on their own lives. Could you please update your full roadmap so that I can follow it? Additionally, it would be great if you could create a Discord community for discussions and support.
@Linguflex 2 месяца назад
Thank you so much for your kind words! ❤ Great to hear that you're learning both Python and JavaScript at the same time in Kenya, I think your determination will definitely pay off. I don't have a specific roadmap since I usually just develop projects based on inspiration and current interests. I highly encourage you to join our Discord community: discord.gg/f556hqRjpv It's a great place to ask questions, share progress and get support from tech enthusiasts. Would love to see you there!
@DengAbuoiDeng 2 месяца назад
🤑🥰🥰🥰🥰🥰
@AK-ox3mv 2 месяца назад
I was looking to a way to add auto generated subtitle to offline videos. Can it be used for this porpuse? Didn't it need any speech recognition model?
@Linguflex 2 месяца назад
I wouldn't use my realtime libraries for this: they are optimized for speed, not quality. I'd generate a srt file transcript with word-level timestamps with a good STT library like stable_whisper, then add the subtitles to the video using that srt file (ffmpeg can do this for example).
@modicool 3 месяца назад
That's incredibly accurate. Nice work! Can you active-transcribe AND wake-word for commands? It'd be great if you could have it always listening and then do something on wake word.
@Linguflex 3 месяца назад
No, currently not. The idea is good, I can see some use-cases for this. I'll think about that.
@DengAbuoiDeng 3 месяца назад
I love you work
@DengAbuoiDeng 3 месяца назад
Can I have it
@Linguflex 3 месяца назад
Sure, here's the repo: github.com/KoljaB/WhoSpeaks
@Centaurman 3 месяца назад
Can this be used hands free or does the mic need toggle per reply?
@Linguflex 3 месяца назад
Can be used hands free
@Centaurman 3 месяца назад
This amazing! Is there anyway I could run this on my computer then talk to it using a headset connected to my phone so it’s mobile?
@Moicestlunehiver 3 месяца назад
Hello man I have difficulties running your library, can you help me ?
@Linguflex 3 месяца назад
Sure. Could you write your problems as an issue, so others can see how it's solved afterwards? Depending on if it's input or output: github.com/KoljaB/RealtimeSTT/issues github.com/KoljaB/RealtimeTTS/issues
@Moicestlunehiver 3 месяца назад
I don't understand how to use it...
@Linguflex 3 месяца назад
What do you want to do? The "tests" folder contains some examples how you can use it: github.com/KoljaB/RealtimeSTT/tree/master/tests Maybe also the "tests" of RealtimeTTS can help, they also use RealtimeSTT a lot: github.com/KoljaB/RealtimeTTS/tree/master/tests
@RolandoLopezNieto 3 месяца назад
Impressive work, thanks
@42ndMoose 3 месяца назад
at the prerequisites 0:39 i downloaded cuDNN as a zip file, but i have no idea where to place it. is there an executable inside or do i extract them somewhere specific? (edit, for people that will be wondering the same: ruclips.net/video/OEFKlRSd8Ic/видео.html ) as well as ffmpeg. as a non developer, it's not pretty straightforward :( (edit: tutorial for this one ruclips.net/video/jZLqNocSQDM/видео.html )
@Linguflex 3 месяца назад
Thanks a lot for this information, as a dev I'm often just not aware of the problems the users have. I'll rework the Prerequisites section, thx again.
@denysolleik9896 4 месяца назад
And this is the worst it will be. Can’t wait for next update.
@GlassesAndCoffeeMugs 4 месяца назад
Diarrheazation?
@karimjedda 4 месяца назад
This is epic, thank you for sharing!
@ThomasRooth 4 месяца назад
Insanely good distinction after just a moment. Meanwhile i had to try and close my eyes and try to guess myself. Might aswell have been a monologue
@chrisivan_yt 4 месяца назад
Interesting but clearly struggling a bit lol
@nomad4850 4 месяца назад
What language you used?
@Linguflex 4 месяца назад
Python
@niscchay 4 месяца назад
I know basic python, where do I go next?
@GlassesAndCoffeeMugs 4 месяца назад
@@niscchay he literally posted the github page lol
@thegrassguy2871 4 месяца назад
Wheat
@MrWuzey 4 месяца назад
Hello, thank you for this great project. I know it's not the point but would it be possible to add the option to use text as input for the user ?
@Linguflex 4 месяца назад
Yes absolutely, we need that. Any ideas where to place it in the UI?
@MrWuzey 4 месяца назад
@@Linguflex I know there are quiet few icons already but the simplest would be maybe to add another icon that would open a chat window ?
@thecalicocowboy 4 месяца назад
Super cool! is this in a public repo?
@Linguflex 4 месяца назад
Not yet but soon
@ccapp3389 4 месяца назад
Nice one! I look forward to trying this out
@MrSuntask 4 месяца назад
very erotic voice 🙂
@Linguflex 4 месяца назад
Thx 😀 took me forever to get that soft whisper into the voice, it's included in linguflex btw
@tiredbusinessdad 4 месяца назад
That is awesome 👏 I just tested it. Great work! Is it possible to change some settings in the interruption? And if so, which file should I do it in?
@Linguflex 4 месяца назад
Thanks a lot. ❤You can click on the 👂button, then a window opens. At the bottom of that window you can customize the trigger volume for speech interruption. Also you can disable speech interruption completely by adding allow_speech_interruption: false to the listen section of the settings.yaml file (in the Linguflex/lingu folder).
@tiredbusinessdad 4 месяца назад
@@Linguflex Thank you ☺️ But I was thinking about to try to build on top of it. Because you might want to only interrupt with certain things, so it doesn’t get interrupted with phrases like “Ahh, I see” or short confirming answers
@Linguflex 4 месяца назад
Right, that's actually a great idea!
@tiredbusinessdad 4 месяца назад
@@Linguflex - just thought you would be able to have a more realistic kind of conversation, if it could be able to detect things like that while it’s speaking, so it doesn’t interrupt for everything said.
@tiredbusinessdad 4 месяца назад
@@Linguflex - Another thing; is there somehow that I can reach out to you privately? I have a business idea that I have been working on for a while, and I have been following you for quite some time now, and would be interested to pitch an idea for a collaboration, if you are interested?
@MrSuntask 5 месяцев назад
Wow thats fast!
@DarkChaosMC 5 месяцев назад
Gut dass ich Deutsch, Englisch und Spanisch spreche, sonst würde diese Demo keinen Sinn machen
@LostPostProductions 5 месяцев назад
Thank you for showing us your library in action as well as letting us know how we can support it!
@kimjongoof5000 5 месяцев назад
Is it possible to make the voice dictation instantaneous at the cost of accuracy? I want to try controlling the servos on an animatronic mouth with voice dictation. It doesn't have to be accurate, it just needs to be accurate enough to be convincing and as fast as possible
@Linguflex 4 месяца назад
You probably want to use whisper.cpp with a quantized tiny model and grammar sampling, look up Georgi Gerganov's chess example.
@Linguflex 4 месяца назад
You could also train a wake word model to do this. They are crazy fast and reliable but specialized on few keywords. Check Openwakeword or PvPorcupine.
@sergitorrabadella 6 месяцев назад
It's impressive! Which GPU are you using?
@Linguflex 6 месяцев назад
Thank you. I have a RTX 2080 Super.
@sergitorrabadella 6 месяцев назад
@@Linguflex Thanks for your answer! I have some questions. I've seen your email in the comments, can I email you?
@Smashachu 6 месяцев назад
Actually you want about 100 MS of delay at the very least. We're human and take time to process information and it would just seem unnatural to have a conversation where you felt like someone was finishing your sentences for you all the time.
@tiredbusinessdad 6 месяцев назад
I have followed a lot of your projects on GitHub! And I am amazed by your work. What platform would be able to generate the fastest and most realistic sounding voices from an input?
@MrStellateWaffle 6 месяцев назад
I'm on windows 10 and when I try to install linguflex, it says "Python not found". Do you know how to fix this?
@Linguflex 6 месяцев назад
You need to install python 3.9.9 first from here www.python.org/downloads/release/python-399/ Another linguflex release is coming soon too (hopefully release in ~2 weeks)
@MrStellateWaffle 6 месяцев назад
@@Linguflex Already installed that python version, but I'm happy to hear another release is coming! Good luck!
@Linguflex 6 месяцев назад
@@MrStellateWaffle Then maybe python was not added to the system's PATH environment variable
@MrWuzey 6 месяцев назад
@@Linguflex It's a good news, i really like the work you are doing with your various libs.
@MrStellateWaffle 6 месяцев назад
I can't seem to install linguflex. Step 7: Launch Linguflex To do this, run the following batch file in the Linguflex installation folder: "start_linguflex" Alternatively: python linguflex.py When I do step 7, it says "Python not found". Do you know how I can fix this?
@akashraut8129 7 месяцев назад
hi Buddy!! Im trying this approach but getting error, I have trained voice assitant using langchain and gpt 3.5 turbo and using elevenlabs api and opean ai api but latency is not reducing
@roller1211 7 месяцев назад
Wow, awsome!
@preenanahnaf 7 месяцев назад
Hey brother! When i am running your program it is showing rate limit error. btw I am using free tier of openai
@Linguflex 7 месяцев назад
Elevenlabs or Openai API ran into rate limit. Check characters used in elevenlabs and settings limits in your openai account
@preenanahnaf 7 месяцев назад
@@Linguflex it is saying openai limit crossed. i am using free tier of openai. is free tier enough for this program to run or i must upgrade to paid tier?
@Linguflex 7 месяцев назад
Paid account, it needs openai api key.
@_mobasshir_ 7 месяцев назад
Great
@k0hacuu 7 месяцев назад
Incredible work! Found your projects today and I cannot describe in words how impressive this all is. +1!

Linguflex

Комментарии