Hi @techiesms, awesome .. thank YOU so much for using/featuring my KALO Recording & Deepgram transcription libraries and your kind ‘Shoutout’, it is my pleasure that i could help you a little bit and your community!. I love your cool projects, and you know: Nobody can explain all tech stuff better than you, your videos and explanations are best !.
Btw: cool hardware details, nice too see e.g. your headphone audio connector, the powering circuits, your auto-reset idea and replay button is a great idea too. So go forward with your great projects .. and i am always happy to support further. Thank you Techiesms for all your great daily work !
Wow! Super interesting project! How about porting that sketch to a device available in the market like a M5 Cardputer, a Lilly-Go T-Deck, a Lilly-Go T- Embed, etc? Those devices include everything needed to just get it up and running as long as the proper libraries are placed into the Arduino Library, then burn the sketch on them. Since many of us don't have the skills to assemble stuff on a PCB board, get the parts needed and can't even solder a button? Other pre-made devices that have ESP32 S3 chips, SD card slot, speaker, buttons, battery connector, lights, screens, microphone, etc, the ones I mentioned would save many of the pains on building our own from scratch. I do see your point of creating home made projects though. I wish I had the skill you have and the talents of creating these IoT's. I love gadgets, but lack the programming aspects of these ESP32's. It's why it's a blessing to have you available in YT. Thanks for sharing this video brother! Stay safe and blessings!
I am trying to make all this using ESP8266, and I have actually worked on it for the last 30 days, not continuously but with a little change according to your thinking. Like, I have put this system in my wristwatch like a hidden spy. I have already made the script for ESP8266, it is working but only the part of STT and TTS is left. This is very challenging for me, like custom script or something more. But now I am copy-pasting and thank you very much for this video.
@@lavanyabhamidipati992 use the ESP32-C3 Super Mini. For the microphone, I use the xcluma INMP441 Omnidirectional High Precision MEMS Microphone Module with I2S Support.
Hi everyone, i tried this project and found the way to avoid use of SD card by directly using SPIFFS memory which can be able to store 220 seconds of audio at 8k and 8 bit which is more than enough, for this AI assistant purpose we genrally need about 5-10 seconds, so i think it is the most reliable way to use it, also it will boost some speed for reading and writing files...
Gd evening sir , can I contact you for some guidance on the above project because u have already tried , please sir reply because we r going through this project
Hello. First of all, thank you very much for sharing each and every detail of the project. I have one query regarding the schematic, what is the purpose of Q1 MOSFET?
As has been mentioned, it would be nice for the project to be operated with a wake word, so that you didn't need to push a button. It might also be nice, for it to recognize certain words and phrases & activate some subroutines, to control devices attached to it. If there aren't enough I/O ports, maybe it could send messages to nearby ESP32 devices & activate equipment that way. What I'd like to use it for, is for creating a voice interactive robot, that you could control with your voice. Perhaps, it could follow you, or take pictures with its camera, after being to told to do so, with a voice command. Maybe, it could communicate with a Raspberry Pi or other computer, to figure out answers, without the Internet.
Good one, but instead of onboard coding, we should use a free cloud hosted endpoint. It will be better allowing us our own custom knowledge and more 🎉🎉🎉
You can use Deepgram itself for text to speech too. You won't have the hassle of chunking the response text. Just like 750 hours of voice to text free, Deepgram text to voices is free up to 250 hours.
you are right, Deepgram offers TTS .. but as I know they do NOT offer any languages beyond English (as today). The Open AI TTS voices are multilingual, each of the 6 voices can speak multiple languages (even changing language in one sentence). And i could not find any ESP32 library supporting their 'steaming concept' for TTS. If you know a library or function call, let us know please!
Wow, bravo. I did something similar (in python) on a 2gb RAM le Potato (minus the convenient button features, BMS, etc.). Didn't think this would be possible on a microcontroller with 520kb RAM. It does seem a bit slow at times in your demonstration, at least compared to my 2gb version. Could this be sped up with streaming? Or do you already include streaming in your .ino code? Very nice project!
I am also working on the idea to extend my recording library with Streaming (streaming I2S bytes realtime from microphone to Deepgram 'during' recording).. but this is crazy complex at least for me (with ESP32 & C, maybe easier in Python, which i won't use on ESP due several reasons), not sure i will ever get int work :D . We will see.
Amazing project! Do you think that maybe you'll be offering the gerber files for this project in the future so some of us can order custom PCBs from PCBGOGO in the future?
brother, your schematic doesnt show all the components you used. like resistors, battery monitoring circuit. can you please please tell me every component you used and how many
@techiesms I have an esp32 lyraT 4.3 board on which I have installed alexa using esp32-adf sdk, just curious if I can use this code on same with all required components already present on the board.
Hlo bro can you share the circuit diagram for making it on a zero pcb board without the smd components I seen the Schematic diagram i dont understand that make it simple plz
yha hai i am a B-Tech final year student i thought of making a smart device with Raspberry pi 4 where the os look like Ai, And if we ask any thing it will show in display and respond like alexa indirectly an alexa with a display any idea for it
I have interesting idea and I am currently working on that I want to make tutorial of that like how to mke fully independent ai pocket device compact in size and has voice assistants and object recognization and has display on it and showing captions on that portable and use with sim card maybe thats great idea for your channel can you please make that ❤
So the ESP32 sends the wave file to DeepGram, then it sends the text result to the (Gemini) LLM, where that output is sent to Google's Text-To-Speech? Since DeepGram and the TTS are not free.. is it possible to just have a paid OpenAI account, and send the wave to the it for end-to-end processing?
that is a great project. But can we do away with the button pressing part ? Can the listening trigger be voice activated with a specific word or phrase ?
Hi @techiesms can you name all the parts or components used in this project and also all things simply and easy to understand because i am just a beginner and don't know about it anything but I want to make this
sir i am buying one iot project from your website but i have only 1500 rs, and it is of 2922, its multi pursose gps tracker kit, please tell me u can help me or not because i requre it for my project please sir 😢
I am really intrestest in build various kind of project but due to the cost and desire component availability i cant build one , i have a request if u could develop a affordabe micricontroller board under 200 rupess having type c interface for programming and power wifi connectivity, as well as consisting of all type of communication interface if possoble like i2c,i2s,uart,spi and also if possible can bus having small foothprint with castaleted pins and also inbuilt charging ic if possible but i know it difficult to build this within such a low cost , but even if u could add a type c port to the node mcu esp8266 and launch it on your website i would really like to purchase it even my friend ones one for our project i really want a affordable mcu under 200 to tinker and learn on it plz share this comment as much as possible
Hi @techiesms, awesome .. thank YOU so much for using/featuring my KALO Recording & Deepgram transcription libraries and your kind ‘Shoutout’, it is my pleasure that i could help you a little bit and your community!. I love your cool projects, and you know: Nobody can explain all tech stuff better than you, your videos and explanations are best !.
Btw: cool hardware details, nice too see e.g. your headphone audio connector, the powering circuits, your auto-reset idea and replay button is a great idea too. So go forward with your great projects .. and i am always happy to support further. Thank you Techiesms for all your great daily work !
Thank you so much....
Sir can you send me the complete circuit diagram not schematic@@techiesms
Hello @kaloprojects please is there a way to contact you, through email or something, I am stuck on a project and desperately need your help
Wow! Super interesting project! How about porting that sketch to a device available in the market like a M5 Cardputer, a Lilly-Go T-Deck, a Lilly-Go T- Embed, etc? Those devices include everything needed to just get it up and running as long as the proper libraries are placed into the Arduino Library, then burn the sketch on them. Since many of us don't have the skills to assemble stuff on a PCB board, get the parts needed and can't even solder a button? Other pre-made devices that have ESP32 S3 chips, SD card slot, speaker, buttons, battery connector, lights, screens, microphone, etc, the ones I mentioned would save many of the pains on building our own from scratch. I do see your point of creating home made projects though. I wish I had the skill you have and the talents of creating these IoT's. I love gadgets, but lack the programming aspects of these ESP32's. It's why it's a blessing to have you available in YT. Thanks for sharing this video brother! Stay safe and blessings!
Based on your suggestion I’ll try to do with other boards that I have
@@techiesms Looking forward to seeing your next video related to the sketch on other boards/devices. Take care and blessings. 🙏✨
I am trying to make all this using ESP8266, and I have actually worked on it for the last 30 days, not continuously but with a little change according to your thinking. Like, I have put this system in my wristwatch like a hidden spy. I have already made the script for ESP8266, it is working but only the part of STT and TTS is left. This is very challenging for me, like custom script or something more. But now I am copy-pasting and thank you very much for this video.
hi what speaker and mic did you use for this project
@@lavanyabhamidipati992 use the ESP32-C3 Super Mini. For the microphone, I use the xcluma INMP441 Omnidirectional High Precision MEMS Microphone Module with I2S Support.
Hi everyone, i tried this project and found the way to avoid use of SD card by directly using SPIFFS memory which can be able to store 220 seconds of audio at 8k and 8 bit which is more than enough, for this AI assistant purpose we genrally need about 5-10 seconds, so i think it is the most reliable way to use it, also it will boost some speed for reading and writing files...
Please tell me
Holly Molly! How did you come about this? I have been looking for ways around this problem for some time now
Gd evening sir , can I contact you for some guidance on the above project because u have already tried , please sir reply because we r going through this project
@@sandhyakota5350 sure
@@sandhyakota5350 sure
In the next one you build, add an AI-THINKER VC-02 so that it is automatically activated by voice without pressing the switch.
Thank you for what I learned from you in Sri Lanka
WOW AMAZING DETAIL!! THANK YOU!! YOU ARE AWESOME!!!!!!!
Great work. Thank you for your great support.
Hello. First of all, thank you very much for sharing each and every detail of the project. I have one query regarding the schematic, what is the purpose of Q1 MOSFET?
Very interesting project. From this you can make a project that can send commands to control your house. I love it.
Awesome video!
As has been mentioned, it would be nice for the project to be operated with a wake word, so that you didn't need to push a button. It might also be nice, for it to recognize certain words and phrases & activate some subroutines, to control devices attached to it. If there aren't enough I/O ports, maybe it could send messages to nearby ESP32 devices & activate equipment that way. What I'd like to use it for, is for creating a voice interactive robot, that you could control with your voice. Perhaps, it could follow you, or take pictures with its camera, after being to told to do so, with a voice command. Maybe, it could communicate with a Raspberry Pi or other computer, to figure out answers, without the Internet.
I would like suggest you to make a Google assistant device using raspberry pi zero. There are no detailed tutorial videos on this topic.
Good one, but instead of onboard coding, we should use a free cloud hosted endpoint. It will be better allowing us our own custom knowledge and more 🎉🎉🎉
I didn't get your point. Can you elaborate
The esp code cannot be off loaded 😂. The software that can be off loaded has already been off loaded and mentioned as so in the video
Just wanted to know how did you guys solder the small components with the pcb ?
can you make a simple circuit just like gpt in esp? please!
You can bulid a live language translator with this, which will be super useful.
Hi bro . Could you make a updated version by adding a bigger speaker and with camera
Perfect!
Thanks
You can use Deepgram itself for text to speech too. You won't have the hassle of chunking the response text. Just like 750 hours of voice to text free, Deepgram text to voices is free up to 250 hours.
you are right, Deepgram offers TTS .. but as I know they do NOT offer any languages beyond English (as today). The Open AI TTS voices are multilingual, each of the 6 voices can speak multiple languages (even changing language in one sentence). And i could not find any ESP32 library supporting their 'steaming concept' for TTS. If you know a library or function call, let us know please!
Wow, bravo. I did something similar (in python) on a 2gb RAM le Potato (minus the convenient button features, BMS, etc.). Didn't think this would be possible on a microcontroller with 520kb RAM. It does seem a bit slow at times in your demonstration, at least compared to my 2gb version. Could this be sped up with streaming? Or do you already include streaming in your .ino code? Very nice project!
I am also working on the idea to extend my recording library with Streaming (streaming I2S bytes realtime from microphone to Deepgram 'during' recording).. but this is crazy complex at least for me (with ESP32 & C, maybe easier in Python, which i won't use on ESP due several reasons), not sure i will ever get int work :D . We will see.
@@kaloprojects would love to check that out
Amazing project! Do you think that maybe you'll be offering the gerber files for this project in the future so some of us can order custom PCBs from PCBGOGO in the future?
Can we add a ov2640 module and send a frame to gemini api when a particular word like “what am i seeing” and get response?
what audio library did you used for this project?
can we use a bluetooth speaker so that we can use mic and speaker + reduse the size of circuite
esp have inbuilt bluetooth right?
is it possible?
what is the BUTTON1 in BMS and Power Supply schematic
Maintenance mode is on
Site will be available soon. Thank you for your patience!
brother, your schematic doesnt show all the components you used. like resistors, battery monitoring circuit. can you please please tell me every component you used and how many
use sim for internet connection please . Then we can use it without wifi and mobile
@techiesms I have an esp32 lyraT 4.3 board on which I have installed alexa using esp32-adf sdk, just curious if I can use this code on same with all required components already present on the board.
Thank you sir ❤
Most welcome
Please make a video on streaming the song from Spotify using esp32 and audio decoder module, it's very useful for us.
Brilliantly clear tutorial. The soldering seems to happen by magic
Thanks
Can we add multiple wifi said and pass so it will connect to any avilable
Hello,Any plans to provide this as building kit?
Building Kit Means?
@@techiesms components and pcb
bro you are really intelligent
Hlo bro can you share the circuit diagram for making it on a zero pcb board without the smd components
I seen the Schematic diagram i dont understand that make it simple plz
@techiesms reply me
yha hai i am a B-Tech final year student i thought of making a smart device with Raspberry pi 4 where the os look like Ai, And if we ask any thing it will show in display and respond like alexa indirectly an alexa with a display any idea for it
Please make a video of making it with easy ways! ❤
bro can you please make a voice module price frendly in veroboard with deepgram and esp32 which is full pack of (alaxa or chtgpt)
I literally had the same idea one week ago. Even built a prototype hahaha
Best of all: With the same components, even.
teach me bro
LOL
No issues, Now you can easily make it with the help of this video
@@techiesms bro did u have any connection diagram
Can this Gemini AI control things over esp32 like house automation , or servos , or leds in any project ?
Not exactly
But to do that you can use the speech to text response and compare it to control the appliances
I have interesting idea and I am currently working on that I want to make tutorial of that like how to mke fully independent ai pocket device compact in size and has voice assistants and object recognization and has display on it and showing captions on that portable and use with sim card maybe thats great idea for your channel can you please make that ❤
Did you succeed?
Not yet 😂@@sakaidosan
Nice project 🎉
Glad you like it!
Fantastic Assistance Portable (FAP) !!!
AP+STA mode would be better where AP mode should take wifi credentials and API inputs
It can be turned into the business of bilingual interpretation .
But can it be operate offline?
Kaise kharede ya wala circuit board
How is this different from Google assistant?
Bro can i use esp32 with micro USB port with out using USB to ttl converter system
Hello sir, Can I get this product along with display extension to check text, if you make so, I will buy this product sir, please add this on?
Where to Put The Google TTS Api Key?
please integrate more AI MODELS like LLAMA, etc.
Thanks!!!
Welcome!
lindo seu projeto amigo
seria possivel acopla esse seu projeto a rede de comunicação na rede can do veiculo para comandar algumas funções
Awesome
So the ESP32 sends the wave file to DeepGram, then it sends the text result to the (Gemini) LLM, where that output is sent to Google's Text-To-Speech?
Since DeepGram and the TTS are not free.. is it possible to just have a paid OpenAI account, and send the wave to the it for end-to-end processing?
BTW, any thoughts on "Home / VOICE RECOGNITION MODULE- VC02 -AI THINKER"?
that is a great project. But can we do away with the button pressing part ?
Can the listening trigger be voice activated with a specific word or phrase ?
Go ahead and do that, nobody’s stopping you
Maza a gaya is project say . Alampana Tusi great ho tofa Kabul Karo .🎉🎉
Bs kr pagle, Rulayega kya....
I want to buy this completely assembled hardware but on ur website its showing it will be back on 30th November can u help me out to get it sooner 🙏
Sir it’s not in stock
We ordered it and waiting for the delivery of components
@@techiesms how many days after 30th will we get the product ( living in Mumbai )
@cerlpearl kindly share the pincode on our WhatsApp
8200079034
Which software used for 3d vision
It was 3-D view in easy EDA
Hi @techiesms can you name all the parts or components used in this project and also all things simply and easy to understand because i am just a beginner and don't know about it anything but I want to make this
You can refer this schematic to check out all the parts
Please made a video on simple circuit like on chat gpt
make an 3d printed case for better convinence
bro is this capable of real time data like time date and weather
最高です!
Tシャツ欲しい。
We can make device to anytime listening
Kitne paise lg jayenge ise bnane me including all parts and pcb and assembling them
We are selling it already through our website
I would like to integrate it with a sim card so that it doesn't require to connect to any hotspot
try edgetts?
Plz Send the circuit diagram in easy manner for without PCB makers
Bro i am completed your masters of iot course but how to claim certificate ???
Share the assignments on techiesms@gmail.com
Can the code be entered into a Panel touch screen ESP32-4848S040 so that we can see the answer on the screen ?
Can I buy the complete built hardware part without buying pcb from pcbgogo
You can buy complete hardware assembled on PCB from our website
Can we run this locally......?
no. its over internet
what if we add a display to it, please make it
Do you ship to Australia
Yes.
Kindly WhatsApp us on +91 82000 79034
what will be the application of this project
Sir di chij bata do google tts free kaise mile Or sir second chij sir ye esp8266 se kar sakte h
can you make faster reply from the device ?
Where did get pcb bro ?
We are not selling PCB, rather we are selling the complete project
@@techiesms assemble project or compounds of project?
Add NXP local voice recognition ic to this
Does the hardware cimw in srilanka
Yes
Kindly WhatsApp on +918200079034
Bro your chatgpt voice assestant v2 speech to text library file is error occur (compile time)give me any solution
can u pease make a 3d case fr this
Sir can I know the actual cost of the product
It's rabbit r1 with the latest patches.
good vido
Could you please share your PCB gerber file?
Helo bro where is schematic diagram,, can u provide link plz...
Its in the GitHub repo.
the link is in the description
@@techiesms thanks bro....
can u provide the gerber file for the pcb?
No sorry
But you can find the schematic in the video
❤❤🤩🤩🤩🤩
Hlo sir you will make one for me please
ERROR in Record_Start() - I2S not initialized, call 'I2S_Record_Init()' missed, I'm getting this error when pressing record button, please help
Same Here
Price
Sir program shows the error can you please help us
Share what error you are facing
I checked all connections I checked i2c microphone but it doesn't recognise speech to text it doesn't convert speech in to the text
Can i get the files for pcb pls?
sir i am buying one iot project from your website but i have only 1500 rs, and it is of 2922, its multi pursose gps tracker kit, please tell me u can help me or not because i requre it for my project please sir 😢
That's great but you're entirely dependent on 1) an internet connection and 2) 3rd party services...
It’s a DIY project so I guess it’s ok…
I am really intrestest in build various kind of project but due to the cost and desire component availability i cant build one , i have a request if u could develop a affordabe micricontroller board under 200 rupess having type c interface for programming and power wifi connectivity, as well as consisting of all type of communication interface if possoble like i2c,i2s,uart,spi and also if possible can bus having small foothprint with castaleted pins and also inbuilt charging ic if possible but i know it difficult to build this within such a low cost , but even if u could add a type c port to the node mcu esp8266 and launch it on your website i would really like to purchase it even my friend ones one for our project i really want a affordable mcu under 200 to tinker and learn on it plz share this comment as much as possible