Portable AI Voice Assistant using ESP32 & Gemini AI 🔥🔥 |

Поделиться
HTML-код
  • Опубликовано: 31 янв 2025

Комментарии • 177

  • @kaloprojects
    @kaloprojects 5 месяцев назад +17

    Hi @techiesms, awesome .. thank YOU so much for using/featuring my KALO Recording & Deepgram transcription libraries and your kind ‘Shoutout’, it is my pleasure that i could help you a little bit and your community!. I love your cool projects, and you know: Nobody can explain all tech stuff better than you, your videos and explanations are best !.

    • @kaloprojects
      @kaloprojects 5 месяцев назад +1

      Btw: cool hardware details, nice too see e.g. your headphone audio connector, the powering circuits, your auto-reset idea and replay button is a great idea too. So go forward with your great projects .. and i am always happy to support further. Thank you Techiesms for all your great daily work !

    • @techiesms
      @techiesms  5 месяцев назад +1

      Thank you so much....

    • @hemanthkumarmp1886
      @hemanthkumarmp1886 4 месяца назад +1

      Sir can you send me the complete circuit diagram not schematic​@@techiesms

    • @DarylChris
      @DarylChris 3 месяца назад

      Hello @kaloprojects please is there a way to contact you, through email or something, I am stuck on a project and desperately need your help

  • @johnny14794
    @johnny14794 5 месяцев назад +15

    Wow! Super interesting project! How about porting that sketch to a device available in the market like a M5 Cardputer, a Lilly-Go T-Deck, a Lilly-Go T- Embed, etc? Those devices include everything needed to just get it up and running as long as the proper libraries are placed into the Arduino Library, then burn the sketch on them. Since many of us don't have the skills to assemble stuff on a PCB board, get the parts needed and can't even solder a button? Other pre-made devices that have ESP32 S3 chips, SD card slot, speaker, buttons, battery connector, lights, screens, microphone, etc, the ones I mentioned would save many of the pains on building our own from scratch. I do see your point of creating home made projects though. I wish I had the skill you have and the talents of creating these IoT's. I love gadgets, but lack the programming aspects of these ESP32's. It's why it's a blessing to have you available in YT. Thanks for sharing this video brother! Stay safe and blessings!

    • @techiesms
      @techiesms  5 месяцев назад +5

      Based on your suggestion I’ll try to do with other boards that I have

    • @johnny14794
      @johnny14794 5 месяцев назад +1

      @@techiesms Looking forward to seeing your next video related to the sketch on other boards/devices. Take care and blessings. 🙏✨

  • @HLX01Creater
    @HLX01Creater 5 месяцев назад +7

    I am trying to make all this using ESP8266, and I have actually worked on it for the last 30 days, not continuously but with a little change according to your thinking. Like, I have put this system in my wristwatch like a hidden spy. I have already made the script for ESP8266, it is working but only the part of STT and TTS is left. This is very challenging for me, like custom script or something more. But now I am copy-pasting and thank you very much for this video.

    • @lavanyabhamidipati992
      @lavanyabhamidipati992 4 месяца назад

      hi what speaker and mic did you use for this project

    • @HLX01Creater
      @HLX01Creater 4 месяца назад

      @@lavanyabhamidipati992 use the ESP32-C3 Super Mini. For the microphone, I use the xcluma INMP441 Omnidirectional High Precision MEMS Microphone Module with I2S Support.

  • @dipanshudhote5671
    @dipanshudhote5671 4 месяца назад +12

    Hi everyone, i tried this project and found the way to avoid use of SD card by directly using SPIFFS memory which can be able to store 220 seconds of audio at 8k and 8 bit which is more than enough, for this AI assistant purpose we genrally need about 5-10 seconds, so i think it is the most reliable way to use it, also it will boost some speed for reading and writing files...

    • @inventors_india_technologies
      @inventors_india_technologies 4 месяца назад

      Please tell me

    • @Charliedave4332
      @Charliedave4332 4 месяца назад

      Holly Molly! How did you come about this? I have been looking for ways around this problem for some time now

    • @sandhyakota5350
      @sandhyakota5350 Месяц назад

      Gd evening sir , can I contact you for some guidance on the above project because u have already tried , please sir reply because we r going through this project

    • @dipanshudhote5671
      @dipanshudhote5671 Месяц назад

      @@sandhyakota5350 sure

    • @dipanshudhote5671
      @dipanshudhote5671 Месяц назад

      @@sandhyakota5350 sure

  • @francegall-web9819
    @francegall-web9819 5 месяцев назад +8

    In the next one you build, add an AI-THINKER VC-02 so that it is automatically activated by voice without pressing the switch.

  • @dishsadanu8979
    @dishsadanu8979 2 месяца назад +1

    Thank you for what I learned from you in Sri Lanka

  • @marshallmann7620
    @marshallmann7620 5 месяцев назад

    WOW AMAZING DETAIL!! THANK YOU!! YOU ARE AWESOME!!!!!!!

  • @braveonder
    @braveonder 5 месяцев назад

    Great work. Thank you for your great support.

  • @Talha80777
    @Talha80777 2 месяца назад

    Hello. First of all, thank you very much for sharing each and every detail of the project. I have one query regarding the schematic, what is the purpose of Q1 MOSFET?

  • @anlpereira
    @anlpereira 5 месяцев назад

    Very interesting project. From this you can make a project that can send commands to control your house. I love it.

  • @RealSnail3D
    @RealSnail3D 4 месяца назад

    Awesome video!

  • @jeremygeorgia4943
    @jeremygeorgia4943 2 месяца назад

    As has been mentioned, it would be nice for the project to be operated with a wake word, so that you didn't need to push a button. It might also be nice, for it to recognize certain words and phrases & activate some subroutines, to control devices attached to it. If there aren't enough I/O ports, maybe it could send messages to nearby ESP32 devices & activate equipment that way. What I'd like to use it for, is for creating a voice interactive robot, that you could control with your voice. Perhaps, it could follow you, or take pictures with its camera, after being to told to do so, with a voice command. Maybe, it could communicate with a Raspberry Pi or other computer, to figure out answers, without the Internet.

  • @jaydebkarmakar624
    @jaydebkarmakar624 4 месяца назад +1

    I would like suggest you to make a Google assistant device using raspberry pi zero. There are no detailed tutorial videos on this topic.

  • @60pluscrazy
    @60pluscrazy 5 месяцев назад +2

    Good one, but instead of onboard coding, we should use a free cloud hosted endpoint. It will be better allowing us our own custom knowledge and more 🎉🎉🎉

    • @techiesms
      @techiesms  5 месяцев назад +1

      I didn't get your point. Can you elaborate

    • @artisticyeti22
      @artisticyeti22 5 месяцев назад

      The esp code cannot be off loaded 😂. The software that can be off loaded has already been off loaded and mentioned as so in the video

  • @DreamScapeRobloxStudio
    @DreamScapeRobloxStudio 4 месяца назад +1

    Just wanted to know how did you guys solder the small components with the pcb ?

  • @sriharin8641
    @sriharin8641 3 месяца назад +2

    can you make a simple circuit just like gpt in esp? please!

  • @yashrajshah7766
    @yashrajshah7766 4 месяца назад

    You can bulid a live language translator with this, which will be super useful.

  • @ShahinazDreams
    @ShahinazDreams 14 дней назад

    Hi bro . Could you make a updated version by adding a bigger speaker and with camera

  • @ha13151
    @ha13151 5 месяцев назад

    Perfect!

  • @gjreditor9835
    @gjreditor9835 5 месяцев назад +3

    You can use Deepgram itself for text to speech too. You won't have the hassle of chunking the response text. Just like 750 hours of voice to text free, Deepgram text to voices is free up to 250 hours.

    • @kaloprojects
      @kaloprojects 5 месяцев назад +1

      you are right, Deepgram offers TTS .. but as I know they do NOT offer any languages beyond English (as today). The Open AI TTS voices are multilingual, each of the 6 voices can speak multiple languages (even changing language in one sentence). And i could not find any ESP32 library supporting their 'steaming concept' for TTS. If you know a library or function call, let us know please!

  • @bens4446
    @bens4446 5 месяцев назад +1

    Wow, bravo. I did something similar (in python) on a 2gb RAM le Potato (minus the convenient button features, BMS, etc.). Didn't think this would be possible on a microcontroller with 520kb RAM. It does seem a bit slow at times in your demonstration, at least compared to my 2gb version. Could this be sped up with streaming? Or do you already include streaming in your .ino code? Very nice project!

    • @kaloprojects
      @kaloprojects 5 месяцев назад +2

      I am also working on the idea to extend my recording library with Streaming (streaming I2S bytes realtime from microphone to Deepgram 'during' recording).. but this is crazy complex at least for me (with ESP32 & C, maybe easier in Python, which i won't use on ESP due several reasons), not sure i will ever get int work :D . We will see.

    • @devbiceps1628
      @devbiceps1628 4 месяца назад +1

      ​@@kaloprojects would love to check that out

  • @SallyCrest
    @SallyCrest 2 месяца назад

    Amazing project! Do you think that maybe you'll be offering the gerber files for this project in the future so some of us can order custom PCBs from PCBGOGO in the future?

  • @mukulsunda
    @mukulsunda 5 месяцев назад +1

    Can we add a ov2640 module and send a frame to gemini api when a particular word like “what am i seeing” and get response?

  • @glennimmanuel9338
    @glennimmanuel9338 28 дней назад

    what audio library did you used for this project?

  • @poppinsred
    @poppinsred 28 дней назад

    can we use a bluetooth speaker so that we can use mic and speaker + reduse the size of circuite
    esp have inbuilt bluetooth right?
    is it possible?

  • @ankurmajumdar7383
    @ankurmajumdar7383 20 дней назад

    what is the BUTTON1 in BMS and Power Supply schematic

  • @electriquefarmaci
    @electriquefarmaci 5 месяцев назад

    Maintenance mode is on
    Site will be available soon. Thank you for your patience!

  • @mabrurareefinroll4625
    @mabrurareefinroll4625 3 месяца назад

    brother, your schematic doesnt show all the components you used. like resistors, battery monitoring circuit. can you please please tell me every component you used and how many

  • @its_yuno.
    @its_yuno. 5 месяцев назад +2

    use sim for internet connection please . Then we can use it without wifi and mobile

  • @apexdc07
    @apexdc07 4 месяца назад

    @techiesms I have an esp32 lyraT 4.3 board on which I have installed alexa using esp32-adf sdk, just curious if I can use this code on same with all required components already present on the board.

  • @k2r.253
    @k2r.253 5 месяцев назад

    Thank you sir ❤

  • @basicelectronics6324
    @basicelectronics6324 2 месяца назад

    Please make a video on streaming the song from Spotify using esp32 and audio decoder module, it's very useful for us.

  • @KiteTurbine
    @KiteTurbine 5 месяцев назад +1

    Brilliantly clear tutorial. The soldering seems to happen by magic

  • @NOOBKRISH
    @NOOBKRISH 5 месяцев назад

    Can we add multiple wifi said and pass so it will connect to any avilable

  • @TheRajeev7778
    @TheRajeev7778 5 месяцев назад +1

    Hello,Any plans to provide this as building kit?

    • @techiesms
      @techiesms  5 месяцев назад +1

      Building Kit Means?

    • @TheRajeev7778
      @TheRajeev7778 5 месяцев назад

      @@techiesms components and pcb

  • @ShayanV-w2c
    @ShayanV-w2c 5 месяцев назад

    bro you are really intelligent

  • @ajuxx
    @ajuxx 5 месяцев назад +1

    Hlo bro can you share the circuit diagram for making it on a zero pcb board without the smd components
    I seen the Schematic diagram i dont understand that make it simple plz

    • @ajuxx
      @ajuxx 5 месяцев назад

      @techiesms reply me

  • @shashivadhangunti3411
    @shashivadhangunti3411 5 месяцев назад +1

    yha hai i am a B-Tech final year student i thought of making a smart device with Raspberry pi 4 where the os look like Ai, And if we ask any thing it will show in display and respond like alexa indirectly an alexa with a display any idea for it

  • @tsshadowff
    @tsshadowff 5 месяцев назад

    Please make a video of making it with easy ways! ❤

  • @rummankarin6332
    @rummankarin6332 3 месяца назад

    bro can you please make a voice module price frendly in veroboard with deepgram and esp32 which is full pack of (alaxa or chtgpt)

  • @fusseldieb
    @fusseldieb 5 месяцев назад

    I literally had the same idea one week ago. Even built a prototype hahaha
    Best of all: With the same components, even.

    • @sarathkumar785
      @sarathkumar785 5 месяцев назад

      teach me bro

    • @techiesms
      @techiesms  5 месяцев назад

      LOL
      No issues, Now you can easily make it with the help of this video

    • @SarathkumarP-mu3ob
      @SarathkumarP-mu3ob 5 месяцев назад

      @@techiesms bro did u have any connection diagram

  • @shardulambikecs1192
    @shardulambikecs1192 4 месяца назад

    Can this Gemini AI control things over esp32 like house automation , or servos , or leds in any project ?

    • @techiesms
      @techiesms  4 месяца назад +2

      Not exactly
      But to do that you can use the speech to text response and compare it to control the appliances

  • @pravinabaldha7240
    @pravinabaldha7240 5 месяцев назад +1

    I have interesting idea and I am currently working on that I want to make tutorial of that like how to mke fully independent ai pocket device compact in size and has voice assistants and object recognization and has display on it and showing captions on that portable and use with sim card maybe thats great idea for your channel can you please make that ❤

  • @sinmim1
    @sinmim1 5 месяцев назад

    Nice project 🎉

    • @techiesms
      @techiesms  5 месяцев назад

      Glad you like it!

  • @baotrangia3417
    @baotrangia3417 5 месяцев назад

    Fantastic Assistance Portable (FAP) !!!

  • @sakethpuppala1016
    @sakethpuppala1016 Месяц назад

    AP+STA mode would be better where AP mode should take wifi credentials and API inputs

  • @gopash
    @gopash 5 месяцев назад

    It can be turned into the business of bilingual interpretation .
    But can it be operate offline?

  • @Shorts01-f3h
    @Shorts01-f3h 5 месяцев назад +1

    Kaise kharede ya wala circuit board

  • @sudharsana5439
    @sudharsana5439 4 месяца назад

    How is this different from Google assistant?

  • @UniqueIdeas151
    @UniqueIdeas151 3 месяца назад

    Bro can i use esp32 with micro USB port with out using USB to ttl converter system

  • @MikaKaito
    @MikaKaito 2 месяца назад

    Hello sir, Can I get this product along with display extension to check text, if you make so, I will buy this product sir, please add this on?

  • @JIP.Creations
    @JIP.Creations 10 дней назад

    Where to Put The Google TTS Api Key?

  • @iOT_India
    @iOT_India 3 месяца назад +1

    please integrate more AI MODELS like LLAMA, etc.

  • @hippopothomas1980
    @hippopothomas1980 5 месяцев назад

    Thanks!!!

  • @Rgkroger
    @Rgkroger Месяц назад

    lindo seu projeto amigo
    seria possivel acopla esse seu projeto a rede de comunicação na rede can do veiculo para comandar algumas funções

  • @MikeNugget
    @MikeNugget 5 месяцев назад

    Awesome

  • @bennguyen1313
    @bennguyen1313 Месяц назад

    So the ESP32 sends the wave file to DeepGram, then it sends the text result to the (Gemini) LLM, where that output is sent to Google's Text-To-Speech?
    Since DeepGram and the TTS are not free.. is it possible to just have a paid OpenAI account, and send the wave to the it for end-to-end processing?

    • @bennguyen1313
      @bennguyen1313 13 дней назад

      BTW, any thoughts on "Home / VOICE RECOGNITION MODULE- VC02 -AI THINKER"?

  • @amitdas3484
    @amitdas3484 Месяц назад +1

    that is a great project. But can we do away with the button pressing part ?
    Can the listening trigger be voice activated with a specific word or phrase ?

    • @heelercs
      @heelercs Месяц назад

      Go ahead and do that, nobody’s stopping you

  • @usmanumer9871
    @usmanumer9871 5 месяцев назад +1

    Maza a gaya is project say . Alampana Tusi great ho tofa Kabul Karo .🎉🎉

    • @techiesms
      @techiesms  5 месяцев назад +1

      Bs kr pagle, Rulayega kya....

  • @2butgaming991
    @2butgaming991 2 месяца назад +1

    I want to buy this completely assembled hardware but on ur website its showing it will be back on 30th November can u help me out to get it sooner 🙏

    • @techiesms
      @techiesms  2 месяца назад +1

      Sir it’s not in stock
      We ordered it and waiting for the delivery of components

    • @cerlpearl
      @cerlpearl 2 месяца назад

      @@techiesms how many days after 30th will we get the product ( living in Mumbai )

    • @techiesms
      @techiesms  2 месяца назад

      @cerlpearl kindly share the pincode on our WhatsApp
      8200079034

  • @darshandarshu1906
    @darshandarshu1906 4 месяца назад

    Which software used for 3d vision

    • @techiesms
      @techiesms  4 месяца назад

      It was 3-D view in easy EDA

  • @harshsaini5979
    @harshsaini5979 4 месяца назад

    Hi @techiesms can you name all the parts or components used in this project and also all things simply and easy to understand because i am just a beginner and don't know about it anything but I want to make this

    • @techiesms
      @techiesms  4 месяца назад

      You can refer this schematic to check out all the parts

  • @brijeshupadhyay8942
    @brijeshupadhyay8942 Месяц назад

    Please made a video on simple circuit like on chat gpt

  • @shock2k3
    @shock2k3 5 месяцев назад

    make an 3d printed case for better convinence

  • @kavinesar4559
    @kavinesar4559 5 месяцев назад

    bro is this capable of real time data like time date and weather

  • @takayasushuto4176
    @takayasushuto4176 5 месяцев назад

    最高です!
    Tシャツ欲しい。

  • @chandrakalashinde8960
    @chandrakalashinde8960 4 месяца назад

    We can make device to anytime listening

  • @harshsaini5979
    @harshsaini5979 4 месяца назад

    Kitne paise lg jayenge ise bnane me including all parts and pcb and assembling them

    • @techiesms
      @techiesms  4 месяца назад

      We are selling it already through our website

  • @unkwon.h
    @unkwon.h 3 месяца назад

    I would like to integrate it with a sim card so that it doesn't require to connect to any hotspot

  • @Sri_Harsha_Electronics_Guthik
    @Sri_Harsha_Electronics_Guthik 5 месяцев назад

    try edgetts?

  • @m.shayanshamim1202
    @m.shayanshamim1202 4 месяца назад

    Plz Send the circuit diagram in easy manner for without PCB makers

  • @psycho_Aresyt
    @psycho_Aresyt День назад

    Bro i am completed your masters of iot course but how to claim certificate ???

    • @techiesms
      @techiesms  День назад

      Share the assignments on techiesms@gmail.com

  • @JohnKourdomenos
    @JohnKourdomenos 5 месяцев назад

    Can the code be entered into a Panel touch screen ESP32-4848S040 so that we can see the answer on the screen ?

  • @harshsaini5979
    @harshsaini5979 4 месяца назад

    Can I buy the complete built hardware part without buying pcb from pcbgogo

    • @techiesms
      @techiesms  4 месяца назад

      You can buy complete hardware assembled on PCB from our website

  • @_s2700
    @_s2700 5 месяцев назад

    Can we run this locally......?

    • @techiesms
      @techiesms  5 месяцев назад

      no. its over internet

  • @clips4399
    @clips4399 4 месяца назад

    what if we add a display to it, please make it

  • @elisalant
    @elisalant 5 месяцев назад

    Do you ship to Australia

    • @techiesms
      @techiesms  5 месяцев назад

      Yes.
      Kindly WhatsApp us on +91 82000 79034

  • @ArtBuddyIndian
    @ArtBuddyIndian 5 месяцев назад

    what will be the application of this project

  • @ManoranjanMithas
    @ManoranjanMithas 5 месяцев назад

    Sir di chij bata do google tts free kaise mile Or sir second chij sir ye esp8266 se kar sakte h

  • @ridwanmulyana99
    @ridwanmulyana99 5 месяцев назад

    can you make faster reply from the device ?

  • @vimal_jayavel3007
    @vimal_jayavel3007 5 месяцев назад

    Where did get pcb bro ?

    • @techiesms
      @techiesms  5 месяцев назад +1

      We are not selling PCB, rather we are selling the complete project

    • @vimal_jayavel3007
      @vimal_jayavel3007 5 месяцев назад

      @@techiesms assemble project or compounds of project?

  • @ShubhamSen-dn1ub
    @ShubhamSen-dn1ub 4 месяца назад

    Add NXP local voice recognition ic to this

  • @VettivelSivakumari
    @VettivelSivakumari 5 месяцев назад

    Does the hardware cimw in srilanka

    • @techiesms
      @techiesms  5 месяцев назад +1

      Yes
      Kindly WhatsApp on +918200079034

  • @anbur1790
    @anbur1790 4 месяца назад

    Bro your chatgpt voice assestant v2 speech to text library file is error occur (compile time)give me any solution

  • @NOOBKRISH
    @NOOBKRISH 4 месяца назад

    can u pease make a 3d case fr this

  • @jesus__voice_for_all8865
    @jesus__voice_for_all8865 3 месяца назад

    Sir can I know the actual cost of the product

  • @armisis
    @armisis 5 месяцев назад

    It's rabbit r1 with the latest patches.

  • @syedmutahir1917
    @syedmutahir1917 3 месяца назад

    good vido

  • @inhlapngo1601
    @inhlapngo1601 4 месяца назад

    Could you please share your PCB gerber file?

  • @rockfireist
    @rockfireist 5 месяцев назад

    Helo bro where is schematic diagram,, can u provide link plz...

    • @techiesms
      @techiesms  5 месяцев назад

      Its in the GitHub repo.
      the link is in the description

    • @rockfireist
      @rockfireist 5 месяцев назад

      @@techiesms thanks bro....

  • @hassansiddiqui5282
    @hassansiddiqui5282 4 месяца назад

    can u provide the gerber file for the pcb?

    • @techiesms
      @techiesms  4 месяца назад

      No sorry
      But you can find the schematic in the video

  • @meetpatel5142
    @meetpatel5142 5 месяцев назад +1

    ❤❤🤩🤩🤩🤩

  • @socalgaming8269
    @socalgaming8269 12 часов назад

    Hlo sir you will make one for me please

  • @sakaidosan
    @sakaidosan 3 месяца назад

    ERROR in Record_Start() - I2S not initialized, call 'I2S_Record_Init()' missed, I'm getting this error when pressing record button, please help

  • @hmalamgir3489
    @hmalamgir3489 4 месяца назад

    Price

  • @vansh1521
    @vansh1521 16 дней назад

    Sir program shows the error can you please help us

    • @techiesms
      @techiesms  15 дней назад

      Share what error you are facing

    • @vansh1521
      @vansh1521 8 дней назад

      I checked all connections I checked i2c microphone but it doesn't recognise speech to text it doesn't convert speech in to the text

  • @unkwon.h
    @unkwon.h 3 месяца назад

    Can i get the files for pcb pls?

  • @eyeofsanatan
    @eyeofsanatan 5 месяцев назад

    sir i am buying one iot project from your website but i have only 1500 rs, and it is of 2922, its multi pursose gps tracker kit, please tell me u can help me or not because i requre it for my project please sir 😢

  • @Patapom3
    @Patapom3 4 месяца назад

    That's great but you're entirely dependent on 1) an internet connection and 2) 3rd party services...

    • @techiesms
      @techiesms  4 месяца назад

      It’s a DIY project so I guess it’s ok…

  • @CuriosityChronicles-we5cu
    @CuriosityChronicles-we5cu 4 месяца назад

    I am really intrestest in build various kind of project but due to the cost and desire component availability i cant build one , i have a request if u could develop a affordabe micricontroller board under 200 rupess having type c interface for programming and power wifi connectivity, as well as consisting of all type of communication interface if possoble like i2c,i2s,uart,spi and also if possible can bus having small foothprint with castaleted pins and also inbuilt charging ic if possible but i know it difficult to build this within such a low cost , but even if u could add a type c port to the node mcu esp8266 and launch it on your website i would really like to purchase it even my friend ones one for our project i really want a affordable mcu under 200 to tinker and learn on it plz share this comment as much as possible