Альберт Иванов
Альберт Иванов
  • Видео 161
  • Просмотров 140 410
Infini-Megrez-Omni(analysis of image, text, and audio modalities) on rtx 3050 (cuda) test
github.com/infinigence/Infini-Megrez-Omni
- model also can take audio as a prompt
- can describe audio fragment (english)
- supports chatting about the image
- ocr works good (can extract numbers from the price)
- handwritten ocr works worse
- inference time on gpu (image description) - 32 sec
Просмотров: 11

Видео

voice cloning (rus-to-rus) with fish speech 1.5 on raspberry pi5
Просмотров 5714 дней назад
speech.fish.audio/#windows-setup выводы: - генерация пары предложений занимает до 5 минут - результат клонирования достойный, но не всегда удачный - иногда плохо справляется с ударениями в словах и интонацией - на данный момент пока нет onnx реализации
show network stat with netplot on raspberry pi5
Просмотров 1414 дней назад
find out what programs sends to net on raspberry pi. github.com/fedeb95/netplot *package has broken install so to start it working just copy-paste files to main dir and change imports in netplot.py: see fork - github.com/zoldaten/netplot
yolov8n.onnx vs yolov8.rknn on luckfox 3566 speed test on 1 image
Просмотров 2821 день назад
github.com/airockchip/rknn_model_zoo
yolov5m.rknn test on luckfox 3566
Просмотров 37Месяц назад
luckfox 3566 has NPU with 0.8 TOPS - so to use ones need do conversion: .pt - .onnx - .rknn. models from - github.com/Qengineering/YoloV5-NPU ./YoloV5_NPU rk3566/yolov5s-240-240_rk3566.rknn small-vehicles.jpg yolov5m.rknn - 5 fps ! very good keep in mind yolov5s.pt gives only 0,3 FPS.
sana 1024 × 1024 (local image generation) test on 3090 rtx ubuntu
Просмотров 20Месяц назад
github.com/NVlabs/Sana prompt = 'a stunning portrait of a woman with flawless skin, her face adorned with intricate floral patterns painted in sort pastel hues,\ gazing thoughtfully while standing in a vast lavender field swaying gently in the breeze;ultra-realistic details,natural lighting?and a dreamy,\ blurred background' выводы: - быстрая скорость (4 сек), заметно быстрее flux schnell - кач...
yolov8 cpp ncnn with vulkan inference on raspberry pi4b - very slow
Просмотров 41Месяц назад
1.convert yolov8n.onnx to ncnn format: pnnx yolov8n.onnx inputshape=[1,3,640,640] yolov8n.bin and yolov8n.param - ready for inference. 2.start cpp inference with ncnn: ./yolov8 image.jpg
vulkan on raspberry pi 4b bullseye aarch64 - samples tests
Просмотров 49Месяц назад
how to install vulkan on bullseye aarch64: qengineering.eu/install-vulkan-on-raspberry-pi.html but missed some libs on link above. additional libs should be installed: sascha willems wont work. may be it need sdk to be installed. so use this examples: github.com/KhronosGroup/Vulkan-Samples/blob/main/docs/build.adoc#linux
yolov11.mnn vs yolov11.onnx speed comparison on raspberry pi4b
Просмотров 93Месяц назад
export to mnn, onnx - docs.ultralytics.com/integrations/mnn/#how-do-i-export-ultralytics-yolo11-models-to-mnn-format mnn install on raspberry - qengineering.eu/install-mnn-on-raspberry-pi-4.html *now its possible - pip install mnn opencl not building on pi4 - github.com/doe300/VC4CL (only pi 3 - qengineering.eu/install-opencl-on-raspberry-pi-3.html)
RMBG 2.0 - background remove model - test on raspberry pi 4b
Просмотров 13Месяц назад
huggingface.co/briaai/RMBG-2.0
ruadapt_qwen2.5_3B test on raspberry pi 4b 8gb raspbian bullseye
Просмотров 103Месяц назад
./build/bin/llama-cli -m models/Q4_K_M.gguf -co -cnv -p "You are Qwen, created by Alibaba Cloud. You are a helpful assistant." -fa -ngl 80 -n 512 -t 0.1 выводы: - в большинстве случаев модель выдает правильные развернутые ответы. - иногда допускает грубейшие фактические ошибки - например, о том, что СССР воевал в начале Второй мировой войны на стороне Германии, а потом против. - скорость генера...
ruadapt_qwen2.5_3B test on luckfox 3566 ubuntu 24.04
Просмотров 102Месяц назад
- luckfox 3566 4GB RAM 32Gb emmc - wiki.luckfox.com/Core3566/ - ubuntu 20.04 was updated to 24.04 - llama.cpp built - github.com/ggerganov/llama.cpp/blob/8f275a7c4593aa34147595a90282cf950a853690/docs/build.md#l4 - model rus-rus quantized - huggingface.co/RefalMachine/ruadapt_qwen2.5_3B_ext_u48_instruct_v4_gguf/tree/main ./build/bin/llama-cli -m models/Q4_K_M.gguf -co -cnv -p "You are Qwen, crea...
Mini-Omni2 test on cpu i5-10400 (no cuda)
Просмотров 1062 месяца назад
model speech(or/and image)-to-text. -вопрос можно задать на русском в том числе, ответ всегда на английской. -речь на выходе real-time заикается, разработчики пишут, что это связано с моделью, которая в float32. - готовый аудиофайл в wav без заиканий. github.com/gpt-omni/mini-omni2
yolon11.pt vs yolon11.onnx (imgsz=dynamic) vs yolon11 (imgsz=256) on raspberry pi 4b
Просмотров 352 месяца назад
1400ms - yolo11n.pt 640x640 726ms - yolo11n.onnx imgsz=dynamic int8 333ms - yolo11n.onnx imgsz=256 int8
stable diffusion mojo vs stable diffusion onnx inference tests
Просмотров 632 месяца назад
stable diffusion mojo vs stable diffusion onnx inference tests
together.ai api test (meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo - image-to-text multi-modal)
Просмотров 552 месяца назад
together.ai api test (meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo - image-to-text multi-modal)
mojo(fast python) on raspberry pi - install and some tests
Просмотров 592 месяца назад
mojo(fast python) on raspberry pi - install and some tests
how to convert craft-text-detector .pt model to .onnx model
Просмотров 252 месяца назад
how to convert craft-text-detector .pt model to .onnx model
craft-text-detector onnx inference on raspberry pi 4b
Просмотров 452 месяца назад
craft-text-detector onnx inference on raspberry pi 4b
whisper vs whisper-cpp vs whisper-jax vs vosk on raspberry pi 4b
Просмотров 853 месяца назад
whisper vs whisper-cpp vs whisper-jax vs vosk on raspberry pi 4b
аудио с таджикского в текст с помощью vosk на raspberry pi4b
Просмотров 683 месяца назад
аудио с таджикского в текст с помощью vosk на raspberry pi4b
HLK-LD2450 test on raspberry pi 4 uart
Просмотров 1503 месяца назад
HLK-LD2450 test on raspberry pi 4 uart
yolov9-t-onnx - test on raspberry pi 4b
Просмотров 673 месяца назад
yolov9-t-onnx - test on raspberry pi 4b
llava-v1.5-7b-4096-preview from groq - test on raspberry pi 4b
Просмотров 1683 месяца назад
llava-v1.5-7b-4096-preview from groq - test on raspberry pi 4b
saiga_llama3_8b_gguf - test models q2 and q4 on raspberry pi 4b
Просмотров 433 месяца назад
saiga_llama3_8b_gguf - test models q2 and q4 on raspberry pi 4b
mini-omni (Real-time speech-to-speech) test on raspberry pi 4b 8Gb RAM - very slow
Просмотров 2183 месяца назад
mini-omni (Real-time speech-to-speech) test on raspberry pi 4b 8Gb RAM - very slow
trocr-base-onnx rus test on cpu
Просмотров 974 месяца назад
trocr-base-onnx rus test on cpu
prometheus+grafana - monitoring windows and linux PCs from raspberry pi 4b
Просмотров 944 месяца назад
prometheus grafana - monitoring windows and linux PCs from raspberry pi 4b
FLUX:SOTA (shnell) test on RTX 3060 12Gb
Просмотров 3644 месяца назад
FLUX:SOTA (shnell) test on RTX 3060 12Gb
whisper_cpp (speech-to-text) test on raspberry 5b
Просмотров 554 месяца назад
whisper_cpp (speech-to-text) test on raspberry 5b

Комментарии

  • @Diabolous
    @Diabolous 12 часов назад

    from version 3.1.0 it apparently no longer works

  • @dl-hydrographics
    @dl-hydrographics День назад

    Does this also work with 3.1?

  • @Arnoud-nf6iz
    @Arnoud-nf6iz 2 дня назад

    this removes some bugs and ghosting

  • @miladkaydi9627
    @miladkaydi9627 3 дня назад

    tanks a lot.

  • @thanekrios3211
    @thanekrios3211 7 дней назад

    Not all heroes wear capes. Thankyou Sir. Still working on 3.0.1

    • @Arnoud-nf6iz
      @Arnoud-nf6iz 2 дня назад

      dont say it work on 3 its not smart to put all that info here

    • @thanekrios3211
      @thanekrios3211 2 дня назад

      @@Arnoud-nf6iz It's free, open source software. Stop worrying so much.

  • @Igrom_Gamedev
    @Igrom_Gamedev 10 дней назад

    Прикольно. А есть портабл версия, чтобы не морочиться с установкой?

  • @zavierhofstetter3809
    @zavierhofstetter3809 14 дней назад

    What kind of model was the model before it was in the ONNX format? I am trying to convert the file into a CoreML model with the ONNX 1.3 package, and it said that it was unable to detect the type of model is was originally (Pytorch, Tensorflow, etc.), so it's throwing an error. Also, sorry if the video already answered this question, I only speak English.

    • @zavierhofstetter3809
      @zavierhofstetter3809 14 дней назад

      Nevermind, I see in the description it says it is a .pt model

  • @JadeRomero-c2p
    @JadeRomero-c2p 18 дней назад

    Funciona pero solo en el preview al iniciar no crea la imagen

  • @MajorBuzzKill
    @MajorBuzzKill 19 дней назад

    3.0.1 works, do as he says in the comment, thanks!

  • @CRIMELAB357
    @CRIMELAB357 19 дней назад

    Thank u awesome-uploader-dude

  • @varg1814
    @varg1814 24 дня назад

    How can u do it if you have facefusion on pinokio? 3.0.0.

    • @stevenquijano2982
      @stevenquijano2982 День назад

      cuando eecutas facefusion en pinokio te sale stop y View File le das en View File y siges los pasos del video

  • @igrikum
    @igrikum 24 дня назад

    3.0.1 Работает 04,12,24

  • @dpartvision2063
    @dpartvision2063 25 дней назад

    I have version 3.0.1, I tried to open the file analyzser, but I can't open it. I'm using Pynthon 3.9.6 why?

    • @chriss3154
      @chriss3154 7 дней назад

      For some reason the IDE is acting up just open it in another text editor and make the change there or copy the file to the desktop and make the change there with the IDE and copy it back 👍

    • @Arnoud-nf6iz
      @Arnoud-nf6iz 2 дня назад

      just open pinokio and click files find it there its a build in editor there

  • @dopexmc7901
    @dopexmc7901 29 дней назад

    which python did you install to open the data?

    • @dopexmc7901
      @dopexmc7901 10 дней назад

      boa constrictor in lil slits

  • @adrianau4246
    @adrianau4246 Месяц назад

    I cannot find this place to change it ..

    • @Arnoud-nf6iz
      @Arnoud-nf6iz 2 дня назад

      pinokio < files < find it there chnage values

  • @alexanderjsjd
    @alexanderjsjd Месяц назад

    Hey , can you help me please? Because i don't search driver for this lCD display😢

  • @Xio-kr9nt
    @Xio-kr9nt Месяц назад

    3.0.1 👍

  • @ayrisgames
    @ayrisgames Месяц назад

    Can you write an explanation for Mac users? How can we change it?

  • @HunterX-666
    @HunterX-666 Месяц назад

    Thanks boss❤

  • @boudaoudpipo9597
    @boudaoudpipo9597 Месяц назад

    Nice bro Do you recommend that I use Mojo in my raspberry pi 5 ?

  • @thealexdfheianryu6069
    @thealexdfheianryu6069 Месяц назад

    What about in 3.0.1?

  • @wayneout
    @wayneout Месяц назад

    I used pinokio, I don't know where to find the content.py file. With pinokio open, I can "see files" but I can't change it. How would I change it? Thanks

    • @dopexmc7901
      @dopexmc7901 29 дней назад

      where you saved pinokio: pinokio>api>facefusion pinokio>facefusion>facefusion>"content_analyser.py

    • @Arnoud-nf6iz
      @Arnoud-nf6iz 2 дня назад

      open pinokio open on left side files. it has a build in editor Change value and easily safe dont need a special program. stop app restart windows

  • @Anirudhagedam27
    @Anirudhagedam27 Месяц назад

    Worked for me

  • @martinkollar1488
    @martinkollar1488 Месяц назад

    3.0.0 works, thank you

  • @ulauoft
    @ulauoft Месяц назад

    03.11.2024 works with 3.01, thank you for that great tip!!!

    • @LuizMoraes-fn5ir
      @LuizMoraes-fn5ir Месяц назад

      not work in my 3.0.1 😭

    • @ulauoft
      @ulauoft Месяц назад

      @@LuizMoraes-fn5ir what problem did you have?

    • @EdwinPouwel
      @EdwinPouwel Месяц назад

      @@ulauoft I also use version 3.0.1 but I still have the blur picture. What did you do to remove it? I changed the value from 0.8 to 1.0, but no result.

    • @phrankme
      @phrankme Месяц назад

      Where is the file in 3.0.1?

    • @Arnoud-nf6iz
      @Arnoud-nf6iz 2 дня назад

      @EdwinPouwel open pinokio click files. (on the left). u find ther what you need to change in the youtube video.. click save.. close app.. restart computer

  • @andr09-g7f
    @andr09-g7f Месяц назад

    3.0.1 still works 😂

    • @adrianau4246
      @adrianau4246 Месяц назад

      Where can I find the setting to change it this vid is confusing

    • @andr09-g7f
      @andr09-g7f Месяц назад

      @ I used video for settings

  • @koipoi8111
    @koipoi8111 Месяц назад

    Hi as a mac user how does one even access this file to change the code?

  • @markuz-gk8re
    @markuz-gk8re Месяц назад

    3.0.1 working

    • @EdwinPouwel
      @EdwinPouwel Месяц назад

      What did you do to get it to work? I changed the value to 1.0 but the blurry picture stays. Need some help.

  • @user-ri1iv5fz6e
    @user-ri1iv5fz6e Месяц назад

    👍

  • @FreeRadical3001
    @FreeRadical3001 Месяц назад

    10/31/2024 still works for 3.0.1 After I saved the change, I had to fully close program and then re-open. When I did that, "poof", the blur was gone :-D

    • @adrianau4246
      @adrianau4246 Месяц назад

      Where do I go to change this setting ?

    • @varg1814
      @varg1814 24 дня назад

      How can u do it if you have facefusion on pinokio?

  • @AzikKhakimovich
    @AzikKhakimovich Месяц назад

    Тоже такая проблема. Зарядное устройство всё родное но начал часто перезагружаться. Что можно сделать, подскажите пожалуйста

    • @АльбертИванов-ц4х
      @АльбертИванов-ц4х Месяц назад

      @AzikKhakimovich снять крышку и приклеить радиатор от raspberry pi на то , что греется.

  • @LeonardoGanzerli
    @LeonardoGanzerli 2 месяца назад

    do you have a tool to translate the text? I would like to translate the game into Italian (google translation) у вас есть инструмент для перевода текста? Я хотел бы перевести игру на итальянский язык.

  • @Edprodd
    @Edprodd 2 месяца назад

    27-10-2024 Jala perfecto Probability limit = 1.00

  • @thatboyp444
    @thatboyp444 2 месяца назад

    w, thank you

  • @lucianorodriguez7726
    @lucianorodriguez7726 2 месяца назад

    Remember to stop and re run facefusion

  • @ddd7007
    @ddd7007 2 месяца назад

    hi sir, im wondering how do you remove the sfw filter in roop unleashed?

  • @spkoftdvl
    @spkoftdvl 2 месяца назад

    16/10/2024 still working on FaceFusion 3.0.0. thanks!

    • @fkri-n8v
      @fkri-n8v 2 месяца назад

      are you sure bro?

    • @Arnoud-nf6iz
      @Arnoud-nf6iz 2 дня назад

      dont say that its not smart

  • @КонстантинОнчуков-т4я

    Здравствуйте есть какой то у вас контакт для связи? Есть проект именно по маппингу. Можете помочь или подсказать?

  • @gabrielramos
    @gabrielramos 2 месяца назад

    working in 3.0 thanks

  • @Mike-jj8nq
    @Mike-jj8nq 3 месяца назад

    does it work for the new 3.0 version

    • @Turbulent2000
      @Turbulent2000 2 месяца назад

      yes, works like a charm

    • @francoismarousez647
      @francoismarousez647 2 месяца назад

      Not for me …

    • @Turbulent2000
      @Turbulent2000 2 месяца назад

      @@francoismarousez647try to completely delete the whole pinokio folder and reinstall it with the latest cuda. That worked for me. But in my opinion the 2.6 version of the app is better than the latest one. No need to change. I would roll it back, tbh.

    • @EdwinPouwel
      @EdwinPouwel Месяц назад

      @@Turbulent2000 can you let me know what you did? Or can you send me a copy of your content_analyzer file to see if this solves my problem?

    • @Turbulent2000
      @Turbulent2000 Месяц назад

      @ I’ll share you a copy a bit later, when I’m home

  • @Pet-l8w
    @Pet-l8w 3 месяца назад

    its not working

  • @tokhenz
    @tokhenz 3 месяца назад

    working on 3.0.0 as of 25/9/2024

  • @EchoesWeaved
    @EchoesWeaved 3 месяца назад

    Does anyone know how to bypass on 3.0?

  • @UltraMarine_1PSUT12
    @UltraMarine_1PSUT12 3 месяца назад

    3.0.0 is working, thanks

    • @physobornsicx
      @physobornsicx 3 месяца назад

      how it could be on Mac??

    • @UltraMarine_1PSUT12
      @UltraMarine_1PSUT12 3 месяца назад

      @@physobornsicx idk, I’m on the windows

    • @physobornsicx
      @physobornsicx 3 месяца назад

      @@UltraMarine_1PSUT12 真羨慕你,感謝回覆,兄弟

  • @bennyxx90
    @bennyxx90 3 месяца назад

    Still works on 3.0.0

    • @ilanchico8375
      @ilanchico8375 3 месяца назад

      I changed it but now the faces don't swap... Do you know how to make it work? I'm on 3.0

    • @ShawnHaning
      @ShawnHaning 3 месяца назад

      @@ilanchico8375 check that you followed the instructions carefully and correctly. I found the file, edited in VB Studio to comment out the 0.8 line and add the 1.00 line and it's working for me in 3.0

    • @bujin5455
      @bujin5455 2 месяца назад

      @@ilanchico8375 it does work, but it's a bit flaky. A better approach is to go down to the actual valuation line and set it to always return false. Search for: return probability > PROBABILITY_LIMIT and change to: return False #probability > PROBABILITY_LIMIT

    • @spkoftdvl
      @spkoftdvl 2 месяца назад

      @@ilanchico8375 You did something wrong, I change it and works fine!

    • @Arnoud-nf6iz
      @Arnoud-nf6iz 2 дня назад

      not smart to say that stuff

  • @goron-jo3oj
    @goron-jo3oj 3 месяца назад

    2.6.1 stil working

  • @Edos-s1y
    @Edos-s1y 3 месяца назад

    Funciona?

  • @alimohsen7071
    @alimohsen7071 3 месяца назад

    Thanks for this video . Can i contact you on telegram please

  • @alimohsen7071
    @alimohsen7071 3 месяца назад

    Hello sir, thank you for this explanation. Can I contact you privately, please?

  • @jamesbaker5136
    @jamesbaker5136 3 месяца назад

    Great video! Thank you! What is the inference time you are seeing for a single image?