🔴Using OmniParser in Less Than 100 Lines of Code: Microsoft's First Step Towards Computer Automation

Поделиться
HTML-код
  • Опубликовано: 28 ноя 2024

Комментарии • 3

  • @RajSingh-of1fs
    @RajSingh-of1fs 24 дня назад +2

    can make vedio on using this omniparsrer for computer control use using open source llm like claude did.

  • @jim02377
    @jim02377 25 дней назад

    Do you know if it will also work if the text on the screen was actually hand written? An example would be an image of text written on a whiteboard?

    • @AryanKargwal
      @AryanKargwal  25 дней назад

      Hey, yes it should work, but the primary use of the model is for parsing screen information, I am sure if you are looking for just handwritten text recognition, a lightweight CNN should be fine like Efficient Net B0.