Размер видео: 1280 X 720853 X 480640 X 360
Показать панель управления
Автовоспроизведение
Автоповтор
can make vedio on using this omniparsrer for computer control use using open source llm like claude did.
Do you know if it will also work if the text on the screen was actually hand written? An example would be an image of text written on a whiteboard?
Hey, yes it should work, but the primary use of the model is for parsing screen information, I am sure if you are looking for just handwritten text recognition, a lightweight CNN should be fine like Efficient Net B0.
can make vedio on using this omniparsrer for computer control use using open source llm like claude did.
Do you know if it will also work if the text on the screen was actually hand written? An example would be an image of text written on a whiteboard?
Hey, yes it should work, but the primary use of the model is for parsing screen information, I am sure if you are looking for just handwritten text recognition, a lightweight CNN should be fine like Efficient Net B0.