Moondream 2: Tiny Visual Language Model For Document Understanding

Поделиться
HTML-код
  • Опубликовано: 13 дек 2024

Комментарии • 3

  • @Gigabyteserviceofficial
    @Gigabyteserviceofficial 4 дня назад

    can you able to make this for mp4 video or live stream instead of image?

  • @kashifrit
    @kashifrit 4 месяца назад

    its not a tiny model problem, its data format. My experience is when data is in a table LLM struggles whereas it excels when its all written in a text

    • @scholarly360
      @scholarly360  4 месяца назад

      You are right. MultiModal LLMs might be the answer in the future.