Gemini 2 Multimodal and Spatial Awareness in Python

Поделиться
HTML-код
  • Опубликовано: 29 дек 2024

Комментарии •

  • @micbab-vg2mu
    @micbab-vg2mu 12 дней назад

    I need to rethink my AI workflows-this model offers many new opportunities.

    • @jamesbriggs
      @jamesbriggs  12 дней назад

      yes, looking forward to testing gemini more

    • @ultrasound1459
      @ultrasound1459 12 дней назад

      keep rethinking every month then with new models coming out 💀

  • @johnny017
    @johnny017 12 дней назад

    I don't find this model really impressive for object detecion. Florence2 can already do a similar job and it is under 1B param model. For real world case, I would not trust prompt engineering to get my results. Rather I would prefer to fine tune the model. It is also a nightmare when google does some tweaking on the model as you were experiencing. I'm also experimenting grounding with Qwen 2VL as they have , , , tokens specifically for object detection.
    Thanks James for the update 🙏

    • @jamesbriggs
      @jamesbriggs  12 дней назад

      thanks for the info - I'll try florence2 and qwen 2vl

  • @madkimchi5444
    @madkimchi5444 12 дней назад +1

    Thanks for the Demo. An interesting model for sure, but anything non open source is not suitable for enterprise use. Not now, not ever, especially since even models that are even tagged as "Appropriate for Enterprise" go through a lot of changes and have their instructions changed while being live. It's an absolute nightmare to work with.

    • @absta1995
      @absta1995 12 дней назад +1

      A lot if not most apps used by enterprise are not open source

    • @jamesbriggs
      @jamesbriggs  12 дней назад +1

      imo in the future we'll be using more open source LLMs for the reasons @madkimchi5444 said, but rn open source LLMs can't do what we can do with OpenAI and other providers, so although it's annoying with model changes I think the only option for a lot of use-cases (not all) is to go with proprietary models locked behind APIs