Layout Parser Main Presentation

Поделиться
HTML-код
  • Опубликовано: 12 сен 2024

Комментарии • 17

  • @phillipjmurphy
    @phillipjmurphy 2 года назад +3

    AMAZING ! I've been looking for something similar to this for a while. Thank you ! I can't wait to dive into it and leverage in my processing.

  • @theq18
    @theq18 3 года назад +2

    Excellent toolkit
    Great job

  • @MrLyonliang
    @MrLyonliang Месяц назад

    Great job! how about latest progress?

  • @connorryan8376
    @connorryan8376 Год назад

    This is amazing! Any suggestion to only grabbing questions off an image of an test using this?

  • @mouadtouzani7120
    @mouadtouzani7120 2 месяца назад

    Is the code for fine tuning the same as retraining after annotation ?

  • @parthrangarajan3241
    @parthrangarajan3241 2 года назад

    Hello, great work on this toolkit. I really love it.
    I have a question.
    Which model would be the best according to you to extract titles and subtitles?

    • @shannonshen258
      @shannonshen258  2 года назад

      Thank you! I'd say the PubLayNet models might be helpful for your task github.com/Layout-Parser/platform/issues/5

  • @ayushchoubey8021
    @ayushchoubey8021 2 года назад +1

    Can we use this on binary images ?

  • @aymenmtibaa4582
    @aymenmtibaa4582 2 года назад +1

    Hi
    Is posible to detect the police and the style of the document

  • @victortarnovskiy8407
    @victortarnovskiy8407 2 года назад

    Hi, great work indeed!
    Can it extract buttons from the document, e.g. buttons in emails?

    • @shannonshen258
      @shannonshen258  2 года назад

      Thanks! I would say yes -- though you might want to customize the layout detection models based on your samples.

  • @misbahfahamsyah7023
    @misbahfahamsyah7023 2 года назад +1

    Hey, thank you for sharing this.
    I have a problem about AttributeError: module layoutparser has no attribute Detectron2LayoutModel
    Anyone solved this?

    • @shannonshen258
      @shannonshen258  2 года назад +1

      Thanks! You might want to take a look at the installation instruction for the detectron2 backend: layout-parser.readthedocs.io/en/latest/notes/installation.html#additional-instruction-install-detectron2-layout-model-backend

  • @KB-pl9vl
    @KB-pl9vl 2 года назад

    Does it support OCR for Ukrainian language?

  • @zaheerbeg4810
    @zaheerbeg4810 2 года назад

    Can we extract paragraph as it is in image file?

    • @shannonshen258
      @shannonshen258  2 года назад

      Yes -- and perhaps the PubLayNet model can help with your task github.com/Layout-Parser/platform/issues/5

  • @user-tf2tm9ry7i
    @user-tf2tm9ry7i 2 года назад

    Thanks! I have installed the package and use the demo code but it always returns me Invalid argument: 'C:\\Users\\xxxx/.torch/iopath_cache\\s/f3b12qc4hc0yh4m\\config.yml?dl=1.lock'. Do you have any idea why that happened?