Customized Layout Detection for Scientific PDFs with LayoutParser and Label Studio

Cursor Is Beating VS Code (...by forking it)

IAP 2024 Visual Design in Scholarly Communication | Lec 5 Slides

Can Shayne Guess Our Fridges?

Making Meatloaf

DRAGON BALL GT CONFIRMED!!!! Super Saiyan 4 Gogeta Trailer SPARKING ZERO Reaction

Layout Parser Main Presentation

Shannon Shen

Просмотров 14 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 12 сен 2024

Комментарии • 17

@phillipjmurphy 2 года назад ⁺³
AMAZING ! I've been looking for something similar to this for a while. Thank you ! I can't wait to dive into it and leverage in my processing.
@theq18 3 года назад ⁺²
Excellent toolkit
Great job
@MrLyonliang Месяц назад
Great job! how about latest progress?
@connorryan8376 Год назад
This is amazing! Any suggestion to only grabbing questions off an image of an test using this?
@mouadtouzani7120 2 месяца назад
Is the code for fine tuning the same as retraining after annotation ?
@parthrangarajan3241 2 года назад
Hello, great work on this toolkit. I really love it.
I have a question.
Which model would be the best according to you to extract titles and subtitles?
@shannonshen258 2 года назад
Thank you! I'd say the PubLayNet models might be helpful for your task github.com/Layout-Parser/platform/issues/5
@ayushchoubey8021 2 года назад ⁺¹
Can we use this on binary images ?
@aymenmtibaa4582 2 года назад ⁺¹
Hi
Is posible to detect the police and the style of the document
@victortarnovskiy8407 2 года назад
Hi, great work indeed!
Can it extract buttons from the document, e.g. buttons in emails?
@shannonshen258 2 года назад
Thanks! I would say yes -- though you might want to customize the layout detection models based on your samples.
@misbahfahamsyah7023 2 года назад ⁺¹
Hey, thank you for sharing this.
I have a problem about AttributeError: module layoutparser has no attribute Detectron2LayoutModel
Anyone solved this?
@shannonshen258 2 года назад ⁺¹
Thanks! You might want to take a look at the installation instruction for the detectron2 backend: layout-parser.readthedocs.io/en/latest/notes/installation.html#additional-instruction-install-detectron2-layout-model-backend
@KB-pl9vl 2 года назад
Does it support OCR for Ukrainian language?
@zaheerbeg4810 2 года назад
Can we extract paragraph as it is in image file?
@shannonshen258 2 года назад
Yes -- and perhaps the PubLayNet model can help with your task github.com/Layout-Parser/platform/issues/5
@user-tf2tm9ry7i 2 года назад
Thanks! I have installed the package and use the demo code but it always returns me Invalid argument: 'C:\\Users\\xxxx/.torch/iopath_cache\\s/f3b12qc4hc0yh4m\\config.yml?dl=1.lock'. Do you have any idea why that happened?

Следующие

Автовоспроизведение

Customized Layout Detection for Scientific PDFs with LayoutParser and Label Studio

Customized Layout Detection for Scientific PDFs with LayoutParser and Label Studio

Cursor Is Beating VS Code (...by forking it)

Cursor Is Beating VS Code (...by forking it)

IAP 2024 Visual Design in Scholarly Communication | Lec 5 Slides

IAP 2024 Visual Design in Scholarly Communication | Lec 5 Slides

Can Shayne Guess Our Fridges?

Can Shayne Guess Our Fridges?

Making Meatloaf

Making Meatloaf

DRAGON BALL GT CONFIRMED!!!! Super Saiyan 4 Gogeta Trailer SPARKING ZERO Reaction

DRAGON BALL GT CONFIRMED!!!! Super Saiyan 4 Gogeta Trailer SPARKING ZERO Reaction

This Game is NOT The Office

This Game is NOT The Office

Extract Key Information from Documents using LayoutLM | LayoutLM Fine-tuning | Deep Learning

Extract Key Information from Documents using LayoutLM | LayoutLM Fine-tuning | Deep Learning

Document Classification with Transformers and PyTorch | Setup & Preprocessing with LayoutLMv3

Document Classification with Transformers and PyTorch | Setup & Preprocessing with LayoutLMv3

LayoutLMv3 Training with CORD (receipts) dataset

LayoutLMv3 Training with CORD (receipts) dataset

LayoutLM: Pre-training of Text and Layout for Document Image Understanding (Paper Summary)

LayoutLM: Pre-training of Text and Layout for Document Image Understanding (Paper Summary)

GPT-4 Tutorial: How to Chat With Multiple PDF Files (~1000 pages of Tesla's 10-K Annual Reports)

GPT-4 Tutorial: How to Chat With Multiple PDF Files (~1000 pages of Tesla's 10-K Annual Reports)

How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)

How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)

Extract Text, Title, Paragraph, Image From A Image Document using Deep Learning.

Extract Text, Title, Paragraph, Image From A Image Document using Deep Learning.

[15] Use Python to extract invoice lines from a semistructured PDF AP Report

[15] Use Python to extract invoice lines from a semistructured PDF AP Report

Image Document Classification using LayoutLM | Document understanding |

Image Document Classification using LayoutLM | Document understanding |

ПЕРЕПИСКА НА САЙТЕ ЗНАКОМСТВ | БЕРЕМЕННАЯ против СТУДЕНТКИ

ПЕРЕПИСКА НА САЙТЕ ЗНАКОМСТВ | БЕРЕМЕННАЯ против СТУДЕНТКИ

Новейший ИРП Франции! Вот это технологии! Я такого еще не видел

Новейший ИРП Франции! Вот это технологии! Я такого еще не видел

НЕВОЗМОЖНЫЙ ЭКСПЕРИМЕНТ

НЕВОЗМОЖНЫЙ ЭКСПЕРИМЕНТ

ну это жиза... #standoff2

ну это жиза... #standoff2

Миллиардный бизнес на салфетке

Миллиардный бизнес на салфетке

😲 Гаишник шокировал водителя Мерседеса такими новостями! | Новостничок

😲 Гаишник шокировал водителя Мерседеса такими новостями! | Новостничок

А на каком языке ты ДУМАЕШЬ?

А на каком языке ты ДУМАЕШЬ?

Самый БЕДНЫЙ ГОРОД РОССИИ! Ужасная правда о Тольятти

Самый БЕДНЫЙ ГОРОД РОССИИ! Ужасная правда о Тольятти