Updated 2025-09-01 11:35:25 +02:00
This repo is forked from https://github.com/Yuliang-Liu/MultimodalOCR
Updated 2025-08-29 15:14:12 +02:00
Updated 2025-08-27 13:28:13 +02:00
Updated 2025-08-18 10:08:30 +02:00
This repository is forked from https://github.com/IDEA-Research/Grounded-SAM-2
Updated 2025-08-14 11:27:12 +02:00
Updated 2025-08-12 17:46:21 +02:00
This repo is used for clustering document embeddings using vison encoders from well pretrained VLM models (LayoutLM, QwenVL, etc.)
Updated 2025-07-11 16:21:00 +02:00
Updated 2025-06-09 15:47:27 +02:00
Updated 2025-01-16 00:26:18 +01:00
Updated 2024-09-16 14:11:50 +02:00