You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Tri-Modal DocRAG uses YOLO, OCR, and VLMs to detect and extract titles, text, figures, and tables from document images, producing structured JSON outputs for RAG systems.