A NotebookLM-inspired multimodal RAG platform for ingesting, encoding, and reasoning over text, images, and video with traceable, source-grounded generation.
text-extraction vlm text-encoding image-encoding llm rag-pipeline multimodal-ai doc-layout figure-extraction
-
Updated
Mar 5, 2026 - Python