docs: add async batch annotation for table extraction from PDFs #301

jean-malo · 2025-12-28T14:30:58Z

This commit introduces a new feature that demonstrates how to perform asynchronous batch processing of PDF documents for table extraction using Mistral's OCR capabilities. The implementation includes:

Creating a batch request with specific table extraction parameters
Uploading the batch file to Mistral's API
Creating and monitoring a batch job
Processing the results once the job completes

The example uses Pydantic models to define the expected response format and handles both successful and error cases from the batch processing. This provides a complete workflow for batch OCR operations with Mistral's API.

This commit introduces a new feature that demonstrates how to perform asynchronous batch processing of PDF documents for table extraction using Mistral's OCR capabilities. The implementation includes: 1. Creating a batch request with specific table extraction parameters 2. Uploading the batch file to Mistral's API 3. Creating and monitoring a batch job 4. Processing the results once the job completes The example uses Pydantic models to define the expected response format and handles both successful and error cases from the batch processing. This provides a complete workflow for batch OCR operations with Mistral's API.

jean-malo requested review from aac228 and lorenzosignoretti December 28, 2025 14:31

jean-malo changed the title ~~feat(ocr): add async batch annotation for table extraction from PDFs~~ docs: add async batch annotation for table extraction from PDFs Dec 28, 2025

lorenzosignoretti approved these changes Dec 29, 2025

View reviewed changes

jean-malo merged commit ee543e4 into main Dec 29, 2025
10 checks passed

jean-malo deleted the docs/ocr-batch-doc-annotation branch December 29, 2025 08:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: add async batch annotation for table extraction from PDFs #301

docs: add async batch annotation for table extraction from PDFs #301

Uh oh!

jean-malo commented Dec 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

docs: add async batch annotation for table extraction from PDFs #301

docs: add async batch annotation for table extraction from PDFs #301

Uh oh!

Conversation

jean-malo commented Dec 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants