Skip to content

[Feature]: Multi-Language Support #1728

Description

@xjasonliu

Feature Area

Ingestion (document processing, upload, Docling)

Problem Description

when trying to ingest a Chinese Document, got this error "'ascii' codec can't encode characters in position 0-10: ordinal not in range(128)", Any roadmap for multi-language support

Proposed Solution

Support UTF-8

Use Case

To handle multi language documents, and docling already can do that.

Alternatives Considered

No response

Priority

Critical for my use case

Additional Context

No response

Contribution

  • I would be willing to help implement this feature.
  • I can help test this feature once implemented.

Checklist

  • I have searched existing issues and discussions to ensure this feature hasn't been requested before.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancement🔵 New feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions