Improving language detection by a-nasstrom · Pull Request #77 · Vexa-ai/vexa

a-nasstrom · 2025-11-18T15:11:35Z

I improved the language detection algorithm by adding segment-level probability aggregation, weighted scoring, early stopping logic, and more robust handling of noisy or mixed-language audio.

…, weighted scoring, early stopping, and more robust handling of noisy/mixed audio.

DmitriyG228 · 2026-04-24T12:54:17Z

This PR has been open since November 2025 and is currently CONFLICTING with main. Are you still working on it?

If yes: happy to coordinate a rebase; ping us and we'll prioritize review.
If no: we'll close and surface the idea (segment-level probability aggregation for language detection) in a future groom cycle in case anyone else wants to pick it up.

No pressure either way — just avoiding an indefinite in-flight PR.

a-nasstrom added 2 commits November 18, 2025 16:08

Improve language detection: add segment-level probability aggregation…

0fd5a1a

…, weighted scoring, early stopping, and more robust handling of noisy/mixed audio.

silent improvements

5cb3c91

DmitriyG228 added this to the 0.7 patches milestone Feb 4, 2026

DmitriyG228 added the area: transcription (Whisper/STT) Transcription / Whisper / STT label Feb 4, 2026

DmitriyG228 removed this from the 0.7 patches milestone Feb 13, 2026

DmitriyG228 added this to the 0.11 milestone Apr 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving language detection#77

Improving language detection#77
a-nasstrom wants to merge 2 commits intoVexa-ai:mainfrom
Symfa-Inc:language_detection_algorithm

a-nasstrom commented Nov 18, 2025

Uh oh!

DmitriyG228 commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

a-nasstrom commented Nov 18, 2025

Uh oh!

DmitriyG228 commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants