Multi-language live streaming in Panopto

This repository documents a client-side stream switching solution designed to support multiple simultaneous audio tracks (languages) for a single high-profile event.

As Panopto’s native architecture currently supports a single audio track per video stream, this solution wraps standard RTMP ingestion in a customised HTML interface. This centralises multiple language streams into a single URL, facilitating dynamic player switching without disrupting the viewer experience via page reloads.

Architecture overview

This prototype bridges standard Panopto delivery with multi-track requirements via a lightweight web wrapper. Three distinct upstream topologies have been modelled for this implementation: separate hardware encoders, an enterprise "fan-out" using AWS Elemental Live, and a cloud-native automated approach utilising AI translation.

Topology A: discrete hardware encoders (standard implementation)

In this configuration, separate physical encoding units are provisioned for each language. This is often utilised when legacy hardware is available or when language feeds originate from disparate physical locations.

Ingest: Multiple encoders (e.g., Encoder 1 for English, Encoder 2 for Spanish) operate independently.
Routing: Each encoder pushes to a unique Panopto Session GUID.
Synchronisation: Alignment relies on the manual synchronisation of "Start" times. Minor latency variances (drift) are expected between languages.

Topology B: AWS Elemental Live "Fan-Out" (enterprise implementation)

To ensure frame-accurate synchronisation and reduce hardware footprint, this topology utilises a single unified encoding appliance.

Ingest: A single AWS Elemental Live appliance receives a master feed containing all audio tracks (e.g., SDI Embedded Audio Ch1-4).
Fan-Out: The encoder processes the video signal once but generates unique RTMP streams for each language by mapping specific audio pairs to distinct Output Groups.
Synchronisation: As all RTMP streams derive from a single system clock, the timecode is identical across all sessions. This minimises visual discontinuity when a user switches languages.

Topology C: cloud-native automation (AI-driven localisation)

This topology addresses requirements for high-volume automated translation without the logistical complexity of human interpreters. It leverages cloud-based video processing and an AI-driven middleware layer for dubbing and captioning (an approach comparable to this reference architecture).

Ingest: A single contribution stream is sent to a Cloud Encoding Service.
Processing: The encoder creates an adaptive bitrate (ABR) stream which is pushed to the AI Localisation Middleware via a supported protocol (e.g., WebDAV or SRT).
AI Translation & Dubbing:
- The middleware performs automatic speech-to-text transcription.
- Machine translation engines generate multi-language subtitles.
- A Text-to-Speech (TTS) Engine is utilised to generate synthetic audio dubbing, synchronised to the sports or event dialogue.
Routing to Panopto: The resulting distinct audio/video feeds (now containing AI-dubbed audio) are forwarded as RTMP inputs to their respective Panopto Session GUIDs, maintaining compatibility with the client-side switcher.

The client-side wrapper

Regardless of the upstream topology chosen, the client-side experience remains consistent. While Panopto provides an Embed API, the native interface lacks dynamic session switching. To address this:

Hosting: A standalone HTML page hosts a Mapping Object (Language Name to Panopto Session GUID).
Interface: A simplified dropdown UI facilitates language selection.
Initialisation: The page targets a single iframe (id="panopto-player-frame") to load the default language immediately.

Stream switching logic

To prevent the full page refresh standard in browser behaviour, the wrapper employs the following logic:

Hot Swapping: JavaScript intercepts the dropdown change event to retrieve the corresponding Session ID.
URL Construction: The script dynamically constructs the new source URL using the specific session GUID.
Autoplay Enforcement: The autoplay=true parameter is appended to ensure playback resumes immediately.

Encoder configuration reference

Hardware Encoders (Topology B)

The encoder must be configured to map specific source audio channels to the corresponding Panopto RTMP endpoints.

Setting Category	Parameter	Value
Input	Audio Selector 1	Name: `Audio_ENG` Source: `Embedded` Track: `1` (Pair 1)
Input	Audio Selector 2	Name: `Audio_ESP` Source: `Embedded` Track: `2` (Pair 2)
Output Group 1	Destination	`rtmp://[Panopto-Ingest-URL]/[StreamKey-A]`
Stream 1	Audio Source	`Audio_ENG`
Output Group 2	Destination	`rtmp://[Panopto-Ingest-URL]/[StreamKey-B]`
Stream 2	Audio Source	`Audio_ESP`

Cloud-Native AI Middleware (Topology C)

For automated workflows, the configuration moves from physical appliances to cloud infrastructure.

Upstream Encoder: Configured to output a clean ABR stream (HLS/DASH) suitable for ingestion by the localisation platform.
Localisation Platform: Requires integration to define target languages (e.g., French, German) and voice profiles for the synthetic audio generation.
Origin/Packaging: The middleware acts as the origin service, packaging the content with specific language tracks.
Output: The translated streams must be bridged back to RTMP if direct ingestion into Panopto is required.

User journeys

1. AV producer journey (setup)

The AV team selects the upstream topology based on resource availability (Human Interpreters vs AI Automation).
- Topology A/B: Interpreters provide live audio; hardware routes feeds to Panopto.
- Topology C: The producer configures the AI localisation platform to automatically translate the master feed into English and Spanish.
Outcome: Two distinct, parallel broadcasts are active within the Panopto cloud environment.

2. Viewer journey (consumption)

github_ready.mp4

The viewer navigates to the custom event URL.
The page initialises; JavaScript injects the Session ID for the default language (English).
The viewer selects "Spanish" via the dropdown menu.
The script updates the iframe src attribute.
Outcome: The player buffers briefly (approximately 1-2 seconds) and resumes playback with the Spanish audio track.

Further enhancements: Locale-based stream initialisation

At present, the player wrapper defaults to a static primary language (e.g., English) upon loading. A proposed enhancement involves implementing logic to automatically detect the viewer's locale, reducing the need for manual selection by the end-user.

This feature functions by querying the browser's navigator.language property to ascertain the user's preferred system language. The script parses this string to identify the primary language ISO code (for instance, truncating es-MX to es).

This code is subsequently cross-referenced against the internal streamMap object:

Match found: If the detected locale corresponds to an available stream, that specific Panopto Session ID is injected into the iframe source immediately.
No match (fallback): If the locale is unsupported, the system reverts to the standard default language to ensure the broadcast remains accessible.

To deploy this functionality, the static initialisation script within the HTML wrapper would be augmented with conditional logic.

// 1. Define the default fallback (e.g., English)
const defaultLang = 'en';

// 2. Detect and normalize browser language (e.g., 'es-MX' becomes 'es')
const userLocale = (navigator.language || navigator.userLanguage).split('-')[0].toLowerCase();

// 3. Determine the startup ID
// If the detected language exists in our map, use it. Otherwise, use default.
const initialId = streamMap.hasOwnProperty(userLocale) 
    ? streamMap[userLocale] 
    : streamMap[defaultLang];

// 4. Initialize Player

document.getElementById('panopto-player-frame').src = 
    `https://{site}/Panopto/Pages/Embed.aspx?id=${initialId}&autoplay=true`;

Implementation notes

The following considerations have been addressed following peer review:

Autoplay Policies: While autoplay=true is included, browser autoplay policies may still require user interaction. A fallback message or play button overlay is recommended for production deployments.
Error Handling: Validation logic should be implemented to handle invalid language keys or missing GUIDs.
User Experience: Ideally, a loading indicator should be displayed during stream switches to mask the buffering period.
Accessibility: aria-label attributes should be applied to the iframe and select elements.
Viewer Pane: The parameter offerviewer=false should be utilised to conceal the viewer list, ensuring a cleaner interface.

These suggestions have been accommodated in the stream-locale-switcher.HTML file located in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
LICENSE		LICENSE
Multi-Language Live Event \| Panopto Wrapper Prototype.HTML		Multi-Language Live Event \| Panopto Wrapper Prototype.HTML
README.md		README.md
UML Manifests		UML Manifests
stream-locale-switcher.HTML		stream-locale-switcher.HTML

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-language live streaming in Panopto

Architecture overview

Topology A: discrete hardware encoders (standard implementation)

Topology B: AWS Elemental Live "Fan-Out" (enterprise implementation)

Topology C: cloud-native automation (AI-driven localisation)

The client-side wrapper

Stream switching logic

Encoder configuration reference

Hardware Encoders (Topology B)

Cloud-Native AI Middleware (Topology C)

User journeys

1. AV producer journey (setup)

2. Viewer journey (consumption)

Further enhancements: Locale-based stream initialisation

Implementation notes

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-language live streaming in Panopto

Architecture overview

Topology A: discrete hardware encoders (standard implementation)

Topology B: AWS Elemental Live "Fan-Out" (enterprise implementation)

Topology C: cloud-native automation (AI-driven localisation)

The client-side wrapper

Stream switching logic

Encoder configuration reference

Hardware Encoders (Topology B)

Cloud-Native AI Middleware (Topology C)

User journeys

1. AV producer journey (setup)

2. Viewer journey (consumption)

Further enhancements: Locale-based stream initialisation

Implementation notes

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages