feat: add pdf-podcast-agent - PDF-to-podcast with live debate and Stripe payments by stripathy1999 · Pull Request #28 · fetchai/innovation-lab-examples

stripathy1999 · 2026-04-10T19:59:07Z

Summary

This PR adds the PDF-to-Podcast Agent example.

The pdf-podcast-agent is a multi-agent ASI:One workflow that converts uploaded research PDFs into a debate-style podcast between two AI hosts (Skeptic + Expert), then supports interactive post-show Q&A and paid live debates.

Core flow:

Extractor agent: parses long PDF text into key insights (thesis, metrics, controversy)
Scriptwriter agent: generates a structured multi-turn debate script
Voice Studio agent: synthesizes host dialogue into MP3
Orchestrator agent: manages end-to-end pipeline, chat UX, artifacts, and payment/debate orchestration
Host A / Host B agents: handle follow-up Q&A and live debate turns

Also includes:

Personality customization (multiple host style combinations)
Stripe-gated live debate via Agent Payment Protocol
Transcript + downloadable output artifacts
Follow-up robustness fix from review: guard empty ResourceContent lists in chat attachment handling to avoid handler crashes

Type of Change

Checklist

I have starred this repository.
I ran ruff check ..
I ran ruff format ..
I added/updated README.md for changed example(s).
I added .env.example if environment variables are required.
I added demo image/GIF (if applicable).
I added agent profile link (if applicable).
I updated CHANGELOG.md (required for non-doc changes).
I verified paths/commands used in docs.

Related Issue

N.A.

Notes for Reviewers

Main implementation is under pdf-podcast-agent/.
Review commit includes a defensive guard in orchestrator.py for empty attachment resources (ResourceContent.resource=[]) to prevent IndexError during message handling.

6-agent PDF-to-podcast pipeline that converts research papers into debate podcasts with live Q&A, turn-by-turn debate, host personality customization, and Stripe payment gating via AgentPaymentProtocol. Tech: uAgents, ASI:One LLM, OpenAI TTS, pdfplumber, pydub, Stripe. Includes Dockerfile and docker-compose.yml for containerised deployment. Made-with: Cursor

sentry · 2026-04-10T20:03:09Z

+
+        await asyncio.sleep(8)
+
+        # Build a plain-text debate history from all accumulated lines


Bug: An await asyncio.sleep(8) call in the handle_debate_response message handler will block the agent's sequential message processing loop, causing significant performance degradation.
_{Severity: MEDIUM}

Suggested Fix

Instead of using await asyncio.sleep() directly in the message handler, refactor the logic to use a non-blocking mechanism. For example, schedule a background task to handle the next step after the delay, allowing the message handler to return immediately.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: pdf-podcast-agent/orchestrator.py#L1030 Potential issue: The `handle_debate_response` message handler contains an `await asyncio.sleep(8)` call. The orchestrator agent is configured to process messages sequentially, not concurrently. This means the 8-second sleep will block the agent's message processing loop. During a multi-turn debate, this will introduce significant, cumulative delays, degrading the user experience of the live debate feature.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

Auto-resolve missing sub-agent addresses from deterministic seeds at container startup so orchestrator wiring works without manual address injection. Document the behavior so Docker deployments stay aligned with the existing multi-agent workflow. Made-with: Cursor

Drop Docker files and remove Docker setup references from the example docs so the PR no longer includes Docker build or installation guidance. Made-with: Cursor

sentry · 2026-04-13T16:12:30Z

+
+    ASI:One sends CommitPayment with ``transaction_id`` set to the Stripe
+    Checkout Session ID.  We verify with Stripe, send CompletePayment,
+    mark the session as paid, and notify the user.
+    """
+    if msg.funds.payment_method != "stripe" or not msg.transaction_id:


Bug: The code accesses item.resource[0] without checking if the list is empty, leading to an unhandled IndexError that will crash the message handler.
_{Severity: HIGH}

Suggested Fix

Add a check to ensure item.resource is not an empty list before attempting to access its first element. For example: if isinstance(item.resource, list) and item.resource:.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: pdf-podcast-agent/orchestrator.py#L886-L891 Potential issue: In the `handle_chat_message` handler, if an incoming `ResourceContent` message has an empty list for the `item.resource` attribute, the code attempts to access `item.resource[0]`. This access occurs before the `try...except` block, causing an unhandled `IndexError` if the list is empty. This will crash the handler, preventing any response from being sent back to the user for what could be a realistic edge case, such as a failed file upload.

Skip empty attachment resource lists in chat handling so malformed or failed uploads do not raise IndexError and crash the message handler. Made-with: Cursor

Normalize import ordering and formatting across the example, add requests stubs for mypy, and fix the orchestrator variable typing edge so Ruff and mypy checks pass in CI. Made-with: Cursor

Suppress import-untyped on requests where CI skips stub installation, and update Stripe Checkout ui_mode to the typed embedded value so changed-file mypy checks pass. Made-with: Cursor

Replace the single demo placeholder with real chat flow screenshots covering podcast generation, live Q&A/paywall, Stripe confirmation, and host personality customization. Made-with: Cursor

sentry · 2026-04-13T18:29:27Z

+            f"  Host A       {HOST_A_ADDRESS or '(not set)'}\n"
+            f"  Host B       {HOST_B_ADDRESS or '(not set)'}"
+        )
+    ctx.storage.set(_PENDING_PAYMENTS_KEY, "{}")


Bug: Concurrent payment attempts overwrite a shared _PENDING_PAYMENTS_KEY, causing the system to credit the wrong user's session and denying access to the paying user.
_{Severity: HIGH}

Suggested Fix

Use a unique key for each pending transaction instead of the shared _PENDING_PAYMENTS_KEY. For example, the Stripe checkout_session_id could be used as part of the storage key to ensure each user's payment information is stored and retrieved separately, preventing data overwrites during concurrent sessions.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: pdf-podcast-agent/orchestrator.py#L951 Potential issue: A race condition exists in the payment flow due to the use of a single shared storage key, `_PENDING_PAYMENTS_KEY`, for all concurrent transactions. When multiple users initiate payment, the data for the second user overwrites the first. When the first user completes their payment, the `on_commit_payment` function retrieves the second user's session ID, marking the wrong session as paid. This prevents the first user from accessing the feature they paid for. The fallback mechanism is not triggered because the session ID is incorrect, not empty.

Store pending Stripe payment context per checkout_session_id instead of a shared singleton record so concurrent payment attempts cannot overwrite each other and mis-credit sessions. Made-with: Cursor

gautammanak1

Code review (pdf-podcast-agent):

Must fix / clarify

README says "6 agents" or python run.py, but run.py only starts 4 processes and does not start host_a / host_b — update docs or extend the launcher.
run.py should parse and export HOST_A_ADDRESS and HOST_B_ADDRESS from get_addresses.py output into child_env, same as the other addresses.
host_a_agent.py / host_b_agent.py docstrings say run hosts "after orchestrator"; README says orchestrator last — fix docstrings to match (hosts before orchestrator is ready to receive messages).

Consider

Document or refactor the await asyncio.sleep(8) in the live debate handler (blocks sequential processing).
Prefer subprocess over exec(open("get_addresses.py").read()) for maintainability.

Looks good

Empty ResourceContent guard, Stripe pending map keyed by checkout session, ruff clean on the example folder.

Update host agent mailbox instantiation, launch all six agents from run.py with HOST_A/HOST_B addresses, align startup-order docs, and make live debate pacing configurable/documented. Made-with: Cursor

Update pdf-podcast-agent requirements to uagents>=0.24.1 and uagents-core>=0.4.4. Made-with: Cursor

Add --explicit-package-bases to the pull_request_ci typecheck step so same-named modules in different directories are resolved by path and no longer collide as top-level module names. Made-with: Cursor

Run mypy once per changed Python file (with existing flags) instead of passing all files in one command, which prevents same-named modules in different directories from colliding. Made-with: Cursor

sentry · 2026-04-14T11:02:23Z

+        combined = segments[0]
+        for seg in segments[1:]:
+            if gap:
+                combined = combined + gap + seg


Bug: The _stitch_audio function lacks input validation, which could lead to creating a 0-byte MP3 file if it receives an empty list of audio chunks.
_{Severity: LOW}

Suggested Fix

Add a guard at the beginning of the _stitch_audio function or its caller to check if the msg.lines or the resulting audio chunks list is empty. If it is, raise an error or handle it gracefully instead of proceeding to stitch the audio.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: pdf-podcast-agent/voice_studio_agent.py#L79-L82 Potential issue: The `_stitch_audio` function will raise an `IndexError` if its `chunks` argument is an empty list, as it attempts to access `segments[0]`. This exception is caught, but the function then returns an empty byte string `b""`, which is written to disk as a silent, corrupt 0-byte MP3 file. While upstream agents currently prevent this by ensuring non-empty inputs, the `voice_studio_agent` lacks its own defensive validation, making it fragile to changes or malformed inputs from other sources.

sentry Bot reviewed Apr 10, 2026

View reviewed changes

stripathy1999 assigned gautammanak1 and rajashekarcs2023 Apr 13, 2026

sakshitripathy added 2 commits April 13, 2026 09:01

chore: remove docker artifacts from pdf-podcast-agent

f0fde08

Drop Docker files and remove Docker setup references from the example docs so the PR no longer includes Docker build or installation guidance. Made-with: Cursor

sentry Bot reviewed Apr 13, 2026

View reviewed changes

sakshitripathy and others added 4 commits April 13, 2026 09:45

fix: guard empty ResourceContent lists in orchestrator

c4b9314

Skip empty attachment resource lists in chat handling so malformed or failed uploads do not raise IndexError and crash the message handler. Made-with: Cursor

Merge branch 'main' into feat/pdf-podcast-agent

5d98789

fix: resolve pdf-podcast-agent lint and type CI failures

cbbcca0

Normalize import ordering and formatting across the example, add requests stubs for mypy, and fix the orchestrator variable typing edge so Ruff and mypy checks pass in CI. Made-with: Cursor

fix: unblock mypy checks for requests and stripe ui mode

5c880ad

Suppress import-untyped on requests where CI skips stub installation, and update Stripe Checkout ui_mode to the typed embedded value so changed-file mypy checks pass. Made-with: Cursor

stripathy1999 changed the title ~~feat: add pdf-podcast-agent — 6-agent PDF-to-podcast with live debate and Stripe payments~~ feat: add pdf-podcast-agent - PDF-to-podcast with live debate and Stripe payments Apr 13, 2026

docs: add ASI:One demo screenshots for pdf-podcast-agent

3970f1a

Replace the single demo placeholder with real chat flow screenshots covering podcast generation, live Q&A/paywall, Stripe confirmation, and host personality customization. Made-with: Cursor

sentry Bot reviewed Apr 13, 2026

View reviewed changes

fix: key pending payments by checkout session

2b56289

Store pending Stripe payment context per checkout_session_id instead of a shared singleton record so concurrent payment attempts cannot overwrite each other and mis-credit sessions. Made-with: Cursor

gautammanak1 reviewed Apr 13, 2026

View reviewed changes

Comment thread pdf-podcast-agent/host_a_agent.py Outdated

gautammanak1 requested changes Apr 13, 2026

View reviewed changes

Comment thread pdf-podcast-agent/host_b_agent.py Outdated

Comment thread pdf-podcast-agent/requirements.txt Outdated

Comment thread pdf-podcast-agent/requirements.txt Outdated

Comment thread pdf-podcast-agent/run.py

Comment thread pdf-podcast-agent/run.py Outdated

sakshitripathy added 4 commits April 14, 2026 03:52

Address PR feedback for host wiring and launcher consistency.

8eeba86

Update host agent mailbox instantiation, launch all six agents from run.py with HOST_A/HOST_B addresses, align startup-order docs, and make live debate pacing configurable/documented. Made-with: Cursor

Bump uAgents dependencies to latest reviewer-requested baselines.

ed900d7

Update pdf-podcast-agent requirements to uagents>=0.24.1 and uagents-core>=0.4.4. Made-with: Cursor

Fix PR mypy duplicate-module failures across sibling projects.

c86d9de

Add --explicit-package-bases to the pull_request_ci typecheck step so same-named modules in different directories are resolved by path and no longer collide as top-level module names. Made-with: Cursor

Avoid mypy duplicate-module collisions in PR typecheck.

390392f

Run mypy once per changed Python file (with existing flags) instead of passing all files in one command, which prevents same-named modules in different directories from colliding. Made-with: Cursor

sentry Bot reviewed Apr 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add pdf-podcast-agent - PDF-to-podcast with live debate and Stripe payments#28

feat: add pdf-podcast-agent - PDF-to-podcast with live debate and Stripe payments#28
stripathy1999 wants to merge 13 commits into
fetchai:mainfrom
stripathy1999:feat/pdf-podcast-agent

stripathy1999 commented Apr 10, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

sentry Bot Apr 10, 2026

Uh oh!

sentry Bot Apr 13, 2026

Uh oh!

sentry Bot Apr 13, 2026

Uh oh!

Uh oh!

gautammanak1 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sentry Bot Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		await asyncio.sleep(8)

		# Build a plain-text debate history from all accumulated lines

Conversation

stripathy1999 commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Type of Change

Checklist

Related Issue

Notes for Reviewers

Uh oh!

Uh oh!

Uh oh!

sentry Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

sentry Bot Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

sentry Bot Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gautammanak1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sentry Bot Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

stripathy1999 commented Apr 10, 2026 •

edited

Loading