Migrate from chat.completions to Responses API by filip-komarzyniec · Pull Request #65 · IBM/ai4rag

filip-komarzyniec · 2026-05-19T07:35:05Z

Description

change API calls from chat/completions to responses (both OpenAI-compatible)
modified streamed pattern according to the document: https://github.com/LukaszCmielowski/architecture-decision-records/blob/autox_docs_updates/documentation/components/autorag/features/rag_pattern_inference.md
other small changes to different classes and functions not really affecting the overall behaviour

Motivation

Use the built-in agent loop working under the hood of Responses API,
stay on top of the industry standards

Changes

API calls change
small refactoring of various names so that they better represent current behaviour

Testing

Commit with updated tests will be added once manual tests are completed.

Checklist

Tests added/updated
Documentation updated
Code follows style guide
All checks passing

jakub-walaszczyk · 2026-05-19T14:35:24Z

@filip-komarzyniec please restore changes that introduce formatting for 80 characters per line. 120 has been fine here, and these changes make noise in the PR

rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com>

…s API rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com>

Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

jakub-walaszczyk

Please follow the requested changes and consider handling chroma within the ai4rag context.

rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com>

Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

jakub-walaszczyk · 2026-05-22T08:16:17Z

    """Constants used for setting the generation (inference) parameters for chat models only."""

-    MAX_COMPLETION_TOKENS = 2048
+    MAX_TOKENS = 2048


If the parameter name is now max_output_tokens why do we use max_tokens? Wouldn't it be better to reflect the name of the parameter?

It would, if we dropped the chat.completions support. The problem is that in the older API (completions) this param is called max_completion_tokens. In newer API (responses) it's max_output_tokens.

Now that we have some abstract constants class, I've decided to call it just max_tokensthere because it's consumed by both API interfaces.
What's more, the max_tokens is a deprecated name once used by chat.completions API so it's not totally made up by me.

The whole problem is caused by the fact that we decided earlier to leave the chat method support. No one uses it now so maybe it's better to just replace it with responses-aligned methods.

Let's replace it and keep proper parameters names, Thanks

filip-komarzyniec requested review from LukaszCmielowski and jakub-walaszczyk and removed request for jakub-walaszczyk May 19, 2026 14:17

filip-komarzyniec added 4 commits May 19, 2026 16:52

source code transitioned from chat/completions API to Responses API

f46209b

rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com>

changed returned pattern content; formatter and linter related changes

f8dc86d

rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com>

updated tests to match recent changes

9932186

rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com>

fixup! source code transitioned from chat/completions API to Response…

59d7320

…s API rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com>

filip-komarzyniec force-pushed the RHOAIENG-60206-AutoRAG-migrate-to-OGX-Responses-API branch from 3a91573 to 59d7320 Compare May 19, 2026 14:52

filip-komarzyniec requested a review from jakub-walaszczyk May 19, 2026 14:53

formatter and linter enforced changes

8317802

Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

filip-komarzyniec force-pushed the RHOAIENG-60206-AutoRAG-migrate-to-OGX-Responses-API branch from adbece5 to 8317802 Compare May 19, 2026 14:55

fixup! updated tests to match recent changes

4d362cd

Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

jakub-walaszczyk requested changes May 20, 2026

View reviewed changes

filip-komarzyniec commented May 20, 2026

View reviewed changes

Comment thread ai4rag/rag/retrieval/retriever.py Outdated

filip-komarzyniec added 2 commits May 21, 2026 15:18

changes mentioned in PR review #1

8e74789

rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com>

updated test files to reflect newest changes

0cde922

rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com>

filip-komarzyniec force-pushed the RHOAIENG-60206-AutoRAG-migrate-to-OGX-Responses-API branch from 38210dc to 0cde922 Compare May 21, 2026 13:18

unwanted formatting changes reverted

71460bd

Signed-off-by: Filip Komarzyniec <fkomarzy@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

filip-komarzyniec force-pushed the RHOAIENG-60206-AutoRAG-migrate-to-OGX-Responses-API branch from 52fead2 to 71460bd Compare May 21, 2026 14:00

filip-komarzyniec requested a review from jakub-walaszczyk May 21, 2026 15:17

fixed invalid parameter in resposnes API call; related tests updated

07f3715

rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

jakub-walaszczyk reviewed May 22, 2026

View reviewed changes

filip-komarzyniec requested a review from jakub-walaszczyk May 22, 2026 10:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate from chat.completions to Responses API#65

Migrate from chat.completions to Responses API#65
filip-komarzyniec wants to merge 10 commits into
mainfrom
RHOAIENG-60206-AutoRAG-migrate-to-OGX-Responses-API

filip-komarzyniec commented May 19, 2026 •

edited

Loading

Uh oh!

jakub-walaszczyk commented May 19, 2026

Uh oh!

jakub-walaszczyk left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jakub-walaszczyk May 22, 2026

Uh oh!

filip-komarzyniec May 22, 2026

Uh oh!

jakub-walaszczyk May 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

filip-komarzyniec commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation

Changes

Testing

Checklist

Uh oh!

jakub-walaszczyk commented May 19, 2026

Uh oh!

jakub-walaszczyk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jakub-walaszczyk May 22, 2026

Choose a reason for hiding this comment

Uh oh!

filip-komarzyniec May 22, 2026

Choose a reason for hiding this comment

Uh oh!

jakub-walaszczyk May 22, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

filip-komarzyniec commented May 19, 2026 •

edited

Loading