Skip to content

Configurable URL for local models + remove Aitta-specific code#205

Open
maciejjan wants to merge 1 commit intoCliDyn:mainfrom
maciejjan:aitta
Open

Configurable URL for local models + remove Aitta-specific code#205
maciejjan wants to merge 1 commit intoCliDyn:mainfrom
maciejjan:aitta

Conversation

@maciejjan
Copy link
Copy Markdown
Contributor

@maciejjan maciejjan commented Apr 23, 2026

This introduces new config options for configuring the URL of the local models:

  • llm_local_endpoint_url (for LLMs),
  • rag_settings.local_endpoint_url (for the RAG embedding model).

All references to Aitta are removed from the code. Instead, Aitta can be used by just setting the above URLs and the API token. (We can get in touch on Mattermost to provide a token.) By default, the URLs point to localhost where e.g. an own vLLM can be running.

The script for creating RAG DB has been adjusted (see #182):

  • removed redundant embedding calculation,
  • batching documents for better performance.

Rationale: Aitta is now a drop-in replacement for the OpenAI API, so it can be used by setting the base URL and access token in classes like langchain_openai.ChatOpenAI. I have thus repurposed the "local" endpoint option by making the URL configurable (instead of localhost:8000) so that it can point to a custom OpenAI-compatible LLM provider, either local or remote - for example Aitta.

Early versions of Aitta (a year ago) needed some custom initialization code and the aitta-client dependency, which was added in #174 and other earlier PRs. This is no longer needed. Thus, I have cleaned up all Aitta-specific code and settings.

This introduces new config options for configuring the url of the
local models:
- `llm_local_endpoint_url` (for LLMs),
- `rag_settings.local_endpoint_url` (for the RAG embedding model).

All references to Aitta are removed from the code. Instead, Aitta can
be used by just setting the above URLs.

The script for creating RAG DB has been adjusted:
- removed redundant embedding calculation,
- batching documents for better performance.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant