Wire model-config-datasource into the runner and helm chart.#175
Wire model-config-datasource into the runner and helm chart.#175Mohammad-nassar10 wants to merge 2 commits into
Conversation
Signed-off-by: Mohammad <mohammad.nassar@ibm.com>
| {{- if .Values.payloadProcessor.listModels }} | ||
| {{- $models := list }} | ||
| {{- range .Values.payloadProcessor.listModels }} | ||
| {{- $models = append $models (dict "name" .) }} | ||
| {{- end }} | ||
| models.json: {{ dict "models" $models | toJson | quote }} | ||
| {{- end }} |
There was a problem hiding this comment.
it's not clear what is listModels from this code.
can you please add the relevant documentation to the readme.MD inside the helm chart dir?
the optional flags should be documentation so whoever uses the helm can configure IPP easily.
There was a problem hiding this comment.
Good point, added, thanks!
Signed-off-by: Mohammad <mohammad.nassar@ibm.com>
There was a problem hiding this comment.
@Mohammad-nassar10 thanks for adding the doc.
now that I understand what was the intention in the PR - this looks somehow very limiting - the list of models is static, there is no way to add a model dynamically (nor remove). it seems like an easy way for a PoC but not the proper way moving forward.
the payload processing config is intended to be used for plugins configuration.
seems like this PR is making the config define the source of truth of available models (this is not a plugin, although one could argue data collector is).
If we take it to EPP's world to compare just to make the point -
think how it would look like if one would have to configure the available endpoints of EPP through the plugins config file. this sounds incorrect to me.
alternative direction that could work:
add a configmap that stores models. configmap could also add metadata for every model entry (e.g., like its cost per 1M tokens). add a controller that watches the "modelsList" configmap(s) and fills that data into the datastore.
|
@nirrozenbaum wrote:
The updates to the README.md state:
That is not part of the IPP's configuration ConfigMap. It is a separate ConfigMap. If and how that ConfigMap is checked for changes is a different thing, which should be handled separately from this PR. |
We already have a |
What type of PR is this?
/kind feature
What this PR does / why we need it:
Register
model-config-datasourceplugin in the runner and expose a way to ship the JSON file through the chart, so users use it to seed candidate models.Which issue(s) this PR fixes:
Fixes #
Release note (write
NONEif no user-facing change):