CompCon API Servers

Design Choices

All LLMs/VLMs/CLIPs serve as API with cache enabled, because loading a LLM/VLM/CLIP is expensive and we never modify them.
LLM functions in utils_llm.py, VLM functions in utils_vlm.py, CLIP functions in utils_clip.py, and others in utils_general.py.
Write unit tests to understand major functions.

Set up OpenAI API key: export OPENAI_API_KEY='[your key]'
Pip install environments: pip install vllm
Configure global variables in global_vars.py
Run python -m vllm.entrypoints.openai.api_server --model HuggingFaceM4/Idefics3-8B-Llama3 --port 8080 --max_model_len 5000
Run python -m serve.utils_llm or python -m serve.utils_vlm to test.