feat : : implement session-based local caching to reduce redundant Su… by Soniyakmt · Pull Request #143 · sugarlabs/speak-ai

Soniyakmt · 2026-05-12T06:17:00Z

fixes #87
summary:
Added local response caching for GenAI to reduce repeated Sugar-AI requests and improve response speed for repeated questions.

What changed
I had made: Added [cache.py] with an LRU-style [ResponseCache]
and Updated [gguf_inference.py] to:

Use a session-level response cache
return cached answers for identical questions
store successful and blocked responses in the cache
Limit the cache size to avoid unbounded memory growth
Coverage: Caches both successful responses and blocked profanity responses.
Caching Logic: Normalises question text (lowercase, trimmed), uses SHA256 for keys, and limits cache size to prevent memory issues.
Updated: [gguf_inference.py] - Integrates caching into [GGUFInference.ask_question()] to check for cached responses before generating new ones.

Note
: Cache keys are normalised to ignore extra whitespace/casing
. If persistence is enabled via [cache_file], the cache can survive restarts

…gar-AI requests

… output and prevent TTS regressions.

Soniyakmt added 2 commits May 12, 2026 11:12

feat : : implement session-based local caching to reduce redundant Su…

3de208c

…gar-AI requests

feat : Add language-specific pronunciation test cases to validate G2P…

f163eae

… output and prevent TTS regressions.

Provide feedback