Teacher-forced confidence as a false-belief sensor for language models.
nlp machine-learning transformers pytorch model-evaluation confidence teacher-forcing mandela-effect llm-interpretability false-beliefs
-
Updated
Mar 3, 2026 - Python