Model Interpretability with Captum and Logit Lens @ UdS WS 2024/2025

🧭 Tutorial Roadmap

Topic	Keywords	Jupyter Notebook
Interpreting LLMs for text generation	Llama, Shapley values, Integrated gradients	LLM_Attribution_with_Llama
Interpreting BERT QA models	BERT, embeddings, attention attributions	BERT_QA_Interpretability
Interpreting BERT QA models	BERT, attention matrices, importance scores	BERT_QA_Interpretability2
Logit Lens Example	Logit Lens	LogitLens


Mini Project
Mini Project (with solution)

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
datasets		datasets
notebooks		notebooks
README.md		README.md