Skip to content

Commit acf9bfa

Browse files
committed
Update documentation
0 parents  commit acf9bfa

File tree

176 files changed

+15649
-0
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

176 files changed

+15649
-0
lines changed

.buildinfo

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
# Sphinx build info version 1
2+
# This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done.
3+
config: 49ae333cedb3a6b39daf109ee177130d
4+
tags: 645f666f9bcd5a90fca523b33c5a78b7

.nojekyll

Whitespace-only changes.
Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
# Feature Importance Scoring methods
2+
TBA, check back soon!

_sources/index.md

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,29 @@
1+
# Interpretability Techniques for Speech Models
2+
3+
Pre-trained foundation models have revolutionized speech technology like many other adjacent fields. The combination of their capability and opacity has sparked interest in researchers trying to interpret the models in various ways. While interpretability in fields such as computer vision and natural language processing has made significant progress towards understanding model internals and explaining their decisions, speech technology has lagged behind despite the widespread use of complex, black-box neural models. Recent studies have begun to address this gap, marked by a growing body of literature focused on interpretability in the speech domain. This tutorial provides a structured overview of interpretability techniques, their applications, implications, and limitations when applied to speech models, aiming to help researchers and practitioners better understand, evaluate, debug, and optimize speech models while building trust in their predictions. In hands-on sessions, participants will explore how speech models encode distinct features (e.g., linguistic information) and utilize them in their inference. By the end, attendees will be equipped with the tools and knowledge to start analyzing and interpreting speech models in their own research, potentially inspiring new directions.
4+
5+
```{note}
6+
We will present our tutorial about _Interpretability Techniques for Speech Models_ on **Sunday, August 17th** at this year's Interspeech conference in Rotterdam. <br> Check out the [preliminary programme](#interspeech-programme) below, and sign up through the [Interspeech registration form](https://www.interspeech2025.org/registration)!
7+
```
8+
9+
(interspeech-programme)=
10+
## Preliminary programme at Interspeech 2025
11+
12+
> **Introduction** to challenges of speech data for interpretability research <br>
13+
> [Grzegorz Chrupała](https://grzegorz.chrupala.me/)
14+
15+
> Tutorial on **Representational Analysis methods** for speech model interpretability, including: <br> Probing{cite}`bentumProcessingStressEndEnd2024,shen-etal-2024-encoding,cormacenglishDomainInformedProbingWav2vec2022`, Representational Similarity Analyses (RSA{cite}`chrupala-etal-2020-analyzing,shenWaveSyntaxProbing2023a`, CCA{cite}`Pasad2021`, CKA{cite}`pmlr-v97-kornblith19a`), <br> CTC & Decoder lenses{cite}`deheerklootsHumanlikeLinguisticBiases2024,langedijkDecoderLensLayerwiseInterpretation2024`, ABX tests{cite}`schatz2016,algayresDPParseFindingWord2022,seysselDiscriminatingFormMeaning2025` <br>
16+
> [Martijn Bentum](https://www.ru.nl/personen/bentum-m), [Charlotte Pouw](https://www.illc.uva.nl/People/Table/person/5440/Charlotte-Pouw)
17+
18+
> Tutorial on **Feature Importance Scoring methods** for speech model interpretability, including: <br> Context-mixing (Attention{cite}`pengWordDiscoveryVisually2022a,a-shams-etal-2024-uncovering`, Value-Zeroing{cite}`mohebbiHomophoneDisambiguationReveals2023a`), <br> Feature attribution{cite}`fucciExplainabilitySpeechModels2024` (Gradient-based{cite}`prasadHowAccentsConfound2020,guptaPhonemeDiscretizedSaliency2024` and Perturbation-based{cite}`wuExplanationsforASR2023,pastor-etal-2024-explaining`) <br>
19+
> [Hosein Mohebbi](https://hmohebbi.github.io/), [Gaofei Shen](https://www.gaofeishen.com/)
20+
21+
> **Outlook** on future work in interpretability, **Discussion** of key takeaways and findings <br>
22+
> [Marianne de Heer Kloots](https://mdhk.net/), [Tom Lentz](https://www.tilburguniversity.edu/staff/t-o-lentz), [Willem Zuidema](https://staff.fnwi.uva.nl/w.zuidema/)
23+
24+
## References
25+
26+
```{bibliography}
27+
:style: unsrt
28+
```
29+
<div>
Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
# Feature Importance Scoring methods
2+
TBA, check back soon!

_sources/interspeech2025/intro.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
# TBA, check back soon!
Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
# Representational Analysis methods
2+
TBA, check back soon!

_sources/intro.md

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
# Interpretability Techniques for Speech Models
2+
3+
Pre-trained foundation models have revolutionized speech technology like many other adjacent fields. The combination of their capability and opacity has sparked interest in researchers trying to interpret the models in various ways. While interpretability in fields such as computer vision and natural language processing has made significant progress towards understanding model internals and explaining their decisions, speech technology has lagged behind despite the widespread use of complex, black-box neural models. Recent studies have begun to address this gap, marked by a growing body of literature focused on interpretability in the speech domain. This tutorial provides a structured overview of interpretability techniques, their applications, implications, and limitations when applied to speech models, aiming to help researchers and practitioners better understand, evaluate, debug, and optimize speech models while building trust in their predictions. In hands-on sessions, participants will explore how speech models encode distinct features (e.g., linguistic information) and utilize them in their inference. By the end, attendees will be equipped with the tools and knowledge to start analyzing and interpreting speech models in their own research, potentially inspiring new directions.
4+
5+
```{note}
6+
We will present our tutorial about _Interpretability Techniques for Speech Models_ on **Sunday, August 17th** at this year's Interspeech conference in Rotterdam. <br> Check out the [preliminary programme](#preliminary-programme) below, and sign up through the [Interspeech registration form](https://www.interspeech2025.org/registration)!
7+
```
8+
9+
## Preliminary programme at Interspeech 2025
10+
11+
> **Introduction** to challenges of speech data for interpretability research <br>
12+
> [Grzegorz Chrupała](https://grzegorz.chrupala.me/)
13+
14+
> Tutorial on **Representational Analysis methods** for speech model interpretability <br> incl. Probing, Representational Similarity Analysis, CTC & Decoder lenses, ABX tests <br>
15+
> [Martijn Bentum](https://www.ru.nl/personen/bentum-m), [Charlotte Pouw](https://www.illc.uva.nl/People/Table/person/5440/Charlotte-Pouw)
16+
17+
> Tutorial on **Feature Importance Scoring methods** for speech model interpretability <br> incl. Context-mixing (Attention, Value-Zeroing) and Feature attribution (Gradient- and Perturbation-based methods) <br>
18+
> [Hosein Mohebbi](https://hmohebbi.github.io/), [Gaofei Shen](https://www.gaofeishen.com/)
19+
20+
> **Outlook** on future work in interpretability, **Discussion** of key takeaways and findings <br>
21+
> [Marianne de Heer Kloots](https://mdhk.net/), [Tom Lentz](https://www.tilburguniversity.edu/staff/t-o-lentz), [Willem Zuidema](https://staff.fnwi.uva.nl/w.zuidema/)

_sources/markdown-notebooks.ipynb

Lines changed: 85 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,85 @@
1+
{
2+
"cells": [
3+
{
4+
"cell_type": "markdown",
5+
"id": "b5ae153c",
6+
"metadata": {},
7+
"source": [
8+
"# Notebooks with MyST Markdown\n",
9+
"\n",
10+
"Jupyter Book also lets you write text-based notebooks using MyST Markdown.\n",
11+
"See [the Notebooks with MyST Markdown documentation](https://jupyterbook.org/file-types/myst-notebooks.html) for more detailed instructions.\n",
12+
"This page shows off a notebook written in MyST Markdown.\n",
13+
"\n",
14+
"## An example cell\n",
15+
"\n",
16+
"With MyST Markdown, you can define code cells with a directive like so:"
17+
]
18+
},
19+
{
20+
"cell_type": "code",
21+
"execution_count": null,
22+
"id": "19b2d938",
23+
"metadata": {},
24+
"outputs": [],
25+
"source": [
26+
"print(2 + 2)"
27+
]
28+
},
29+
{
30+
"cell_type": "markdown",
31+
"id": "a2b7492b",
32+
"metadata": {},
33+
"source": [
34+
"When your book is built, the contents of any `{code-cell}` blocks will be\n",
35+
"executed with your default Jupyter kernel, and their outputs will be displayed\n",
36+
"in-line with the rest of your content.\n",
37+
"\n",
38+
"```{seealso}\n",
39+
"Jupyter Book uses [Jupytext](https://jupytext.readthedocs.io/en/latest/) to convert text-based files to notebooks, and can support [many other text-based notebook files](https://jupyterbook.org/file-types/jupytext.html).\n",
40+
"```\n",
41+
"\n",
42+
"## Create a notebook with MyST Markdown\n",
43+
"\n",
44+
"MyST Markdown notebooks are defined by two things:\n",
45+
"\n",
46+
"1. YAML metadata that is needed to understand if / how it should convert text files to notebooks (including information about the kernel needed).\n",
47+
" See the YAML at the top of this page for example.\n",
48+
"2. The presence of `{code-cell}` directives, which will be executed with your book.\n",
49+
"\n",
50+
"That's all that is needed to get started!\n",
51+
"\n",
52+
"## Quickly add YAML metadata for MyST Notebooks\n",
53+
"\n",
54+
"If you have a markdown file and you'd like to quickly add YAML metadata to it, so that Jupyter Book will treat it as a MyST Markdown Notebook, run the following command:\n",
55+
"\n",
56+
"```\n",
57+
"jupyter-book myst init path/to/markdownfile.md\n",
58+
"```"
59+
]
60+
}
61+
],
62+
"metadata": {
63+
"jupytext": {
64+
"formats": "md:myst",
65+
"text_representation": {
66+
"extension": ".md",
67+
"format_name": "myst",
68+
"format_version": 0.13,
69+
"jupytext_version": "1.11.5"
70+
}
71+
},
72+
"kernelspec": {
73+
"display_name": "Python 3",
74+
"language": "python",
75+
"name": "python3"
76+
},
77+
"source_map": [
78+
13,
79+
25,
80+
27
81+
]
82+
},
83+
"nbformat": 4,
84+
"nbformat_minor": 5
85+
}

_sources/markdown-notebooks.md

Lines changed: 53 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,53 @@
1+
---
2+
jupytext:
3+
formats: md:myst
4+
text_representation:
5+
extension: .md
6+
format_name: myst
7+
format_version: 0.13
8+
jupytext_version: 1.11.5
9+
kernelspec:
10+
display_name: Python 3
11+
language: python
12+
name: python3
13+
---
14+
15+
# Notebooks with MyST Markdown
16+
17+
Jupyter Book also lets you write text-based notebooks using MyST Markdown.
18+
See [the Notebooks with MyST Markdown documentation](https://jupyterbook.org/file-types/myst-notebooks.html) for more detailed instructions.
19+
This page shows off a notebook written in MyST Markdown.
20+
21+
## An example cell
22+
23+
With MyST Markdown, you can define code cells with a directive like so:
24+
25+
```{code-cell}
26+
print(2 + 2)
27+
```
28+
29+
When your book is built, the contents of any `{code-cell}` blocks will be
30+
executed with your default Jupyter kernel, and their outputs will be displayed
31+
in-line with the rest of your content.
32+
33+
```{seealso}
34+
Jupyter Book uses [Jupytext](https://jupytext.readthedocs.io/en/latest/) to convert text-based files to notebooks, and can support [many other text-based notebook files](https://jupyterbook.org/file-types/jupytext.html).
35+
```
36+
37+
## Create a notebook with MyST Markdown
38+
39+
MyST Markdown notebooks are defined by two things:
40+
41+
1. YAML metadata that is needed to understand if / how it should convert text files to notebooks (including information about the kernel needed).
42+
See the YAML at the top of this page for example.
43+
2. The presence of `{code-cell}` directives, which will be executed with your book.
44+
45+
That's all that is needed to get started!
46+
47+
## Quickly add YAML metadata for MyST Notebooks
48+
49+
If you have a markdown file and you'd like to quickly add YAML metadata to it, so that Jupyter Book will treat it as a MyST Markdown Notebook, run the following command:
50+
51+
```
52+
jupyter-book myst init path/to/markdownfile.md
53+
```

0 commit comments

Comments
 (0)