Wav2vec2Interpretation

Scripts and additional images for article "Investigating wav2vec2 context representations and the effects of fine-tuning, a case-study of a Finnish model"

Umap visualization of the embeddings

Visualization of the CNN outputs

You can find the visualizations of the embeddings produced by the CNN component in pics/cnn_*.svg

Visualization of the pretrained Transformer's outputs

You can find the high res visualizations of the embeddings produced by the pre-trained Transformer component in pics/pre_*.[svg/eps]

Visualization of the finetuned Transformer's outputs

You can find the high res visualizations of the embeddings produced by the fine-tuned Finnish Transformer component in pics/fine_*.[svg/eps]

Gender, age and speaker information

Pictures marked with utt2age show how the age information is embedded in the models, utt2speaker files demonstrate how well the models could differentiate between speakers and utt2gender visualizes the gender information in the embeddings

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
pics		pics
scripts		scripts
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wav2vec2Interpretation

Umap visualization of the embeddings

Visualization of the CNN outputs

Visualization of the pretrained Transformer's outputs

Visualization of the finetuned Transformer's outputs

Gender, age and speaker information

About

Releases

Packages

Languages

License

aalto-speech/Wav2vec2Interpretation

Folders and files

Latest commit

History

Repository files navigation

Wav2vec2Interpretation

Umap visualization of the embeddings

Visualization of the CNN outputs

Visualization of the pretrained Transformer's outputs

Visualization of the finetuned Transformer's outputs

Gender, age and speaker information

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages