From 43e0c5df964a8fc992589bb36201427ff190a97c Mon Sep 17 00:00:00 2001 From: "Linke, Julian" <linke@tugraz.at> Date: Fri, 4 Aug 2023 15:19:44 +0200 Subject: [PATCH] Update file README.md --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 24cbec7..a3a8681 100644 --- a/README.md +++ b/README.md @@ -71,7 +71,7 @@ conda activate speechcodebookanalysis The file ```conda.sh``` is sourced at the beginning of ```run.sh```. -### Fairseq Repository +### Fairseq Repository (as of 27th July, 2023) You need to clone the fairseq repository to another directory (e.g., ```../fairseq```). ``` @@ -80,7 +80,7 @@ git clone https://github.com/facebookresearch/fairseq.git Make sure to modify the file ```path.sh``` in order to export the necessary environment variables. The file ```path.sh``` is also sourced at the beginning of ```run.sh```. -### Model File +### Model File (as of 27th July, 2023) **You need to download and store a model file**. In the main script (```run.sh```) you can specify the ```model_path```. This study is based on the large pretrained model **XLSR-53** which can be downloaded here: [wav2vec2](https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec) **Unfortunately loading/initializing the model with version ```fairseq 0.12.2``` lead to errors because of mismatches with respect to dictionary keys. Anyway, we provide a script (```local/create_xlsr_new.py```) which removes some dictionary keys and stores a new version of the model preventing those errors** (see also [ISSUE](https://github.com/facebookresearch/fairseq/issues/3741)). -- GitLab