Hello Nauman, and vr14:
Iβm in a similar situation; I believe Iβve followed the setup instructions diligently, but Iβm also getting the same error when running the experiment. Iβll try to enumerate my relevant state below:
OS: Ubuntu 20.04, with CUDA installed and verified working.
AMI Dataset lives at: demarco/datasets/ami
, with a tree structure of:
(py38) demarco@demarco-ubuntu:~/dev/datasets/ami$ tree . --filelimit 27
.
βββ corpus
β βββ ES2002a
β βββ audio
β β βββ ES2002a.Mix-Headset.wav
β βββ video
β βββ ES2002a.Closeup1.avi
β βββ ES2002a.Closeup2.avi
β βββ ES2002a.Closeup3.avi
β βββ ES2002a.Closeup4.avi
β βββ ES2002a.Corner.avi
β βββ ES2002a.Overhead.avi
βββ manual_annot
βββ 00README_MANUAL.txt
βββ abstractive [142 entries exceeds filelimit, not opening dir]
βββ AMI-metadata.xml
βββ argumentation
β βββ ae [376 entries exceeds filelimit, not opening dir]
β βββ ar [94 entries exceeds filelimit, not opening dir]
β βββ dis [95 entries exceeds filelimit, not opening dir]
βββ configuration
β βββ amiConfig.xml
β βββ amiSubConfig.xml
βββ corpusdoc [73 entries exceeds filelimit, not opening dir]
βββ corpusResources
β βββ meetings.xml
β βββ participants.xml
βββ decision
β βββ manual [47 entries exceeds filelimit, not opening dir]
βββ dialogueActs [695 entries exceeds filelimit, not opening dir]
βββ disfluency [160 entries exceeds filelimit, not opening dir]
βββ extractive [274 entries exceeds filelimit, not opening dir]
βββ focus [56 entries exceeds filelimit, not opening dir]
βββ handGesture [61 entries exceeds filelimit, not opening dir]
βββ headGesture [173 entries exceeds filelimit, not opening dir]
βββ LICENCE.txt
βββ manifest_1.7.html
βββ MANIFEST_MANUAL.txt
βββ movement [498 entries exceeds filelimit, not opening dir]
βββ namedEntities [468 entries exceeds filelimit, not opening dir]
βββ ontologies
β βββ ae-types.xml
β βββ ap-types.xml
β βββ ar-types.xml
β βββ chunks.xml
β βββ chunk-types.xml
β βββ da-types.xml
β βββ default-topics.xml
β βββ dis-types.xml
β βββ dsfl-types.xml
β βββ floor-types.xml
β βββ foa-targets.xml
β βββ leg-targets.xml
β βββ ne-types.xml
β βββ rse-types.xml
β βββ rsr-types.xml
β βββ subj-types.xml
β βββ you-types.xml
βββ participantRoles
β βββ ES2002d.A.role.xml
β βββ ES2002d.B.role.xml
β βββ ES2002d.C.role.xml
β βββ ES2002d.D.role.xml
β βββ ES2008b.A.role.xml
β βββ ES2008b.B.role.xml
β βββ ES2008b.C.role.xml
β βββ ES2008b.D.role.xml
β βββ ES2008d.A.role.xml
β βββ ES2008d.B.role.xml
β βββ ES2008d.C.role.xml
β βββ ES2008d.D.role.xml
β βββ ES2009d.A.role.xml
β βββ ES2009d.B.role.xml
β βββ ES2009d.C.role.xml
β βββ ES2009d.D.role.xml
β βββ IS1003d.A.role.xml
β βββ IS1003d.B.role.xml
β βββ IS1003d.C.role.xml
β βββ IS1003d.D.role.xml
βββ participantSummaries [323 entries exceeds filelimit, not opening dir]
βββ resource.xml
βββ segments [687 entries exceeds filelimit, not opening dir]
βββ topics [139 entries exceeds filelimit, not opening dir]
βββ words [687 entries exceeds filelimit, not opening dir]
βββ youUsages [63 entries exceeds filelimit, not opening dir]
30 directories, 54 files
Note I simply downloaded the Edinburgh ES 2002a meeting, selected βLow-size DivX videosβ and βHeadset mixβ.
I installed extra_requirements.txt
and updated hparams/ecapa_tdnn.yaml
as follows:
seed: 1234
__set_seed: !apply:torch.manual_seed [!ref <seed>]
# Folders
# data: http://groups.inf.ed.ac.uk/ami/download/
data_folder: /home/demarco/dev/datasets/ami/corpus/
manual_annot_folder: /home/demarco/dev/datasets/ami/manual_annot/
output_folder: results/ami/sd_ecapa_tdnn
save_folder: !ref <output_folder>/save
device: 'cuda:0'
I only changed the data_folder
and manual_annot_folder
paths, but also removed a trailing /
from output_folder
; I noted under a debugger that full_ref_rttm_file
had a double //
in its path; unclear if that could negatively affect things.
My error output is identical:
(py38) demarco@demarco-ubuntu:~/dev/when/ml/speechbrain/recipes/AMI/Diarization$ python experiment.py hparams/ecapa_tdnn.yaml
speechbrain.core - Beginning experiment!
speechbrain.core - Experiment folder: results/ami/sd_ecapa_tdnn
speechbrain.pretrained.fetching - Fetch embedding_model.ckpt: Using existing file/symlink in results/ami/sd_ecapa_tdnn/save/embedding_model.ckpt.
speechbrain.utils.parameter_transfer - Loading pretrained files for: embedding_model
__main__ - Tuning for nn (Multiple iterations over AMI Dev set)
__main__ - Diarizing dev set
0it [00:00, ?it/s]
speechbrain.core - Exception:
Traceback (most recent call last):
File "experiment.py", line 443, in <module>
best_nn = dev_nn_tuner(full_csv, "dev")
File "experiment.py", line 292, in dev_nn_tuner
concate_rttm_file = diarize_dataset(
File "experiment.py", line 229, in diarize_dataset
concate_rttm_file = out_rttm_dir + "/sys_output.rttm"
UnboundLocalError: local variable 'out_rttm_dir' referenced before assignment
Iβm not sure how to proceed. Any help you can offer would be greatly appreciated. Thank you in advance for your help and your contributions to this excellent library.