You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When accessing the first element of the commonvoice dataset, it fails with :
File "test_data.py", line 17, in test001
waveform, sample_rate, dictionnary = train_dataset[0]
File "/home/fix_jer/.local/lib/python3.6/site-packages/torchaudio/datasets/commonvoice.py", line 233, in __getitem__
return load_commonvoice_item(line, self._header, self._path, self._folder_audio)
File "/home/fix_jer/.local/lib/python3.6/site-packages/torchaudio/datasets/commonvoice.py", line 96, in load_commonvoice_item
waveform, sample_rate = torchaudio.load(filename)
File "/home/fix_jer/.local/lib/python3.6/site-packages/torchaudio/backend/sox_backend.py", line 35, in load
raise OSError("{} not found or is a directory".format(filepath))
OSError: /opt/Datasets/CommonVoice/fr_79h_2019-02-25/fr/clips/89e67e7682b36786a0b4b4022c4d42090c86edd96c78c12d30088e62522b8fe466ea4912e6a1055dfb91b296a0743e0a2bbe16cebac98ee5349e3e8262cb9329 not found or is a directory
Indead, the clip exists, but it seams to me the extension ".mp3" is missing. The filename provided to "torchaudio.load" does not contain the extension since the train.tsv file does not contain the extension.
One workaround is to edit commonvoice.py:L93, and changing :
wget https://raw.githubusercontent.com/pytorch/pytorch/master/torch/utils/collect_env.py
# For security purposes, please check the contents of collect_env.py before running it.
python collect_env.py
PyTorch version: 1.7.1
Is debug build: False
CUDA used to build PyTorch: 10.2
ROCM used to build PyTorch: N/A
Python version: 3.6 (64-bit runtime)
Is CUDA available: False
CUDA runtime version: No CUDA
GPU models and configuration: No CUDA
Nvidia driver version: No CUDA
cuDNN version: No CUDA
HIP runtime version: N/A
MIOpen runtime version: N/A
Versions of relevant libraries:
[pip3] botorch==0.1.4
[pip3] gpytorch==1.0.1
[pip3] numpy==1.17.4
[pip3] numpy-stl==1.6.2
[pip3] pytorch-complex==0.0.8
[pip3] torch==1.7.1
[pip3] torchaudio==0.7.2
[pip3] torchdiffeq==0.0.1
[pip3] torchtext==0.8.0
[pip3] torchvision==0.8.1+cpu
[conda] Could not collect
The text was updated successfully, but these errors were encountered:
🐛 Bug
When accessing the first element of the commonvoice dataset, it fails with :
Indead, the clip exists, but it seams to me the extension ".mp3" is missing. The filename provided to "torchaudio.load" does not contain the extension since the train.tsv file does not contain the extension.
One workaround is to edit commonvoice.py:L93, and changing :
To Reproduce
Steps to reproduce the behavior:
Expected behavior
After adding the ".mp3" extension in commonvoice.py , I get the expected behavior :
Environment
torchaudio.__version__
print? (If applicable) 0.7.2Please copy and paste the output from our
environment collection script
(or fill out the checklist below manually).
You can get the script and run it with:
PyTorch version: 1.7.1
Is debug build: False
CUDA used to build PyTorch: 10.2
ROCM used to build PyTorch: N/A
OS: Ubuntu 18.04.5 LTS (x86_64)
GCC version: (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0
Clang version: 6.0.0-1ubuntu2 (tags/RELEASE_600/final)
CMake version: version 3.10.2
Python version: 3.6 (64-bit runtime)
Is CUDA available: False
CUDA runtime version: No CUDA
GPU models and configuration: No CUDA
Nvidia driver version: No CUDA
cuDNN version: No CUDA
HIP runtime version: N/A
MIOpen runtime version: N/A
Versions of relevant libraries:
[pip3] botorch==0.1.4
[pip3] gpytorch==1.0.1
[pip3] numpy==1.17.4
[pip3] numpy-stl==1.6.2
[pip3] pytorch-complex==0.0.8
[pip3] torch==1.7.1
[pip3] torchaudio==0.7.2
[pip3] torchdiffeq==0.0.1
[pip3] torchtext==0.8.0
[pip3] torchvision==0.8.1+cpu
[conda] Could not collect
The text was updated successfully, but these errors were encountered: