Generate training data code #7

GabrielHaoHao · 2024-04-07T03:18:29Z

Hello! Thank you very much for your work! I am currently facing some issues.“Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting” this paper mentioned that the training set involves approximately 8000k phrases. However, when I use your testing set splitting strategy to divide the data into 100h and 360h hours, I can only obtain a dataset size close to that of the testing set. This is obviously incorrect, and I really hope to know what the training set splitting strategy is. Looking forward to your reply.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate training data code #7

Generate training data code #7

GabrielHaoHao commented Apr 7, 2024

Generate training data code #7

Generate training data code #7

Comments

GabrielHaoHao commented Apr 7, 2024