You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I recently downloaded this dataset, and noticed that the 10min and 1h subsets are of equal size (in number of utterances).
Both account to 1,571 lines of phonetic transcriptions.
Fetching the corresponding audios results in two sets that are 05:29:37 long (HH:MM:SS).
I'm guessing this is a mistake? :)
The text was updated successfully, but these errors were encountered:
I have the same issue. Moreover, loading the instances in Python I get the exact same filelists in both files, so the files are identical.
EDIT:
It seems like the 1h folder is split into 6 sub-folders, 10 mins each. So by taking all paths of any of those sub-folders you would have the 10mins of data, and by taking the entire subfolder you would have the 1h. So you could rebuild the .txt files using the established directory structure.
Hello,
I recently downloaded this dataset, and noticed that the 10min and 1h subsets are of equal size (in number of utterances).
Both account to 1,571 lines of phonetic transcriptions.
Fetching the corresponding audios results in two sets that are 05:29:37 long (HH:MM:SS).
I'm guessing this is a mistake? :)
The text was updated successfully, but these errors were encountered: