Pre-processing code for Korean Speech Data with Number provided by AI Hub
This reposisotry is pre-processing code for Korean Speech Data with Number provided by AI Hub
Korean Speech Data with Number consists of more than 10,000 hours of voice data with 84 categories including numbers in Chinese characters (한자어), native words(고유어) and foreign words(외래어).
-
Modify the data_root, data_sets.. options on run.sh
-
Run 'bash run.sh' to pre-process data