# cantone **Repository Path**: hf-datasets/cantone ## Basic Information - **Project Name**: cantone - **Description**: Mirror of https://huggingface.co/datasets/AlienKevin/cantone - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2024-11-03 - **Last Updated**: 2024-11-03 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README --- license: mit task_categories: - audio-classification language: - yue tags: - speech - cantonese - yue - syllable - pronunciation pretty_name: Cantone size_categories: - 10K ``` Apple voices are gathered using jyutping text directly and a native Cantonese ASR system is used to filter out unsupported syllables. 2. Preprocessing * All audios are converted to 16kHz WAV files * Peak normalize all audios to -20 dBFS * Clip silence at the beginning and end (sound below -50 dBFS are deemed silence) 3. Verification Occassionally, some syllables are not synthesized correctly. * Apple voices usually renders tone 5 syllables as tone 2: we remove all tone 5 syllables from apple voices * Microsoft voices prepends consonants like ng, g, and b in front of isolate vowel syllables like aa: we remove all vowel syllables from microsoft voices ## License MIT