This dataset is a specialized collection of bilingual Chinese-English speech recordings, tailored to cater to the needs of speech recognition technology development. It is characterized by its unique blend of languages, high-fidelity audio quality, and the diversity of its contributors.
This dataset was recorded in a quiet office/home environment, with the participation of 200 speakers, including 123 males and 77 females. All speakers who took part in the recording were professionally screened to ensure standardized pronunciation and clear articulation. The recorded text materials cover information such as news.