This dataset was recorded in noise environments such as cafes, restaurants, and streets, with a total of 54 speakers participating, including 27 males and 27 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text covers information such as news and Twitter.
This dataset was recorded in a quiet office/home environment, with the participation of 200 speakers, including 123 males and 77 females. All speakers who took part in the recording were professionally screened to ensure standardized pronunciation and clear articulation. The recorded text materials cover information such as news.