This dataset covers 29,954 dialect speakers from 26 provinces in China, ranging in age from 12 to 75, with a total recording time of 34,073 hours and an average recording duration of nearly 60 minutes, maintaining a balanced gender ratio. The topics covered are very extensive, including news, text messages, vehicle control, music, general, maps, daily colloquial speech, family, health, travel, work, socializing, celebrities, weather, and other common life topics.
This dataset was recorded in a quiet office/home environment, with the participation of 200 speakers, including 123 males and 77 females. All speakers who took part in the recording were professionally screened to ensure standardized pronunciation and clear articulation. The recorded text materials cover information such as news.