This dataset was recorded in a car noise environment, involving a total of 40 speakers, consisting of 20 males and 20 females. All participants who were involved in the recording process were professionally screened to ensure standardized pronunciation and clear enunciation. The recorded text materials span a range of topics including navigation, text messages, and media information.