This dataset was recorded in both quiet and noisy environments, with the participation of 1,000 speakers, consisting of 500 males and 500 females. All speakers involved in the recordings were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts encompass information such as news updates, everyday conversations, and text messages.