This dataset was recorded in a quiet office environment, with 40 speakers participating, evenly divided between males and females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation. The recorded text consists of sentences from daily conversations and news.