All Datasets

Search our off-the-shelf datasets.

Filter by
Category
Category
840 Person Image Collection by Front-facing Camera and Face 21Points Labeling
Accented English Pronunciation Evaluation Corpus (Word Level)
This dataset was recorded in a quiet office/home environment, with the participation of 22 speakers, including 11 males and 11 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation, and they were scored on a word-by-word basis by linguists.
Advertising and Marketing-Female voice
Labeled: Pronunciation & Rhythm
Advertising and Marketing-Male Voice
Labeled: Pronunciation & Rhythm
Aesthetic Composition Training Corpus
Images are captured by professional photographers. Composition types include rule-of-thirds, horizontal, diagonal, triangular, and central composition. All images are evaluated and annotated by personnel with high aesthetic standards. Each image meets at least one composition type and at most three composition types.
Aesthetics Video Corpus
Afghanistan Dari Pronunciation Lexicon
This Afghanistan Dari Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Dari language as spoken in Afghanistan. With 30,075 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Afghanistan Pashto Pronunciation Lexicon
This Afghanistan Pashto Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Pashto language as spoken in Afghanistan. With 50,170 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Afrikaans Speech Recognition Corpus (Mobile)
This dataset was recorded in a quiet home environment, with the participation of 125 speakers, including 124 males and 1 female. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover news, everyday conversations, and other information.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Category
Category