All Datasets

Search our off-the-shelf datasets.

Filter by

ASR

TTS

Model Evaluaion Report

NLP

Lexicon

Machine Translation

OCR

Multimodal

840 Person Image Collection by Front-facing Camera and Face 21Points Labeling

Image collection Yellow skin Camera

Accented English Pronunciation Evaluation Corpus (Word Level)

This dataset was recorded in a quiet office/home environment, with the participation of 22 speakers, including 11 males and 11 females. All speakers involved in the recording were professionally selected to ensure standard pronunciation and clear articulation, and they were scored on a word-by-word basis by linguists.

Education and learning Smart search Speech recognition

Advertising and Marketing-Female voice

Labeled: Pronunciation & Rhythm

E-education Advertising，Marketing

Advertising and Marketing-Male Voice

Labeled: Pronunciation & Rhythm

Advertising，Marketing，E-education

Aesthetic Composition Training Corpus

Images are captured by professional photographers. Composition types include rule-of-thirds, horizontal, diagonal, triangular, and central composition. All images are evaluated and annotated by personnel with high aesthetic standards. Each image meets at least one composition type and at most three composition types.

Aesthetics Video Corpus

Video Collection

Afghanistan Dari Pronunciation Lexicon

This Afghanistan Dari Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Dari language as spoken in Afghanistan. With 30,075 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.

Speech recognition speech synthesis Dari

Afghanistan Pashto Pronunciation Lexicon

This Afghanistan Pashto Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Pashto language as spoken in Afghanistan. With 50,170 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.

Speech recognition speech synthesis Pashto

Afrikaans Speech Recognition Corpus (Mobile)

This dataset was recorded in a quiet home environment, with the participation of 125 speakers, including 124 males and 1 female. All speakers involved in the recording were professionally selected to ensure standardized pronunciation and clear enunciation. The recorded texts cover news, everyday conversations, and other information.

Education and learning Smart search Speech recognition

All Datasets

Filter by

840 Person Image Collection by Front-facing Camera and Face 21Points Labeling

Accented English Pronunciation Evaluation Corpus (Word Level)

Advertising and Marketing-Female voice

Advertising and Marketing-Male Voice

Aesthetic Composition Training Corpus

Aesthetics Video Corpus

Afghanistan Dari Pronunciation Lexicon

Afghanistan Pashto Pronunciation Lexicon

Afrikaans Speech Recognition Corpus (Mobile)

Get started

Join our newsletter to stay updated

Filter by