All Datasets

Search our off-the-shelf datasets.

Filter by

ASR

TTS

Model Evaluaion Report

NLP

Lexicon

Machine Translation

OCR

Multimodal

Aesthetic Composition Training Corpus

Images are captured by professional photographers. Composition types include rule-of-thirds, horizontal, diagonal, triangular, and central composition. All images are evaluated and annotated by personnel with high aesthetic standards. Each image meets at least one composition type and at most three composition types.

Handheld Object Portrait Corpus

Data collection covers both indoor and outdoor environments, including offices, meeting rooms, parking lots, gardens, and other common work and daily-life scenarios. Lighting conditions include normal lighting, low-light, and backlit scenarios commonly encountered in real-world settings. Twenty video clips per participant, with each clip corresponding to the presentation of a single object and accompanied by three close-up images of the object from different angles. The subject appears in upper-body or full-body views, holding one object with one or both hands, recorded in standing or seated postures. The object is moved according to predefined actions. Each video clip includes a brief verbal description of the object provided by the model. The subject is clearly visible, and the face is not occluded for extended periods during recording.

American English Female Speech Synthesis Corpus – Energetic Sweet Voice (F404-50)

Female American English Mature Voice

American English Female Speech Synthesis Corpus – Mature Voice (F404-47)

Female American English Mature Voice

Chinese-English Code-Mixed Female Speech Synthesis Corpus (Inspirational Content)

Chinese Male Voice Character

All Datasets

Filter by

Aesthetic Composition Training Corpus

Handheld Object Portrait Corpus

American English Female Speech Synthesis Corpus – Energetic Sweet Voice (F404-50)

American English Female Speech Synthesis Corpus – Mature Voice (F404-47)

Chinese-English Code-Mixed Female Speech Synthesis Corpus (Inspirational Content)

Chinese Male Voice Character Imitation Speech Synthesis Corpus – Rulai Fozu

Chinese Male Voice Character Imitation Speech Synthesis Corpus – Pao Xiaoge

Taiwanese English Conversational Speech Recognition corpus (Telephone)

Northern Irish English Conversational Speech Recognition Corpus (Mobile)

Get started

Join our newsletter to stay updated

Filter by