All Datasets

Search our off-the-shelf datasets.

Filter by
Category
Category
Conference Video Action Collection Corpus
Product Features: Captured in a conference setting, with participants maintaining a neutral facial expression throughout, slowly walking around the room, without side glances or looking up/down, and with faces unobstructed. Each participant records 1–2 sets of videos (standing/sitting) using four cameras simultaneously (Logitech Rally, Aver CAM 550, Yealink UVC86, Poly E60). Additionally, two sets of photos are collected: one taken on the day using a computer, and one personal photo taken within the past two years using a phone. Ethnicities Collected: Black, White, Asian (non-Chinese), Brown. Age Range: All age groups, with balanced gender ratio. Video/Image Specifications: Video resolution 1080P or higher, photo resolution 720P; each video is approximately 1 minute long.
Android Front-Facing Multi-Skin-Tone Face Collection Corpus
Product Features: Captured using the front-facing camera of Oppo series smartphones released after 2023. Models maintain eye contact with the camera, with a shooting distance of 20–50 cm (selfie unlock distance). Includes six lighting conditions (normal light, side light, backlight, front light, low light, warm light). For each participant: at least 15 photos for Black, White, and Asian (Chinese) individuals; at least 25 photos for Brown individuals. The dataset includes 25 pairs/groups of twins or look-alike individuals. Ethnicities Collected: Black, White, Asian (Chinese), Brown. Age Range: All adults. Image Specifications: 1080P resolution or higher.
AIGC Image Corpus – Portrait Category
Product Features: AIGC-generated portrait data covering four styles—3D cartoon, comic, watercolor, and sketch—with each style including four ethnicities. The original portrait images are shared across styles, with about 10 images generated per person for each style. Ethnicities Collected: Black, White, Asian (non-Chinese), and Brown. Age Range: All adults, with a balanced gender ratio. Image Specifications: Resolution of 1080P or higher.
Wu Chinese Single-Speaker Free Talk Speech Synthesis Corpus
Premium Chinese Female Voice Speech Synthesis Corpus (3 Speakers)
Features: Agent-style duplex conversation; the main speaker acts as the Agent and engages in multi-turn dialogues with companion voices in scenarios such as emotional companionship and daily Q&A. Emotion: Both main and companion voices are annotated with 28 emotions—including neutral, joy, anger, sadness, fear, surprise, disgust, etc.—with clause-level emotion tagging.
Standard Arabic Average Timbre Voice Speech Synthesis Corpus – Free Talk
Single-Host Podcast Style
Standard Arabic Male Voice Speech Synthesis Corpus
Covers news, dialogues, encyclopedic content, time, date, menus, hotels, and English abbreviations, suitable for multi-scenario, multi-domain speech synthesis.
Standard Arabic Female Voice Speech Synthesis Corpus – Natural Style
Covers daily conversations, news, travel, table/manual introductions, and mixed Arabic-English content, suitable for multi-scenario, multi-domain speech synthesis.
Standard Arabic Male Voice Speech Synthesis Corpus – Natural Style
Covers daily conversations, news, travel, table/manual introductions, and mixed Arabic-English content, suitable for multi-scenario, multi-domain speech synthesis.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Category
Category