Multimodal

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
AIGC Image Corpus – Portrait Category
Product Features: AIGC-generated portrait data covering four styles—3D cartoon, comic, watercolor, and sketch—with each style including four ethnicities. The original portrait images are shared across styles, with about 10 images generated per person for each style. Ethnicities Collected: Black, White, Asian (non-Chinese), and Brown. Age Range: All adults, with a balanced gender ratio. Image Specifications: Resolution of 1080P or higher.
Android Front-Facing Multi-Skin-Tone Face Collection Corpus
Product Features: Captured using the front-facing camera of Oppo series smartphones released after 2023. Models maintain eye contact with the camera, with a shooting distance of 20–50 cm (selfie unlock distance). Includes six lighting conditions (normal light, side light, backlight, front light, low light, warm light). For each participant: at least 15 photos for Black, White, and Asian (Chinese) individuals; at least 25 photos for Brown individuals. The dataset includes 25 pairs/groups of twins or look-alike individuals. Ethnicities Collected: Black, White, Asian (Chinese), Brown. Age Range: All adults. Image Specifications: 1080P resolution or higher.
Chinese-Lao Parallel Corpus
Corpus Field: Most are inclined to fields such as news, transportation and tourism, daily life, sports and health, finance, and technology.
Chinese-Thai Parallel Corpus
Corpus Field: Most are inclined to fields such as news, transportation and tourism, daily life, sports and health, finance, and technology.
Conference Video Action Collection Corpus
Product Features: Captured in a conference setting, with participants maintaining a neutral facial expression throughout, slowly walking around the room, without side glances or looking up/down, and with faces unobstructed. Each participant records 1–2 sets of videos (standing/sitting) using four cameras simultaneously (Logitech Rally, Aver CAM 550, Yealink UVC86, Poly E60). Additionally, two sets of photos are collected: one taken on the day using a computer, and one personal photo taken within the past two years using a phone. Ethnicities Collected: Black, White, Asian (non-Chinese), Brown. Age Range: All age groups, with balanced gender ratio. Video/Image Specifications: Video resolution 1080P or higher, photo resolution 720P; each video is approximately 1 minute long.
DMS with Multi-skin color Drivers Corpus
This dataset serves as a comprehensive resource for developing and testing driver monitoring systems, specifically designed to enhance in-vehicle safety through behavioral analysis.
English-Lao Parallel Corpus
Corpus Field: Most are inclined to fields such as news, transportation and tourism, daily life, sports and health, finance, and technology.
English-Thai Parallel Corpus
Corpus Field: Most are inclined to fields such as news, transportation and tourism, daily life, sports and health, finance, and technology.
Filipino-English Parallel Corpus (Spoken)
Corpus Field: Most are inclined to fields such as news, transportation and tourism, daily life, sports and health, finance, and technology.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More