Open Datasets: GigaSpeech 2 – 30,000 Hours of Southeast Asian Multilingual Speech Recognition Open Source Dataset Released
The term “Giga” originates from “gigantic,” reflecting the vast audio resources available on the internet. However, the quality of these audio resources varies significantly, and high-quality audio-text pairs are particularly scarce and expensive to annotate, especially for low-resource languages. GigaSpeech, a highly successful open-source English dataset, addresses this issue by providing thousands of hours of […]
Dataocean AI New Datasets – December
The new dataset from Dataocean AI for December is here! This release includes datasets in speech recognition, speech synthesis, multimodal learning, and more, designed to support the training of multimodal large models. Developers can easily overcome data bottlenecks and efficiently improve model performance. Professional Scenario Text-Image Pair Dataset 🔥 Product Features: Includes images taken […]
Dataocean AI: An Expert in Content Moderation for a Safe and Reliable Network Environment
With the popularization and development of technology, fake information and illegal content are more easily disseminated, and harmful content has become more complex and subtle, while the demand for content safety is more urgent than ever. A comprehensive compliance solution includes reviewing various content modalities such as text, images, audio, and video, and identifying deceptive […]
Dataocean AI New Datasets – September
We are excited to announce our NEW arrivals, including Accented English Speech Recognition Corpus from 60+ Countries, Accented Spanish Speech Recognition Corpus from 8 Countries, India Multilingual Speech Recognition Corpus, Spontaneous Dialogue Speech Synthesis Corpus, Multi-Style, Multi-Tone Speech Synthesis Corpus. These high-quality AI training datasets will help enhance model performance, allowing AI products to meet […]
Chinese Continuous Visual Speech Recognition Challenge Workshop 2024 Has Concluded Successfully
On the morning of August 16th, the Chinese Continuous Visual Speech Recognition Challenge Workshop 2024 (CNVSRC Workshop 2024) was held at the 19th National Conference on Man-Machine Speech Communication (NCMMSC 2024) in Urumqi,China. The workshop includes CNVSRC 2024 introduction, address, rank announcement, technical report and system description sharing. The workshop is a forum to exchange ideas regarding Chinese large […]
Hi-Scene’s Dataset of Over 50,000 Sports Videos Enhances AI Referees’ Precision in Capturing Thrilling Moments
The Paris Olympics have been in full swing since July 26. As a premier international stage representing fairness and justice, every score in the Olympics holds significant importance for the athletes’ futures. With the advancement of AI technology, AI is playing an increasingly vital role in assisting with event judgments and decisions. Alibaba Cloud: […]
How AthleteGPT is Perfectly Prepared for the Paris Olympics: The Technology Behind the Success
At this year’s Paris Olympics, approximately 10,000 athletes from over 200 countries and regions will compete with determination for the Olympic spirit and their dreams. “AthleteGPT,” a powerful intelligent voice assistant, will assist athletes and staff from different countries and regions, enabling smoother communication and interaction throughout this summer event. How do I get to […]