Chinese Continuous Visual Speech Recognition Challenge 2024
Initiated by the NCMMSC 2024 Organizing Committee and jointly hosted by Tsinghua University, Beijing University of Posts and Telecommunications, Speech Home and Dataocean AI, the second Chinese Continuous Visual Speech Recognition Challenge (CNVSRC 2024) kicks off today. We sincerely invite your participation and registration. Event Introduction Visual speech recognition, also known as lip reading, is […]
Essential Data for Training Large Speech Foundation model
Recently, OpenAI has delivered another breakthrough in the field of speech technology. By using a text input along with a 15-second audio sample, they can generate speech that sounds both natural and remarkably similar to the original voice. What’s particularly impressive is that even with a small model, a 15-second sample is enough to create […]
Multilingual ASR Parallel Corpora Empower Smart Education
Nowadays, AI-assisted smart education has become a key driver in enhancing learning efficiency and teaching quality. Language learning is not only a means of acquiring knowledge but also meets the growing communication needs of people. Moreover, it serves as a bridge to connect the world and understand different cultures. At the application level, people can […]
Dataocean AI Unveils NEW Brand, NEW Site, and NEW Multilingual Speech Corpus for Speech Foundation Models at ICASSP 2024
Dataocean AI announced the upgrade of its branding at ICASSP. The new brand, characterized by dynamic gradient design, symbolizes the company’s unwavering commitment to excellence and innovation in the AI data industry. With almost 20 years of experience in the AI data industry, Dataocean AI will continue to innovate, subvert the industry with technology, and […]
Unleashing Data Potential —— Sora Leads a New Era
Sora is an AI video synthesis model trained on a large scale of video data, which can create realistic and imaginative scenes according to text instructions. OpenAI is teaching artificial intelligence to understand and simulate the physical world in motion, with the goal of training models to help people solve problems that require interaction with […]
Google Open Sources Lite Version of Gemini – Gemma
Following the continuous promotion of its own models by Open AI to showcase their impressive effects, Google has finally launched the first shot in the open source of large models. Recently, Google introduced the Gemma series, a globally powerful and lightweight open-source large model. Gemma adopts the same technology as Gemini and belongs to the […]