Lexicon

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Chinese Wuhanhua Pronunciation Lexicon
This Chinese Wuhanhua Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Chinese Wuhan dialect as spoken in Wuhan City, Hubei Province of China. With 113,334 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular PINYIN phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Chinese Xi’anhua Pronunciation Lexicon
This Chinese Xi'anhua Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Chinese Xi'an dialect as spoken in Xi'an City, Shannxi Province of China. With 55,645 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular DPY phonemic system. Additionally, the lexicon is categorically classified for each entry, providing added value to the dataset. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Chinese Zhengzhouhua Pronunciation Lexicon
This Chinese Zhengzhouhua Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Chinese Zhengzhou dialect as spoken in Zhengzhou City, Henan Province of China. With 55,129 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular DPY phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Chinese-Korean Phoneme Lexicon
This Chinese-Korean Phoneme Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Chinese Mandarin language as spoken in China and the Korean language as spoken in Korea. With 7,000 meticulously crafted entries and an impressive 97.00% entry accuracy rate, this lexicon covers 5,000 frequently used Chinese characters and their corresponding Korean Hangul, and provides accurate pronunciation transcription in the popular RR phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Czechia Czech Pronunciation Lexicon
This Czechia Czech Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Czech language as spoken in Czechia. With 107,942 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Denmark Danish Pronunciation Lexicon
This Denmark Danish Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Danish language as spoken in Denmark. With 103,287 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
English-Korean Phoneme Lexicon
This English-Korean Phoneme Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the English language as spoken in the United States and the Korean language as spoken in Korea. With 80,010 meticulously crafted entries and an impressive 97.00% entry accuracy rate, this lexicon covers over 80,000 English words and their corresponding Korean Hangul, and provides accurate pronunciation transcription in the popular RR phonemic system. Additionally, the lexicon is categorically classified for each entry, providing added value to the dataset. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Eritrea Tigrinya Pronunciation Lexicon
This Eritrea Tigrinya Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Tigrinya language as spoken in Eritrea. With 30,001 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Estonia Estonian Pronunciation+POS Lexicon
This Estonia Estonian Pronunciation+POS Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Estonian language as spoken in Estonia. With 119,272 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. Additionally, the lexicon is POS annotated for each entry, providing added value to the dataset. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More