Lexicon

Search our off-the-shelf datasets.

Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More
Brazilian Portuguese Pronunciation Lexicon
This Brazilian Portuguese Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Portuguese language as spoken in Brazil. With 100,388 meticulously crafted entries and an impressive 97.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Burundi Kirundi Pronunciation Lexicon
This Burundi Kirundi Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Kirundi language as spoken in Burundi. With 30,000 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Cambodia Khmer Pronunciation Lexicon
This Cambodia Khmer Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Khmer language as spoken in Cambodia. With 101,895 meticulously crafted entries and an impressive 97.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Canadian French Pronunciation Lexicon
This Canadian French Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the French language as spoken in Canada. With 116,077 meticulously crafted entries and an impressive 97.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Chile Spanish Pronunciation Lexicon
This Chile Spanish Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Spanish language as spoken in Chile. With 123,488 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Chinese DuoYinZi Pronunciation Lexicon
This Chinese DuoYinZi (polyphone) Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Chinese Mandarin language as spoken in China. With 103,364 meticulously crafted polyphoneic entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular PINYIN phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Chinese Hakka Pronunciation Lexicon
This Chinese Hakka Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Hakka Chinese language as spoken in Meizhou City, Guangzhou Province of China. With 102,067 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular PINYIN phonemic system. Additionally, the lexicon is categorically classified for each entry, providing added value to the dataset. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Chinese Hefeihua Pronunciation Lexicon
This Chinese Hefeihua Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Chinese Hefei dialect as spoken in Hefei City, Anhui Province of China. With 56,127 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular DPY phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.
Chinese Jinanhua Pronunciation Lexicon
This Chinese Jinanhua Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Chinese Jinan dialect as spoken in Jinan City, Shandong Province of China. With 53,184 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular DPY phonemic system. Additionally, the lexicon is categorically classified for each entry, providing added value to the dataset. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.

Join our newsletter to stay updated

Thank you for signing up!

Stay informed and ahead with the latest updates, insights, and exclusive content delivered straight to your inbox.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our company.

Filter by
Filter by
Language
Filter by Languages
Language
Devices
Devices
Applicable Fields
Applicable Fields
More
Applicable Scenarios
Applicable Scenarios
More