Name: American English ASR Model Evaluation Report - DataoceanAI
SKU: King-EVA-017
Availability: InStock

American English ASR Model Evaluation Report

In this report, we will evaluate the performance of the 6 most representative PA engines, focusing on the result of accuracy, fluency, and prosody in utterance level, accuracy in word level, and accuracy in phoneme level, which are critical and commonly recognized parameters for measuring. For the following content of this report, we will first introduce the 6 engines evaluated and the features we use. Later, we will describe the data we prepared for this experiment. Next, we will illustrate the methodology and evaluation metrics used for the pronunciation assessment. Moreover, we will present the evaluation results and the analysis based on each aspect. Last but not least, this report ends with a conclusion and further discussion.

Specifications:

ID:

King-EVA-017

Language:

English

Test Data

2.12 hours, 125 speakers

Evaluation Metrics

Accuracy, fluency, and prosody Test in utterance level， Accuracy Test in word level， Accuracy Test in phoneme level

People also searched for

German TTS Model Evaluation Report

This report evaluates the performance of leading commercial text-to-speech (TTS) systems, focusing on their German audio synthesis capabilities. There are four sections in the report. Section one provides a brief introduction. In section two, we illustrate the methodology and evaluation metrics used for the Functional Metrics Evaluation and Mean Opinion Score (MOS) Evaluation. Sections three present the evaluation results and analysis. The report concludes with a discussion in section four.

French TTS Model Evaluation Report

This report evaluates the performance of leading commercial text-to-speech (TTS) systems, focusing on their France French audio synthesis capabilities. There are four sections in the report. Section one provides a brief introduction. In section two, we illustrate the methodology and evaluation metrics used for the Functional Metrics Evaluation and Mean Opinion Score (MOS) Evaluation. Sections three present the evaluation results and analysis. The report concludes with a discussion in section four.

Italian TTS Model Evaluation Report

This report evaluates the performance of leading commercial text-to-speech (TTS) systems, focusing on their Italian audio synthesis capabilities. There are four sections in the report. Section one provides a brief introduction. In section two, we illustrate the methodology and evaluation metrics used for the Functional Metrics Evaluation and Mean Opinion Score (MOS) Evaluation. Sections three present the evaluation results and analysis. The report concludes with a discussion in section four.

Spanish TTS Model Evaluation Report

This report evaluates the performance of leading commercial text-to-speech (TTS) systems, focusing on their Spain Spanish audio synthesis capabilities. There are four sections in the report. Section one provides a brief introduction. In section two, we illustrate the methodology and evaluation metrics used for the Functional Metrics Evaluation and Mean Opinion Score (MOS) Evaluation. Sections three present the evaluation results and analysis. The report concludes with a discussion in section four.