Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Hi Experts, I'm checking answer for my assignemts and I don't know if I'm on the right track. Could you give me some advices? Thanks.

Hi Experts,

I'm checking answer for my assignemts and I don't know if I'm on the right track.

Could you give me some advices?

Thanks.

3.A team of researchers describe their plans to collect a corpus in the following way: Our purpose is to collect a speech corpus of international English that can be used to train an ASR system for transcribing doctors spoken comments as they visit their patients. We plan on collecting speech from male and female physicians based in North America. Participants will be asked to read from freely available childrens literature as their speech is recorded using a webcam microphone. To save time and avoid participant dropout, they will not fill out a demographic questionnaire. Professional annotators will annotate the recordings with orthographic transcriptions and Part of Speech tags in a spreadsheet application. The annotations will later be used to train acoustic and language models for the ASR. Describe three issues with the way the research team plans to collect the data. [3 points]

1: limited generalizability in data collection This data collection plan for speech corpus in the development of an Automatic Speech Recognition (ASR) system is limited to a specific group of people, namely, physicians based in North America. This poses an issue in terms of the representativeness of the speech corpus and its ability to account for the diversity of speech patterns and accents. The potential consequences of this limitation include the development of a biased ASR system with limited generalizability.

2: limited source of speech data The study design restricts the participants to solely utilizing publicly accessible children's literature as the source of speech data. This could result in a narrow range of speech in the corpus, potentially compromising its representativeness and ability to accurately transcribe speech from healthcare professionals in practical settings. The potential limitations of this limited source of speech data may negatively affect the performance of the Automatic Speech Recognition (ASR) system.

3: quality of speech data impacted by the equipment If the equipment is of poor quality, the resulting speech data may be noisy, making it difficult for the ASR system to accurately transcribe the speech.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database And Expert Systems Applications 24th International Conference Dexa 2013 Prague Czech Republic August 2013 Proceedings Part 2 Lncs 8056

Authors: Hendrik Decker ,Lenka Lhotska ,Sebastian Link ,Josef Basl ,A Min Tjoa

2013th Edition

3642401724, 978-3642401725

More Books

Students also viewed these Databases questions