Question
Hi Experts, I'm checking answer for my assignemts and I don't know if I'm on the right track. Could you give me some advices? Thanks.
Hi Experts,
I'm checking answer for my assignemts and I don't know if I'm on the right track.
Could you give me some advices?
Thanks.
3.A team of researchers describe their plans to collect a corpus in the following way: Our purpose is to collect a speech corpus of international English that can be used to train an ASR system for transcribing doctors spoken comments as they visit their patients. We plan on collecting speech from male and female physicians based in North America. Participants will be asked to read from freely available childrens literature as their speech is recorded using a webcam microphone. To save time and avoid participant dropout, they will not fill out a demographic questionnaire. Professional annotators will annotate the recordings with orthographic transcriptions and Part of Speech tags in a spreadsheet application. The annotations will later be used to train acoustic and language models for the ASR. Describe three issues with the way the research team plans to collect the data. [3 points]
1: limited generalizability in data collection This data collection plan for speech corpus in the development of an Automatic Speech Recognition (ASR) system is limited to a specific group of people, namely, physicians based in North America. This poses an issue in terms of the representativeness of the speech corpus and its ability to account for the diversity of speech patterns and accents. The potential consequences of this limitation include the development of a biased ASR system with limited generalizability.
2: limited source of speech data The study design restricts the participants to solely utilizing publicly accessible children's literature as the source of speech data. This could result in a narrow range of speech in the corpus, potentially compromising its representativeness and ability to accurately transcribe speech from healthcare professionals in practical settings. The potential limitations of this limited source of speech data may negatively affect the performance of the Automatic Speech Recognition (ASR) system.
3: quality of speech data impacted by the equipment If the equipment is of poor quality, the resulting speech data may be noisy, making it difficult for the ASR system to accurately transcribe the speech.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started