Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Needs a python code and basic idea how you have Done this Exercise 4 Description A software development company has contacted you to create a

Needs a python code and basic idea how you have Done this

image text in transcribed

Exercise 4 Description A software development company has contacted you to create a speech recognition system to integrate in a Python project they are developing. In particular, the project consists of an airport virtual assistant You have to build a prototype of the application (exercise 4) that should meet the following requirements: 1. The application must be written in Python. 2. Your client prefers to host the ASR software package in the application to avoid slow-down or interruptions to the system in the event of issues with the internet connection. In particular, they want you to base the speech recognition system on the Mozilla Deep Speech software package. 3. The application must be capable of language-selection, at the very least compatible with the following languages: English, Italian and Spanish (see document Ex4_models.pdf) 4. The airport virtual assistant will be installed in an environment that can be extremely noisy- So, the speech recognition system should be configured to be able to handle this situation. Your client gives you freedom for implementing any solution (for example, to configure in python a gain/amplification, low pass filter, or some other audio filter to improve the error rate) 5. The company has prepared a set of audio files with which you can evaluate the system. For this evaluation, you will test how well it recognises several phrases in each language. You also have to record and evaluate two short sentences (your_sentence1.wav and your_sentence2.wav). Feel free to prepare your own sentences. The output of your work should be a table with the following information, where WER (") is the word error rate for the phrase (see Exd_audio_files.zip) Language English Spanish English English File suitcase.wav maleta.wav your_sentence.wav your_sentence2.wav WER 0% 25% 0% 20% 6. In this step of the project, you have to build a prototype, so you have to focus on the functionality of the application rather than in its visual design Ideas for further development: 1. To evaluate several automatic speech recognition (ASR) systems and to produce a report with the results of the evaluation. You should make your recommendation for which ASR system the company should use, or if you do not think any of them are suitable, you should say that. Justify your finding with data. Important: for this task, evaluate only the English language). For maximum marks, you should evaluate at least a couple of ASR systems. We recommend that you use the offline ones shown in the course as they do not have limits on the number of requests. You can use cloud APIs if you wish, but be careful with the number of requests you make. Exercise 4 Description A software development company has contacted you to create a speech recognition system to integrate in a Python project they are developing. In particular, the project consists of an airport virtual assistant You have to build a prototype of the application (exercise 4) that should meet the following requirements: 1. The application must be written in Python. 2. Your client prefers to host the ASR software package in the application to avoid slow-down or interruptions to the system in the event of issues with the internet connection. In particular, they want you to base the speech recognition system on the Mozilla Deep Speech software package. 3. The application must be capable of language-selection, at the very least compatible with the following languages: English, Italian and Spanish (see document Ex4_models.pdf) 4. The airport virtual assistant will be installed in an environment that can be extremely noisy- So, the speech recognition system should be configured to be able to handle this situation. Your client gives you freedom for implementing any solution (for example, to configure in python a gain/amplification, low pass filter, or some other audio filter to improve the error rate) 5. The company has prepared a set of audio files with which you can evaluate the system. For this evaluation, you will test how well it recognises several phrases in each language. You also have to record and evaluate two short sentences (your_sentence1.wav and your_sentence2.wav). Feel free to prepare your own sentences. The output of your work should be a table with the following information, where WER (") is the word error rate for the phrase (see Exd_audio_files.zip) Language English Spanish English English File suitcase.wav maleta.wav your_sentence.wav your_sentence2.wav WER 0% 25% 0% 20% 6. In this step of the project, you have to build a prototype, so you have to focus on the functionality of the application rather than in its visual design Ideas for further development: 1. To evaluate several automatic speech recognition (ASR) systems and to produce a report with the results of the evaluation. You should make your recommendation for which ASR system the company should use, or if you do not think any of them are suitable, you should say that. Justify your finding with data. Important: for this task, evaluate only the English language). For maximum marks, you should evaluate at least a couple of ASR systems. We recommend that you use the offline ones shown in the course as they do not have limits on the number of requests. You can use cloud APIs if you wish, but be careful with the number of requests you make

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access with AI-Powered Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Students also viewed these Databases questions