Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 21, 2024

from transformers import Wav 2 Vec 2 ForCTC, Wav 2 Vec 2 Tokenizer import torch import librosa import soundfile as sf model = Wav 2

from transformers import Wav

2

Vec

2

ForCTC, Wav

2

Vec

2

Tokenizer

import torch

import librosa

import soundfile as sf

model

=

Wav

2

Vec

2

ForCTC.from

_

pretrained

('

theainerd

/

Wav

2

Vec

2 -

large

-

xlsr

-

hindi'

)

tokenizer

=

Wav

2

Vec

2

Tokenizer.from

_

pretrained

('

theainerd

/

Wav

2

Vec

2 -

large

-

xlsr

-

hindi'

)

import pandas as pd

=

.

read

_

csv

("

train

.

csv

")

.

head

(1)

=

.

drop

(

columns

= ["

age

",

"gender","up

_

votes","down

_

votes","accents","variant","locale","segment"

])

(

.

columns

)

## df

=

.

drop

(

columns

= ["

client

_

"])

list

= []

sentences

= []

samples

= 30

for i in range

(0,

samples

)

list.append

(

["

path

"] [

])

sentences.append

(

["

sentence

"] [

])

(

list

)

(

sentences

)

## print

(

len

(

sentences

))

import os

path

=

"common

_

voice

\

clips

"

file

_

path

= []

for i in range

(0,

samples

)

file

_

path.append

(

.

path.join

(

path

,

list

[

]))

## print

(

file

_

path

[

])

# Function to transcribe an audio file

def transcribe

_

audio

(

file

_

path

)

# Load the audio file

audio, sr

=

librosa.load

(

file

_

path, sr

= 16000)

## resampling

# Convert the audio to a format that can be input to the model

input

_

values

=

tokenizer

(

audio

,

return

_

tensors

= "

") .

input

_

values

# Run the audio through the model to generate the transcription

#with torch.no

_

grad

()

logits

=

model

(

input

_

values

) .

logits

predicted

_

ids

=

torch.argmax

(

logits

,

dim

= - 1)

transcription

=

tokenizer.decode

(

predicted

_

ids

[0])

return transcription

transcription

= []

for i in range

(0,

samples

)

transcription.append

(

transcribe

_

audio

(

file

_

path

[

]))

(

"

Transcription for

{

file

_

path

[

]}

{

transcription

[

]} ")

help me tp fine

-

tune this model and add code in my written code for fine tuning and explain in detail and alsoexplain in detail any other alternative approach

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Machine Learning And Knowledge Discovery In Databases European Conference Ecml Pkdd 2022 Grenoble France September 19 23 2022 Proceedings Part 4 Lnai 13716

Authors: Massih-Reza Amini ,Stephane Canu ,Asja Fischer ,Tias Guns ,Petra Kralj Novak ,Grigorios Tsoumakas

1st Edition

3031264118, 978-3031264115

More Books

Students also viewed these Databases questions

Question

3. What was Sherringtons evidence for inhibition in the nervous systempg78

Answered: 1 week ago

Question

★★★★★

Alfred Engineering Company is a young and growing producer of electronic measuring instruments and technical equipment. You have been retained by Alfred to advise it in the preparation of a statement...

Answered: 1 week ago

Question

★★★★★

from transformers import Wav 2 Vec 2 ForCTC, Wav 2 Vec 2 Tokenizer import torch import librosa import soundfile as sf model = Wav 2 Vec 2 ForCTC.from _ pretrained ( ' theainerd / Wav 2 Vec 2 - large...

Answered: 1 week ago

Question

★★★★★

Prepare a statement of cash flows using the indirect method. Condensed financial data of Cheng Inc. follow. CHENG INC. Comparative Balance Sheets December 31 Assets 2020 Cash Accounts receivable...

Answered: 1 week ago

Question

★★★★★

Problem 3 (30 points) Sound Electronics, Inc. produces a battery-operated tape recorder at plants located in Martinsville, North Carolina; Plymouth, New York; and Franklin, Missouri. The unit...

Answered: 1 week ago

Question

★★★★★

Wingate Company, a wholesale distributor of electronic equipment, has been experiencing losses for some time, as shown by its most recent monthly contribution format income statement: Sales Variable...

Answered: 1 week ago

Question

★★★★★

Problem 1-4A (Algo) Understand the format of financial statements and the links among them (LO1-3) Below are incomplete financial statements for Turek, Incorporated. Required: Calculate the missing...

Answered: 1 week ago

Question

★★★★★

1. Jim paid $1,100 for 100 shares of XYZ stock (which included the broker's commission of $25) in 2016. He sold 50 shares for $700 in 2022. What should Jim report as his basis? a. $1,100 b. $550 c....

Answered: 1 week ago

Question

★★★★★

1. What is a budget? 2. What is the importance of budgets for the company? 3. Describe what are the benefits of having a budget in the company. 4. Define what a master budget is. 5. Describe what a...

Answered: 1 week ago

Question

★★★★★

A You know that both Ellen and Steve rely on you as a friend (and in Ellens case, as family). How can you maintain your relationships with both of them even as their relationship with each other is...

Answered: 1 week ago

Question

★★★★★

1. The engagement in and resolution of interpersonal conflict are often key factors in romantic comedies (like Life As We Know It and When Harry Met Sally), as well as in buddy-driven action films...

Answered: 1 week ago

Question

★★★★★

2. For an interesting look at conflict and debate, you need not search further than the U.S. Congress. Debates on the floor of the Senate and House of Representatives are broadcast on C-SPAN and...

Answered: 1 week ago

Previous Question Next Question