Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jul 29, 2024

Source of data: Golub et al . ( 1 9 9 9 ) . Molecular classification of cancer: class discovery and class prediction by gene

Source of data: Golub et al

. (1999) .

Molecular classification of cancer: class discovery and class prediction by

gene expression monitoring, Science, Vol.

286

531 - 537 .

The data set golub consists of the expression levels of

3051

genes for

38

tumor mRNA samples. Each tumor

mRNA sample comes from one patient

(

.

. 38

patients total

),

and

27

of these tumor samples correspond to

acute lymphoblastic leukemia

(

ALL

)

and the remaining

11

to acute myeloid leukemia

(

AML

) .

You will need to discover how many genes can be used to differentiate the tumor types

(

meaning that their

expression level differs between the two tumor types

)

using

the uncorrected p

-

values,

the Holm

-

Bonferroni correction, and

(

iii

)

the Benjamini

-

Hochberg correction?

Feel free to use libraries for multiple hypothesis testing in

R

or python.

If you are using Python, you can use the following code to load the data:

with zipfile.ZipFile

("

statsreview

_

release

1 .

zip"

)

as zip

_

file:

golub

_

data, golub

_

classnames

= (

.

genfromtxt

(

zip

_

file.open

('

data

_

and

_

materials

/

golub

_

data

/ {}' .

format

(

fname

)),

delimiter

=',',

names

=

True, converters

= {0

: lambda s:

int

(

.

strip

(

' "'))})

for fname in

['

golub

.

csve, 'golub

_

.

csv

'])

Part

(

)

\frac{0.0}{2.0}

points

(

graded

)

Let

{\overset{}{x}}_{A L L, i}

be the mean of the expression levels for gene i across the ALL mRNA samples. Similarly, let

{\overset{}{x}}_{A M L, i}

be the same but for the AML mRNA samples instead.

For each of these,

N_{A L L}

and

N_{A M L}

are the number of mRNA samples for the ALL tumors and AML tumors

respectively.

s_{A L L, i}^{2}

is the sample variance for gene i across the ALL mRNA observations, then the corresponding

variance for

{\overset{}{x}}_{A L L, i}

s_{{\overset{}{x}}_{A M L,}^{2}}^{2} = \frac{s_{A M L, i}^{2}}{N_{A M L}}

We can use

{\overset{}{x}}_{i} = {\overset{}{x}}_{A L L, i} - {\overset{}{x}}_{A M L, i}

as a metric for the difference in expression levels for gene

i .

The

variance of this metric is

s_{{\overset{}{x}}_{i}}^{2} = s_{{\overset{}{x}}_{A L L}, i}^{2} + s_{{\overset{}{x}}_{A M L}, i}^{2}

This allows us to use the following test statistic:

t_{W e l c h, i} = \frac{{\overset{}{x}}_{A L L, i} - {\overset{}{x}}_{A M L, i}}{\sqrt[]{\frac{s_{A L L, i}^{2}}{N_{A L L}} + \frac{s_{A M L, i}^{2}}{N_{A M L}}}}

which you can recognize as similar to the t

-

test statistic, and is itself known as the Welch unequal variances t

-

test.

The distribution for the Welch test statistic can be approximated by a t

-

distribution, but with a modified

number of degrees of freedom. The number of degrees of freedom is approximately

) i) A L L) A M L

where

) A L L

and

) A M L .

Use the Welch t

-

test to find the number of significantly associated genes

(0.05)

using uncorrected p

-

values.

How many genes are significant?

(

Please enter the value with a precision of at least two significant figures,

your answer will be graded with a

10 %

tolerance.

)

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

The Accidental Data Scientist

Authors: Amy Affelt

1st Edition

1573877077, 9781573877077

More Books

Students also viewed these Databases questions

Question

★★★★★

Identify each as a heterogeneous mixture or a homogeneous mixture. a. Air b. Dirt c. A television set

Answered: 1 week ago

Question

★★★★★

An archeologist fits a regression model, rejecting the null hypothesis that 2 = 0, with P Answered: 1 week ago

Answered: 1 week ago

Question

★★★★★

21. Consider Example 3.12, which refers to a miner trapped in a mine. Let N denote the total number of doors selected before the miner reaches safety. Also, let Ti denote the travel time...

Answered: 1 week ago

Question

★★★★★

Gottschalk Company sponsors a defined benefit plan for its 100 employees. On January 1, 2017, the company's actuary provided the following information. Accumulated other comprehensive loss...

Answered: 1 week ago

Question

★★★★★

The Johnsons Change Their Life Insurance Coverage Harry and Belinda Johnson spend $ 2 0 per month on life insurance in the form of a premium on a $ 1 0 , 0 0 0 , paid - at - 6 5 cash - value policy...

Answered: 1 week ago

Question

★★★★★

A hockey arena sells premium tickets for $54. At this price, the arena will sell 150 premium tickets every game. The owners know from past years that they will sell 4 more premium tickets per game...

Answered: 1 week ago

Question

★★★★★

1. Hardly there is any location which can be ideal or perfect. One has to strike a balance between various factors affecting plant location. Some factors are crucial in deciding the location of the...

Answered: 1 week ago

Question

★★★★★

What is meant by producing in a "pull" rather than a "push" environment when adopting a lean process? (LO3.4)

Answered: 1 week ago

Question

★★★★★

How can situational leadership be used to improve diversity in organizations?

Answered: 1 week ago

Question

★★★★★

Determine the true cost of each employee. In addition to the items listed below, each employee has been hired through an agency charging a fee of 20% of each employee's base salary. Your first three...

Answered: 1 week ago

Question

★★★★★

Out of the push and pull inventory systems, which is better for a small business that stocks nondurable food items? Please provide a reference and a practical example.

Answered: 1 week ago

Question

★★★★★

How is the education level required for a position established?

Answered: 1 week ago

Question

★★★★★

Why is a job analysis important?

Answered: 1 week ago

Question

★★★★★

What, if anything, does a job incumbent contribute to a position description?

Answered: 1 week ago

Previous Question Next Question