Please consider and work through the following questions.
Breast cancer patients | Normal individuals |
---|---|
2.34 1.72 2.98 1.70 2.04 2.57 1.90 1.07 2.89 3.35 1.60 4.26 1.99 1.78 2.56 4.57 0.57 1.66 2.00 4.89 4.61 |
1.31 0.99 1.30 1.48 1.35 1.22 1.14 1.23 1.31 1.08 2.00 0.50 1.51 1.17 1.43 1.15 1.53 0.80 1.85 1.03 1.40 |
What is the representative or typical BRAC1 gene expression value for each group? Do the two groups have similar amounts of variability? What is the appropriate approach to describe the data?
Answer
Plot the data to check for potential outliers, typos, and skewness of the data. For data with symmetric distribution, mean and standard deviation can be used; otherwise, median and range can be used.
From the boxplot, we did not see any obvious outliers, and the data do not seem to be skewed. A Q-Q plot can also be generalized for visualizing data normality. The mean gene expression for the cancer patients and normal individuals are 2.5 and 1.2, respectively. The corresponding standard deviations are 1.2 and 0.33, respectively. A Bartlett’s test shows that there is a significant difference in data variation between the two groups (p<0.001), and a two sample t test shows that cancer patients have significant higher expression of this gene than the normal individuals (p<0.001).
If you have questions or need assistance, please email Gilbert Berdine at [email protected] or Shengping Yang at [email protected]
Submitted: 10/7/2017