STAT 3001 Week 2 Project New Work
STAT 3001 Week 2 Project NEW
Instructions: To complete this project, you will need the following materials:
• STATDISK User Manual (found in the classroom in DocSharing. Click on “+” under the word “Index” on the right side of the Doc Sharing page to see all links to my previous posts there)
• Access to the Internet to download the STATDISK program(www.statdisk.org).
This assignment is worth a total of 60 points.
Part I. Histograms and Frequency Tables
1. Open the file PARENTHT using menu option Datasets and then Elementary Stats, 9th Edition. This file contains some information about the height of a group of children and the height of their parents. What are the names of the variables in this file?
Heights, genders and age
2. Create a histogram for the mother’s height using the frequency of each value. Paste the chart here:
3. Using the information in the above histogram, complete this table. Be sure to estimate the range of heights for each bar as well as the frequency, relative frequency, and cumulative frequency. Height Frequency Relative Frequency Cumulative Frequency
a. How many of the mothers have a height of 67 inches or less? How do you know?
b. How many of the mothers are between 62 and 65 inches? Show your work.
c. What percent of the mothers have heights that are 71 inches or more?
Part II. Comparing Datasets
1. Create a boxplot that compares the heights of the mothers with the heights of the fathers on the same graph. Paste it here.
2. Describe the similarities and differences in the data sets.
Part III. Finding Descriptive Numbers
3. Open the file named CANS (using Datasets and then Elementary Stats, 9th Edition). List all the variables in the dataset.
4. Find the Mean, median, and midrange for the Data in Column 1.
5. Find the Range, variance, and standard deviation for the first column.
6. List any values for the first column that you think may be outliers. Why do you think that?
[Hint: You may want to sort the data and look at the smallest and largest values.]
7. Find the Mean, median, and midrange for the data in Column 2.
8. Find the Range, variance, and standard deviation for the data in Column 2.
9. List any values for the second column that you think may be outliers. Why do you think that?
10. Find the five-number summary for the cans data in Columns 1 and 2. You will need to label each of the columns with an appropriate measure
11. Compare the cans datasets using a boxplot of Columns 1 and 2. Paste your boxplot here
12. Create a histogram for the
Column 1 data.
13. Create a histogram for the
Column 2 data.
Part IV. Interpreting Statistical Information
The CANS data contains two columns, both of which are measurements of the axial loads (in pounds) of 175 aluminum cola cans. The axial load is the maximum amount of pressure that can be supported by the sides of the can before collapsing.
Group A: Column 1 (CN109) contains measurements for cans that are 0.0109 inches thick
Group B: Column 2 (CN111) contains measurements for cans that are 0.0111 inches thick
Using the descriptive statistics found above, what conclusions can you make comparing the axial loads of the cans in Group A with Group B. Be sure to include at least one of each of the following and explain how the statistics you computed above support your claim.
a. One conclusion about a measure of center
b. One conclusion about the variability in the two datasets
c. One conclusion about the shape of the distribution or the range of data.