WORKING WITH DATA
INSTRUCTIONS
**Submit your completed Week 2 Project on the attached Assignment Template Document**
Use the attached the excel document: ST3001Data_L_Rivera as a data to complete this assignment.
1. Create the following graphs in Excel:
1.1 Using the column labeled “Smoker” (not “BMI Sorted by Smoking Status”), create an appropriate graphical display to clearly show the breakdown of smokers and nonsmokers in your data set. Write a 1-sentence caption for your graph clearly indicating what it displays.
1.2 Create a histogram for the column labeled “BMI” (not broken down by smoking status). Use a bin width of 2 units. Start the bins at your minimum data point, as appropriate for your data set. Write a 1-sentence caption for your graph that clearly indicates what it displays.
1.3 Create two modified box plots for BMI, one for “Smokers” and one for “Non-smokers.” Write a 1-sentence caption for your graph clearly indicating what it displays.
2. Descriptive Statistics
2.1 Use the data analysis tool pack to create two (2) tables of descriptive statistics—one for “Smokers” and one for “Nonsmokers”—using the columns created by you in Week 1.
2.2 Use these statistics to answer the following questions, comparing smokers to nonsmokers. Be sure to provide values from your Excel output to support your reasoning.
A. Which group has an BMI that is typically higher?
B. Which group has greater variation in their BMI?
C. Do you suspect any outliers are present in the BMI for each group? Be sure to justify your reasoning.
3. Respond to the following questions:
3.1 Do you suspect that smoking is related to BMI status? Write a 2-sentence explanation. Include at least one (1) summary statistic to validate your reasoning.
3.2 Would you want to make broad conclusions about smoking and BMI based on this sample data? Explain why or why not.
** At least 3 references**