Applications of Statistical Methods

Question 1 Investment Portfolio (12 marks)

Consider the daily percent change in the stock price of two companies, A and B, in an investment portfolio. The dataset is called Investment Portfolio.

Answer the following questions manually. Use statistical software or MS Excel for help with the computation of any summary statistics needed for manual computations.

a) Draw a scatterplot of the company A daily percent changes against the company B daily percent changes. Describe the relationship between daily percent changes that you see in this scatterplot.

b) Determine the regression equation to predict the daily percent change in the stock price of company A from the daily percent change in the stock price of company B. Interpret the value of the slope coefficient.

c) Find the correlation between the percent changes. Does the correlation value support your description of the scatterplot in part a)?

d) Compute the corresponding coefficient of determination and interpret its value. In financial terms, it represents the proportion of non-diversifiable risk in company A.

e) Compute the 95% confidence interval for the slope coefficient. f) Test at the 5% significance level whether the slope coefficient is significantly different

from 1, representing the beta of a highly diversified portfolio. Dont forget to show your computations.

Questions 2 Location Analysis (38 marks)

Location analysis is an important decision in operations management of production and service industries. A critical decision for many organizations is where to locate a processing plant, warehouse, retail outlet, etc. A large number of business variables are typically considered in this decision problem.

The management of a large motel/inn chain is aware of the challenges in choosing new motel locations. The chains management uses the operating margin, which is the ratio of the sum of profit, depreciation, and interest expenses divided by total revenue, to make this type of decision. In general, the higher the operating margin, the greater the success of the motel/inn.

The chains management has collected data on 100 randomly selected of its current inns. By measuring the operating margin, the objective is to predict which sites would likely generate more profit. Below is a description of the different variables considered in this analysis.

Variable Description Location ID Number Location identifier

Operating Margin Operating margin, in percent

Number Number of motels, inns, and hotel rooms within 5 miles

Nearest Number of miles to the closest competitors

Enrollment Number of college and university enrollment (in thousands) in nearby college and universities

Income Average household income (in thousands) of the neighborhood

Distance Distance from downtown

Quality The quality of the service level of the location (1 = bad, 2 = average, 3 = good, 4 = excellent)

High Speed Internet High speed internet availability (1 = no, 2 = yes)

Gym Gym availability (1 = no, 2 = yes)

The dataset is called Location Analysis.

Part 1 (10 marks)

Using Minitab or any other statistical software, run a simple linear regression model to predict Operating Margin based on Distance and answer the following questions:

a) Using an appropriate graph, plot Operating Margin versus Distance and comment on the relationship between these two variables.

b) Write down your estimation of the regression equation for predicting Operating Margin from Distance. Draw the regression line on the plot in part a).

c) Assuming = 0.01, test whether Distance has statistically significant predictive power in estimating Operating Margin. State the hypotheses, provide a test statistic and p-value, and state your conclusion. Show your calculations.

d) Interpret the values of the regression coefficients (slope and intercept).

Part 2 (6 marks)

Using Minitab or any other statistical software, now perform a multiple linear regression analysis of Operating Margin (response variable) against all the remaining variables as predictors, excluding Location ID Number.

a) Write down the regression equation and provide at least two summary measures of the fit of the model. Based on the summary measures, does the model provide a good fit for the data? Explain.

b) Plot the residuals against the fitted values and comment on whether the usual model conditions are met.

c) The variable Operating Margin New in the dataset corresponds to the Operating Margin variable from which some values have been recorded as missing values. Identify those missing values and explain what they are and why they were recorded as missing.

Part 3 (12 marks)

Using statistical software, run the same multiple linear regression model as in Part 2 above but this time using Operating Margin New as the response variable. Then, answer the following questions:

a) Briefly compare the resulting regression equation and fit with those obtained in Part 2. b) Plot the residuals against the fitted values and comment on whether the model complies

with the usual conditions for multiple linear regression. c) Provide an interpretation for the model intercept and for the regression coefficients

associated with variables Income and Distance. Is an interpretation of the model intercept appropriate in this case? Compare the value of the regression coefficient for Distance with the one obtained in Part 1 above and clearly explain any difference.

d) Do you see any justification for dropping any variable(s) from the model? Explain (hint: multicollinearity; the significance of predictors).

e) Run a final model using Operating Margin New as the response variable and including only the significant predictors (hint: those with a p-value 5%).

f) Test the overall significance of the final model in part e). Use a 1% significance level and follow all the steps for hypothesis testing indicated in the Instructions section.

Part 4 (10 marks)

Based on your final model in Part 3 above, answer the following questions:

a) Test the marginal contribution of Quality, assuming that the other variables in the model remain constant. Use a 1% significance level, and make sure you follow all the steps for hypothesis testing indicated in the Instructions section. Show the computation of the t- statistics (i.e., the ratio used to compute it).

b) Calculate the 99% prediction interval for the actual operating margin of a new location with the same characteristics as those for Location ID Number 3098 in the data file. Check if the prediction interval includes the actual operating margin associated with Location ID Number 3098 and explain why it does or does not.

c) Calculate the 99% confidence interval for the mean operating margin of a new location with the same characteristics as those for Location ID Number 3098 in the data file. Explain any difference between the size of this interval and the one in part b) above.

Location Analysis

  

Location ID Number

Operating Margin

Number

Nearest

Enrollment

Income

Distance

Quality

High Speed Internet

Gym

Operating Margin New

 

2801

39.3

3591

2.0

18.5

29

7.6

2

1

1

39.3

 

2608

49.5

2726

1.2

19.0

36

9.3

3

2

2

49.5

 

3191

49.0

2890

2.4

20.0

35

2.6

3

1

2

49.0

 

3210

45.1

2172

1.4

7.0

35

9.2

3

2

2

45.1

 

3701

41.9

3517

2.9

6.0

38

9.1

2

2

1

41.9

 

2547

43.5

2910

1.4

12.5

35

1.8

2

2

1

43.5

 

2670

40.3

2848

2.5

19.0

30

8.4

2

1

2

40.3

 

2664

42.8

3003

2.2

19.5

39

12.0

2

1

1

42.8

 

2492

57.3

2601

3.1

18.5

39

11.0

4

2

2

57.3

 

2649

52.1

3140

2.2

23.5

34

3.7

3

2

2

52.1

 

3066

25.1

1613

1.7

21.5

29

4.1

4

2

2

 

2193

54.4

3234

3.2

19.5

39

8.3

4

2

2

54.4

 

2793

47.3

2761

3.4

16.0

39

6.6

3

1

2

47.3

 

3620

57.4

2687

0.9

15.5

42

6.9

4

2

2

57.4

 

2538

38.4

3264

2.7

22.5

29

10.4

2

1

1

38.4

 

3730

56.5

2045

1.5

15.0

38

4.3

4

2

2

56.5

 

3120

35.2

3251

1.7

13.0

35

4.3

1

1

1

35.2

 

2389

39.3

3159

2.7

11.0

34

0.2

2

1

1

39.3

 

3053

47.3

2642

1.8

9.0

30

4.8

3

1

2

47.3

 

3304

44.2

3471

2.2

12.0

39

5.4

3

2

1

44.2

 

3770

54.4

2756

3.2

14.5

39

7.9

4

2

1

54.4

 

3380

51.5

3069

3.3

20.5

37

7.7

3

2

2

51.5

 

2795

46.2

2244

3.6

15.5

36

5.9

3

1

1

46.2

 

2315

49.8

2859

1.7

14.5

39

3.0

3

2

2

49.8

 

2682

41.1

3098

2.3

11.5

34

7.7

2

1

2

41.1

 

2789

51.1

3227

1.9

11.0

37

6.0

3

2

2

51.1

 

2847

52.2

2879

2.1

19.5

36

2.9

3

2

2

52.2

 

2372

49.9

3255

2.2

22.5

36

8.7

3

2

2

49.9

 

3349

39.8

3823

3.6

17.0

38

4.8

2

1

2

39.8

 

3704

43.5

3198

1.8

17.0

36

4.6

2

2

1

43.5

 

2872

45.4

2443

0.1

10.5

36

7.5

3

2

1

45.4

 

2570

58.5

2210

2.7

17.5

35

2.7

4

2

2

58.5

 

2956

46.0

2655

1.1

22.0

34

8.1

3

1

1

46.0

 

3098

81.5

4214

2.4

16.0

34

5.6

1

1

1

 

3256

41.0

3480

1.9

18.0

31

11.0

2

1

2

41.0

 

2486

46.0

2341

2.3

23.0

29

7.4

3

2

2

46.0

 

3466

48.8

2935

3.9

20.0

35

6.1

3

1

2

48.8

 

2912

53.5

3285

2.0

23.5

42

2.9

3

2

2

53.5

 

3200

54.1

2862

2.9

16.5

41

11.8

3

2

1

54.1

 

3208

34.3

3548

2.5

13.5

31

8.3

1

1

2

34.3

 

3799

50.2

3021

1.7

8.5

41

5.5

3

1

2

50.2

 

3450

43.3

3383

2.2

11.5

33

9.5

2

2

1

43.3

 

2403

40.8

2676

3.3

25.0

41

5.1

2

1

1

40.8

 

2555

49.0

2826

4.2

17.0

34

1.2

3

2

2

49.0

 

3056

52.0

3354

2.0

26.5

37

7.2

3

2

2

52.0

 

2331

51.2

3082

1.5

22.0

40

11.2

3

2

2

51.2

 

2806

52.7

3378

3.3

10.0

34

7.6

3

2

2

52.7

 

2802

35.7

3591

1.4

9.5

43

5.0

1

1

1

35.7

 

2501

40.0

3397

1.6

19.5

32

3.1

2

1

1

40.0

 

2636

60.1

2619

2.9

12.0

31

6.8

4

2

2

60.1

 

3016

48.2

2429

2.3

20.0

33

7.6

3

1

2

48.2

 

3581

36.8

3006

2.5

15.5

37

4.2

2

1

1

36.8

 

2395

55.4

2790

1.6

12.0

37

9.1

4

2

1

55.4

 

2365

33.8

2810

2.8

17.5

35

14.4

1

1

1

33.8

 

3152

51.2

2775

2.5

15.0

39

5.6

3

1

2

51.2

 

3071

42.9

3838

2.4

10.0

40

2.9

2

1

1

42.9

 

2698

36.1

2962

0.9

12.5

41

8.6

1

1

1

36.1

 

3026

47.5

3080

1.9

13.5

43

8.1

3

1

2

47.5

 

2785

33.2

2629

1.4

20.0

35

7.2

1

1

1

33.2

 

2973

54.2

2484

2.8

16.0

33

0.6

3

2

1

54.2

 

2785

32.4

3124

1.6

17.0

29

7.3

1

1

1

32.4

 

2834

54.1

2432

2.9

17.5

35

10.3

3

2

1

54.1

 

2146

49.0

1998

3.6

19.0

41

4.9

3

2

2

49.0

 

3625

46.3

2554

2.1

10.0

32

9.9

3

1

1

46.3

 

3328

41.6

2776

2.1

16.0

31

7.3

2

1

2

41.6

 

3358

43.2

2724

1.2

14.5

42

0.3

2

2

1

43.2

 

2230

54.4

3241

2.2

21.5

44

5.1

4

2

2

54.4

 

2662

42.5

2403

1.0

19.0

32

8.2

2

1

1

42.5

 

3736

45.0

3697

2.1

20.5

40

9.0

3

1

1

45.0

 

3227

33.5

3657

1.5

18.0

28

3.5

1

1

1

33.5

 

2718

35.9

2786

1.9

15.5

37

8.5

1

1

1

35.9

 

3675

38.1

3568

2.3

16.5

37

3.7

2

1

1

38.1

 

2960

44.0

2813

3.8

15.0

33

11.8

2

1

1

44.0

 

3405

71.2

3567

2.5

13.5

32

9.1

1

1

2

 

2793

37.8

2980

3.2

12.0

32

6.6

2

1

1

37.8

 

2675

43.7

2915

1.6

13.5

36

5.3

2

2

1

43.7

 

2874

40.2

3670

2.8

17.5

37

9.9

2

1

1

40.2

 

3467

55.5

3203

4.2

8.0

37

2.7

4

2

2

55.5

 

3593

44.9

3003

0.9

15.5

37

10.2

3

2

1

44.9

 

2596

38.8

3330

2.7

13.5

41

4.0

2

1

1

38.8

 

3341

52.6

3013

3.7

20.0

41

14.1

3

2

1

52.6

 

3479

53.0

2844

3.5

13.5

30

10.8

3

2

1

53.0

 

2656

58.0

2382

3.2

12.0

39

1.3

4

2

2

58.0

 

2617

54.2

3018

3.1

20.0

39

8.2

3

2

1

54.2

 

2114

60.5

2932

1.8

19.5

39

6.1

4

2

2

60.5

 

3287

47.6

2751

3.0

10.5

36

9.1

3

2

2

47.6

 

3531

36.9

2923

1.2

14.0

40

7.6

2

1

1

36.9

 

3562

34.5

2730

0.3

17.0

33

4.3

1

1

1

34.5

 

3648

30.8

3622

2.3

15.0

34

7.6

1

1

1

30.8

 

3569

49.0

3759

2.9

19.0

33

10.8

3

2

2

49.0

 

3038

45.5

2691

3.2

13.5

46

5.7

3

2

2

45.5

 

2663

51.4

2640

3.8

19.5

37

7.6

3

2

2

51.4

 

2971

49.6

3359

1.6

10.5

44

2.6

3

2

2

49.6

 

3116

31.9

3422

3.3

15.5

38

12.1

1

1

1

31.9

 

2546

53.4

2772

0.8

14.5

45

11.3

3

2

2

53.4

 

2118

48.8

2825

1.1

13.0

40

9.0

3

1

2

48.8

 

2117

39.2

2996

1.3

15.5

38

8.5

2

1

1

39.2

 

3794

43.4

3597

3.9

15.5

35

13.2

2

1

1

43.4

 

2910

41.0

2846

3.1

20.5

31

8.8

2

1

2

41.0

 

3504

49.6

2922

1.9

19.5

32

4.4

3

1

2

49.6

             

Investment Portfolio

  

Company A

Company B

 

1.31

0.38

 

-0.26

1.27

 

0.88

0.56

 

0.48

0.62

 

-0.49

-0.55

 

0.37

-1.24

 

-0.22

-1.87

 

1.34

1.30

 

0.21

-0.19

 

0.92

0.42

 

0.76

-1.06

 

-0.53

-1.07

 

-0.72

0.58

 

0.06

0.65

 

0.20

0.76

 

0.41

0.85

 

0.26

0.74

 

1.46

0.83

 

0.53

0.83

 

1.28

0.36

 

0.12

0.18

 

-0.52

-1.08

 

-0.73

-0.91

 

1.06

1.87

 

1.33

2.85

 

-0.18

-0.53

 

2.04

1.79

 

1.57

2.73

 

-0.50

-0.76

 

0.91

1.33

 

-0.25

-0.50

 

0.62

1.93

 

-2.04

-0.10

 

0.46

-0.95

 

1.69

0.40

 

0.11

0.18

 

-0.55

-2.59

 

1.13

1.18

 

0.91

-0.73

 

-1.19

-1.71

 

-1.33

-3.11

 

0.89

0.77