Board logo

标题: [2008] Topic 9: Multiple Regression: Estimation and Hypothesis Testing 相关习 [打印本页]

作者: aianjie    时间: 2009-6-26 14:55     标题: [2008] Topic 9: Multiple Regression: Estimation and Hypothesis Testing 相关习

 

AIM 2: Define and interpret the partial slope coefficient.

1、Which of the following statements regarding the results of a regression analysis is FALSE? The:

A) slope coefficient in a multiple regression is the value of the dependent variable for a given value of the independent variable.

B) slope coefficient in a multiple regression is the change in the dependent variable for a one-unit change in the independent variable, holding all other variables constant.

C) intercept is the value that the dependent variable takes on if all the independent variables had a value of zero.

D) slope coefficients in the multiple regression are referred to as partial betas.


作者: aianjie    时间: 2009-6-26 14:55

 

The correct answer is A

The slope coefficient is the change in the dependent variable for a one-unit change in the independent variable.



作者: aianjie    时间: 2009-6-26 14:55

 

2、When interpreting the results of a multiple regression analysis, which of the following terms represents the value of the dependent variable when the independent variables are all equal to zero?

A) Slope coefficient.

B) p-value.

C) t-value.

D) Intercept term.


作者: aianjie    时间: 2009-6-26 14:55

 

The correct answer is D

The intercept term is the value of the dependent variable when the independent variables are set to zero.



作者: aianjie    时间: 2009-6-26 14:56

 

AIM 3: List the assumptions of the multiple linear regression model.

1、Which of the following statements least accurately describes one of the fundamental multiple regression assumptions?

A) The error term is normally distributed.

B) The variance of the error terms is not constant (i.e., the errors are heteroskedastic).

C) The independent variables are not random.

D) There is no exact linear relationship between any two or more independent variables.


作者: aianjie    时间: 2009-6-26 14:56

 

The correct answer is B

The variance of the error term IS assumed to be constant, resulting in errors that are homoskedastic.



作者: aianjie    时间: 2009-6-26 14:56

 

2、One of the underlying assumptions of a multiple regression is that the variance of the residuals is constant for various levels of the independent variables. This quality is referred to as:

A) a normal distribution.

B) homoskedasticity.

C) a linear relationship.

D) serial correlation.


作者: aianjie    时间: 2009-6-26 14:56

 

The correct answer is B

Homoskedasticity refers to the basic assumption of a multiple regression model that the variance of the error terms is constant.



作者: aianjie    时间: 2009-6-26 14:57

 

AIM 4: Explain the concept of multicollinearity and implications it has on modeling.

1、Which of the following statements regarding multicollinearity is FALSE?

A) Multicollinearity may be a problem even if the Multicollinearity is not perfect.

B) Multicollinearity makes it difficult to determine the contribution to explanation of the dependent variable of an individual explanatory variable.

C) Multicollinearity may be present in any regression model.

D) If the t-statistics for the individual independent variables are insignificant, yet he F-statistic is significant, this indicates the presence of Multicollinearity.


作者: aianjie    时间: 2009-6-26 14:57

 

The correct answer is C

Multicollinearity is not an issue in simple regression.



作者: aianjie    时间: 2009-6-26 14:57

 

2、A variable is regressed against three other variables, x, y, and z. Which of the following would NOT be an indication of multicollinearity? X is closely related to:

A) 3y + 2z.

B) 3.

C) y2.

D) 9y, and x is closely related to 4z.


作者: aianjie    时间: 2009-6-26 14:57

 

The correct answer is C

If x is related to y2, the relationship between x and y is not linear, so multicollinearity does not exist. If x is equal to a constant (3), it will be correlated with the intercept term.



作者: aianjie    时间: 2009-6-26 14:57

 

3、An analyst runs a regression of portfolio returns on three independent variables.  These independent variables are price-to-sales (P/S), price-to-cash flow (P/CF), and price-to-book (P/B).  The analyst discovers that the p-values for each independent variable are relatively high.  However, the F-test has a very small p-value.  The analyst is puzzled and tries to figure out how the F-test can be statistically significant when the individual independent variables are not significant.  What violation of regression analysis has occurred?

A) serial correlation.

B) conditional heteroskedasticity.

C) multicollinearity.

D) unconditional heteroskedasticity.


作者: aianjie    时间: 2009-6-26 14:58

 

The correct answer is C

An indication of multicollinearity is when the independent variables individually are not statistically significant but the F-test suggests that the variables as a whole do an excellent job of explaining the variation in the dependent variable.


作者: aianjie    时间: 2009-6-26 14:58

 

4、An analyst further studies the independent variables of a study she recently completed. The correlation matrix shown below is the result. Which statement best reflects possible problems with a multivariate regression?

 

Age

Education

Experience

Income

Age

1.00

 

 

 

Education

0.50

1.00

 

 

Experience

0.95

0.55

1.00

 

Income

0.60

0.65

0.89

1.00

A)    Age should be excluded from the regression.

B)    Education may be unnecessary.

C)   Income is not needed.

D)   Experience may be a redundant variable.


作者: aianjie    时间: 2009-6-26 14:58

 

The correct answer is D

The correlation coefficient of experience with age and income, respectively, is close to +1.00. This indicates a problem of multicollinearity and should be addressed by excluding experience as an independent variable.



作者: aianjie    时间: 2009-6-26 14:58

 

AIM 6: Define, calculate and interpret the variance and standard errors in a multilinear regression.

In the multivariable regression model: Y = B0 + B1 × Xli + B2 × X2i + εi, the formula for the standard errors of the estimated coefficients includes the variance of εi, which is represented by: σ2. Since the term is:

A) not known with certainty, the expression σ2 is replaced with: .

B) not known with certainty, the standard errors of the coefficients cannot be estimated.

C) not known with certainty, the expression σ2 is replaced with: .

D) known with certainty, the standard errors of the coefficients are known with certainty.


作者: aianjie    时间: 2009-6-26 14:59

 

The correct answer is C

The expression:  is an estimate of σ2 when there are two independent variables and n-3 degrees of freedom.



作者: aianjie    时间: 2009-6-26 14:59

 

AIM 8: Interpret regression results.

1、Consider the following analysis of variance (ANOVA) table:

Source

Sum of squares

Degrees of freedom

Mean square

Regression

556

1

556

Error

679

50

13.5

Total

1,235

51

 

The R2 for this regression is:

A)    0.82.

B)    0.55.

C)   0.67.

D)   0.45.


作者: aianjie    时间: 2009-6-26 14:59

 

The correct answer is D

R2 = RSS/SST = 556/1,235 = 0.45.

 


作者: aianjie    时间: 2009-6-26 15:00

 

2、A regression equation with 4 independent variables is estimated using 20 data points. The R2 is 0.46. An analyst is testing to see whether all of the coefficients are equal to zero. The p-value for the test is:

A) lower than 0.025.

B) between 0.05 and 0.10.

C) greater than 0.10.

D) between 0.025 and 0.05.


作者: aianjie    时间: 2009-6-26 15:00

 

The correct answer is D

To solve this problem, one can assume any value for the total sum of squares. In this case, assume its 1. The regression sum of squares is R2 multiplied by the total sum of squares, which is 0.46. The residual sum of squares is the difference between the total sum of squares and the regression sum of squares, which is 1 ? 0.46 = 0.54. The numerator degrees of freedom is equal to the number of independent variables, which is 4, and the mean regression sum of squares is the regression sum of squares divided by the numerator degrees of freedom, which is 0.46 / 4 = 0.115. The denominator degrees of freedom is the number of observations minus the number of independent variables, minus 1, which is 20 ? 4 ? 1 = 15. The mean squared error is the residual sum of squares divided by the denominator degrees of freedom, which is 0.54 / 15 = 0.036. The F-statistic is the ratio of the mean regression sum of squares to the mean squared error, which is 0.115 / 0.036 = 3.19, which is in between the F-values (with four numerator degrees of freedom and 15 denominator degrees of freedom) of 3.06 for a p-value of 0.05 (calculated using the F-table at 5%) and 3.80 for a p-value of 0.025 (calculated using the F-table at 2.5%).



作者: aianjie    时间: 2009-6-26 15:00

 

3、A dependent variable is regressed against a single independent variable across 100 observations. The mean squared error is 2.807, and the mean regression sum of squares is 117.9. What is the correlation coefficient between the two variables?

A) 0.55.

B) 0.30.

C) 0.99.

D) 0.65.


作者: aianjie    时间: 2009-6-26 15:01

 

The correct answer is A

The correlation coefficient is the square root of the R2, which can be found by dividing the regression sum of squares by the total sum of squares. The regression sum of squares is the mean regression sum of squares multiplied by the number of independent variables, which is 1, so the regression sum of squares is equal to 117.9. The residual sum of squares is the mean squared error multiplied by the denominator degrees of freedom, which is the number of observations minus the number of independent variables, minus 1, which is equal to 100 ? 1 ? 1 = 98. The residual sum of squares is then 2.807 × 98 = 275.1. The total sum of squares is the sum of the regression sum of squares and the residual sum of squares, which is 117.9 + 275.1 = 393.0. The R2 = 117.9 / 393.0 = 0.3, so the correlation is the square root of 0.3 = 0.55.



作者: aianjie    时间: 2009-6-26 15:01

 

4、Erica Basenj, CFA, has been given an assignment by her boss. She has been requested to review the following regression output to answer questions about the relationship between the monthly returns of the Toffee Investment Management (TIM) High Yield Bond Fund and the returns of the index (independent variable).

Regression Statistics

< >>

< >>

< >>

< >>

< >>

R2

??

< >>

< >>

< >>

< >>

Standard Error

??

< >>

< >>

< >>

< >>

Observations

20

< >>

< >>

< >>

< >>

< >>

< >>

< >>

< >>

< >>

< >>

ANOVA

< >>

< >>

< >>

< >>

< >>

< >>

df

SS

MS

F

Significance F

Regression

1

23,516

23,516

?

?

Residual

18

?

7

< >>

< >>

Total

19

23,644

< >>

< >>

< >>

< >>

< >>

< >>

< >>

< >>

< >>

Regression Equation

< >>

< >>

< >>

< >>

< >>

< >>

< >>

Coefficients

Std. Error

t-statistic

P-value

Intercept

< >>

5.2900

1.6150

?

?

Slope

< >>

0.8700

0.0152

?

?

What is the value of the correlation coefficient?

A)    ?0.9973.

B)    0.8700.

C)   0.9973.

D)   ?0.8700.


作者: aianjie    时间: 2009-6-26 15:01

 

The correct answer is C

R2 is the correlation coefficient squared, taking into account whether the relationship is positive or negative. Since the value of the slope is positive, the TIM fund and the index are positively related. R2 is calculated by taking the (RSS / SST) = 0.99459. (0.99459)1/2 = 0.9973.


作者: aianjie    时间: 2009-6-26 15:03

 

What is the sum of squared errors (SSE)?

A) 23,515.

B) 23,644.

C) 3,283.

D) 128.


作者: aianjie    时间: 2009-6-26 15:03

 

The correct answer is D

SSE = SST ? RSS = 23,644 ? 23,516 = 128

What is the value of R2?

A) 0.0055.

B) 5.2900.

C) 0.9946. 

D) 0.9471.


作者: aianjie    时间: 2009-6-26 15:05

 

The correct answer is D

SSE = SST ? RSS = 23,644 ? 23,516 = 128


作者: aianjie    时间: 2009-6-26 15:05

 

What is the value of R2?

A) 0.0055.

B) 5.2900.

C) 0.9946. 

D) 0.9471.


作者: aianjie    时间: 2009-6-26 15:06

 

The correct answer is C

R2 = RSS / SST = 23,516 / 23,644 = 0.9946.


作者: aianjie    时间: 2009-6-26 15:06

 

Is the intercept term statistically significant at the 5% level of significance and the 1% level of significance, respectively?

1%   5%

A)               Yes    No

B)               No     No

C)               Yes    Yes

D)               No     Yes


作者: aianjie    时间: 2009-6-26 15:06

 

The correct answer is C

The test statistic is t = b / std error of b = 5.29 / 1.615 = 3.2755.

Critical t-values are ± 2.101 for the degrees of freedom = n ? k ? 1 = 18 for alpha = 0.05. For alpha = 0.01, critical t-values are ± 2.878. At both levels (two-tailed tests) we can reject H0 that b = 0.


作者: aianjie    时间: 2009-6-26 15:07

 

What is the value of the F-statistic?

A) 0.0003. 

B) 0.9945. 

C) 182. 

D) 3,359. 


作者: aianjie    时间: 2009-6-26 15:07

 

The correct answer is D

F = mean square regression / mean square error = 23,516 / 7 = 3,359.


作者: aianjie    时间: 2009-6-26 15:08

 

Heteroskedasticity can be defined as:

A) error terms that are dependent. 

B) a regression that changes over time. 

C) nonconstant variance of the error terms. 

D) independent variables that are correlated with each other. 


作者: aianjie    时间: 2009-6-26 15:08

 

The correct answer is C

Heteroskedasticity occurs when the variance of the residuals is not the same across all observations in the sample. Autocorrelation refers to dependent error terms. Nonstationarity is when the regression changes over time.

 


作者: aianjie    时间: 2009-6-26 15:12

 

5、A study of a sample of incomes (in thousands of dollars) of 35 individuals shows that income is related to age and years of education. The following table shows the regression results:

   

Coefficient

Standard Error

t-statistic

P-value

Intercept

5.65

1.27

4.44

0.01

Age

0.53

?

1.33

0.21

Years of Education

2.32

0.41

?

0.01

 

 

 

 

 

Anova

df

SS

MS

F

Regression

?

215.10

?

?

Error

?

115.10

?

 

Total

?

?

 

 

The standard error for the coefficient of age and t-statistic for years of education are:

A)    0.40; 5.66.

B)    0.53; 2.96.

C)   0.20; 1.96.

D)   0.32; 1.65.


作者: aianjie    时间: 2009-6-26 15:14

 

The correct answer is A

standard error for the coefficient of age = coefficient / t-value = 0.53 / 1.33 = 0.40

t-statistic for the coefficient of education = coefficient / standard error = 2.32 / 0.41 = 5.66


作者: aianjie    时间: 2009-6-26 15:15

 

Mean square regression (MSR) and mean square error (MSE) are:

A) 102.10; 7.11.

B) 7.38; 3.42.

C) 107.55; 3.60.

D) 6.72; 3.58.


作者: aianjie    时间: 2009-6-26 15:15

 

The correct answer is C

df for Regression = k = 2

df for Error = n – k – 1 = 35 – 2 – 1 = 32

MSR = RSS / df = 215.10 / 2 = 107.55

MSE = SSE / df = 115.10 / 32 = 3.60


作者: aianjie    时间: 2009-6-26 15:16

 

What is the R2 for the regression?

A) 62%. 

B) 76%. 

C) 65%. 

D) 82%. 


作者: aianjie    时间: 2009-6-26 15:16

 

The correct answer is C

SST = RSS + SSE

= 215.10 + 115.10 = 330.20

R2 = RSS / SST = 215.10 / 330.20 = 0.65


作者: aianjie    时间: 2009-6-26 15:16

 

What is the predicted income of a 40-year-old person with 16 years of education?

A) $63,970. 

B) $62,120. 

C) $52,780. 

D) $74,890.


作者: aianjie    时间: 2009-6-26 15:17

 

The correct answer is A

income  = 5.65 + 0.53 (age) + 2.32 (education)

        = 5.65 + 0.53 (40) + 2.32 (16)= 63.97 or $63,970


作者: aianjie    时间: 2009-6-26 15:17

 

What is the F-value?

A) 14.36. 

B) 2.16.

C) 29.88.

D) 1.88.


作者: aianjie    时间: 2009-6-26 15:18

 

The correct answer is C

F = MSR / MSE = 107.55 / 3.60 = 29.88

 


作者: aianjie    时间: 2009-6-26 15:18

 

6、Which statement is most accurate? Assume a 5 percent level of significance. The F-statistic is:

Analysis of Variance Table (ANOVA)

Source

Degrees of
freedom (df)

Sum of
Squares

Mean Square
(SS/df)

F-statistic

Regression

5

18,500

3,700

 

Error

94

600.66

6.39

 

Total

99

19,100.66

 

 

A)    0.0017 and the regression is said to be statistically significant.

B)    0.0017 and the regression is said to be statistically insignificant.

C)   579.03 and the regression is said to be statistically insignificant.

D)   579.03 and the regression is said to be statistically significant.


作者: aianjie    时间: 2009-6-26 15:20

 

The correct answer is D

F =3,700/6.39 = 579.03 which is significant since the critical F value is between 2.29 and 2.37. The critical F value is found by using a 5% level of significance F-table and looking up the value that corresponds with 5 = k = the number of independent variables in the numerator and 100 _ 5 _ 1 = 94 df in the denominator resulting in a critical value between 2.29 and 2.37.

 


作者: aianjie    时间: 2009-6-26 15:20

 

7、Consider the following analysis of variance (ANOVA) table:

Source

Sum of squares

Degrees of freedom

Mean square

  Regression

   500

 1

500

  Error

   750

50

  15

  Total

1,250

51

 

The R2 and the F-statistic are, respectively:

A)    R2 = 0.40 and F = 33.333.

B)    R2 = 0.67 and F = 0.971.

C)   R2 = 0.40 and F = 0.971.

D)   R2 = 0.67 and F = 33.333.


作者: aianjie    时间: 2009-6-26 15:21

 

The correct answer is A

R2 = 500 / 1,250 = 0.40. The F-statistic is 500 / 15 = 33.33.

 


作者: aianjie    时间: 2009-6-26 15:22

 

8、Consider the following analysis of variance (ANOVA) table:

Source

Sum of squares

Degrees of freedom

Mean square

 Regression

200

  1

200

 Error

400

40

  10

 Total

600

41

 

The R2 and the F-statistic are, respectively:

A)    R2 = 50% and F = 2.0.

B)    R2 = 33% and F = 20.0.

C)   R2 = 50% and F = 0.952.

D)   R2 = 33% and F = 2.0.


作者: aianjie    时间: 2009-6-26 15:22

 

The correct answer is B

R2 = 200 / 600 = 0.333. The F-statistic is 200 / 10 = 20.

 


作者: aianjie    时间: 2009-6-26 15:24

 

9、Cynthia Jones is Director of Marketing at Vancouver Industries, a large producer of athletic apparel and accessories. Approximately three years ago, Vancouver experienced increased competition in the marketplace, and consequently sales for that year declined nearly 20 percent. At that time, Jones proposed a new marketing campaign for the company, aimed at positioning Vancouver’s product lines toward a younger target audience. Although the new marketing effort was significantly more costly than previous marketing campaigns, Jones assured her superiors that the resulting increase in sales would more than justify the additional expense. Jones was given approval to proceed with the implementation of her plan.

Three years later, in preparation for an upcoming shareholders meeting, the CEO of Vancouver has asked Jones for an evaluation of the marketing campaign. Sales have increased since the inception of the new marketing campaign nearly three years ago, but the CEO is questioning whether the increase is due to the marketing expenditures or can be attributed to other factors. Jones is examining the following data on the firm's aggregate revenue and marketing expenditure over the last 10 quarters. Jones plans to demonstrate the effectiveness of marketing in boosting sales revenue. She chooses to utilize a simple linear regression model. The output is as follows:

 

Aggregate Revenue (Y)

Advertising Expenditure (X)

Y2

XY

X2

 

300

7.5

90,000

2,250

56.25

320

9.0

102,400

2,880

81.00

310

8.5

96,100

2,635

72.25

335

8.2

112,225

2,747

67.24

350

9.0

122,500

3,150

81.00

400

8.5

160,000

3,400

72.25

430

10.0

184,900

4,300

100.00

390

10.5

152,100

4,095

110.25

380

9.0

144,400

3,420

81.00

430

11.0

184,900

4,730

121.00

TOTAL

3,645

91.2

1,349,525

33,607

842.24

Slope coefficient = 34.74 Standard error of slope coefficient = 9.916629313 Standard error of intercept = 92.2840128

ANOVA

 

Df

SS

MS

Regression

1

12,665.125760

12,665.12576

Residual

8

8,257.374238

1,032.17178

Total

9

20,922.5

 

For the questions below, assume that the regression model was estimated using a large number of observations.

What is the upper limit of the 95 percent confidence interval for the slope coefficient?

A)    62.84.

B)    54.18.

C)   46.05.

D)   36.70.


作者: aianjie    时间: 2009-6-26 15:24

 

The correct answer is B

Upper Limit = coefficient + (1.96 x standard error of the coefficient) = 34.74 + (1.96 x 9.917) = 54.18


作者: aianjie    时间: 2009-6-26 15:24

 

What is the lower limit of the 95 percent confidence interval for the slope coefficient?

A) 32.78.

B) 15.30.

C) 12.24.

D) 19.89.


作者: aianjie    时间: 2009-6-26 15:25

 

The correct answer is B

Lower Limit = Coefficient - (1.96 x Standard Error of the coefficient)

 = 34.74 - (1.96 x 9.917)

 = 34.74 - 19.4373 = 15.30


作者: aianjie    时间: 2009-6-26 15:26

 

Which of the following is the CORRECT value of the correlation coefficient between aggregate revenue and advertising expenditure?

A) 0.9500.

B) 0.3947.

C) 0.7780.

D) 0.6053.


作者: aianjie    时间: 2009-6-26 15:26

 

The correct answer is C

The R2 = (SST - SSE)/SST = RSS/SST = (20,922.5 - 8,257.374) / 20,922.5 = 0.6053.

The correlation coefficient is the square root of the R2 in a simple linear regression which is the square root of 0.6053 = 0.7780.

Which of the following reports the CORRECT value and interpretation of the R2 for this regression? The R2 is:

A) 0.3947 indicating that the variability of advertising expenditure explains about 39.47% percent of the variability of aggregate revenue.

B) 0.6053 indicating that the variability of advertising expenditure explains about 60.53% of the variability in aggregate revenue.

C) 0.6053 indicating that the variability of aggregate revenue explains about 60.53% of the variability in advertising expenditure.

D) 0.3947 indicating that the variability of aggregate revenue explains about 39.47% percent of the variability of advertising expenditure


作者: aianjie    时间: 2009-6-26 15:26

 

The correct answer is B

The R2 = (SST - SSE)/SST = (20,922.5 - 8,257.374) / 20,922.5 = 0.6053.

The interpretation of this R2 is that 60.53% of the variation in aggregate revenue (Y) is explained by the variation in advertising expenditure (X).


作者: aianjie    时间: 2009-6-26 15:27

 

What is the y-intercept term, b0?

A) 47.6712. 

B) 34.7400.

C) 92.2840.

D) 512.3600.


作者: aianjie    时间: 2009-6-26 15:27

 

The correct answer is A

The mean of the aggregate revenue (Y) is 3,645/10 = 364.50 and of the advertising expenditure (X) is 91.2/10 = 9.12. The y-intercept, b0 = MeanY – Slope * MeanX = 364.50 – 34.74 * 9.12 = 47.6712.


作者: aianjie    时间: 2009-6-26 15:28

 

What is the calculated F-statistic?

A) 12.2700.

B) 92.2840.

C) 1,032.1717.

D) 0.1250.


作者: aianjie    时间: 2009-6-26 15:28

 

The correct answer is A

The computed value of the F-Statistic = MSR/MSE = 12,665.12576 / 1,032.17178 = 12.27, where MSR and MSE are from the ANOVA table.


作者: aianjie    时间: 2009-6-26 15:28

 

10、Given the results from a multiple regression function: 24 ? 2 × X1i + 4 × X2i, which of the following are known to be true? Based upon the equation:

?            if the observed dependent variable equals 30 and X1i = 1, then X2i = 2.

?            if both independent variables equal zero, the fitted value for the dependent variable is 24.

?            there is a negative relationship between the dependent variable and X1i.

?            given X1i = 2 and X2i = 1 the fitted value for the dependent variable is 24.

A) II and III only.

B) I and III only.

C) II, III, and IV only.

D) I, II, III, and IV.


作者: aianjie    时间: 2009-6-26 15:29

 

The correct answer is C

A researcher cannot make inferences about the independent variables from observed values of the dependent variable.

 


作者: aianjie    时间: 2009-6-26 15:29

 

11、John Conner, FRM and Chris Bond, FRM are planning to compute the coefficient of determination from a regression analysis. Conner asserts the following formula is appropriate for a two-variable regression, but it will have to be adjusted for degrees of freedom when additional variables are added to the equation: .

Bond asserts that the following formula is appropriate, and it does not have to be adjusted for degrees of freedom: .

What, if any, error have the analysts made in their assertions?

A) There are no errors in the assertions.

B) Bond’s formula has an error, but Bond is correct in that it does not have to be adjusted for degrees of freedom.

C) Conner’s formula has an error, but Conner is correct in that it must be adjusted for degrees of freedom.

D) Conner’s formula is correct, but Conner is not correct in asserting that it must be adjusted for degrees of freedom.


作者: aianjie    时间: 2009-6-26 15:30

 

The correct answer is D

The formulas are both correct, and it is not necessary to adjust for degrees of freedom

 


作者: aianjie    时间: 2009-6-26 15:30

 

12、Werner Baltz, CFA, has regressed 30 years of data to forecast future sales for National Motor Company based on the percent change in gross domestic product (GDP) and the change in price of a U.S. gallon of fuel at retail. The results are presented below. Note: results must be multiplied by $1,000,000:

Coefficient Estimates

< > td>

< > td>

< > td>

< > td>

< > td>

Standard Error

Predictor

Coefficient

of the Coefficient

Intercept

78

13.710

?1 GDP

30.22

12.120

?2$ Fuel

?412.39

183.981

 

 

Analysis of Variance Table (ANOVA)

< > td>

< > td>

< > td>

< > td>

Source

Degrees of Freedom

Sum of Squares

Mean Square

Regression

 

291.30

145.65

Error

27

132.12

 

Total

29

423.42

 

< > td>

In 2002, if GDP rises 2.2% and the price of fuels falls $0.15, Baltz’s model will predict Company sales in 2002 to be (in $ millions) closest to:

A)    $128.

B)    $82.

C)   $206.

D)   $254.


作者: aianjie    时间: 2009-6-26 15:30

 

The correct answer is D

Sales will be closest to $78 + ($30.22 × 2.2) + [(?412.39) × (?$0.15)] = $206.34 million.

 


作者: aianjie    时间: 2009-6-26 15:31

 

13、Baltz proceeds to test the hypothesis that none of the independent variables has significant explanatory power. He concludes that, at a 5% level of significance:

A) at least one of the independent variables has explanatory power, because the calculated F-statistic exceeds its critical value. 

B) all of the independent variables have explanatory power, because the calculated F-statistic exceeds its critical value. 

C) none of the independent variables has explanatory power, because the calculated F-statistic does not exceed its critical value. 

D) at least one of the independent variables has explanatory power, because the calculated F-statistic does not exceed its critical value. 


作者: aianjie    时间: 2009-6-26 15:31

 

The correct answer is A

From the ANOVA table, the calculated F-statistic is (mean square regression / mean square error) = 145.65 / 4.89 = 29.7853. From the F distribution table (2 df numerator, 27 df denominator) the F-critical value may be interpolated to be 3.36. Because 29.7853 is greater than 3.36, Baltz rejects the null hypothesis and concludes that at least one of the independent variables has explanatory power.

 


作者: aianjie    时间: 2009-6-26 15:32

 

14、Baltz then tests the individual variables, at a 5% level of significance, to determine whether sales are explained by individual changes in GDP and fuel prices. Baltz concludes that:

A) neither GDP nor fuel price changes explain changes in sales.

B) only GDP changes explain changes in sales.

C) both GDP and fuel price changes explain changes in sales.

D) only fuel price changes explain changes in sales.


作者: aianjie    时间: 2009-6-26 15:32

 

The correct answer is C

From the ANOVA table, the calculated t-statistics are (30.22 / 12.12) = 2.49 for GDP and (?412.39 / 183.981) = ?2.24 for fuel prices. These values are both outside the t-critical value at 27 degrees of freedom of ±2.052. Therefore, Baltz is able to reject the null hypothesis that these coefficients are equal to zero, and concludes that each variable is important in explaining sales.

 


作者: aianjie    时间: 2009-6-26 15:33

 

15、Consider the following regression equation:

Salesi = 20.5 + 1.5 R&Di + 2.5 ADVi – 3.0 COMPi

where Sales is dollar sales in millions, R&D is research and development expenditures in millions, ADV is dollar amount spent on advertising in millions, and COMP is the number of competitors in the industry.

Which of the following is NOT a correct interpretation of this regression information?

A) If a company spends $1 more on R&D (holding everything else constant), sales are expected to increase by $1.5 million.

B) If R&D and advertising expenditures are $1 million each and there are 5 competitors, expected sales are $9.5 million.

C) One more competitor will mean $3 million less in sales (holding everything else constant).

D) Increasing advertising dollars by $1 million (holding everything else constant), will result in $2.5 million additional sales.


作者: aianjie    时间: 2009-6-26 15:33

 

The correct answer is A

If a company spends $1 million more on R&D (holding everything else constant), sales are expected to increase by $1.5 million. Always be aware of the units of measure for the different variables.

 


作者: aianjie    时间: 2009-6-26 15:33

 

16、Consider the following regression equation:

Salesi = 10.0 + 1.25 R&Di + 1.0 ADVi – 2.0 COMPi + 8.0 CAPi

where Sales is dollar sales in millions, R&D is research and development expenditures in millions, ADV is dollar amount spent on advertising in millions, COMP is the number of competitors in the industry, and CAP is the capital expenditures for the period in millions of dollars.

Which of the following is NOT a correct interpretation of this regression information?

A) If a company spends $1 million more on capital expenditures (holding everything else constant), Sales are expected to increase by $8.0 million.

B) One more competitor will mean $2 million less in Sales (holding everything else constant).

C) If R&D and advertising expenditures are $1 million each, there are 5 competitors, and capital expenditures are $2 million, expected Sales are $8.25 million.

D) Increasing advertising dollars by $1 million (holding everything else constant), will result in $1 million additional Sales.


作者: aianjie    时间: 2009-6-26 15:34

 

The correct answer is C

Predicted sales = $10 + 1.25 + 1 – 10 + 16 = $18.25 million.

 


作者: aianjie    时间: 2009-6-26 15:34

 

17、Henry Hilton, CFA, is undertaking an analysis of the bicycle industry. He hypothesizes that bicycle sales (SALES) are a function of three factors: the population under 20 (POP), the level of disposable income (INCOME), and the number of dollars spent on advertising (ADV). All data are measured in millions of units. Hilton gathers data for the last 20 years. Which of the follow regression equations correctly represents Hilton’s hypothesis?

A) SALES = α x β1 POP x β2 INCOME x β3 ADV x ε.

B) INCOME = α + β1 POP + β2 SALES + β3 ADV + ε.

C) SALES = α + β1 POP + β2 INCOME + β3 ADV + ε.

D) INCOME = α + β1 POP + β2 ADV + ε.


作者: aianjie    时间: 2009-6-26 15:34

 

The correct answer is C

SALES is the dependent variable. POP, INCOME, and ADV should be the independent variables (on the right hand side) of the equation (in any order). Regression equations are additive.

 


作者: aianjie    时间: 2009-6-26 15:35

 

18、Henry Hilton, CFA, is undertaking an analysis of the bicycle industry.  He hypothesizes that bicycle sales (SALES) are a function of three factors: the population under 20 (POP), the level of disposable income (INCOME), and the number of dollars spent on advertising (ADV).  All data are measured in millions of units.  Hilton gathers data for the last 20 years and estimates the following equation (standard errors in parentheses): 

SALES = α + 0.004 POP + 1.031 INCOME + 2.002 ADV

(0.005)

(0.337)

(2.312)

 

The critical t-statistic for a 95 percent confidence level is 2.120.  Which of the independent variables is statistically different from zero at the 95 percent confidence level?

A)    INCOME and ADV.

B)    POP and ADV.

C)   INCOME only.

D)   ADV only.


作者: aianjie    时间: 2009-6-26 15:35

 

The correct answer is C

The calculated test statistic is coefficient/standard error. Hence, the t-stats are 0.8 for POP, 3.059 for INCOME, and 0.866 for ADV. Since the t-stat for INCOME is the only one greater than the critical t-value of 2.120, only INCOME is significantly different from zero.

 


作者: aianjie    时间: 2009-6-26 15:35

 

19、Henry Hilton, CFA, is undertaking an analysis of the bicycle industry.  He hypothesizes that bicycle sales (SALES) are a function of three factors: the population under 20 (POP), the level of disposable income (INCOME), and the number of dollars spent on advertising (ADV).  All data are measured in millions of units.  Hilton gathers data for the last 20 years and estimates the following equation (standard errors in parentheses):

SALES = 0.000  +  0.004 POP + 1.031 INCOME + 2.002 ADV

(0.113)

(0.005)

(0.337)

(2.312)

 

For next year, Hilton estimates the following parameters: (1) the population under 20 will be 120 million, (2) disposable income will be $300,000,000, and (3) advertising expenditures will be $100,000,000.  Based on these estimates and the regression equation, what are predicted sales for the industry for next year?

A)    $557,143,000.

B)    $509,980,000.

C)   $656,991,000.

D)   $669,471,000.


作者: aianjie    时间: 2009-6-26 15:36

 

The correct answer is B

Predicted sales for next year are:

SALES = α + 0.004 (120) + 1.031 (300) + 2.002 (100) = 509,980,000.

 


作者: aianjie    时间: 2009-6-26 15:36

 

20、Consider a study of 100 university endowment funds that was conducted to determine if the funds’ annual risk-adjusted returns could be explained by the size of the fund and the percentage of fund assets that are managed to an indexing strategy. The equation used to model this relationship is:

ARARi = b0 + b1Sizei + b2Indexi + ei

Where:

ARARi

=

the average annual risk-adjusted percent returns for the fund i over the 1998-2002 time period.

Sizei

=

the natural logarithm of the average assets under management for fund i.

Indexi

=

the percentage of assets in fund i that were managed to an indexing strategy.

The table below contains a portion of the regression results from the study.

Partial Results from Regression ARAR on Size and Extent of Indexing

 

Coefficients

Standard Error

t-Statistic

Intercept

???

0.55

?5.2

Size

0.6

0.18

???

Index

1.1

???

2.1

Which of the following is the most accurate interpretation of the slope coefficient for size? ARAR:

A)    will change by 1.0% when the natural logarithm of assets under management changes by 0.6, holding index constant.

B)    will change by 0.6% when the natural logarithm of assets under management changes by 1.0, holding index constant.

C)   and index will change by 1.1% when the natural logarithm of assets under management changes by 1.0.

D)   will change by 1.1% when the natural logarithm of assets under management changes by 1.0 and index changes by 0.6.


作者: aianjie    时间: 2009-6-26 15:36

 

The correct answer is B

A slope coefficient in a multiple linear regression model measures how much the dependent variable changes for a one-unit change in the independent variable, holding all other independent variables constant. In this case, the independent variable size (= ln average assets under management) has a slope coefficient of 0.6, indicating that the dependent variable ARAR will change by 0.6% return for a one-unit change in size, assuming nothing else changes. Pay attention to the units on the dependent variable.

Which of the following is the estimated standard error of the regression coefficient for index?

A) 1.91.

B) 2.31.

C) 0.52.

D) 1.00.


作者: aianjie    时间: 2009-6-26 15:37

 

The correct answer is C

The t-statistic for testing the null hypothesis H0:  βi = 0 is t = (bi ? 0) / σi, where βi is the population parameter for independent variable i, bi is the estimated coefficient, and σi is the coefficient standard error.  Using the information provided, the estimated coefficient standard error can be computed as bIndex / t = σIndex = 1.1 / 2.1 = 0.5238.


作者: aianjie    时间: 2009-6-26 15:37

 

Which of the following is the t-statistic for size?

A) 0.30.

B) ?0.12.

C) 0.70.

D) 3.33.


作者: aianjie    时间: 2009-6-26 15:37

 

The correct answer is D

The t-statistic for testing the null hypothesis H0:  βi = 0 is t = (bi ? 0) / σi, where βi is the population parameter for independent variable i, bi is the estimated coefficient, and σi is the coefficient standard error.  Using the information provided, the t-statistic for size can be computed as t = bSize / σSize = 0.6 / 0.18 = 3.3333.


作者: aianjie    时间: 2009-6-26 15:37

 

Which of the following is the estimated intercept for the regression?

A) ?0.11.

B) ?9.45.

C) ?2.86.

D) ?4.65.


作者: aianjie    时间: 2009-6-26 15:38

 

The correct answer is C

The t-statistic for testing the null hypothesis H0:  βi = 0 is t = (bi ? 0) / σi, where βi is the population parameter for independent variable i, bi is the estimated parameter, and σi is the parameter’s standard error.  Using the information provided, the estimated intercept can be computed as b0 = t × σ0 = ?5.2 × 0.55 = ?2.86.


作者: lianai    时间: 2009-6-26 15:45

 

Which of the following statements is most accurate regarding the significance of the regression parameters at a 5% level of significance?

A) All of the parameter estimates are significantly different than zero at the 5% level of significance.

B) The parameter estimates for the intercept are significantly different than zero. The slope coefficients for index and size are not significant.

C) The parameter estimates for the intercept and the independent variable size are significantly different than zero. The coefficient for index is not significant.

D) None of the regression parameters are significantly different than zero at the 5% level of significance.


作者: lianai    时间: 2009-6-26 15:45

 

The correct answer is A

At 5% significance and 97 degrees of freedom (100 ? 3), the critical t-value is slightly greater than, but very close to, 1.984.  The t-statistic for the intercept and index are provided as ?5.2 and 2.1, respectively, and the t-statistic for size is computed as 0.6 / 0.18 = 3.33.  The absolute value of the all of the regression intercepts is greater than tcritical = 1.984.  Thus, it can be concluded that all of the parameter estimates are significantly different than zero at the 5% level of significance.


作者: lianai    时间: 2009-6-26 15:46

 

21、Consider the following estimated regression equation, with the standard errors of the slope coefficients as noted:

Salesi = 10.0 + 1.25 R&Di + 1.0 ADVi – 2.0 COMPi + 8.0 CAPi

where the standard error for the estimated coefficient on R&D is 0.45, the standard error for the estimated coefficient on ADV is 2.2 , the standard error for the estimated coefficient on COMP is 0.63, and the standard error for the estimated coefficient on CAP is 2.5.

The equation was estimated over 40 companies. Using a 5 percent level of significance, which of the estimated coefficients are significantly different from zero?

A) R&D, ADV, COMP, and CAP.

B) R&D, COMP, and CAP only.

C) COMP and CAP only.

D) ADV and CAP only.


作者: lianai    时间: 2009-6-26 15:47

 

The correct answer is B

The critical t-values for 40-4-1 = 35 degrees of freedom and a 5 percent level of significance are ± 2.03.


作者: lianai    时间: 2009-6-26 15:48

 

22、 Milky Way, Inc. is a large manufacturer of children’s toys and games based in the United States. Their products have high name brand recognition, and have been sold in retail outlets throughout the United States for nearly fifty years. The founding management team was bought out by a group of investors five years ago. The new management team, led by Russell Stepp, decided that Milky Way should try to expand its sales into the Western European market, which had never been tapped by the former owners. Under Stepp’s leadership, additional personnel are hired in the Research and Development department, and a new marketing plan specific to the European market is implemented. Being a new player in the European market, Stepp knows that it will take several years for Milky Way to establish its brand name in the marketplace, and is willing to make the expenditures now in exchange for increased future profitability.

Now, five years after entering the European market, Stepp is reviewing the results of his plan. Sales in Europe have slowly but steadily increased over since Milky Way’s entrance into the market, but profitability seems to have leveled out. Stepp decides to hire a consultant, Ann Hays, CFA, to review and evaluate their European strategy. One of Hays’ first tasks on the job is to perform a regression analysis on Milky Way’s European sales. She is seeking to determine whether the additional expenditures on research and development and marketing for the European market should be continued in the future.

Hays begins by establishing a relationship between the European sales of Milky Way (in millions of dollars) and the two independent variables, the number of dollars (in millions) spent on research and development (R&D) and marketing (MKTG). Based upon five years of monthly data, Hays constructs the following estimated regression equation:

Estimated Sales = 54.82 + 5.97 (MKTG) + 1.45 (R&D)

Additionally, Hays calculates the following regression estimates:

 

Coefficient

Standard Error

Intercept

54.82

3.165

MKTG

5.97

1.825

R&D

1.45

0.987

Hays begins the analysis by determining if both of the independent variables are statistically significant. To test whether a coefficient is statistically significant means to test whether it is statistically significantly different from:

A)    the upper tail critical value.

B)    slope coefficient.

C)   zero.

D)   the lower tail critical value.


作者: lianai    时间: 2009-6-26 15:48

 

The correct answer is C

The magnitude of the coefficient reveals nothing about the importance of the independent variable in explaining the dependent variable. Therefore, it must be determined if each independent variable is statistically significant. The null hypothesis is that the slope coefficient for each independent variable equals zero.


作者: lianai    时间: 2009-6-26 15:49

 

The t-statistic for the marketing variable is calculated to be:

A) 3.271. 

B) 1.886. 

C) 1.469. 

D) 17.321. 


作者: lianai    时间: 2009-6-26 15:49

 

The correct answer is A

The t-statistic for the marketing coefficient is calculated as follows: (5.97– 0.0) / 1.825 = 3.271

Hays formulates a test structure where the decision rule is to reject the null hypothesis if the calculated test statistic is either larger than the upper tail critical value or lower than the lower tail critical value. At a 5 percent significance level with 57 degrees of freedom, assume that the two-tailed critical t-values are tc = ±2.004. Based on this information, Hays makes the following conclusions:

?            Point 1: The intercept term is statistically significant.

?            Point 2: Both independent variables contribute to explaining states for Milky Way, Inc.

?            Point 3: Both independent variables have slope coefficients that are significantly different from zero at the 5% significance level.

?            Point 4: If an F-test were being used, the null hypothesis would be rejected.

Which of Hays’ conclusions are CORRECT?

A) Points 1 and 2. 

B) Points 1 and 4. 

C) Points 2 and 3. 

D) Points 2 and 4. 


作者: lianai    时间: 2009-6-26 15:50

 

The correct answer is B

Hays’ Point 1 is correct. The t-statistic for the intercept term is (54.82 – 0) / 3.165 = 17.32, which is greater than the critical value of 2.004, so we can conclude that the intercept term is statistically significant.

Hays’ Points 2 and 3 are incorrect. The t-statistic for the R&D term is (1.45 – 0) / 0.987 = 1.469, which is not greater than the critical value of 2.004. This means that the R&D term does not have a slope coefficient that is greater than 0 at the 5 percent significant level. It also means that only MKTG can be said to contribute to explaining sales for Milky Way, Inc.

Hays’ Point 4 is correct. An F-test tests whether at least one of the independent variables is significantly different from zero, where the null hypothesis is that all none of the independent variables are significant. Since we know that MKTG is a significant variable (t-statistic of 3.271), we can reject the hypothesis that none of the variables are significant.






欢迎光临 FRM论坛 (http://bbs.frmspace.com/) Powered by Discuz! 7.2