Regression Testing

Can someone help me with this assignment

1.

In a regression model with one independent variable X, the intercept or constant (b0) represents the

  1. A) predicted value of the dependent variable Y when the X = 0.
  2. B) estimated average per unit change in the dependent variable for every unit change in X.
  3. C) predicted or fitted value of dependent variable.
  4. D) Average variation around the sample regression line.

2.

In a regression model with one independent variable X, the slope coefficient (b1) represents the

  1. A) predicted value of the dependent variable Y when the X = 0.
  2. B) estimated average per unit change in the dependent variable for every unit change in X.
  3. C) predicted or fitted value of dependent variable.
  4. D) average variation around the sample regression line.

3.

If one examines the relationship between two variables, the correlation (r) and the slope coefficient (b1) in a simple regression model

  1. A) may have opposite signs.
  2. B) must have the same signs.
  3. C) must have opposite signs.
  4. D) are equal.

4.

If one examines the relationship between two variables, the correlation (r) and the slope coefficient (b1) in a multiple regression model

  1. A) may have opposite signs.
  2. B) must have the same signs.
  3. C) must have opposite signs.
  4. D) are equal.

5.

Suppose you were given data on number of radio ads and sales for a local department store. You had observations for 20 months. The correlation between ads and sales is 0.40. Does this indicate a positive and significant relationship between ads and sales at the 0.005 level of significance?

  1. A) Yes
  2. B) No
  3. C) Not enough information to determine significance

6.

R2 detects the strength of the relationship between the dependent variable and any individual independent variable.

  1. A) True
  2. B) False

7.

When an explanatory variable is dropped from a multiple regression model, R2 can increase.

  1. A) True
  2. B) False

8.

If one tests to see if the value of R-squared is statistically different from zero, one should use a

  1. A) t-statistic.
  2. B) F-statistic.
  3. C) z-statistic.
  4. D) correlation coefficient (r).

9.

In a multiple regression model, the adjusted R2

  1. A) can be negative.
  2. B) must be positive.
  3. C) has to be larger than R2.
  4. D) can be larger than 1.

10.

In a multiple regression model, the intercept or constant (b0)

  1. A) forces the regression line to go through the origin (point where dependent variable is zero when the independent variable is zero).
  2. B) is typically unimportant in interpreting the model so can be omitted from model with little effect on results.
  3. C) measures the portion of variation in the dependent variable that is explained by the variation in all the independent variables.
  4. D) provides flexibility so the least squares method can estimate the best line fit for data in the relevant range of firm operations.

11.

Which of the following is NOT a part of regression specification?

  1. A) Selecting the dependent variable.
  2. B) Identifying important independent variables.
  3. C) Explaining expected signs for the coefficients of each independent variable.
  4. D) Forecasting the expected value of R2 for your model.

Situation 7.1.1:

A large national bank charges local companies for using their services. A bank official reported the results of a regression analysis designed to predict the bank’s charges (Y) — measured in dollars per month — for services rendered to local companies. One independent variable used to predict service charge to a company is the company’s annual sales revenue (X) — measured in millions of dollars. Data for 21 companies who use the bank’s services were used to fit this model:

(Y) = b0 + b1X

The results of the simple linear regression are provided below.

 

Model Variable Coefficient t-statistic (absolute value)
B0 Constant -270 1.25
B1 X 200 1.75
R2 = 0.63

 

 

12.

Referring to Situation 7.1.1, interpret the estimate of b0.

  1. A) All companies will be charged at least $270 by the bank.
  2. B) There is no practical interpretation since bank charges of less than $0 is a nonsensical value.
  3. C) About 95% of the observed service charges fall within $270 of the least squares line.
  4. D) For every $1 million increase in sales revenue, we expect a service charge to decrease $2,700.

13.

Referring to Situation 7.1.1, interpret b1 using the appropriate t-test (Hint: first determine whether this should be a one-tail or two-tail test).

  1. A) There is insufficient information to determine the statistical significance of sales revenue.
  2. B) There is sufficient evidence (at the a = 0.05) to conclude that sales revenue (X) is a significant predictor of service charge (Y).
  3. C) There is insufficient evidence (at the a = 0.10) to conclude that sales revenue (X) is a significant predictor of service charge (Y).
  4. D) For every $1 million increase in sales revenue, we expect the service charge to decrease by $70.

14.

Referring to Situation 7.1.1, what is the percentage of the total variation in bank charges explained by variations in company sales revenue?

  1. A) 175%
  2. B) 125%
  3. C) 95%
  4. D) 63%

15.

Referring to Situation 7.1.1, what is the fitted value of bank charges when company sales revenue is $10m?

  1. A) $1260
  2. B) $1730
  3. C) $2000
  4. D) $6300

Situation 7.1.2:

An economist is interested to see how consumption for an economy (in $ billions) is influenced by gross domestic product ($ billions) and aggregate price (consumer price index). The economist used observations from 15 industrialized countries to estimate the relationship. The Microsoft Excel output of this regression is partially reproduced below.

 

Variable Coefficient Standard Error
Constant -0.086 0.567
GDP 0.765 0.057
Price -0.0006 0.0028
R2 = 0.96; Adjusted R2 = 0.93
F = 18.63

16.

Referring to Situation 7.1.2, one economy in the sample had an aggregate consumption level of $3 billion, a GDP of $3.5 billion, and an aggregate price level of 125. What is the residual for this data point?

  1. A) $2.52 billion
  2. B) $0.48 billion
  3. C) – $1.33 billion
  4. D) – $2.52 billion

17.

Which of the independent variables (price, GDP) in Situation 7.1.2 is statistically significant and, therefore, important to explaining variations in consumption (Hint: you are provided standard errors not t-statistics in the table)?

  1. A) GDP only
  2. B) Price only
  3. C) Both GDP and Price
  4. D) Neither GDP or Price

18.

Referring to Situation 7.1.2, interpret the test to determine whether the model (all independent variables taken together) have statistically significant explanatory power.

  1. A) Since the R2 indicates that variation in all independent variables explain 96% of the variation in the dependent variables, the model can be interpreted as having significant explanatory power.
  2. B) Using a = 0.05 and F = 3.59, the null hypothesis is rejected and the model can be interpreted as having significant explanatory power.
  3. C) Using a = 0.05 and F2,12 = 3.89, the null hypothesis is rejected and the model can be interpreted as having significant explanatory power.
  4. D) Using a = 0.05 and t12 = 1.7823, the null hypothesis is rejected and the model can be interpreted as having significant explanatory power.

19.

The economist in Situation 7.1.2 decided to add a new variable, the percentage of GDP devoted to agriculture, to the model. The new results included an R2 = 0.98 and Adjusted R2 = 0.92. How do you interpret these changes?

  1. A) Since R2 increased by more than adjusted R2 decreased, the new variable was an important addition to the model.
  2. B) The change in adjusted R2 implies that the benefit of the information added by the new variables is less than the cost in lost degrees of freedom caused by adding the new variable so the new variable is not an important addition to the model.
  3. C) One cannot evaluate the importance of the new variable to the model without knowing either the standard error or t-statistic for the variable’s coefficient.

20.

There are several reasons why choices between alternative regression models should NOT be based on which model has the highest R2. Which of the following is NOT one of those reasons?

  1. A) Models using time series data will have higher R2 than models using cross-sectional data.
  2. B) Some variables are inherently more unstable and thus harder to predict.
  3. C) R2 indicates relationships not causality.
  4. D) R2 captures the individual effects of variables but not the effects of all variables.