![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Principle Foundations Home Page
|
|||||
Simple Regression Analysis |
|||||
|
|||||
(bivariate
model) Simple regression is used for testing hypothesis about the relationship between a dependent variable, Y, and and independent or explanatory variable, X, and for prediction. This is to be contrasted with multiple regression analysis, in which there are not one, but two or more independent or explanatory variables. Linear
regression analysis assumes that there is an approximate linear
relationship between X and Y (i.e. the set of random sample values of X
and Y fall on or near a straight line). This is to be contrasted with
non-linear regression analysis. A
scatter diagram is a figure in which each pair of independent - dependent
observations is plotted as a point in the XY plane. Its purpose is to
determine (by inspection) if there exists an approximate linear
relationship between the dependent variable, Y, and the independent or
explanatory variable, X.
The
two-variable linear model, or simple regression analysis, is used for
testing hypotheses about the relationship between independent variable, Y,
and an independent or explanatory variable, X, and for prediction. Simple
linear regression analysis usually begins by plotting the set of XY values
on a scatter diagram and determining by inspection if there exists an
approximate linear relationship.
Yi = b0 + b1Xi
(1.5) Since the
points are unlikely to fall precisely on the line, exact linear
relationship in Eq.(1.5) must be modified to include a random disturbance,
error, or stochastic term, ui :
Yi = b0+b1Xi+ui
(1.6) The error term is assumed to be ;
Example: For a given level of the independent variable (Xi=income), the expected level of the dependent variable (Yi=consumption) will be: E(Yi/Xi)=β0+β1Xi
Assumptions of the classical linear regression model (OLS):
E(ui) = 0 Because of
this assumption, Eq (1.5) gives the average value of Y. Specifically,
since X is assumed fixed, the value of Y in Eq (1.6) varies above and
below its mean as u exceeds or is smaller than 0. Since the average value
of u is assumed to be 0, Eq (1.5) gives the average value of Y.
E(ui)2 =
This
assumption ensures that each observation is equally reliable, so that
estimates of the regression coefficients are efficient and tests of
hypothesis about them are not biased. These first three assumptions about
the error term can be summarized as u ~ N(0,
E(uiuj)
= 0
for
This ensures that the average
value of Y depends only on X and not on u, and it is once again required
in order to have efficient estimates of the regression coefficients and
unbiased tests of their significance.
E(Xiui) = 0 This assumption is made to simplify the analysis.
Copyright
© 2002
Back
to top Evgenia Vogiatzi <<Previous Next>> |