Tuesday, January 3, 2017

CFA Level 2 - Quantitative Methods

Linear Regression

regression equation : Yi = b0 + b1Xi + εii = 1, …, n 

 (Yib0b1Xi)2 
    means (dependent variable – predicted value of dependent variable)2


Standard Error of Estimates

εi , the residual term in the regression.

SEE=i=1n(Yib0b1Xi)2n21/2=i=1n(εi)2n21/2
∑             i=1n(εi)2


It is the sum of squared residuals


Coefficient of determination

If we call i=1n(YiY)2 the total variation of Y and i=1n(YiYi)2 the unexplained variation from the regression,

then we can measure the explained variation from the regression using the following equation:

R2=ExplainedvariationTotalvariation=TotalvariationUnexplainedvariationTotalvariation=1UnexplainedvariationTotalvariation

Hypothesis testing

We will use a 95 percent confidence interval for our test, or we could say that the test has a significance level of 0.05.

The number of degrees of freedom equals the number of observations minus the number of parameters estimated

t-test of significance:
t=b1b1sb1
The b1 is from the null hypothesis (hypothesized population value of the regression coefficient).
The b1bar is the estimated regression coefficient.
.
The tis critical t-value at the 0.05/0.01 significance level, from the book. The t is calculated. If t is greater than tc, then we can reject the null hypothesis.
s is the standard error of the regression.

For example, the t-statistic is 2.50, and at the 0.05 significance level, tc = 2.00; thus we reject the null hypothesis because t > tc. This statement is equivalent to saying that we are 95 percent confident that the interval for the coefficient does not contain the value b1.

Often, financial analysts report the p-value or probability value for a particular hypothesis. The p-value is the smallest level of significance at which the null hypothesis can be rejected.

For example, if the p-value is 0.005, we can reject the hypothesis that the true parameter is equal to 0 at the 0.5 percent significance level (99.5 percent confidence).

Analysis of Variance (ANOVA)

Total SS = Regression SS + Residual SS

Regression SS:
i=1n(YiYi)2  
Total SS:
 i=1n(YiY)2 

F-test
The F-test for determining whether the slope coefficient equals to 0 is based on an F-statistic.
The F-statistic measures how well the regression equation explains the variation in the dependent variable. 
If the independent variable explains little of the variation in the dependent variable, the value of the F-statistic will be very small.

F statistic: Regression MSS /  Residual MSS
MSS: means sum of squares (SS divided by df)

F=RSS/1SSE/(n2)=MeanregressionsumofsquaresMeansquarederror  

Confidence Interval of Regression Coefficient (Interval Prediction)
   CI = coeff + tc * SE
tc is from the confidence level. For large number of df:
90% confidence interval will have alpha value of 0.10 (two tail), and tc of 1.645
95% confidence interval will have alpha value of 0.05 (two tail), and tc of 1.96
99% confidence interval will have alpha value of 0.01 (two tail), and tc of 2.576

The end.




No comments:

Post a Comment