2023年10月9日星期一

Multiple linear regression (1)

In multiple regression, each coefficient is interpreted as the estimated change in y corresponding to a one unit change in a variable, when all other variables are held constant. For example: 



So in this example, dollar 9000 is an estimate of the expected increase in sales y, corresponding to a dollar 1000 increase in capital investment x1, when marketing expenditures x2 are held constant.

New considerations:

  • Adding more independent variables to a multiple regression procedure dose not mean the regression will be "better" or offer better predictions; in fact, it can make things worse, i.e., OVERFITTING.
  • The addition of more independent variables creates more relationships among them. So not only are the independent variables potentially related to the dependent variable, they are also potentially related to each other. When this happens, it is called MULTICOLLINEARITY.
  • The ideal is for all of the independent variables to be correlated with the dependent variable but NOT with each other.

Multiple linear regression lingo



Low bias? Low variance?

-- It depends
-- Typically unbiased is better
-- Sometimes the variance is so out of control. You sacrifice a little bias to fix the variance
  • Ridge regression
  • Lasso


没有评论:

发表评论

85年前的4月,他写了卡利古拉。

  所以首先要闭上嘴巴——不要观众了,学着自我评判。专注保养身体之余亦不忘追求人生的意义。放下一切身段,致力于一种双重的解放——对于金钱以及对于自己的虚荣和怯懦。生活要有规律。花两年时间来想通一件事其实不算浪费人生。要把之前那些习惯改掉,先全心全力地汲取教训,然后再耐心地去学习。...