» Coefficient of Determination (R²)

Coefficient of Determination (R²)

1900
  • Karl Pearson
Statistician analyzing regression model data in an office setting.

A statistic indicating the goodness of fit of a model, representing the proportion of the variance in the dependent variable that is predictable from the independent variable(s). An R² of 1 indicates a perfect fit, while 0 indicates no linear relationship. It is calculated as [latex]R^2 \equiv 1 – \frac{SS_{res}}{SS_{tot}}[/latex], where [latex]SS_{res}[/latex] is the residual sum of squares.

The coefficient of determination, R-squared, is a key metric for evaluating regression models. It provides an intuitive measure of how much of the variability in the outcome is captured by the model. It is derived from two key components. The first is the Total Sum of Squares ([latex]SS_{tot} = \sum_i (y_i – \bar{y})^2[/latex]), which measures the total variance in the dependent variable [latex]y[/latex]. The second is the Residual Sum of Squares ([latex]SS_{res} = \sum_i (y_i – \hat{y}_i)^2[/latex]), which measures the variance left unexplained by the model, where [latex]\hat{y}_i[/latex] is the predicted value.

The formula [latex]R^2 = 1 – SS_{res}/SS_{tot}[/latex] can be interpreted as the percentage of total variance that is ‘explained’ by the regression model. For instance, an R² of 0.75 means that 75% of the variability in the outcome can be accounted for by the predictors in the model. In simple linear regression, R² is simply the square of Pearson’s correlation coefficient (r) between the observed and predicted values.

However, R² has a significant limitation: it never decreases when a new predictor variable is added to the model, even if the new variable is irrelevant. This can be misleading and encourage overfitting. To counteract this, the Adjusted R-squared is often used. It modifies the R² value to account for the number of predictors in the model, providing a more accurate measure of goodness of fit for multiple regression.

UNESCO Nomenclature: 1209
- 统计资料

类型

抽象系统

中断

实质性

使用方法

广泛使用

前体

  • Concept of variance and standard deviation
  • Method of least squares
  • Pearson’s product-moment correlation coefficient
  • 方差分析 (方差分析)原则

应用

  • evaluating the performance of predictive models in science and engineering
  • model selection in econometrics and social sciences
  • quantifying the proportion of variance explained by a set of predictors
  • validating financial models for risk assessment

专利:

NA

潜在的创新想法

级别需要会员

您必须是!!等级!!会员才能访问此内容。

立即加入

已经是会员? 在此登录
Related to: r-squared, coefficient of determination, goodness of fit, model evaluation, explained variance, sum of squares, regression diagnostics, statistical significance, adjusted r-squared, correlation.

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注

迎接新挑战
机械工程师、项目、工艺工程师或研发经理
有效的产品开发

可在短时间内接受新的挑战。
通过 LinkedIn 联系我
塑料金属电子集成、成本设计、GMP、人体工程学、中高容量设备和耗材、精益制造、受监管行业、CE 和 FDA、CAD、Solidworks、精益西格玛黑带、医疗 ISO 13485

我们正在寻找新的赞助商

 

您的公司或机构从事技术、科学或研究吗?
> 给我们发送消息 <

接收所有新文章
免费,无垃圾邮件,电子邮件不分发也不转售

或者您可以免费获得完整会员资格以访问所有受限制的内容>这里<

历史背景

(如果日期不详或不相关,例如 "流体力学",则对其显著出现的时间作了四舍五入的估计)。

相关发明、创新和技术原理

滚动至顶部

你可能还喜欢