Search National Agricultural Library Digital Collections

NALDC Record Details:

Using R² to compare least-squares fit models: When it must fail

Permanent URL:
http://handle.nal.usda.gov/10113/55052
File:
Download [PDF File]
Abstract:
R² can be used correctly to select from among competing least-squares fit models when the data are fitted in common form and with common weighting. However, when models are compared by fitting data that have been mathematically transformed in different ways, R² is a flawed statistic, even when the data are properly weighted in accord with the transformations. The reason is that in its most commonly used form, R²can be expressed in terms of the excess variance (s²) and the total variance in y (sy ²) — the first of which is either invariant or approximately so with proper weighting, but the second of which can vary substantially in data transformations. When given data are analyzed “as is” with different models and fixed weights, sy ² remains constant and R² is a valid statistic. However, then s², and χ² in weighted fitting, are arguably better metrics for such comparisons.
Author(s):
Joel Tellinghuisen , Carl H. Bolster
Note:
USDA Scientist Submission
Source:
Chemometrics and intelligent laboratory systems 2011 February 15 v.105 no.2
Language:
English
Publisher:
Elsevier B.V.
Year:
2011
Collection:
Journal Articles, USDA Authors, Peer-Reviewed