R: Maximum Likelihood (ML) Shrinkage in Simple Linear Regression

YonX {RXshrink}

R Documentation

Maximum Likelihood (ML) Shrinkage in Simple Linear Regression

Description

Compute and display Normal-theory ML Shrinkage statistics when a y-Outcome Variable is regressed upon a SINGLE x-Variable (i.e. p = 1). This illustration is usefull in regression pedagogy. The OLS (BLUE) estimate is a scalar in these simple cases, so the MSE optimal Shrinkage factor, dMSE, is also a scalar less than +1 and greater than 0 when cor(y,x) differs from Zero. The corresponding m-Extent of Optimal Shrinkage is marked by the "purple" vertical dashed-line on all YonX() TRACE Diagnostics.

Usage

  YonX(form, data, delmax = 0.999999)

Arguments

`form`	A regression formula [y ~ x] suitable for use with lm().
`data`	Data frame containing observations on both variables in the formula.
`delmax`	Maximum allowed value for Shrinkage delta-factor that is strictly less than 1. (default = 0.999999, which prints as 1 when rounded to fewer than 6 decimal places.)

Details

Since only a single x-Variable is being used, these "simple" models are (technically) NOT "Ill-conditioned". Of course, the y-Outcome may be nearly multi-collinear with the given x-Variable, but this simply means that the model then has low "lack-of-fit". In fact, the OLS estimate can never have the "wrong" numerical sign in these simple p = 1 models! Furthermore, since "risk" estimates are scalar-valued, no "exev" TRACE is routinely displayed; its content duplicates information in the "rmse" TRACE. Similarly, no "infd" TRACE is displayed because any "inferior direction" COSINE would be either: +1 ("upwards") when an estimate is decreasing, or -1 ("downwards") when an estimate is increasing. The m-Extent of shrinkage is varied from 0.000 to 1.000 in 1000 "steps" of size 0.001.

Value

An output list object of class YonX:

`data`	Name of the data.frame object specified as the second argument.
`form`	The regression formula specified as the first argument to YonX() must have only ONE right-hand-side X-variable in calls to YonX().
`p`	Number of X-variables MUST be p = 1 in YonX().
`n`	Number of complete observations after removal of all missing values.
`r2`	Numerical value of R-square goodness-of-fit statistic.
`s2`	Numerical value of the residual mean square estimate for error.
`prinstat`	Vector of five Principal Statistics: eigval, sv, b0, rho & tstat.
`yxnam`	Character Names of "Y" and "X" data vectors.
`yvec`	"Y" vector of data values.
`xvec`	"X" vector of data values.
`coef`	Vector of Shrinkage regression Beta-coefficient estimates: delta * B0.
`rmse`	Vector of Relative MSE Risk estimates starting with the rmse of the OLS estimate.
`spat`	Vector of Shrinkage (multiplicative) delta-factors: 1.000 to 0.000 by -0.001.
`qrsk`	Vector of Quatratic Relative MSE Risk estimates with minimum at delta = dMSE.
`exev`	Vector of Excess Eigenvalues = Difference in MSE Risk: OLS minus GRR.
`mlik`	Normal-theory Likelihood ...for Maximum Likelihood estimation of Shrinkage m-Extent.
`sext`	Listing of summary statistics for all M-extents-of-shrinkage.
`mUnr`	Unrestricted optimal m-Extent of Shrinkage from the dMSE estimate; mUnr = 1 - dMSE.
`mClk`	Most Likely Observed m-Extent of Shrinkage: best multiple of (1/steps) <= 1.
`minC`	Minimum Observed Value of CLIK Normal-theory -2*log(Likelihood-Ratio).
`minE`	Minimum Observed Value of EBAY (Empirical Bayes) criterion.
`minR`	Minimum Observed Value of RCOF (Random Coefficients) criterion.
`minRR`	Minimum Relative Risk estimate.
`mRRm`	m-Extent of the Minimum Relative Risk estimate.
`mReql`	m-Extent where the "qrsk" estimate is first >= the observed OLS RR at m = 0.
`Phi2ML`	Maximum Likelihood estimate of the Phi-Squared noncentrality parameter of the F-ratio for testing H: true beta-coefficient = zero.
`Phi2UB`	Unbiased Phi-Squared noncentrality estimate. This estimate can be negative.
`dALT`	This Maximim Likelihood estimate of Optimal Shrinkage has serious Downward Bias.
`dMSE`	Best Estimate of Optimal Shrinkage Delta-factor from the "Correct Range" adjustment to the Unbiased Estimate of the NonCentrality of the F-ratio for testing Beta = 0.

Author(s)

Bob Obenchain <wizbob@att.net>

References

Obenchain RL. (1978) Good and Optimal Ridge Estimators. Annals of Statistics 6, 1111-1121. doi:10.1214/aos/1176344314

Obenchain RL. (2022) Efficient Generalized Ridge Regression. Open Statistics 3: 1-18. doi:10.1515/stat-2022-0108

Obenchain RL. (2022) RXshrink_in_R.PDF RXshrink package vignette-like document, Version 2.1. http://localcontrolstatistics.org

Examples

  data(haldport)
  form <- heat ~ p4caf
  YXobj <- YonX(form, data=haldport)
  YXobj
  plot(YXobj)

[Package RXshrink version 2.3 Index]