DataLab is a compact statistics package aiming at exploratory data analysis. Please visit the DataLab Web site for more information....


Create a Ridge Regression Model

Command: Math -> Multiple Regression -> Ridge Regression -> Create Model...

Ridge regression is a method to cope with parameter instabilities when some of the descriptor variables are highly correlated. See the statistical background for more information.

The command Math/Multiple Regression/Ridge Regression... (toolbar button ) serves to calculate a ridge regression model. For that purpose first the variables have to be specified by clicking the corresponding variable list. The independent variables (descriptors) are at the left, the dependent variable (target variable) can be selected at the right. After clicking the variable list the variable selection dialog allows to select the desired variables.

In order to calculate the MLR/RR model click the button "Calculate" (). After that the most important diagrams for checking the regression are displayed in the diagnostic window tabs. These diagrams contain the plot of the estimated vs. the actual target values, the distribution of the residuals, and the residuals plotted against an arbitrary independent variable or against the object number. The details on the results can be viewed in the "Details" tab. Further, the ridge trace and the associated levels of significance of the model parameters are displayed in the "Ridge Trace" tab, and the cross validation for different levels of the ridge parameter Lambda can be performed on the "Cross Validation" tab.

The ridge trace (see figure above) shows the estimated parameters over the value of Lambda. It offers a second tab showing the signficance of the estimated parameters over Lambda (figure below). The 5% level of significance is displayed as a gray dotted line. Significant parameters are below this level.

You can store the model on the disk () to be able to apply the model to other datasets by means of the command Apply Model... (button ).(1)

In order to make the variable selection easier the detection of multicollinearities () can be started directly from the RR window.



(1) Please note that the model calculated by ridge regression is formally equivalent to an MLR model. Thus the same dialog, both for MLR and ridge regression, can be used to apply the model.