| DataLab is a compact statistics package aiming at exploratory data analysis. Please visit the DataLab Web site for more information.... | 

Home  Features of DataLab  Tools  Splitting a Data Set | 
||||||
See also: Transpose Data Matrix, Randomisation of the Data, Sorting the Data, Resizing the Data Matrix, Serializing the Data Matrix 
![]()  | 
||||||
Splitting a Data Set
 
 During data analysis it is often necessary to create two or more disjoint subsets from a common set of data, which then can be used as training and test sets. DataLab therefore provides three ways of creating such subsets: (1) splitting of the variables (columns), (2) splitting of the objects (rows), and (3) the creation of a test and a training set. The size of the datasets can be controlled by the scroll bar in the left center. The mode of selection can either be random, blocked or interleaved. 
 The names of the subsets are created automatically from the file template by appending decimal numbers with three places. The process of subset creation is started by clicking the command Do It. 
 
  | 
||||||