|
DATA MINING
Desktop Survival Guide by Graham Williams |
|
|||
|
A dataset is usually more complex than a simple vector. Indeed, often
we have several vectors making up the dataset, and refer to this as a
matrix. A matrix is a data structure containing items all of the same
data type. We construct a matrix with the XnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)R functionsR functionsmatrix and
XnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)R functionsR functionsc functions. Rows and columns of a matrix can have
names, and the functions XnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)R functionsR functionscolnames and
XnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)R functionsR functionsrownames will list the current names. However, you can
also assign a new list of names to these functions!
> ds <- matrix(c(52, 37, 59, 42, 36, 46, 38, 21, 18, 32, 10, 67),
nrow=3, byrow=T)
> colnames(ds) <- c("Low", "Medium", "High","VHigh")
> rownames(ds) <- c("Married","Prev.Married","Single")
> ds
Low Medium High VHigh
Married 52 37 59 42
Prev.Married 36 46 38 21
Single 18 32 10 67
|
Of course, manually creating datasets in this way is only useful for
small data collections. A slightly easier approach is to manually
modify and add to the dataset using a simple spreadsheet-like
interface through the XnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)R functionsR functionsedit function or through the
XnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)R functionsR functionsfix function which will also assign the results of the
edit back to the variable being edited. Note that normally the
XnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)R functionsR functionsedit function returns , and thus prints to the screen if
it is not assigned, the datasets. To avoid the dataset being printed
to the screen, when you do not assign XnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)R functionsR functionsedit to a variable
because all you wanted to do was browse the dataset, use the
XnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)R functionsR functionsinvisible function.
> ds <- edit(ds) > fix(ds) > invisible(edit(ds)) |
The XnullXXnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)XnullXR functionsR functionscbind function combines each of its arguments,
column-wise (the c in the name is for column), into a
single data structure:
> age <- c(35, 23, 56, 18)
> gender <- c("m", "m", "f", "f")
> people <- cbind(age, gender)
> people
age gender
[1,] "35" "m"
[2,] "23" "m"
[3,] "56" "f"
[4,] "18" "f"
|
The XnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)R functionsR functionsrbind function similarly combines its argument, but in a row-wise manner. The result will be the same as if we transpose the matrix with the XnullXR functionsR functions (R function)R functionsR libraries (R library)R functionsR option (R option)R functionsR packages (R package)R functionsDatasets (Dataset)R functionsR functionst function:
> t(people)
[,1] [,2] [,3] [,4]
age "35" "23" "56" "18"
gender "m" "m" "f" "f"
> people <- rbind(age, gender)
> people
[,1] [,2] [,3] [,4]
age "35" "23" "56" "18"
gender "m" "m" "f" "f"
|
Copyright © 2004-2008 Togaware Pty Ltd Support further development through the purchase of the PDF version of the book.