R: dataclean

dataclean {SuRF.vs}

R Documentation

dataclean

Description

This function is to 1)Scale the count data (count data only) to proportion 2)create a data frame consisting of proportion data, and 3) Keep an variable name list (original variable names and names in terms of X's, e.g.X1,X2,..,etc. ) #environmental data (host genome and other information about observations)

Usage

dataclean(X.c, X.o, y)

Arguments

`X.c`	data frame that has count data from all levels (only count data will be row scaled)
`X.o`	data frame that has other environmental variables (no scaling will be done, those variables will scaled together with proportion data in LASSO step)
`y`	a vector representing the outcome (0 or 1 for binomial model)

Value

data.Xy: a dataframe containing all variables named as X1,X2,...,Xp and the binary outcome (called status)in the last column; this data frame will be used in the other functions for data analysis

[Package SuRF.vs version 1.1.0.1 Index]