dataclean {SuRF.vs}R Documentation

dataclean

Description

This function is to 1)Scale the count data (count data only) to proportion 2)create a data frame consisting of proportion data, and 3) Keep an variable name list (original variable names and names in terms of X's, e.g.X1,X2,..,etc. ) #environmental data (host genome and other information about observations)

Usage

dataclean(X.c, X.o, y)

Arguments

X.c

data frame that has count data from all levels (only count data will be row scaled)

X.o

data frame that has other environmental variables (no scaling will be done, those variables will scaled together with proportion data in LASSO step)

y

a vector representing the outcome (0 or 1 for binomial model)

Value

data.Xy: a dataframe containing all variables named as X1,X2,...,Xp and the binary outcome (called status)in the last column; this data frame will be used in the other functions for data analysis


[Package SuRF.vs version 1.1.0.1 Index]