Title Unsupervised Feature Transformation
UFT_func(Data, Seed)
Data | is the original data we want to deal with. It should be of data fram format. It may contain both categorical and numerical features. The UFT_Func will split the data and drop the already numerical features. |
---|---|
Seed | is for reproducibility reasons. |
The output from the UFT_Func is a dataframe of numerical columns replacing the original categorical features.
X <- data.frame("ID" = as.numeric(c(1,2,3,4)), "Age" = c(21,15,44,14), "Sex" = c("Male", "Male", "Female", "Female" ), "Name" = c("John","Samer", "Sara", "Carolina"), Smoking = c("Yes","No", "No","Yes")) uft_x <- UFT_func(Data = X,Seed = 11)