Perform feature selection using the CATSCORE method
select_features.catscore.Rd
This function selects features from the input data matrix X using the CATSCORE method and the provided feature selection specification.
Usage
# S3 method for catscore
select_features(obj, X, Y, ranking.score = c("entropy", "avg", "max"), ...)
Arguments
- obj
The feature selection specification created by
feature_selector()
.- X
The input data matrix.
- Y
The response variable.
- ranking.score
The feature score to use. Supported scores are "entropy", "avg", or "max". Default is "entropy".
Details
The CATSCORE method computes a correlation adjusted t-test for every column in the matrix using sda.ranking
from the sda
package.
See also
feature_selector
for creating a feature selection specification.
Examples
fsel <- feature_selector("catscore", "top_k", 1000)
X <- as.data.frame(matrix(rnorm(100 * 10), 100, 10))
Y <- rep(letters[1:5], 20)
selected_features <- select_features(fsel, X, Y, ranking.score = "entropy")
#> selecting features via catscore
#> retaining 10 features in matrix with 10 columns