feature engineer
count features
- class autox.autox_competition.feature_engineer.fe_count.FeatureCount[source]
Convert categorical features into the number of occurrences.
cross features
- class autox.autox_competition.feature_engineer.fe_cross.FeatureCross(importance_type='split')[source]
synthetic feature formed by multiplying (crossing) two features.
- fit(X, y, objective, category_cols, top_k=10, used_cols=[])[source]
- Parameters:
X – {array-like, sparse matrix} of shape (n_samples, n_features). Training vector, where n_samples is the number of samples and n_features is the number of features.
y – array-like of shape (n_samples,). Target vector relative to X.
objective – str, objective equal to ‘binary’ or ‘regression’.
category_cols – list, column names of categorical features.
top_k – int, keep the top_k importance cross features, default top_k = 10.
used_cols – list, columns will be used for training model, default top_k = 10.