importFrom
by ::
.multi_strata()
provides a vector of stratification groups based on a data frame that can be then passed to partition()
or create_folds()
. Each stratification group will contain "similar" data rows, where similarity is either based on a kmeans cluster analysis or forming all combinations of binned columns. Thanks to kapsner for the idea and the help with the implementation.This is documentation and maintenance update only with the following changes:
create_folds()
creates in-sample indices by default. If out-of-sample indices are to be generated, set invert = TRUE
.create_folds
and partition
have received a shuffle
option to shuffle rows within folds/partitions. The default is FALSE.create_folds
and partition
cannot return empty folds/partitions anymore. This impacts only extremely small data sets.create_timefolds
now allows also moving windows training data, not just extending windows data.Reduced minimally required R version from 3.5 to 3.1.
Added type = "blocked"
to create_folds
and partition
to allow for blocked splitting.
Added function create_timefolds
for cross-validation of time series data.
Added vignette
This is the initial CRAN release.