Convert a the data into a torch::dataset() which the vaeac model creates batches from.
Arguments
- X
A torch_tensor contain the data of shape N x p, where N and p are the number of observations and features, respectively.
- one_hot_max_sizes
A torch tensor of dimension
n_featurescontaining the one hot sizes of then_featuresfeatures. That is, if theith feature is a categorical feature with 5 levels, thenone_hot_max_sizes[i] = 5. While the size for continuous features can either be0or1.
Details
This function creates a torch::dataset() object that represent a map from keys to data samples.
It is used by the torch::dataloader() to load data which should be used to extract the
batches for all epochs in the training phase of the neural network. Note that a dataset object
is an R6 instance, see https://r6.r-lib.org/articles/Introduction.html, which is classical
object-oriented programming, with self reference. I.e, vaeac_dataset() is a subclass
of type torch::dataset().