![](../logo.png)
Build the training dataset
BuildTrainingSet.Rd
Use a Gaussian mixture model (with random parameters) to generate a traning dataset from the reference single-cell data
Arguments
- count
single-cell count matrix (features x cells)
- latent
matrix of single-cell latent space (cells x dims)
- max.iter
size of the training dataset (default = 10,000)
- max.cent
max number of centers in the Gaussian (default = 5)
- step
manually parallelize building the training dataset
- dims
number of dimensions from latent (default = ncol(latent))
- min.cent
min number of centers in the Gaussian (default = 1)
- n
number of cells to be chosen to create the training dataset (default is half the number of cells in the count matrix)
- sigma_min_cells
min number of cells that should be captured by the standard deviation of the Gaussian
- sigma_max_cells
max number of cells that should be captured by the standard deviation of the Gaussian
- verbose
logical indicating whether to print progress (default = TRUE)