Code by TomMakesThings

Imports

Citation for Splatter and the original paper.

Get Dataset to Replicate

Use R to download the dataset sc_10x from Luyi Tian's GitHub and open as dataset.

Get the gene counts from dataset and view an extract.

Create Simulated Datasets

Splat Simulation

Simulate a single population of cells that replicates the variation of sc_10x.

Simulate 4 groups of cells seeded by sc_10x.

Get the group labels of each cell.

Compare Simulations

Plot a graph comparing the simulated data to the original dataset based upon the distribution of mean expression. This demonstrates how well the Splatter was able to immitate the data.

Save Data to File

Using R, save the simulated group data and the matching labels as CSV files.

Now download the files using Python.

View the simulated group data and the labels of each cell as DataFrames.