Hi,
Not an issue a but few questions regarding processing and output.
I am using cellout on scgen embedding using sciplex3 data. Basically I want to predict for an unseen single-cell tumor data, what drug can make it more closer to normal cell type. These tumors are not like the cancer type used in sciplex3 data, so i am sure there is huge issue with correct predictions. However, I still want to give it a try.
- I have 5 samples, but I cannot figure out for how to make predictions for each sample separately, without training the data again. Currently I am combining all the data together. This results in total cells for holdout sample to be ~20k.
- When I say test=0.6, does it mean 60% of holdout sample is used for perturbation prediction? But it doesn't seem so. How can I predict perturbed cells-state for more cells?
- Is it possible to run scGen part only to obtain the embedding rather than running the entire training process using scgen model as it is quite time consuming and I don't need scgen output.
- I am obtaining the prediction in data_space, and hence getting a matrix of predicted expression, which I am hoping are log transformed. However, I see a lot of negative values. Shall I make expression value less than 0 to 0? Is it z-scored?
Thanks for your help!
Hi,
Not an issue a but few questions regarding processing and output.
I am using cellout on scgen embedding using sciplex3 data. Basically I want to predict for an unseen single-cell tumor data, what drug can make it more closer to normal cell type. These tumors are not like the cancer type used in sciplex3 data, so i am sure there is huge issue with correct predictions. However, I still want to give it a try.
Thanks for your help!