- 0.1 Identify Amtrine+ / Amtrine+GFP+ columns (transduced population) in original spreadsheet
- 0.2 Extract GeoMean / Percentage data for each marker for Amtrine+ / Amtrine+GFP+
- 0.3 Write output files for each individual marker including: plate and well position and Flow readout (MFI or percentage)
- 0.4 Match shRNA names to position
Script: 0_dataPrep.py
Source files: Screen_markers
Output files:
1_0_Raw
1_1_shRNAmatched
- 1.1 Calculate percentile and Z-score of each sample
- 1.1.1 Approach 1: Calculate normal distribution percentile and Z-score based on total population
- 1.1.2 Approach 2: Calculate normal distribution percentile and Z-score based on control plate
- 1.1.1 Approach 1: Calculate normal distribution percentile and Z-score based on total population
- 1.2 Calculate stats of un-normalized data / normalized data (mean and standard deviation)
Script: 1_dataConversion.R
Source files: 1_1_shRNAmatched
Output files:
1_2_normtoall_ZP
1_2_normtocontrol_ZP/Amt
1_2_normtocontrol_ZP/Amt
- 2.1 Test consistency of effect of shRNAs targeting the same gene for each marker
- 2.1.1 Calculate p-value of shRNAs targeting the same gene v.s. all samples with two sample t-test (<- from percentile calculated in last step)
- 2.1.2 Calculate average Z score and percentile of shRNA targeting the same gene
- 2.1.1 Calculate p-value of shRNAs targeting the same gene v.s. all samples with two sample t-test (<- from percentile calculated in last step)
- 2.2 Compile data into spreadsheets to summarize average Z score, percentile and p-value of each gene for different markers
- 2.3 Plot gene average Z-score heatmaps for each marker
Script: 2_convertedDataAnalysis.R
Source files: 1_2_normtocontrol_ZP/Sep
Output files:
2_0_t-test_by_gene/0_Amt_sep
2_0_t-test_by_gene/0_AmtGFP_sep
2_0_t-test_by_gene/1_Amt_compile
2_0_t-test_by_gene/1_AmtGFP_compile
- 3.1 Cluster shRNAs by Z-score for each marker with t-sne clustering
- 3.2 Examine clustering of shRNAs targeting the same gene with the t-sne clustering result
Script: 3_Scikit_learn_byshRNA.py; 3_tsne_cbs-plot_byshRNA.R
Source files: 1_2_normtocontrol_ZP/Compiled_*
Output files:
1_2_normtocontrol_ZP/Compiled_* /per*_ clusterbyshRNA (per*: perplexity)
- 4.1 Cluster genes by average Z-score for each marker with t-sne clustering (per*: perplexity)
- 4.2 Plot genes which significantly up/down regulate markers when knocked down (by marker)
- 4.3 Cluster extraction by louvain method; pathway analysis of extracted clusters
Script: 3_Scikit_learn_byGene.py; 3_tsne_cbs-plot_byGene.R; 4_Tsne_cluster_extraction.R
Source files: 2_0_t-test_by_gene/1_Amt_compile
Output files:
t-SNE coordinates
0. Bubble plot source
1. Annotated bubble plots
1. Non-annotated bubble plots
2. Cluster extraction and plots
3. Extracted cluster pathway analysis