micromamba create -f scCirclehunter.yaml
micromamba activate scCirclehunter
python setup.py installCirclehunter2 can also be used for bulk ATAC-seq if the -b option is not specified. If the -b CB option is provided, read counts for each ecDNA segment will be output and saved in an .h5ad file. Since similar operations are performed in the downstream analysis of Step 2, we recommend running the following command directly for both bulk and single-cell ATAC-seq.
The result of the following command can be found in GBM4349_circlehunter2_ecDNA.bed.
circlehunter2 -p 16 --blacklist /share/references/hg38/blacklist/hg38.blacklist.sorted.bed /mnt/2w/data2/andy/scATAC-Seq/rawdata/cellranger/GBM4349/outs/possorted_bam.bam /home/andy/Projects/circlehunter2/workspace/dev/data/GBM4349_circlehunter2_ecDNA.bedinput BAM file have to be coordinate sorted, index of the BAM file should present with a .bai suffix, more options please check
circlehunter2 -hcirlcehunter2 output an extended BED format file with columns:
chrom: contig name of which the segment come fromstart: segment startend: segment endname: No. of this segment, format asecDNA_{ecDNA_no}_{total_segments}_{segment_no}, you can use these info to form a complete ecDNA circlescore: no data in this columns, always a.strand: orientation of this segmentstart_ci: confidence interval of segment startend_ci: confidence interval of segment endstart_peak: discordant enrich region of segment startend_peak: discordant enrich region of segment endstart_cross: reads counts of which cross the segment start breakpoint and disjoint the breakpointend_cross: reads counts of which cross the segment end breakpoint and disjoint the breakpointlinked_reads: reads counts of which connected this segment and the next onedepth_mean: mean of depth of this region, maybe false positive if this is smaller thancutoff[^cutoff]high_coverage: fraction of region that depth is higher thancutoff, maybe false positive if this is pretty small
[cutoff]: cutoff is a user input params -c or determine by circlehunter2 automatically if not set by user, which is default to the inverse survival of poisson distribution use mean of the whole genome depth as cutoff will be log in the stderr.
Refer to the example in demo/scCirclehunter_downstream_demo.html for assigning ecDNA to cell populations. The functions utilized are available in the scripts directory.
