- Build a pipeline create transcript level count matrix from SRR file only with Biocontainers
- This pipeline can be used any environment containing docker/docker-compse. (Win/Mac/Linux)
Biocontainers
|--Dockerfiles
|--input
|--output
|--STARidx
|--docker-components
| |- docker-compose-XXX.yml
| |-.env
|--wait-for-it.sh
- All processes are cotroled in
docker-compose.yml
wait-for-it.sh dominates the run order of containers written in docker-compose
SRR file
↓ 1.(parallel-fastq-dump)
FastQ
↓ 2.(fastp)
FastQ (quarity checked.trimed)
↓ 3.(STAR)
BAM/SAM
↓ 4.(RSEM)
count data
- create docker image for RSEM
docker build -t biocontainer/rsem -f Dockerfiles/rsem
- Create STAR index (refer to this section )
- Create RSEM index (refer to this section)
- Modify variables in
.env
- This repository has
docker-compose-XXX.yml file for each bioinformatics tools
- XXX: sratoolkit,fastq-dump,fastp,STAR,RSEM
- Each tools can be run independently with below command