GitHub - Guan06/DADA2_pipeline: The pipeline for amplicon sequencing analysis based mainly on DADA2.

Pipeline for data processing for amplicon sequencing data, suitable for bacterial, fungal and oomycetal community profiling. Usearch is needed for preprocessing the data, specifically, demultiplexing.

To run the workflow:

Prepare your input files and put it in the data_dir (later defined in config.sh)

Name sequence fastq files as:

prefix_forward_reads.fastq.gz  
prefix_reverse_reads.fastq.gz  
prefix_barcodes.fastq.gz

Prepare mapping file and name them as:

prefix_mapping.txt (for Bacteria)
prefix_mapping_ITSf.txt (for Fungi if applicable)
prefix_mapping_ITSo.txt (for Oomycetes if applicable)

Validate the prepared mapping file

./activate.sh
validate_mapping_file.py -m $data_dir/prefix_mapping.txt -o ./
Edit config file:

Define the working_dir and data_dir in config file (./config.sh)
Define the profiled kingdom and amplification primer set if applicable(for fungi and oomycetes)
Run scripts step by step or all together (take MiSeq data as example here) ./step1_demultiplex.sh

Checking the output in the folder (take Bacteria as example here):

less $working_dir/01.split_fq/$l_list_miseq/Bac_forward/split_library_log.txt

In this file, the number of reads were shown and for those who has less than 10 (or 20 to be more strict) samples, remove the demultiplexed file, where you could find in subfolder under the same folder(e.g. sample OD1):

rm $working_dir/01.split_fq/$l_list_miseq/Bac_forward/out/OD1.fastq ## forward read file of this sample

rm $working_dir/01.split_fq/$l_list_miseq/Bac_reverse/out/OD1.fastq ## reverse read file of the same sample

Then run the later steps as follow:

./step2_dada2.sh ./step3_get_ASV_table.sh

or

./all_in_one.sh

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Databases		Databases
tax_oomycetes		tax_oomycetes
README.md		README.md
activate.sh		activate.sh
all_in_one.sh		all_in_one.sh
config.sh		config.sh
dada2_bacteria.R		dada2_bacteria.R
dada2_bacteria_454.R		dada2_bacteria_454.R
dada2_fungi.R		dada2_fungi.R
dada2_oomycetes.R		dada2_oomycetes.R
get_ASV_tab_bacteria.R		get_ASV_tab_bacteria.R
get_ASV_tab_fungi.R		get_ASV_tab_fungi.R
get_ASV_tab_oomycetes.R		get_ASV_tab_oomycetes.R
step1_demultiplex.sh		step1_demultiplex.sh
step1_demultiplex_454.sh		step1_demultiplex_454.sh
step2_dada2.sh		step2_dada2.sh
step2_dada2_454.sh		step2_dada2_454.sh
step3_get_ASV_table.sh		step3_get_ASV_table.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages