Usage¶
Command overview¶
circfull <command> [options]
Supported commands:
RG: Reference-guided detectionDNSC: De novo self-correction to consensus sequencecRG: Reference-guided detection using DNSC-corrected readsstrand: Identify transcript strand of circFL-seq readssRG: Reference-guided detection with stranded readsmRG: Merge circRNA results from RG and cRG/sRGanno: Annotate full-length circRNAsgeneExp: Quantify gene expression to evaluate linear RNA residualsvar: Call variants in circRNAs
Common workflow¶
Inputs¶
rawFq=M1.example.fq
cleanFq=M1.clean.fq
gtfFile=gencode.v19.annotation.sort.gtf.gz
genome=hg19.fa
thread=80
outDir=circFL_example
1. Trim/split raw reads¶
porechop-runner.py -t $thread -i $rawFq -o $cleanFq --barcode_threshold 95 --check_reads 1000
2. Detect full-length circRNA from clean reads (RG)¶
circfull RG -f $cleanFq -g $genome -a $gtfFile -t $thread -o $outDir
3. Identify strand information¶
circfull strand -f $rawFq -F $cleanFq -a $gtfFile -l 100 -t $thread -r $outDir -o $outDir
4. Detect from stranded reads (sRG)¶
circfull sRG -g $genome -a $gtfFile -s $outDir -t $thread -o $outDir
5. Build consensus with DNSC¶
circfull DNSC -f $cleanFq -t $thread -o $outDir
circfull DNSC -f $cleanFq -t $thread -o $outDir -c
6. Detect from DNSC consensus (cRG)¶
circfull cRG -f $outDir -g $genome -a $gtfFile -t $thread -o $outDir
7. Merge and filter circRNAs (mRG)¶
rmsk=rmsk.bed.gz
circfull mRG -f $cleanFq -g $genome -r $outDir -c $outDir -s $outDir -t $thread -o $outDir
8. Annotate full-length circRNAs¶
circfull anno -b ${outDir}/mRG/circFL_Normal_pass.bed -a $gtfFile -o ${outDir}/mRG/circFL_Normal_pass_anno.txt
9. Quantify gene expression¶
umgtfFile=gencode.v19.annotation.gtf
gene_ref=ref_gene.fa
createGeneRef $umgtfFile $genome $gene_ref
circfull geneExp -f $cleanFq -r $gene_ref -o $outDir -t $thread
10. Call variants¶
circfull var -f $cleanFq -g $genome -r $outDir -t $thread -o $outDir
Key command options¶
RG¶
circfull RG -f fastq -g genome -a anno [-u] [-t threads] [-r] [-o output]
DNSC¶
circfull DNSC -f fastq [-t threads] [-c] [-m mem] [-o output]
cRG¶
circfull cRG -f DNSC -g genome -a anno [-t threads] [-u] [-o output]
strand¶
circfull strand -f fastq -F fastq2 -a anno [-r RG_dir] [-l hang_len] [-t threads] [-o output]
sRG¶
circfull sRG -g genome -a anno [-s strandDir] [-t threads] [-r] [-o output]
mRG¶
circfull mRG -f fastq -g genome [-r RG] [-c cRG] [-s sRG] [-t threads] [-o output]
anno¶
circfull anno -b bed -a anno [-o output]
geneExp¶
circfull geneExp -f fastq -r ref [-t threads] [-o output]
var¶
circfull var -f fastq -g genome [-r RG_dir] [-t threads] [-o output]
Note: Higher thread counts increase memory usage.