脚本路径:
/TJPROJ6/NC_BG_SH/personal_dir/liuying/WGBS_RNA
测试路径:
/TJPROJ6/NC_BG_SH/personal_dir/liuying/WGBS_RNA/test3
准备文件:
rna_seq文件夹准备:
甲基化分析路径如果不存在,需要重新准备:
使用/TJPROJ6/NC_BG_SH/personal_dir/liuying/WGBS_RNA/new_associate_prepare.pl时
rna_seq文件夹准备: *在关联分析执行路径下mkdir -p rna_seq/Diff_TR/Diff,将转录的结果中*Differential/1.deglist下比较组合目录软链过来 *通过转录结果gene_fpkm.xls计算每组所有生物学重复fpkm的平均值:rna_seq/Diff_TR/Diff/rowmeans_fpkm.xls *甲基化分析路径,需要重新准备: *在关联分析路径下mkdir -p methydir,将甲基化释放的CX_report(目录名称为CX_report,下层放cx文件)和解压好的结果文件软链到该路径下
prepare.sh
#perl /TJPROJ6/NC_BG_SH/personal_dir/liuying/WGBS_RNA/associate_prepare.pl \ perl /TJPROJ6/NC_BG_SH/personal_dir/liuying/WGBS_RNA/new_associate_prepare.pl \ -methydir /TJPROJ6/NC_BG_SH/shouhou/202205/X101SC21114961-Z01-J001_asso/WGBS/EPI/TJPROJ7/EPI/Projects/WGBS_2021_Q4/X101SC21114961-Z01-J001-B1-36_kongxincai_WGBS_20211215 \##甲基化路径,需包含2号文件夹和结果文件 -refdir /TJPROJ6/NC_BG_SH/shouhou/202205/X101SC21114961-Z01-J001_asso/rna_seq \##rna_seq准备文件 -Genomedir /TJPROJ1/RNA/reference_data/Plant/Ipomoea_aquatica/kehu_provide/WGBS/Genome_Reg/all \##WGBS参考基因组路径,到Genome_reg/all -outdir /TJPROJ6/NC_BG_SH/personal_dir/liuying/WGBS_RNA/test3/Prepare \##prepare输出路径 -fai /TJPROJ1/RNA/reference_data/Plant/Ipomoea_aquatica/kehu_provide/WGBS/Chr.genome.fa.fai \##参考基因组fai文件 -group CQ1:CQ2:CQ3,CQD1:CQD2:CQD3 \##同WGBS run.sh,组名在WGBS和转录组中都有且一致 -groupname CQ,CQD \#同WGBS run.sh,组名在WGBS和转录组中都有且一致 -compare 2:1 \同WGBS run.sh,保证比较组合在WGBS和转录组中都有且case和control的顺序一致 -report_features promoter,utr5,exon,intron,utr3,genebody,upstream_2k,downstream_2k \ ##在WGBS run.sh的features基础上增加genebody、upstream_2k、downstream2k -Biorepeat yes \ ##是否有组内生物学重复
run.sh
#perl /TJPROJ6/NC_BG_SH/personal_dir/liuying/WGBS_RNA/Association_Pipeline_V2.pl \ perl /TJPROJ6/NC_BG_SH/personal_dir/liuying/WGBS_RNA/new_Association_Pipeline_V2.pl \该pipline适配转录的环境变量 --project X101SC21114961-Z01-J001_Ipomoea_aquatica,海南大学甲基化和chip测序分析技术服务(委托)合同,X101SC21114961-Z01-J001-B1 \ --preparedir /TJPROJ6/NC_BG_SH/personal_dir/liuying/WGBS_RNA/test3/Prepare \##prepare文件夹 --species ath \##kegg物种, --group CQ1:CQ2:CQ3,CQD1:CQD2:CQD3 \同prepare.sh --groupname CQ,CQD \同prepare.sh --compare 2:1 \同prepare.sh --fa /TJPROJ1/RNA/reference_data/Plant/Ipomoea_aquatica/kehu_provide/WGBS/Chr.genome.fa \##WGBS参考基因组fa --gtf /TJPROJ1/RNA/reference_data/Plant/Ipomoea_aquatica/kehu_provide/WGBS/gene.gtf \##参考基因组gtf --chrlist /TJPROJ1/RNA/reference_data/Plant/Ipomoea_aquatica/kehu_provide/WGBS/chrlist \#同WGBS --goann /TJPROJ1/RNA/reference_data/Plant/Ipomoea_aquatica/kehu_provide/WGBS/go.txt \#同WGBS --features promoter,utr5,exon,intron,utr3,genebody,upstream_2k,downstream_2k \##同prepare.sh
*本地运行run.sh后会生成sjm_Associate_analysis.sh,sh sjm_Associate_analysis.sh进行关联分析。