COG注释参考COG注释
cog数据库位置:/TJPROJ2/GB/PUBLIC/database/GB_TR/mRNA/noref/cog
使用/TJPROJ7/GB_TR/PUBLIC/source/ncRNA/gb_tr_man_pipline/bin/DisGeNET脚本
使用方法
/TJPROJ7/GB_TR/PUBLIC/source/ncRNA/gb_tr_man_pipline/bin/DisGeNET --disgenet /TJPROJ13/GB_TR/reference_data/Animal/Homo_sapiens/Homo_sapiens_Ensemble_94/Homo_sapiens_Ensemble_94_disgenet.txt --diffgene /TJPROJ6/NC_BG_SH/shouhou/202302/X101SC22113551-Z01-F001_reanalysis/test/Differential/1.deglist/IFvsMF/IFvsMF_deg_all.xls --diffresult /TJPROJ6/NC_BG_SH/shouhou/202302/X101SC22113551-Z01-F001_reanalysis/test/Differential/1.deglist/IFvsMF/IFvsMF_deg.xls --enrich normal --prefix /TJPROJ6/NC_BG_SH/shouhou/202302/X101SC22113551-Z01-F001_reanalysis/test/Enrichment/5.DisGeNET/IFvsMF/ALL/IFvsMF.all
需要整理COG的注释文件格式如下
gene_id | entrez_id | gene_name | disease_id | disease_name |
ENSG00000121410 | 1 | A1BG | C0001418 | Adenocarcinoma |
第一列是gene_id,第二列不用,第三列是COG注释的序列name;第四列是COGid;第5列是COGidterm的description