1. hifi结合hic数据进行初步组装
hifiasm -o YC.asm -t 60 --h1 /home/lx_sky6/yt/20230106_baimaike/0106_hic/Unknown_BB398-05H0001_good_1.fq.gz --h2 /home/lx_sky6/yt/20230106_baimaike/0106_hic/Unknown_BB398-05H0001_good_2.fq.gz /home/lx_sky6/yt/20230106_baimaike/0105_YC_hifi/BMK220921-BB398-02P0001/cell/BMK220921-BB398-02P0001.ccs.fastq.gz
运行结束后生成的一系列文件中,我们只需要关注如下几项 (prefix表示前缀)
prefix.bp.r_utg.gfa: haplotype-resolved raw unitig graph,记录所有的单倍型信息
prefix.bp.p_utg.gfa: 在raw unitig graph基础上过滤小的bubble,
prefix.bp.p_ctg.gfa: 主要contig的assembly graph(主要看这个)
prefix.bp.hap1.p_ctg.gfa: haplotype1的部分分型的contig graph
prefix.bp.hap2.p_ctg.gfa: haplotype2的部分分型的contig graph
awk '/^S/{print ">"$2;print $3}' YC.asm.hic.p_ctg.gfa > ../YC.asm.hic.p_ctg.fasta # get primary contigs in FASTA
初步查看组装完整性
export PATH=/home/lx_sky6/software/miniconda3/envs/busco/bin/:$PATH
cd /home/lx_sky6/yt/20230106_baimaike/0105_YC_hifi/1-assembly/busco
/home/lx_sky6/software/miniconda3/envs/busco/bin/busco -m geno -i /home/lx_sky6/yt/20230106_baimaike/0105_YC_hifi/YC.asm.hic.p_ctg.fasta -o viridiplantae_based_yc --offline -l /home/lx_sky6/yxl/zangyaohongjingtian/annotation/xy/braker/busco_downloads/viridiplantae_odb10 --cpu 40
hifi+hic效果是真不错呀