From 生信技能树
方法一、点开execl表格,找到fastq.MD5,对于了我们的fastq_aspera。
vim md5.txt
(rna) Apr7 16:15:45 ~/project/Human-16-Asthma-Trans/data/rawdata
$ cat md5.txt
9850074d512ab9d6db SRR1039508_2.fastq.gz
(rna) Apr7 16:32:11 ~/project/Human-16-Asthma-Trans/data/rawdata
$ vim md5.txt
(rna) Apr7 16:34:00 ~/project/Human-16-Asthma-Trans/data/rawdata
$ md5sum -c md5.txt
SRR1039508_2.fastq.gz: FAILED
SRR1039523_2.fastq.gz: FAILED
SRR1039515_1.fastq.gz: FAILED
md5sum: WARNING: 3 computed checksums did NOT match
方法二、生成md5与fasq一一对应
$ head -n 1 filereport_read_run_PRJNA229998_tsv.txt | tr '\t' '\n' | cat -n | less
(rna) Apr7 16:45:17 ~/project/Human-16-Asthma-Trans/data/rawdata
$ less -S filereport_read_run_PRJNA229998_tsv.txt |awk -F'\t' 'NR>1 {print$8}' |less -S |tr ';' '\n' |less -S
(rna) Apr7 16:49:15 ~/project/Human-16-Asthma-Trans/data/rawdata
$ less -S filereport_read_run_PRJNA229998_tsv.txt |awk -F'\t' 'NR>1 {print$8}' |less -S |tr ';' '\n' |less -S >md51
$ less -S filereport_read_run_PRJNA229998_tsv.txt |awk -F'\t' 'NR>1 {print$10}' |tr ';' '\n' |awk -F'/' '{print$7}' >md54 #以‘/’为分隔符,我们需要的数据在第7个,命名为md54
$ paste -d' ' md51 md54 |grep '_' #-d 指定分隔符为两个空格,然后合并md51 和md 54 ,之后再抓取‘_’
$ paste -d' ' md51 md54 |grep '_' >md5.txt
$ nohup md5sum -c md5.txt >check & #放到后台,生成check文件
$ cat check #看check里面的结果
生成md5值
$ md5sum filereport_read_run_PRJNA229998_tsv.txt
a7473f130f55843ef0349081e42fafa4 filereport_read_run_PRJNA229998_tsv.txt