標籤:pat either version net 安裝 ref 序列 mapper local
好吧,這是本周(2016.10.21-28)的學習任務之一:安裝bowtie2並學習其使用方法&參數設定
所以,啃文檔咯,官方文檔Version 2.2.9 http://bowtie-bio.sourceforge.net/bowtie2/manual.shtml
以下是我的整理。我不生產文檔,我只是文檔的搬運工麼麼噠~
Bowtie2適合將長度50-1000bp的reads比對到長的參考序列上。Bowtie 2 indexes the genome with an FM Index
(based on the Burrows-Wheeler Transform or BWT) 。輸出結果為SAM格式。已整合在很多軟體中,如
TopHat(a fast splice junction mapper for RNA-seq reads),
Cufflinks(transcriptome assembly and isoform quantitiation from RNA-seq reads),
Crossbow( cloud-enabled tool for analyzing reseuqncing data),
Myrna(a cloud-enabled tool for aligning RNA-seq reads and measuring differential gene expression)。
Bowtie1和2的區別:Bowtie 2‘s command-line arguments and genome index format are both different from Bowtie 1‘s.
1,bowtie1出現的早,所以對於測序長度在50bp以下的序列效果不錯,而bowtie2在長度在50bp以上的更好。
2,Bowtie 2支援有空位的比對Number of gaps and gap lengths are not restricted, except by way of the configurable scoring scheme.
3,Bowtie 2支援局部比對(local, some chars will be omited/trimmed),也可以全域比對(end-to-end, all char participate)
4,Bowtie 2對最長序列沒有要求,但是Bowtie 1最長不能超過1000bp。
5. Bowtie 2 allows alignments to [overlap ambiguous characters] (e.g. `N`s) in the reference. Bowtie 1 does not.
6,Bowtie 2不能比對colorspace reads.
7, Bowtie 2‘s paired-end alignment is more flexible. Try to find unpaired alignments for each mate。
8, Bowtie 2 reports a spectrum of mapping qualities, Bowtie 1 reports either 0 or high。
MUMmer: align 2 very large sequences(eg: 2 genomes)
NUCmer, BLAT/BLAST, Bowtie2: sensitive alignment to short ref seq(eg: a bacterial genome)
安裝bowtie2: 直接下載bowtie2-2.2.9-linux-x86_64.zip,解壓,修改環境路徑即可
Scores: 更高分=更相似
--ma :match bonus
--mp :mismatch penalty
--np :penality for having N in either the read or the ref
--rdg :affine read gap penalty
--rfg :affine ref gap penalty
全域比對栗子:預設,高品質位點的mismatch罰分為-6,長度為2的gap罰分為-11(gap open-5, extension-3),如果在長度為50的read中只有這兩個問題,則總分為-17。所以,最好的分數是0,指read和ref完全相同。
default min score threshold:
可以用--score-min設定
-0.6-0.6*L(read長度)
局部比對栗子:罰分同上,但每個match獲得bonus,+2,則如果是上面情況,則得分為2*49-6-11=81
default min score threshold:
20+8*ln(L)(read長度)
安裝生物資訊學軟體-bowtie2