bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-11_CDS_annotation_glimmer3.pl_2_6
Length=463
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 239 2e-68
gi|496050829|ref|WP_008775336.1| hypothetical protein 225 6e-63
gi|490418709|ref|WP_004291032.1| hypothetical protein 219 7e-61
gi|494822885|ref|WP_007558293.1| hypothetical protein 179 3e-46
gi|575094321|emb|CDL65708.1| unnamed protein product 154 2e-37
gi|494308783|ref|WP_007173938.1| hypothetical protein 151 9e-37
gi|647452987|ref|WP_025792807.1| hypothetical protein 150 2e-36
gi|575094354|emb|CDL65742.1| unnamed protein product 145 2e-34
gi|490477384|ref|WP_004347761.1| capsid protein 142 1e-33
gi|496521299|ref|WP_009229582.1| capsid protein 130 7e-30
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 239 bits (611), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 166/483 (34%), Positives = 232/483 (48%), Gaps = 57/483 (12%)
Query 1 MSDFNPLNRARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVP 60
MS L + S R+ FDLS K FTAKVGE+LP + PG+K+ I FTRT P
Sbjct 1 MSSVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQP 60
Query 61 VNTAAYTRIKEYYDFYAVPLRLISRALPQAFTQMTD--YItsaasstansstltsVPFVS 118
VN+AAY+R++EYYDFY VP RL+ P FT M D + SS S F
Sbjct 61 VNSAAYSRLREYYDFYFVPYRLLWNMAPTFFTNMPDPHHAADLVSSVNLSQRHPWFTFFD 120
Query 119 QTIFNAFFQTANAGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKKYLG 178
+ + + + ++ G V S KLL+ L YG K Y
Sbjct 121 IMEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYG------------FGKDYES 168
Query 179 VDSLNDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGAGN-- 236
V +D+D+ ++ + P LAYQKI D+F + QW+ Y YN+DY G +
Sbjct 169 VKVPSDSDDIVL-------SPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGF 221
Query 237 ----IGLVTD------MVQLRYANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsLLYY 286
D M L Y N+ KDYF GMLP +QYG V+V
Sbjct 222 HIPMSSFTNDAFKNPTMFDLNYCNFQKDYFTGMLPRAQYGDVSVA--------------- 266
Query 287 EPsssataalqsaggssssvrlsqtvsssQGIRL---NSD----LSALSIRATEYLQRWK 339
P S+ + +S + G+ + NS+ LS L++R E LQ+W+
Sbjct 267 SPIFGDLDIGDSSSLTFASAPQQGANTIQSGVLVVNNNSNTTAGLSVLALRQAECLQKWR 326
Query 340 EIVQFSSKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLDADSSQASIA 399
EI Q DY QM F + + H Y+GGW+S ++I+EVVNTNL D +QA I
Sbjct 327 EIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDISEVVNTNLTGD-NQADIQ 385
Query 400 GKGISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAFDHS 459
GKG + +G+ + ++ +EH IIMC+YH +P+LDW++ A Q T +D+ P FD
Sbjct 386 GKGTGTLNGNKVDFE-SSEHGIIMCIYHCLPLLDWSINRIARQNFKTTFTDYAIPEFDSV 444
Query 460 APQ 462
Q
Sbjct 445 GMQ 447
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 225 bits (573), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 157/490 (32%), Positives = 225/490 (46%), Gaps = 62/490 (13%)
Query 1 MSDFNPLNRARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVP 60
M++ L R T R+ FDLSSK+ FTAK GE+LP +PG+K+ I FTRT P
Sbjct 1 MANIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEVLPGDKWSIDLKSFTRTQP 60
Query 61 VNTAAYTRIKEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVPFVSQT 120
+NTAA+ R++EYYDFY VP L+ TQM D + +P +Q
Sbjct 61 LNTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYD---------NPQHATSYIPSANQA 111
Query 121 IFNAFFQTANAG----------DQPNT----RDDAGLPIVYGSCKLLDMLGYGSMIASNN 166
+ G D T ++ G G+ KLL+ LGYG+
Sbjct 112 LAGVMPNVTCKGIADYLNLVAPDVTTTNSYEKNYFGYSRSLGTAKLLEYLGYGNFYTYAT 171
Query 167 PSKAAITKKYLGVDSLNDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAY 226
TK PL ++ +N LAYQKIY D +SQWEK +
Sbjct 172 SKNNTWTKS------------PL--SSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCF 217
Query 227 NVDYWSGAGNIGLVTD-------------MVQLRYANYPKDYFMGMLPSSQYGSVAVLPs 273
NVDY SG + + D M LRY N+ KD F G+LP QYG A +
Sbjct 218 NVDYLSGTVDSAMTIDSMITGQGFAPFYNMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNV 277
Query 274 issssdsrsLLYYEPsssataalqsaggssssvrlsqtvsssQGIRLNSDLSALSIRATE 333
S+ S + P + + Q + + + L++R E
Sbjct 278 NLSNVLSAQYMVQTPDGDPVGGSPFSSTGVNL----------QTVNGSGTFTVLALRQAE 327
Query 334 YLQRWKEIVQFSSKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLDADS 393
+LQ+WKEI Q +KDY DQ+ + + E S Y+GG ++ ++INEVVN N+ S
Sbjct 328 FLQKWKEITQSGNKDYKDQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNITG-S 386
Query 394 SQASIAGKGISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQ 453
+ A IAGKG+ +G +++D G + +IMC+YH++P+LD+ P T +DF
Sbjct 387 NAADIAGKGVVVGNGR-ISFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDFAI 445
Query 454 PAFDHSAPQS 463
P FD +S
Sbjct 446 PEFDRVGMES 455
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 219 bits (558), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 162/478 (34%), Positives = 231/478 (48%), Gaps = 47/478 (10%)
Query 1 MSDFNPLNRARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVP 60
M++ L R R+ FDLS KK FTAK GE+LP + +PG+ ++I+ FTRT P
Sbjct 1 MANIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQP 60
Query 61 VNTAAYTRIKEYYDFYAVPLRLISRALPQAFTQMTDYItsaass--tansstltsVPFVS 118
VNTAA+ RI+EYYDF+ VP L+ TQM D A S T N +P+++
Sbjct 61 VNTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHAVSIDPTRNFVLSGEMPYMT 120
Query 119 Q----TIFNAFFQTANAGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITK 174
+ NA + D + G S KLL+ LGYG+ +
Sbjct 121 SEAIASYINALSTASALADYKSNY--FGYNRSKSSVKLLEYLGYGNY------------E 166
Query 175 KYLGVDSLNDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGA 234
+L D N A PL+ + + L LAYQKIY DF+ +SQWE+ +NVDY G+
Sbjct 167 SFL-TDDWNTA--PLMANLNHNIFGL--LAYQKIYSDFYRDSQWERVSPSTFNVDYLDGS 221
Query 235 G---NIGLVTDMVQ------LRYANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsLLY 285
+ T+ Q LRY N+ KD F G+LP QYG AV + +L
Sbjct 222 SMNLDNAYSTEFYQNYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGKLTLSN 281
Query 286 YEPsssataalqsaggssssvrlsqtvsssQGIRLNSDLSALSIRATEYLQRWKEIVQFS 345
+ G+S + + DLS L +R E+LQ+WKEI Q
Sbjct 282 FS-----------TVGTSPTTASGTATKNLPAFDTVGDLSILVLRQAEFLQKWKEITQSG 330
Query 346 SKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLDADSSQASIAGKGISS 405
+KDY DQ+ +G+ + Y+GG SS I+INEV+NTN+ S+ A IAGKG+
Sbjct 331 NKDYKDQLEKHWGVSVGDGFSELCTYLGGVSSSIDINEVINTNITG-SAAADIAGKGVGV 389
Query 406 NSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAFDHSAPQS 463
+G + ++ + +IMC+YH +P+LD+ P +D+ P FD QS
Sbjct 390 ANGE-INFNSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQS 446
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 179 bits (455), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 132/488 (27%), Positives = 228/488 (47%), Gaps = 47/488 (10%)
Query 1 MSDFNPLNRARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVP 60
M++ + R R+ +DL+ K FTAK G ++P +W +P + + F RT P
Sbjct 8 MANIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQP 67
Query 61 VNTAAYTRIKEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVPFVSQT 120
+NTAA+ R++ Y+DFY VP R + P A TQM + + +VP +
Sbjct 68 LNTAAFARMRGYFDFYFVPFRQMWNKFPTAITQMRTNL----LHASGPVLADNVPLSDEL 123
Query 121 IFNAFFQTAN-AGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKKYLGV 179
+ Q A+ ++++ G + C +L+ LGYG + G
Sbjct 124 PYFTAEQVADYIVSLADSKNQFGYYRAWLVCIILEYLGYGDFY--------PYIVEAAGG 175
Query 180 DSLNDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGAGNIGL 239
+ A P++ + + P AYQKIY DF +QWE+ +N+DY SG+ + L
Sbjct 176 EGATWATRPML--NNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISGSAD-SL 232
Query 240 VTD-----------MVQLRYANYPKDYFMGMLPSSQYGSVAVLPsissssdsr------- 281
D + +RY+N+ +D G +P +QYG + +P S
Sbjct 233 QLDFTVEGFKDSFNLFDMRYSNWQRDLLHGTIPQAQYGEASAVPVSGSMQVVEGPTPPAF 292
Query 282 -------sLLYYEPsssataalqsaggssssvrlsqtvsssQGIRLNSD----LSALSIR 330
+ L + ++ A S R+ + +++ G+ + D +S L++R
Sbjct 293 TTGQDGVAFLNGNVTIQGSSGYLQAQTSVGESRILRFNNTNSGLIVEGDSSFGVSILALR 352
Query 331 ATEYLQRWKEIVQFSSKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLD 390
E Q+WKE+ S +DY Q+ A +G + + ++G + ++INEVVN N+
Sbjct 353 RAEAAQKWKEVALASEEDYPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNIT 412
Query 391 ADSSQASIAGKGISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISD 450
+++ A IAGKG S +G ++ ++ G ++ I+MCV+H +P LD+ + T+T + D
Sbjct 413 GENA-ADIAGKGTMSGNG-SINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLD 470
Query 451 FPQPAFDH 458
FP P FD
Sbjct 471 FPIPEFDK 478
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 154 bits (389), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 144/499 (29%), Positives = 228/499 (46%), Gaps = 71/499 (14%)
Query 16 RSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVPVNTAAYTRIKEYYDF 75
R+SFDLS + +FTAKVGE+LPC+ Q PG+ ++SS +FTRT P+ + A+TR++E +
Sbjct 19 RNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPLQSNAFTRLRENVQY 78
Query 76 YAVPLRLISRALPQAFTQMT------DYItsaasstansstltsVPFVSQTIFNAFF--- 126
+ VP + + MT D A+S N T +P V+ +A+
Sbjct 79 FFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVTTQMPCVNYKTLHAYLLKF 138
Query 127 ---QTANAGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKKYLGVDSLN 183
T + + G S KLL +LGYG N P + A K + D N
Sbjct 139 INRSTVGSDGSVGPEFNRGCYRHAESAKLLQLLGYG-----NFPEQFANFK--VNNDKHN 191
Query 184 DAD---NPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGAGNIGLV 240
+ + Y S ++ LAY KI D + QW+ + A NVDY + + L
Sbjct 192 QSGQNFKDVTYNNSPYLSIFRLLAYHKICNDHYLYRQWQPYNASLCNVDYLTPNSSSLLS 251
Query 241 TD-----------------MVQLRYANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsL 283
D ++ +R++N P DYF G+LP+SQ+GS +V+ ++ ++
Sbjct 252 IDDALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQFGSESVVNLNLGNASGSAV 311
Query 284 L------------------YYEPsssataalqsaggssssvrlsqtvsssQGIRLNS--- 322
L E +++A +S+ +S + S + +N+
Sbjct 312 LNGTTSKDSGRWRTTTGEWEMEQRVASSANGNLKLDNSNGTFISHDHTFSGNVAINTSLS 371
Query 323 -DLSALSIRATEYLQRWKEIVQFSSKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVINI 381
+LS +++R Q++KEI + D+ Q+ A FGIK P+ +S +IGG SS+INI
Sbjct 372 GNLSIIALRNALAAQKYKEIQLANDVDFQSQVEAHFGIK-PDEKNENSLFIGGSSSMINI 430
Query 382 NEVVNTNLDADSSQ---ASIAGKGISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTG 438
NE +N NL D+ A+ G G +S TY +++ +Y P+LD+ G
Sbjct 431 NEQINQNLSGDNKATYGAAPQGNGSASIKFTAKTYG------VVIGIYRCTPVLDFAHLG 484
Query 439 QAPQLTVTAISDFPQPAFD 457
L T SDF P D
Sbjct 485 IDRTLFKTDASDFVIPEMD 503
>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=553
Score = 151 bits (382), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 123/465 (26%), Positives = 207/465 (45%), Gaps = 48/465 (10%)
Query 10 ARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVPVNTAAYTRI 69
R + +R++FDLS + LFTA G +LP IP + I++ F RT+P+NTAA+ +
Sbjct 11 TRPNRNRNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASM 70
Query 70 KEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVPFVS-QTIFNAFFQT 128
+ Y+F+ VP + Q T M D+ +SA S ++ VP+ + ++FN+
Sbjct 71 RGVYEFFFVPYHQLWAQFDQFITGMNDFHSSANKSIQGGTSPLQVPYFNVDSVFNSLNTG 130
Query 129 ANAGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKKYLGVDSLNDADNP 188
+G + DD YG+ +LLD+LGYG S + D+++ N
Sbjct 131 KESG--SGSTDDLQYKFKYGAFRLLDLLGYGRKFDSFGTAYP---------DNVSGLKNN 179
Query 189 LVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGA-GNIGLVTDMVQLR 247
L Y S LAY KIY D++ NS +E ++N D + G + +V D+ +LR
Sbjct 180 LDYNCS----VFRILAYNKIYQDYYRNSNYENFDTDSFNFDKFKGGLVDAKVVADLFKLR 235
Query 248 YANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsLLYYEPsssataalqsaggssssvr 307
Y N DYF + S L + + + A
Sbjct 236 YRNAQTDYFTNLRQSQ-------------------LFSFTTAFEDVDNINIAPRDYVKSD 276
Query 308 lsqtvsssQGIRLNS---DLSALSIRATEYLQRWKEIVQFSSKDYSDQMAAQFGIKAPEY 364
S + G+ +S D S S+RA + + + + K + DQM A +G++ P+
Sbjct 277 GSNFTRVNFGVDTDSSEGDFSVSSLRAAFAVDKLLSVTMRAGKTFQDQMRAHYGVEIPDS 336
Query 365 MGNHSHYIGGWSSVININEVVNTNLDADSS-------QASIAGKGISSNSGHTLTYDCGA 417
+Y+GG+ S + +++V T+ + +AGKG S G + +D
Sbjct 337 RDGRVNYLGGFDSDMQVSDVTQTSGTTATEYKPEAGYLGRVAGKGTGSGRGR-IVFDA-K 394
Query 418 EHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAFDHSAPQ 462
EH ++MC+Y VP + ++ T P + D+ P F++ Q
Sbjct 395 EHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENLGMQ 439
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 150 bits (380), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 142/474 (30%), Positives = 212/474 (45%), Gaps = 63/474 (13%)
Query 6 PLNRARISTHRSSFDLSSKKLFTAKVGEILP--CYWQIAIPGNKYRISSDWFTRTVPVNT 63
P + R++ R+ FDLSS+++F+AK G++LP C W++ P ++ S RT +NT
Sbjct 2 PAPKPRLA--RNGFDLSSRRIFSAKAGQLLPIGC-WEVN-PSEHFKFSVQDLVRTTTLNT 57
Query 64 AAYTRIKEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVPFVSQTIFN 123
A+Y R+KEYY F+ V R +L Q F Q + S L V T +N
Sbjct 58 ASYARMKEYYHFFFVSYR----SLWQWFDQFI------VGTNNPHSALNGVKKNGTTNYN 107
Query 124 AFFQTANAGD--------QPNTRDDAGLPIVYGSCKLLDMLGYGSMIASN--NPSKAAIT 173
+ D + + D G G+ KLL+ML YG N +
Sbjct 108 QICSSVPTFDLGKLITRLKTSDMDSQGFNYSEGAAKLLNMLNYGVTNKGKFMNLENLITS 167
Query 174 KKYLGVDSLNDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSG 233
YL S +D + +Y V+ LAYQKI+ DF+ N W ++NVD ++
Sbjct 168 TSYL--PSKDDKEPSSIYACK--VSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVDDYAD 223
Query 234 AGNIGLVTDM----VQLRYANYPKDYFMGMLPSSQYG-SVAVLPsissssdsrsLLYYEP 288
N+ + D+ Q+RY Y KD+ M P+ Y + LP + + L
Sbjct 224 DSNLTIEPDVALKFCQMRYRPYAKDWLTSMKPTPNYSDGIFNLPEYVRGNGNVIL----- 278
Query 289 sssataalqsaggssssvrlsqtvsssQGIRLNSDLSALSIRATEYLQRWKEIVQFSSK- 347
+ S +VS G S S +RA L + E + ++
Sbjct 279 ----------------TNNKSGSVSLDSGTVSPSSFSVNDLRAAFALDKMLEATRRANGL 322
Query 348 DYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLDA--DSSQASI---AGKG 402
DY+ Q+ A FG K PE N + ++GG+ + I ++EVV+TN +A D S ASI GKG
Sbjct 323 DYASQIEAHFGFKVPESRANDARFLGGFDNSIVVSEVVSTNGNAASDGSHASIGDLGGKG 382
Query 403 ISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAF 456
I S S T+ +D EH IIMC+Y P ++N + P F QP F
Sbjct 383 IGSMSSGTIEFDS-TEHGIIMCIYSVAPQSEYNASYLDPFNRKLTREQFYQPEF 435
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 145 bits (365), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 96/268 (36%), Positives = 139/268 (52%), Gaps = 30/268 (11%)
Query 16 RSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVPVNTAAYTRIKEYYDF 75
R+ FDLS KK FTAK GE+LP ++ +PG+ + I+ FTRT P+NT+A+ R++EYYDF
Sbjct 12 RNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTSAFARMREYYDF 71
Query 76 YAVPLRLISRALPQAFTQMTDYItsaa--sstansstltsVP-FVSQTIFNAFFQTANAG 132
Y VP + TQM + A+ + N+ +P F S+ I + A A
Sbjct 72 YFVPFEQMWNKFDSCITQMNANVQHASGPTLDDNTPLSGRMPYFTSEQIADYLNDQATAA 131
Query 133 DQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKKYLGVDSLNDADNPLVYQ 192
++ G +CKLL LGYG + ++ + K PL+Y
Sbjct 132 ----RKNPFGFNRSTLTCKLLQYLGYGDYNSFDSETNTWSAK-------------PLLYN 174
Query 193 TSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGAGNI-----GLVTD---MV 244
++ P LAYQKIY DF+ +QWEK +N+DY G ++ GL +D
Sbjct 175 LE--LSPFPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKGTSDLQMDLTGLPSDDNNFF 232
Query 245 QLRYANYPKDYFMGMLPSSQYGSVAVLP 272
+RY NY KD F G+LP +QYGS +V+P
Sbjct 233 DIRYCNYQKDMFHGVLPVAQYGSASVVP 260
Score = 105 bits (263), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 85/137 (62%), Gaps = 2/137 (1%)
Query 327 LSIRATEYLQRWKEIVQFSSKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVN 386
L++R E+LQ+WKE+ +DY Q+ +GIK +++ + + Y+GG ++ ++INEV+N
Sbjct 354 LALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCATSLDINEVIN 413
Query 387 TNLDADSSQASIAGKGISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVT 446
N+ D++ A IAGKG + +G ++ ++ E+ IIMC+YH +P++D+ +G T+
Sbjct 414 NNITGDNA-ADIAGKGTFTGNG-SIRFESKGEYGIIMCIYHVLPIVDYVGSGVDHSCTLV 471
Query 447 AISDFPQPAFDHSAPQS 463
+ FP P D +S
Sbjct 472 DATSFPIPELDQIGMES 488
>gi|490477384|ref|WP_004347761.1| capsid protein [Prevotella buccalis]
gi|281300712|gb|EFA93043.1| putative capsid protein (F protein) [Prevotella buccalis ATCC
35310]
Length=552
Score = 142 bits (358), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 130/475 (27%), Positives = 208/475 (44%), Gaps = 58/475 (12%)
Query 1 MSDFNPLNRA-RISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTV 59
MS PL +A R + R++FDLS K LFTA G +LP IP + I + F R +
Sbjct 1 MSKKIPLIKASRANRPRNAFDLSQKHLFTAHAGMLLPVMTLDLIPHDHVSIQATDFMRCL 60
Query 60 PVNTAAYTRIKEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVP-FVS 118
P+N+AA+ ++ Y+F+ VP + Q T M DY + S S + +P F
Sbjct 61 PMNSAAFMSMRSVYEFFFVPYSQLWHPFDQFITGMNDYRSVLQSDLYKSKSPLVIPSFKR 120
Query 119 QTIFNAFFQTANAG---DQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKK 175
+ ++ F A G Q N D G + +LLD+LGYG + ++ S+ K
Sbjct 121 KELYELF--NAPGGFLNQQSNQPDIFGFKSRFNFLRLLDLLGYGVYVNADGSSRIDAFSK 178
Query 176 YLGVDSLNDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGA- 234
L ++ ++ AYQKIY DF+ N+ +E ++++D + +
Sbjct 179 LL--------------DDTEKLSIFRLAAYQKIYSDFYRNTTYEAVDVSSFSLDNITDSI 224
Query 235 GNIGLVTDMVQLRYANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsLLYYEPsssata 294
I LRY N DYF + P+ P + S + Y P ++ +
Sbjct 225 SAINAFKRFGTLRYRNAQLDYFTNLRPT---------PLFDLDNPSLNSFYNTPGNADSV 275
Query 295 alqsaggssssvrlsqtvsssQGIRLNSD-LSALSIRATEYLQRWKEIVQFSSKDYSDQM 353
++ S + +L+SD L+ SIR L + I Q + K Y++Q+
Sbjct 276 SIDSDSNAV-------------NFQLDSDLLTVQSIRNAFALDKLMRITQRAGKTYAEQI 322
Query 354 AAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLDADSSQ-----------ASIAGKG 402
A FG + E +YIGG+ S I + +V + S + + GK
Sbjct 323 KAHFGFEVSEGRDGRVNYIGGFDSNIQVGDVTQMSGTTASPEQGVSIKHGGYLGRVTGKA 382
Query 403 ISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAFD 457
S SGH + +D EH I+MC+Y VP + ++ T P +T + DF P F+
Sbjct 383 QGSGSGH-IEFDA-HEHGILMCIYSLVPDMQYDATRIDPFVTKLSRGDFFMPEFE 435
>gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317]
gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon
317 str. F0108]
Length=541
Score = 130 bits (328), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 126/467 (27%), Positives = 199/467 (43%), Gaps = 62/467 (13%)
Query 10 ARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVPVNTAAYTRI 69
+R + RS+FDLS K L+TA G +LP + + RI + F RT+P+N+AA+ +
Sbjct 12 SRANRPRSAFDLSQKHLYTAPAGALLPVLSVDLMFHDHIRIQAQDFMRTMPMNSAAFISM 71
Query 70 KEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVPFVSQTIFNAFFQTA 129
+ Y+F+ VP + Q T M DY +S SS A L SVP V F +
Sbjct 72 RGVYEFFFVPYSQLWHPYDQFITSMNDYRSSVVSSAAGDKALDSVPNVKLADMYKFVR-- 129
Query 130 NAGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKKYLGVDSLNDADNPL 189
+ +D G P SC+L+D+LGYG I S SK + PL
Sbjct 130 ----ERTDKDIFGYPHSNNSCRLMDLLGYGKPITS---SKTPV---------------PL 167
Query 190 VYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSG--AGNIGLVTDMVQLR 247
+Y + VN LAY KIY D++ N+ +E Y++N+D+ G + L
Sbjct 168 LY--TGNVNLFRLLAYNKIYSDYYRNTTYEGVDVYSFNIDHKKGTFVPTADEFKKYLNLH 225
Query 248 YANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsLLYYEPsssataalqsaggssssvr 307
Y N P D++ + P+ + +I S S S L +P+ SA + +
Sbjct 226 YRNAPLDFYTNLRPTPLF-------TIGSDSFSSVLQLSDPTGSAGFSADGNSAKLNMAS 278
Query 308 lsqtvsssQGIRLNSDLSALSIRATEYLQRWKEIVQFSSKDYSDQMAAQFGIKAPEYMGN 367
L+ +IR+ L + I + K Y++Q+ A FG+ E
Sbjct 279 PDV-------------LNVSAIRSAFALDKLLSISMRAGKTYAEQIEAHFGVTVSEGRDG 325
Query 368 HSHYIGGWSSVININEVVNT------------NLDADSSQASIAGKGISSNSGHTLTYDC 415
+Y+GG+ S + + +V T N I GKG S G + +D
Sbjct 326 QVYYLGGFDSNVQVGDVTQTSGTTNPNVSEVGNAKLAGYLGKITGKGTGSGYGE-IQFDA 384
Query 416 GAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAFDHSAPQ 462
E ++MC+Y VP + ++ P + D+ P F++ Q
Sbjct 385 -KEPGVLMCIYSVVPAMQYDCMRLDPFVAKQTRGDYFIPEFENLGMQ 430
Lambda K H a alpha
0.319 0.133 0.403 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 3145328611953