bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-5_CDS_annotation_glimmer3.pl_2_7
Length=748
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094326|emb|CDL65712.1| unnamed protein product 374 6e-114
gi|547920049|ref|WP_022322420.1| capsid protein VP1 114 3e-23
gi|649569140|gb|KDS75238.1| capsid family protein 108 6e-22
gi|649555287|gb|KDS61824.1| capsid family protein 108 1e-21
gi|492501782|ref|WP_005867318.1| hypothetical protein 105 3e-20
gi|639237429|ref|WP_024568106.1| hypothetical protein 104 3e-20
gi|530695351|gb|AGT39907.1| major capsid protein 97.4 7e-18
gi|609718276|emb|CDN73650.1| conserved hypothetical protein 97.1 9e-18
gi|495508923|ref|WP_008233569.1| capsid protein 93.6 1e-17
gi|313766923|gb|ADR80650.1| putative major coat protein 96.3 2e-17
>gi|575094326|emb|CDL65712.1| unnamed protein product [uncultured bacterium]
Length=758
Score = 374 bits (959), Expect = 6e-114, Method: Compositional matrix adjust.
Identities = 205/373 (55%), Positives = 262/373 (70%), Gaps = 16/373 (4%)
Query 371 LTSDKPVDLTLGSS---PYYNSGSAN-KDKQIKISAYSFRAYEGIYNAYIRDNRNNPYYV 426
+T DK +D+ +GSS PYY GSAN DK IK+SAY FRAYE IYNAYIR+ RNNP+ +
Sbjct 330 ITFDK-LDVFIGSSGKYPYY--GSANMSDKAIKLSAYPFRAYEAIYNAYIRNTRNNPFVL 386
Query 427 NGQVQYNKWIPTYDGGADQ-NIYELRYANWEKDFLTTAVQSPQQGTAPLVGIttytetve 485
NG+ YN+WI T GG+D +LR+ANW+ D TTA+ +PQQG APLVG+TTY
Sbjct 387 NGKKTYNRWITTDAGGSDTLTPRDLRFANWQSDAYTTALTAPQQGVAPLVGLTTYEIRSV 446
Query 486 ttSDDGTPVTRELSRIALVDEDGKKYQVSFDSDSEGLKGVSYVELDNEVKLRQPRNLIDV 545
+D G VT A+VDE+G Y+V F+S+ E LKGV+Y L + ++L+
Sbjct 447 --NDAGHEVTT--VNTAIVDEEGNAYKVDFESNGEALKGVNYTPLKAGEAVNM-QSLVSP 501
Query 546 VTSGISINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDI 605
VTSGISIND RNVNAYQ++LELN +G+SY++IIEGRF+V VRYD L MPE+ GG +RDI
Sbjct 502 VTSGISINDFRNVNAYQRYLELNQFRGFSYKEIIEGRFDVNVRYDALNMPEYLGGITRDI 561
Query 606 EMHSISQTVDQDLDGSQTYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLP 665
++ I+QTV+ GS +Y +LGSQSG+A G++ ++ FCDEESIVMGI+ V P+P
Sbjct 562 VVNPITQTVETT--GSGSYVGSLGSQSGLATCFGNTDGSISVFCDEESIVMGIMYVMPMP 619
Query 666 VYTQLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWY 725
VY LLPK TYR LD + PEF+HIG+QPI KE+ P+Q D D + VFGY RPWY
Sbjct 620 VYDSLLPKWLTYRERLDSFNPEFDHIGYQPIYAKELGPMQCVQDDIDP-NTVFGYQRPWY 678
Query 726 EYVQKYDQAHGLF 738
EYV K D+AHGLF
Sbjct 679 EYVAKPDRAHGLF 691
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 64/129 (50%), Positives = 90/129 (70%), Gaps = 4/129 (3%)
Query 5 AFDATLDVNNEIKVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMV 64
F+ D+ N++K N+FDWSH NN TT +GR+TPVF ELVP + S+RI P FGL+FMPM+
Sbjct 4 VFNKIGDIKNDVKRNSFDWSHDNNFTTDLGRITPVFTELVPPNSSIRIKPEFGLRFMPMM 63
Query 65 FPIQTRLRARMMFFKYPLRALWDGYRDFI-GNFREDLEEPYLDLNTV--TRLDAMAKTGS 121
FPIQT+++A + F+K PLR LW Y DFI + E+ + PY+ ++ + +A +G
Sbjct 64 FPIQTKMKAYLSFYKVPLRTLWADYMDFISSDNTEEFQPPYMSFDSTDYSEGGTLAPSG- 122
Query 122 LGDYLGLPT 130
LGDY G+PT
Sbjct 123 LGDYFGIPT 131
>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/359 (31%), Positives = 156/359 (43%), Gaps = 46/359 (13%)
Query 399 KISAYSFRAYEGIYNAYIRD-NRNNPYYVNGQVQYNKWIPTYDGGADQ--NIYELRYANW 455
++SA FRAY+ IYN Y RD N P + + T GG DQ + LR W
Sbjct 159 QVSALPFRAYQLIYNEYYRDQNLTEP------IDFTLGSGTTVGG-DQLMALMSLRRRAW 211
Query 456 EKDFLTTAVQSPQQG---TAPLVGIttytetvettSDDGTPVTRELSRIALVDEDGKKYQ 512
EKD+ T+A+ Q+G T P+ G + V D R E+G Y
Sbjct 212 EKDYFTSALPWLQRGPEVTVPVQGAGGSMDVVYERQSDSQKWVDSSGREF---ENGHAYD 268
Query 513 VSF----DSDSEGLKGVS------YVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQ 562
++ D +S + V+ ELD L+ ++V GI+INDLR NA Q
Sbjct 269 ITMARANDPNSALMVAVNGGTNNRAPELDPNGTLK-----VNVDEMGININDLRTSNALQ 323
Query 563 KFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQ 622
++ E N R G Y + I F V+ L P+F GG I + + QT D Q
Sbjct 324 RWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGRMPISVSEVLQTSSTDETSPQ 383
Query 623 TYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLD 682
G +GI + + +E ++GI+ +TP Y Q +P+ FT +D
Sbjct 384 ANMAGHGISAGI-------NNGFKHYFEEHGYIIGIMSITPRSGYQQGVPRDFTKFDNMD 436
Query 683 HYQPEFNHIGFQPILYKE--VCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFR 739
Y PEF H+ Q I +E V AY++G FGY + EY +AHG FR
Sbjct 437 FYFPEFAHLSEQEIKNQELFVSEDAAYNNG------TFGYTPRYAEYKYHPSEAHGDFR 489
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/124 (30%), Positives = 60/124 (48%), Gaps = 10/124 (8%)
Query 17 KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM 76
+ N F+ S+ + LT +G + P+ C V + R+ ++ P+V P+ R+
Sbjct 14 RRNAFNLSYESKLTLNMGELVPIMCMPVVSGDKFRVKTESLVRLAPLVAPMMHRVNVFTH 73
Query 77 FFKYPLRALWDGYRDFI--GNFREDLEE-PYLDLNTVTRLDAMAKT-------GSLGDYL 126
+F P R +W+ + DFI G ED+ P + +N + L + A SL DYL
Sbjct 74 YFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSLWDYL 133
Query 127 GLPT 130
GLPT
Sbjct 134 GLPT 137
>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str.
3999B T(B) 6]
Length=390
Score = 108 bits (270), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 95/346 (27%), Positives = 147/346 (42%), Gaps = 30/346 (9%)
Query 399 KISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIYELRYANWEKD 458
K+SA FRAY IYN Y RD + +++ Y + ++++L WEKD
Sbjct 6 KVSALPFRAYHLIYNEYYRDQN-----LTSELEITLDSGNYQLPVNSSLWQLHRRAWEKD 60
Query 459 FLTTAVQSPQQGTAPLVGIttytetvettSDD--GTPVTRELSRIALVDEDGKKYQVSFD 516
+ T+A+ Q+G V I E + +T R + + S
Sbjct 61 YFTSALPWVQRGPEVTVPINGGGEIPVEMKEGFAAQKITTFPDRKPISGSEVLYSAPSVL 120
Query 517 SDSE--GLKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLELNMRKGYS 574
S + +KG + +E DN V ++ G++IND+R NA Q++ E N R G
Sbjct 121 SYGQIGSIKGQALIEPDNFV--------VNTDQMGVNINDIRTSNALQRWFERNARSGSR 172
Query 575 YRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGI 634
Y + I F V+ L P+F GG I + + QT D Q G +G+
Sbjct 173 YIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGV 232
Query 635 AGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQ 694
+ +E +MGI+ + P Y Q +PK F +D Y PEF H+G Q
Sbjct 233 -------NHGFTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQ 285
Query 695 PILYKEVCPLQAYSDGPDTLSD-VFGYNRPWYEYVQKYDQAHGLFR 739
I +E+ Y + D ++ FGY + EY ++ HG FR
Sbjct 286 EIKNEEL-----YLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFR 326
>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=541
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 95/346 (27%), Positives = 147/346 (42%), Gaps = 30/346 (9%)
Query 399 KISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIYELRYANWEKD 458
K+SA FRAY IYN Y RD + +++ Y + ++++L WEKD
Sbjct 157 KVSALPFRAYHLIYNEYYRDQN-----LTSELEITLDSGNYQLPVNSSLWQLHRRAWEKD 211
Query 459 FLTTAVQSPQQGTAPLVGIttytetvettSDD--GTPVTRELSRIALVDEDGKKYQVSFD 516
+ T+A+ Q+G V I E + +T R + + S
Sbjct 212 YFTSALPWVQRGPEVTVPINGGGEIPVEMKEGFAAQKITTFPDRKPISGSEVLYSAPSVL 271
Query 517 SDSE--GLKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLELNMRKGYS 574
S + +KG + +E DN V ++ G++IND+R NA Q++ E N R G
Sbjct 272 SYGQIGSIKGQALIEPDNFV--------VNTDQMGVNINDIRTSNALQRWFERNARSGSR 323
Query 575 YRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGI 634
Y + I F V+ L P+F GG I + + QT D Q G +G+
Sbjct 324 YIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGV 383
Query 635 AGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQ 694
+ +E +MGI+ + P Y Q +PK F +D Y PEF H+G Q
Sbjct 384 -------NHGFTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQ 436
Query 695 PILYKEVCPLQAYSDGPDTLSD-VFGYNRPWYEYVQKYDQAHGLFR 739
I +E+ Y + D ++ FGY + EY ++ HG FR
Sbjct 437 EIKNEEL-----YLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFR 477
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/120 (31%), Positives = 58/120 (48%), Gaps = 6/120 (5%)
Query 17 KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM 76
+ N F+ S+ N LT G + P+ C+ V R+N ++ P+V P+ R+
Sbjct 14 RRNVFNLSYENKLTVNAGELIPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVDVFTH 73
Query 77 FFKYPLRALWDGYRDFIGNFREDLEEP----YLDLNTVTRLDAMAK--TGSLGDYLGLPT 130
+F P R +W+ + DFI + + P Y +TV +A GSL DYLGLP+
Sbjct 74 YFFVPNRLIWNKWEDFITKGVDGTDSPVFPTYSFPSTVDTANAHNSFGDGSLWDYLGLPS 133
>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis
CL09T03C24]
Length=538
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 99/346 (29%), Positives = 145/346 (42%), Gaps = 34/346 (10%)
Query 399 KISAYSFRAYEGIYNAYIRD-NRNNPYYVNGQVQYNKWIPTYDGGADQNIYELRYANWEK 457
++SA FRAY+ IYN Y RD N P + N I + LR WEK
Sbjct 158 QVSALPFRAYQLIYNEYYRDQNLTKPI----EFSLNSGI-VLSADEVTRLLTLRRRTWEK 212
Query 458 DFLTTAVQSPQQG---TAPLVGIttytetvettSDDGTPVTRELSRIALVDEDGKKYQVS 514
D+ T+A+ Q+G T P+ G G + L A D +
Sbjct 213 DYFTSALPWVQRGPEVTVPIQG-------------SGGNLDVTLKNDAHADTYRMPGTSN 259
Query 515 FDSDSEGLKGVSYVELDNEVKLRQPRNL-IDVVTSGISINDLRNVNAYQKFLELNMRKGY 573
+ + L G + + + +P N ++V G+SINDLR NA Q++ E N R G
Sbjct 260 RPAGAMQLVGGALIAGGTDGAYLEPDNFQVNVDELGVSINDLRTSNALQRWFERNARSGS 319
Query 574 SYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSG 633
Y + I F V+ L P+F GG I + + QT D Q G +G
Sbjct 320 RYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSATDSTSPQANMAGHGISAG 379
Query 634 IAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGF 693
+ + + +E ++GI+ + P Y Q +PK F +D Y PEF H+G
Sbjct 380 V-------NHGFKRYFEEHGYIIGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGE 432
Query 694 QPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFR 739
Q I +EV Q P + + FGY + EY ++ HG FR
Sbjct 433 QEIKNEEVYLQQT----PASNNGTFGYTPRYAEYKYSMNEVHGDFR 474
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/121 (31%), Positives = 55/121 (45%), Gaps = 7/121 (6%)
Query 17 KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM 76
+ N F+ S+ N LT G + P+ C+ V R+N ++ P+V P+ R+
Sbjct 14 RRNVFNLSYENKLTANAGELVPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVDVFTH 73
Query 77 FFKYPLRALWDGYRDFIGNFREDLEEPYL-------DLNTVTRLDAMAKTGSLGDYLGLP 129
+F P R LW+ + DFI + + P D T + GSL DYLGLP
Sbjct 74 YFFVPNRLLWNQWEDFITKGVDGTDTPVFPKIALRPDWVNPTSAAVLLDDGSLWDYLGLP 133
Query 130 T 130
T
Sbjct 134 T 134
>gi|639237429|ref|WP_024568106.1| hypothetical protein [Elizabethkingia anophelis]
Length=546
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 102/356 (29%), Positives = 150/356 (42%), Gaps = 31/356 (9%)
Query 399 KISAYSFRAYEGIYNAYIRD-NRNNPYYV--NGQVQ------YNKWIPTYDGGADQNIYE 449
++S F AY+ I++ Y RD N + +V NG + N W P+ Q +++
Sbjct 138 RVSMLPFLAYQKIWDEYYRDENLIDSVFVDKNGDKRELFIDGINYWNPSLPYEFRQ-LFD 196
Query 450 LRYANWEKDFLTTAVQSPQQGTAPLVGIttytetvettSDDGTPVTRE----LSRIALVD 505
++ W D+ T+A+ Q+G A V + + G ++ LS
Sbjct 197 IKKRAWHHDYFTSALPFAQKGAA--VKMPLQMTADLFYNPGGNTFVKKPDGSLSHTGFRL 254
Query 506 EDGKKYQVSFDSDSEGLKGVSYVELDNEVKLRQPRNL-IDVVT-SGISINDLRNVNAYQK 563
EDG V D + S N V + NL +D+ T SG +INDLR Q+
Sbjct 255 EDG---SVPADGIGHLMVETSSTGNSNPVNIDNSSNLGVDLKTASGSTINDLRRAFKLQE 311
Query 564 FLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQT 623
+LE N R G Y + I F VK L PEF GG I + + Q D Q
Sbjct 312 WLEKNARAGSRYAESILSFFGVKTSDGRLQRPEFLGGNKTPILISEVLQQSSTDSTTPQG 371
Query 624 YAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDH 683
G G G F +E V+G++ V P Y+Q +P+HF+ D+
Sbjct 372 NMAGHGISVGKEG-------GFSKFFEEHGYVIGLMSVIPKTSYSQGIPRHFSKFDKFDY 424
Query 684 YQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFR 739
+ P+F HIG QP+ KE+ A + G VFGY + EY HG F+
Sbjct 425 FWPQFEHIGEQPVYNKEIF---AKNVGDYDSGGVFGYVPRYSEYKYSPSTIHGDFK 477
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 55/120 (46%), Gaps = 4/120 (3%)
Query 12 VNNEIKVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRL 71
V+ K + F+ S+ + G + P+ C+ + + INP+ + PM+ P+ +
Sbjct 8 VSKAPKSSTFNMSYDRKFSMNFGDLVPIHCQEIVPGDKISINPQHMTRLAPMLAPVMHEV 67
Query 72 RARMMFFKYPLRALWDGYRDFIGNFREDLEEPYLDLNTVTRLDAMAKTGSLGDYLGLPTT 131
+ +F P R LW + FI + L+ L + V L SLGDYLGLP T
Sbjct 68 NVFIHYFFVPNRILWKNWEAFITGGQSGLDAHMLPV--VQNLP--VPKSSLGDYLGLPLT 123
>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 93/353 (26%), Positives = 144/353 (41%), Gaps = 26/353 (7%)
Query 389 SGSANKDKQIKISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIY 448
+G + I ++ RAY I+N + RD +Q + + DG Y
Sbjct 137 AGQVDAGSSISHNSLFTRAYNLIWNEWFRDE---------NLQDSVVVDKGDGPDTYTDY 187
Query 449 ELRYANWEKDFLTTAVQSPQQGTAPLVGIttytetvettSDDGTPV-TRELSRIALVDED 507
L D+ T+A+ PQ+G A V + +D G P RE+S +
Sbjct 188 TLLRRGKRHDYFTSALPWPQKGDA--VTLPLGGSANVVYNDTGDPAYIREVSTGNVWTTP 245
Query 508 GKKYQVSFDSDSEGLKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLEL 567
++ S ++ G V ++ + + +IN +R Q+ LE
Sbjct 246 SRE---SVSKEANGNMSVPTGSVNAQYDPNGSLVADLSTATAATINAIRQSFQIQRLLER 302
Query 568 NMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKA 627
+ R G Y +I+ F V + PE+ GG S I ++ ++Q G+ T
Sbjct 303 DARGGTRYTEIVRSHFGVISPDARMQRPEYLGGGSAPIIVNPVAQQSASGASGTDTPLGT 362
Query 628 LGS-QSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQP 686
LG+ +G+A SG E +V+G+ V Y Q L + F+ D + P
Sbjct 363 LGAVGTGLA-----SGHGFASSFTEHGVVVGLCSVRADLTYQQGLHRMFSRSTRYDFFFP 417
Query 687 EFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFR 739
F+H+G QPIL KE+ Y+ G T DVFGY W EY K Q GL R
Sbjct 418 VFSHLGEQPILNKEL-----YATGTSTDDDVFGYQEAWAEYRYKPSQVTGLMR 465
Score = 39.7 bits (91), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 32/129 (25%), Positives = 55/129 (43%), Gaps = 3/129 (2%)
Query 15 EIKVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRAR 74
EI + FD G + P+ + V S+ + + +FP+ +
Sbjct 20 EIPRSKFDAQKTLKTAFDSGYLVPILVDEVLPGDSMNLRMTAFTRLATPLFPVMDNMYLD 79
Query 75 MMFFKYPLRALWDGYRDFIGNFREDLEEPYLD--LNTVTRLDAMAKTGSLGDYLGLPTTL 132
FF P R LW ++ F+G R+ + +D + T+T + SL DY+GLPT
Sbjct 80 TFFFFVPNRLLWSNWQRFMGE-RDPDPDSSIDYTIPTMTSPNGGYAVNSLQDYMGLPTAG 138
Query 133 FGKFGTTLS 141
G+++S
Sbjct 139 QVDAGSSIS 147
>gi|609718276|emb|CDN73650.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=537
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 155/355 (44%), Gaps = 40/355 (11%)
Query 400 ISAYSFRAYEGIYNAY----------IRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIYE 449
++ F AY+ I++ + RD+ NP + + +P Y + +++
Sbjct 139 VNLLPFLAYQKIWDEFYRDENLIQPLFRDSNGNPVKMFNDGINDHNLPPYSKFTE--LFK 196
Query 450 LRYANWEKDFLTTAVQSPQQGTAPLVGIttytetvettSDDGTPVTRELSRIALVDEDGK 509
+R W D+ T+A+ Q+G A + I P+T E+ + +
Sbjct 197 MRKRAWHHDYFTSALPFAQKGNAVKIPIF---------PQGNVPLTYEMGSQTFIKDMAG 247
Query 510 KYQVSFD--SDSEG-LKGVSYVELDNEVKLRQPRNL-IDVVTSGIS-INDLRNVNAYQKF 564
+ D SD G L+ VS + L +NL +++ + +S +NDLR Q++
Sbjct 248 NPAPNKDLRSDVNGNLQDVS----GQPLSLDPSKNLKLNMASENVSTVNDLRRAFKLQEW 303
Query 565 LELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTY 624
LE N R G Y + I F VK L PEF GG I IS+ + Q S T
Sbjct 304 LEKNARAGSRYAESILSFFGVKTSDGRLQRPEFLGGNKSPI---MISEVLQQSATDSTTP 360
Query 625 AKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHY 684
+ GI G+ D G F +E V+G++ V P Y+Q +P+HF+ D++
Sbjct 361 QGNMAGH-GI-GIGKDGG--FSRFFEEHGYVIGLMSVIPKTSYSQGIPRHFSKSDKFDYF 416
Query 685 QPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFR 739
P+F HIG QP+ KE+ D D+ + VFGY + EY HG F+
Sbjct 417 WPQFEHIGEQPVYNKEI--FAKNIDAFDSEA-VFGYLPRYSEYKFSPSTVHGDFK 468
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 63/136 (46%), Gaps = 11/136 (8%)
Query 17 KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM 76
K + F+ S+ + G + P+ C+ V + INP+ + PM+ P+ + +
Sbjct 13 KSSTFNMSYDRKFSMNFGDLVPIHCQEVIPGDKISINPQHMTRLAPMIAPVMHEVNVFIH 72
Query 77 FFKYPLRALWDGYRDFIGNFREDLEEPYLDLNTVTRLDAM-AKTGSLGDYLGLPTTLFGK 135
+F P R +W + FI E LD + + R+ + GSL D+LGLP T G+
Sbjct 73 YFFVPNRIIWSNWEQFITG-----GESGLDQHLMPRVGNLPVSKGSLADHLGLPLTT-GR 126
Query 136 FGTTLSVATTGHIFGL 151
F +V G ++ L
Sbjct 127 F----AVGNAGVLYNL 138
>gi|495508923|ref|WP_008233569.1| capsid protein [Richelia intracellularis]
gi|471330407|emb|CCH67259.1| capsid protein [Richelia intracellularis HH01]
Length=230
Score = 93.6 bits (231), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/190 (34%), Positives = 90/190 (47%), Gaps = 11/190 (6%)
Query 551 SINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSI 610
+IN +RN Q+ LE + R G Y +I+ F V L PE+ GG S I ++ I
Sbjct 19 TINAIRNSFQIQRLLERDARGGTRYTEIVRSHFGVISPDARLQRPEYLGGGSAPITVNPI 78
Query 611 SQTVDQDLDGSQTYAKALGS-QSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQ 669
+QT + GS T ALG+ +G+A +G E +++G+ V Y Q
Sbjct 79 AQTSASTVTGSDTPLGALGAVGTGLA-----NGHGFSTSFTEHGVILGLASVRADLTYQQ 133
Query 670 LLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQ 729
L + ++ + D Y P F H+G Q +L KE+ Y DG S VFGY W EY
Sbjct 134 GLHRMWSRQTRYDFYFPVFAHLGEQAVLNKEI-----YCDGTANDSGVFGYQERWAEYRY 188
Query 730 KYDQAHGLFR 739
K Q GL R
Sbjct 189 KPSQVTGLMR 198
>gi|313766923|gb|ADR80650.1| putative major coat protein [Uncultured Microviridae]
Length=542
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 150/354 (42%), Gaps = 31/354 (9%)
Query 399 KISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDG-GAD--QNIYELRYANW 455
++SA FRAY I+N + RD +P G G D YEL
Sbjct 148 EMSALPFRAYNLIFNEWFRDQ-----------NLQDSVPVETGNGPDIPWTQYELLKRGK 196
Query 456 EKDFLTTAVQSPQQGTA---PLVGIttytetvettSDDGTPVTRELSRIALVDEDGKKYQ 512
D+ T+A+ PQ+GTA PL G+ T +G + D D Y
Sbjct 197 RHDYFTSALPWPQKGTAVSSPLTGLAPVTGIGRL---NGAAWDTGSQDVHETDNDVVTYA 253
Query 513 VSFDSDSEGLKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLELNMRKG 572
+ G ++++ + ++ V GI+IND R Q+ LE + R G
Sbjct 254 QNIGGTGGGNDKNFMMQVEGSGRPLVYADMAQAV-GGININDFRLAYQTQRLLERDARGG 312
Query 573 YSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQS 632
Y + ++ + V L PEF GG S + M +++QT Q G+QT A G +
Sbjct 313 TRYVETLKNHWGVTSPDFRLQRPEFLGGGSDRVNMQNVAQTTHQ---GTQTVYDAKGQEV 369
Query 633 GIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIG 692
I +G G + + F E +++G++ V Y+Q L ++++ R D Y P + IG
Sbjct 370 SIGEAQGTHGFS-KSFT-EHGVIIGLVNVRSDITYSQGLERYWSKRTRYDFYYPVLSQIG 427
Query 693 FQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRRPEQYEI 746
Q +L KE+ D +VFGY + EY K GLF++ +I
Sbjct 428 EQTVLNKEIYYQNTSQD-----QEVFGYQERYAEYRFKPSMVTGLFKQDRAVDI 476
Lambda K H a alpha
0.319 0.137 0.411 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 5871190766832