bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-8_CDS_annotation_glimmer3.pl_2_6
Length=290
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094326|emb|CDL65712.1| unnamed protein product 322 2e-100
gi|547920049|ref|WP_022322420.1| capsid protein VP1 124 1e-28
gi|649569140|gb|KDS75238.1| capsid family protein 121 4e-28
gi|649557305|gb|KDS63784.1| capsid family protein 118 6e-28
gi|649555287|gb|KDS61824.1| capsid family protein 120 2e-27
gi|492501782|ref|WP_005867318.1| hypothetical protein 120 2e-27
gi|639237429|ref|WP_024568106.1| hypothetical protein 112 1e-24
gi|609718276|emb|CDN73650.1| conserved hypothetical protein 108 3e-23
gi|495508923|ref|WP_008233569.1| capsid protein 97.4 2e-20
gi|530695351|gb|AGT39907.1| major capsid protein 99.4 4e-20
>gi|575094326|emb|CDL65712.1| unnamed protein product [uncultured bacterium]
Length=758
Score = 322 bits (825), Expect = 2e-100, Method: Compositional matrix adjust.
Identities = 162/290 (56%), Positives = 209/290 (72%), Gaps = 10/290 (3%)
Query 1 LKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLELNMRKGYSYRDIIEG 60
LKGV+Y L + ++L+ VTSGISIND RNVNAYQ++LELN +G+SY++IIEG
Sbjct 479 LKGVNYTPLKAGEAVNM-QSLVSPVTSGISINDFRNVNAYQRYLELNQFRGFSYKEIIEG 537
Query 61 RFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGIAGVRGDS 120
RF+V VRYD L MPE+ GG +RDI ++ I+QTV+ GS +Y +LGSQSG+A G++
Sbjct 538 RFDVNVRYDALNMPEYLGGITRDIVVNPITQTVETT--GSGSYVGSLGSQSGLATCFGNT 595
Query 121 GRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEV 180
++ FCDEESIVMGI+ V P+PVY LLPK TYR LD + PEF+HIG+QPI KE+
Sbjct 596 DGSISVFCDEESIVMGIMYVMPMPVYDSLLPKWLTYRERLDSFNPEFDHIGYQPIYAKEL 655
Query 181 CPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQKPQLAQSF 240
P+Q D D + VFGY RPWYEYV K D+AHGLF ++L NF+M R F+ P+L QSF
Sbjct 656 GPMQCVQDDIDP-NTVFGYQRPWYEYVAKPDRAHGLFLSSLRNFIMFRSFDNVPELGQSF 714
Query 241 LVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPISRVAIPRLD 290
V+ P V +VF+V TE++DKI GQI FDCTA+LPISRV +PRL+
Sbjct 715 TVMQPGSVNNVFSV------TEVSDKILGQIHFDCTAQLPISRVVVPRLE 758
>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553
Score = 124 bits (311), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 86/270 (32%), Positives = 128/270 (47%), Gaps = 22/270 (8%)
Query 22 IDVVTSGISINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFS 81
++V GI+INDLR NA Q++ E N R G Y + I F V+ L P+F GG
Sbjct 304 VNVDEMGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGR 363
Query 82 RDIEMHSISQTVDQDLDGSQTYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVT 141
I + + QT D Q G +GI + + +E ++GI+ +T
Sbjct 364 MPISVSEVLQTSSTDETSPQANMAGHGISAGI-------NNGFKHYFEEHGYIIGIMSIT 416
Query 142 PLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQPILYKE--VCPLQAYSDGPDTLSDVFGY 199
P Y Q +P+ FT +D Y PEF H+ Q I +E V AY++G FGY
Sbjct 417 PRSGYQQGVPRDFTKFDNMDFYFPEFAHLSEQEIKNQELFVSEDAAYNNG------TFGY 470
Query 200 NRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADD 259
+ EY +AHG FR NLS + ++R+F KP L +F+ P+ VFA ++ +D
Sbjct 471 TPRYAEYKYHPSEAHGDFRGNLSFWHLNRIFEDKPNLNTTFVECKPS--NRVFATSETED 528
Query 260 GTELADKIYGQIWFDCTAKLPISRVAIPRL 289
DK + Q++ D A + + P L
Sbjct 529 -----DKFWVQMYQDVKALRLMPKYGTPML 553
>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str.
3999B T(B) 6]
Length=390
Score = 121 bits (303), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 134/290 (46%), Gaps = 28/290 (10%)
Query 1 LKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLELNMRKGYSYRDIIEG 60
+KG + +E DN V ++ G++IND+R NA Q++ E N R G Y + I
Sbjct 128 IKGQALIEPDNFV--------VNTDQMGVNINDIRTSNALQRWFERNARSGSRYIEQILS 179
Query 61 RFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGIAGVRGDS 120
F V+ L P+F GG I + + QT D Q G +G+
Sbjct 180 HFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGV------- 232
Query 121 GRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEV 180
+ +E +MGI+ + P Y Q +PK F +D Y PEF H+G Q I +E+
Sbjct 233 NHGFTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEEL 292
Query 181 CPLQAYSDGPDTLSD-VFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQKPQLAQS 239
Y + D ++ FGY + EY ++ HG FR N++ + ++R+F +KP L +
Sbjct 293 -----YLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFRGNMAFWHLNRIFKEKPNLNTT 347
Query 240 FLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPISRVAIPRL 289
F+ +P+ VFA + D DK + QI+ D A + + P L
Sbjct 348 FVECNPS--NRVFATAETSD-----DKYWVQIYQDIKALRLMPKYGTPML 390
>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=245
Score = 118 bits (296), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 79/263 (30%), Positives = 123/263 (47%), Gaps = 20/263 (8%)
Query 28 GISINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMH 87
G++IND+R NA Q++ E N R G Y + I F V+ L P+F GG I +
Sbjct 2 GVNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVS 61
Query 88 SISQTVDQDLDGSQTYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYT 147
+ QT D Q G +G+ + +E +MGI+ + P Y
Sbjct 62 EVLQTSSTDSTSPQANMAGHGISAGV-------NHGFTRYFEEHGYIMGIMSIRPRTGYQ 114
Query 148 QLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSD-VFGYNRPWYEY 206
Q +PK F +D Y PEF H+G Q I +E+ Y + D ++ FGY + EY
Sbjct 115 QGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEEL-----YLNESDAANEGTFGYTPRYAEY 169
Query 207 VQKYDQAHGLFRTNLSNFLMHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELADK 266
++ HG FR N++ + ++R+F +KP L +F+ +P+ VFA + D DK
Sbjct 170 KYSQNEVHGDFRGNMAFWHLNRIFKEKPNLNTTFVECNPS--NRVFATAETSD-----DK 222
Query 267 IYGQIWFDCTAKLPISRVAIPRL 289
+ QI+ D A + + P L
Sbjct 223 YWVQIYQDIKALRLMPKYGTPML 245
>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=541
Score = 120 bits (301), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 134/290 (46%), Gaps = 28/290 (10%)
Query 1 LKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLELNMRKGYSYRDIIEG 60
+KG + +E DN V ++ G++IND+R NA Q++ E N R G Y + I
Sbjct 279 IKGQALIEPDNFV--------VNTDQMGVNINDIRTSNALQRWFERNARSGSRYIEQILS 330
Query 61 RFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGIAGVRGDS 120
F V+ L P+F GG I + + QT D Q G +G+
Sbjct 331 HFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGV------- 383
Query 121 GRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEV 180
+ +E +MGI+ + P Y Q +PK F +D Y PEF H+G Q I +E+
Sbjct 384 NHGFTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEEL 443
Query 181 CPLQAYSDGPDTLSD-VFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQKPQLAQS 239
Y + D ++ FGY + EY ++ HG FR N++ + ++R+F +KP L +
Sbjct 444 -----YLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFRGNMAFWHLNRIFKEKPNLNTT 498
Query 240 FLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPISRVAIPRL 289
F+ +P+ VFA + D DK + QI+ D A + + P L
Sbjct 499 FVECNPS--NRVFATAETSD-----DKYWVQIYQDIKALRLMPKYGTPML 541
>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis
CL09T03C24]
Length=538
Score = 120 bits (300), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 82/274 (30%), Positives = 129/274 (47%), Gaps = 19/274 (7%)
Query 17 QPRNL-IDVVTSGISINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPE 75
+P N ++V G+SINDLR NA Q++ E N R G Y + I F V+ L P+
Sbjct 283 EPDNFQVNVDELGVSINDLRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQ 342
Query 76 FFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGIAGVRGDSGRALECFCDEESIVM 135
F GG I + + QT D Q G +G+ + + +E ++
Sbjct 343 FLGGGRTPISVSEVLQTSATDSTSPQANMAGHGISAGV-------NHGFKRYFEEHGYII 395
Query 136 GILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSD 195
GI+ + P Y Q +PK F +D Y PEF H+G Q I +EV Q P + +
Sbjct 396 GIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEEVYLQQT----PASNNG 451
Query 196 VFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQKPQLAQSFLVIDPAQVTDVFAVT 255
FGY + EY ++ HG FR N++ + ++R+F++ P L +F+ +P+ VFA
Sbjct 452 TFGYTPRYAEYKYSMNEVHGDFRGNMAFWHLNRIFSESPNLNTTFVECNPSN--RVFATA 509
Query 256 KADDGTELADKIYGQIWFDCTAKLPISRVAIPRL 289
+ D DK + Q++ D A + + P L
Sbjct 510 ETSD-----DKYWIQLYQDVKALRLMPKYGTPML 538
>gi|639237429|ref|WP_024568106.1| hypothetical protein [Elizabethkingia anophelis]
Length=546
Score = 112 bits (281), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 129/283 (46%), Gaps = 20/283 (7%)
Query 11 NEVKLRQPRNL-IDVVT-SGISINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRY 68
N V + NL +D+ T SG +INDLR Q++LE N R G Y + I F VK
Sbjct 278 NPVNIDNSSNLGVDLKTASGSTINDLRRAFKLQEWLEKNARAGSRYAESILSFFGVKTSD 337
Query 69 DELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGIAGVRGDSGRALECFC 128
L PEF GG I + + Q D Q G G G F
Sbjct 338 GRLQRPEFLGGNKTPILISEVLQQSSTDSTTPQGNMAGHGISVGKEG-------GFSKFF 390
Query 129 DEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSD 188
+E V+G++ V P Y+Q +P+HF+ D++ P+F HIG QP+ KE+ A +
Sbjct 391 EEHGYVIGLMSVIPKTSYSQGIPRHFSKFDKFDYFWPQFEHIGEQPVYNKEIF---AKNV 447
Query 189 GPDTLSDVFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQK--PQLAQSFLVIDPA 246
G VFGY + EY HG F+ L + + R+F+ P+L + F+ ++ +
Sbjct 448 GDYDSGGVFGYVPRYSEYKYSPSTIHGDFKDTLYFWHLGRIFDSSAPPKLNRDFIEVNKS 507
Query 247 QVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPISRVAIPRL 289
++ +FAV + +DK Y ++ TAK +S P
Sbjct 508 GLSRIFAV------EDNSDKFYCHLYQKITAKRKMSYFGDPSF 544
>gi|609718276|emb|CDN73650.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=537
Score = 108 bits (271), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 135/281 (48%), Gaps = 20/281 (7%)
Query 13 VKLRQPRNL-IDVVTSGIS-INDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDE 70
+ L +NL +++ + +S +NDLR Q++LE N R G Y + I F VK
Sbjct 271 LSLDPSKNLKLNMASENVSTVNDLRRAFKLQEWLEKNARAGSRYAESILSFFGVKTSDGR 330
Query 71 LLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGIAGVRGDSGRALECFCDE 130
L PEF GG I IS+ + Q S T + GI G+ D G F +E
Sbjct 331 LQRPEFLGGNKSPI---MISEVLQQSATDSTTPQGNMAGH-GI-GIGKDGG--FSRFFEE 383
Query 131 ESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSDGP 190
V+G++ V P Y+Q +P+HF+ D++ P+F HIG QP+ KE+ D
Sbjct 384 HGYVIGLMSVIPKTSYSQGIPRHFSKSDKFDYFWPQFEHIGEQPVYNKEI--FAKNIDAF 441
Query 191 DTLSDVFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVF--NQKPQLAQSFLVIDPAQV 248
D+ + VFGY + EY HG F+ +L + + R+F ++ P L QSF+ D +
Sbjct 442 DSEA-VFGYLPRYSEYKFSPSTVHGDFKDDLYFWHLGRIFDTDKPPVLNQSFIECDKNAL 500
Query 249 TDVFAVTKADDGTELADKIYGQIWFDCTAKLPISRVAIPRL 289
+ +FAV DD DK Y ++ TAK +S P
Sbjct 501 SRIFAV--EDD----TDKFYCHLYQKITAKRKMSYFGDPSF 535
>gi|495508923|ref|WP_008233569.1| capsid protein [Richelia intracellularis]
gi|471330407|emb|CCH67259.1| capsid protein [Richelia intracellularis HH01]
Length=230
Score = 97.4 bits (241), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/197 (34%), Positives = 93/197 (47%), Gaps = 11/197 (6%)
Query 30 SINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSI 89
+IN +RN Q+ LE + R G Y +I+ F V L PE+ GG S I ++ I
Sbjct 19 TINAIRNSFQIQRLLERDARGGTRYTEIVRSHFGVISPDARLQRPEYLGGGSAPITVNPI 78
Query 90 SQTVDQDLDGSQTYAKALGS-QSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQ 148
+QT + GS T ALG+ +G+A +G E +++G+ V Y Q
Sbjct 79 AQTSASTVTGSDTPLGALGAVGTGLA-----NGHGFSTSFTEHGVILGLASVRADLTYQQ 133
Query 149 LLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQ 208
L + ++ + D Y P F H+G Q +L KE+ Y DG S VFGY W EY
Sbjct 134 GLHRMWSRQTRYDFYFPVFAHLGEQAVLNKEI-----YCDGTANDSGVFGYQERWAEYRY 188
Query 209 KYDQAHGLFRTNLSNFL 225
K Q GL R+ S L
Sbjct 189 KPSQVTGLMRSTSSGTL 205
>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539
Score = 99.4 bits (246), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 78/270 (29%), Positives = 117/270 (43%), Gaps = 20/270 (7%)
Query 25 VTSGISINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDI 84
+ +IN +R Q+ LE + R G Y +I+ F V + PE+ GG S I
Sbjct 281 TATAATINAIRQSFQIQRLLERDARGGTRYTEIVRSHFGVISPDARMQRPEYLGGGSAPI 340
Query 85 EMHSISQTVDQDLDGSQTYAKALGS-QSGIAGVRGDSGRALECFCDEESIVMGILIVTPL 143
++ ++Q G+ T LG+ +G+A SG E +V+G+ V
Sbjct 341 IVNPVAQQSASGASGTDTPLGTLGAVGTGLA-----SGHGFASSFTEHGVVVGLCSVRAD 395
Query 144 PVYTQLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPW 203
Y Q L + F+ D + P F+H+G QPIL KE+ Y+ G T DVFGY W
Sbjct 396 LTYQQGLHRMFSRSTRYDFFFPVFSHLGEQPILNKEL-----YATGTSTDDDVFGYQEAW 450
Query 204 YEYVQKYDQAHGLFRTNLSNFL----MHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADD 259
EY K Q GL R+ + L + + F P L +F + D V V AV +
Sbjct 451 AEYRYKPSQVTGLMRSTAAGTLDAWHLAQNFGSLPTLNSTF-IEDTPPVDRVVAVGSEAN 509
Query 260 GTELADKIYGQIWFDCTAKLPISRVAIPRL 289
G + + FD P+ ++P L
Sbjct 510 GQQFIFDAF----FDINMARPMPMYSVPGL 535
Lambda K H a alpha
0.322 0.141 0.422 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1498703630544