bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-7_CDS_annotation_glimmer3.pl_2_6
Length=648
Score E
Sequences producing significant alignments: (Bits) Value
gi|547312923|ref|WP_022044635.1| putative uncharacterized protein 82.0 1e-13
gi|492501782|ref|WP_005867318.1| hypothetical protein 76.3 2e-11
gi|649557305|gb|KDS63784.1| capsid family protein 69.7 5e-10
gi|547920049|ref|WP_022322420.1| capsid protein VP1 72.0 6e-10
gi|649569140|gb|KDS75238.1| capsid family protein 70.5 1e-09
gi|649555287|gb|KDS61824.1| capsid family protein 70.5 2e-09
gi|599088027|gb|AHN52939.1| major capsid protein 65.9 9e-09
gi|599087961|gb|AHN52906.1| major capsid protein 65.1 1e-08
gi|599087475|gb|AHN52663.1| major capsid protein 64.7 2e-08
gi|599088021|gb|AHN52936.1| major capsid protein 64.3 3e-08
>gi|547312923|ref|WP_022044635.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
gi|524208404|emb|CCZ76639.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
Length=338
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/330 (29%), Positives = 138/330 (42%), Gaps = 35/330 (11%)
Query 350 GLCLKTYNSDLLQNWINTEWIDGVNGISEVTAIDV---TDGKLTMDALNLQQKVYNMLNR 406
GL Y+ DL N I V I + A+D+ T + + L L+ K+ N ++R
Sbjct 11 GLLSVPYSPDLFGNIIKQGSSPAVE-IEVMNALDLNISTGFSVAVPELRLRTKIQNWMDR 69
Query 407 IAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIV---FQEVISNSASGEQP-LGT 462
+ VSGG D T++ + P F G I + + + SASGE LG
Sbjct 70 LFVSGGRVGDVFRTLWGTKSSAIYVNKPDFLGVWQASINPSNVRAMANGSASGEDANLGQ 129
Query 463 LAGRGYDTGKQKGGHVKIK--VTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLDDIHKPA 520
LA D GH I EP M I + P YSQG ++ D P
Sbjct 130 LAAC-VDRYCDFSGHSGIDYYAKEPGTFMLITMLVPEPAYSQGLHPDLASISFGDDFNPE 188
Query 521 LDGIGYQ----------------DSLNWQRAWWDDNRMEKNGRL----QPSAGKTVAWLN 560
L+GIG+Q L+ + + W + G L S G+ VAW
Sbjct 189 LNGIGFQLVPRHRFSMMPRGFNFTGLDQEASPWFGH--TGTGVLVDPNMVSVGEEVAWSW 246
Query 561 YMTNINRTFGNFAINDNEAFMVLNRNYE-MSPSAGTNETKIADLT-TYIDPVKYNYIFAE 618
T+ +R G+FA N N + VL R + P GT + + T TYI+P+ + Y+F +
Sbjct 247 LRTDYSRLHGDFAQNGNYQYWVLTRRFTTYFPDDGTGFYQDGEYTGTYINPLDWQYVFVD 306
Query 619 TNLDAMNFWVQTKFDIKVRRLISAKQIPNL 648
L A NF FD+ V +SA +P L
Sbjct 307 QTLMAGNFAYYGTFDLNVTSSLSANYMPYL 336
>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis
CL09T03C24]
Length=538
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 111/270 (41%), Gaps = 23/270 (9%)
Query 382 IDVTDGKLTMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMS 441
++V + ++++ L + R A SG Y + + + F + R + P F GG
Sbjct 289 VNVDELGVSINDLRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGR 348
Query 442 TEIVFQEVISNSASGE-QPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDY 500
T I EV+ SA+ P +AG G G G K E +I+GI SI PR Y
Sbjct 349 TPISVSEVLQTSATDSTSPQANMAGHGISAGVNHG--FKRYFEEHGYIIGIMSIRPRTGY 406
Query 501 SQGNEFYNEFLTLD--DIHKPALDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAW 558
QG +F D D + P +G Q+ N + + NG G T +
Sbjct 407 QQGVP--KDFRKFDNMDFYFPEFAHLGEQEIKN-EEVYLQQTPASNNGTF----GYTPRY 459
Query 559 LNYMTNINRTFGNFAINDNEAFMVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAE 618
Y ++N G+F N AF LNR + SP+ T T+++ N +FA
Sbjct 460 AEYKYSMNEVHGDF--RGNMAFWHLNRIFSESPNLNT---------TFVECNPSNRVFAT 508
Query 619 TNLDAMNFWVQTKFDIKVRRLISAKQIPNL 648
+W+Q D+K RL+ P L
Sbjct 509 AETSDDKYWIQLYQDVKALRLMPKYGTPML 538
>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=245
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 101/248 (41%), Gaps = 23/248 (9%)
Query 404 LNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEVISNSASGE-QPLGT 462
R A SG Y + + + F + R + P F GG T I EV+ S++ P
Sbjct 18 FERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQAN 77
Query 463 LAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLD--DIHKPA 520
+AG G G G E +IMGI SI PR Y QG +F D D + P
Sbjct 78 MAGHGISAGVNHG--FTRYFEEHGYIMGIMSIRPRTGYQQGVP--KDFRKFDNMDFYFPE 133
Query 521 LDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAWLNYMTNINRTFGNFAINDNEAF 580
+G Q+ N + + +++ G G T + Y + N G+F N AF
Sbjct 134 FAHLGEQEIKN-EELYLNESDAANEGTF----GYTPRYAEYKYSQNEVHGDF--RGNMAF 186
Query 581 MVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAETNLDAMNFWVQTKFDIKVRRLI 640
LNR ++ P+ T T+++ N +FA +WVQ DIK RL+
Sbjct 187 WHLNRIFKEKPNLNT---------TFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLM 237
Query 641 SAKQIPNL 648
P L
Sbjct 238 PKYGTPML 245
>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 72/270 (27%), Positives = 112/270 (41%), Gaps = 23/270 (9%)
Query 382 IDVTDGKLTMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMS 441
++V + + ++ L + R A G Y + + + F + R + P F GG
Sbjct 304 VNVDEMGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGR 363
Query 442 TEIVFQEVISNSASGE-QPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDY 500
I EV+ S++ E P +AG G G G K E +I+GI SITPR Y
Sbjct 364 MPISVSEVLQTSSTDETSPQANMAGHGISAGINNG--FKHYFEEHGYIIGIMSITPRSGY 421
Query 501 SQGNEFYNEFLTLD--DIHKPALDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAW 558
QG +F D D + P + Q+ N Q + ++ NG G T +
Sbjct 422 QQGVP--RDFTKFDNMDFYFPEFAHLSEQEIKN-QELFVSEDAAYNNGTF----GYTPRY 474
Query 559 LNYMTNINRTFGNFAINDNEAFMVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAE 618
Y + + G+F N +F LNR +E P+ T T+++ N +FA
Sbjct 475 AEYKYHPSEAHGDF--RGNLSFWHLNRIFEDKPNLNT---------TFVECKPSNRVFAT 523
Query 619 TNLDAMNFWVQTKFDIKVRRLISAKQIPNL 648
+ + FWVQ D+K RL+ P L
Sbjct 524 SETEDDKFWVQMYQDVKALRLMPKYGTPML 553
>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str.
3999B T(B) 6]
Length=390
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 101/248 (41%), Gaps = 23/248 (9%)
Query 404 LNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEVISNSASGE-QPLGT 462
R A SG Y + + + F + R + P F GG T I EV+ S++ P
Sbjct 163 FERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQAN 222
Query 463 LAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLD--DIHKPA 520
+AG G G G E +IMGI SI PR Y QG +F D D + P
Sbjct 223 MAGHGISAGVNHG--FTRYFEEHGYIMGIMSIRPRTGYQQGVP--KDFRKFDNMDFYFPE 278
Query 521 LDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAWLNYMTNINRTFGNFAINDNEAF 580
+G Q+ N + + +++ G G T + Y + N G+F N AF
Sbjct 279 FAHLGEQEIKN-EELYLNESDAANEGTF----GYTPRYAEYKYSQNEVHGDF--RGNMAF 331
Query 581 MVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAETNLDAMNFWVQTKFDIKVRRLI 640
LNR ++ P+ T T+++ N +FA +WVQ DIK RL+
Sbjct 332 WHLNRIFKEKPNLNT---------TFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLM 382
Query 641 SAKQIPNL 648
P L
Sbjct 383 PKYGTPML 390
>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=541
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 101/248 (41%), Gaps = 23/248 (9%)
Query 404 LNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEVISNSASGE-QPLGT 462
R A SG Y + + + F + R + P F GG T I EV+ S++ P
Sbjct 314 FERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQAN 373
Query 463 LAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLD--DIHKPA 520
+AG G G G E +IMGI SI PR Y QG +F D D + P
Sbjct 374 MAGHGISAGVNHG--FTRYFEEHGYIMGIMSIRPRTGYQQGVP--KDFRKFDNMDFYFPE 429
Query 521 LDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAWLNYMTNINRTFGNFAINDNEAF 580
+G Q+ N + + +++ G G T + Y + N G+F N AF
Sbjct 430 FAHLGEQEIKN-EELYLNESDAANEGTF----GYTPRYAEYKYSQNEVHGDF--RGNMAF 482
Query 581 MVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAETNLDAMNFWVQTKFDIKVRRLI 640
LNR ++ P+ T T+++ N +FA +WVQ DIK RL+
Sbjct 483 WHLNRIFKEKPNLNT---------TFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLM 533
Query 641 SAKQIPNL 648
P L
Sbjct 534 PKYGTPML 541
>gi|599088027|gb|AHN52939.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=219
Score = 65.9 bits (159), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 68/143 (48%), Gaps = 5/143 (3%)
Query 390 TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV 449
T++ L ++ +L R A SG Y + ++ F G N+M+ P F GG ST I V
Sbjct 77 TINQLRQAFQIQKLLERDARSGTRYAEIVKAHF-GVNFMDVTYRPEFLGGTSTPINVTSV 135
Query 450 ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN 508
S SG P GTLA G T GG TE +MGI S+ + Y QG N ++
Sbjct 136 PQTSESGTTPQGTLAAFG--TATVNGGGFTKSFTEHCIVMGIASVRADLTYQQGLNRMFS 193
Query 509 EFLTLDDIHKPALDGIGYQDSLN 531
T D + PAL IG Q LN
Sbjct 194 R-STRYDFYFPALAHIGEQAVLN 215
>gi|599087961|gb|AHN52906.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=210
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/145 (36%), Positives = 69/145 (48%), Gaps = 5/145 (3%)
Query 390 TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV 449
T++ L ++ +L R A SG Y + ++ F G N+M+ P F GG ST I V
Sbjct 68 TINQLRQAFQIQKLLERDARSGTRYSEIVKAHF-GVNFMDVTYRPEFLGGTSTPINVTSV 126
Query 450 ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN 508
S SG P GTLA G T GG TE +MGI S+ + Y QG N ++
Sbjct 127 PQTSESGTTPQGTLAAFG--TATINGGGFTKSFTEHCIVMGIASVRADLTYQQGLNRMFS 184
Query 509 EFLTLDDIHKPALDGIGYQDSLNWQ 533
T D + PAL IG Q LN +
Sbjct 185 R-STRYDFYFPALAHIGEQSVLNKE 208
>gi|599087475|gb|AHN52663.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=210
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/145 (36%), Positives = 69/145 (48%), Gaps = 5/145 (3%)
Query 390 TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV 449
T++ L ++ +L R A SG Y + ++ F G N+M+ P F GG ST I V
Sbjct 68 TINQLRQAFQIQKLLERDARSGTRYSEIVKAHF-GVNFMDVTYRPEFLGGTSTPINVTSV 126
Query 450 ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN 508
S SG P GTLA G T GG TE +MGI S+ + Y QG N ++
Sbjct 127 PQTSESGTTPQGTLAAFG--TATINGGGFTKSFTEHCILMGIASVRADLTYQQGLNRMFS 184
Query 509 EFLTLDDIHKPALDGIGYQDSLNWQ 533
T D + PAL IG Q LN +
Sbjct 185 R-STRYDFYFPALAHIGEQSVLNKE 208
>gi|599088021|gb|AHN52936.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=220
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 69/145 (48%), Gaps = 5/145 (3%)
Query 390 TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV 449
T++ L ++ +L R A SG Y + ++ F G N+M+ P F GG ST + V
Sbjct 78 TINQLRQAFQIQKLLERDARSGTRYSEIVKAHF-GVNFMDVTYRPEFLGGSSTPVNVTSV 136
Query 450 ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN 508
S SG P GTLA G T GG TE +MGI S+ + Y QG N ++
Sbjct 137 PQTSESGTTPQGTLAAFG--TATINGGGFTKSFTEHCIVMGIASVRADLTYQQGLNRMFS 194
Query 509 EFLTLDDIHKPALDGIGYQDSLNWQ 533
T D + PAL IG Q LN +
Sbjct 195 R-STRYDFYFPALAHIGEQSVLNKE 218
Lambda K H a alpha
0.315 0.132 0.393 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4913515649712