bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-26_CDS_annotation_glimmer3.pl_2_7
Length=173
Score E
Sequences producing significant alignments: (Bits) Value
gi|492501782|ref|WP_005867318.1| hypothetical protein 71.6 2e-11
gi|649557305|gb|KDS63784.1| capsid family protein 68.9 2e-11
gi|649569140|gb|KDS75238.1| capsid family protein 68.9 7e-11
gi|649555287|gb|KDS61824.1| capsid family protein 68.9 1e-10
gi|547920049|ref|WP_022322420.1| capsid protein VP1 68.2 2e-10
gi|647452987|ref|WP_025792807.1| hypothetical protein 53.5 2e-05
gi|494610271|ref|WP_007368517.1| capsid protein 50.8 1e-04
gi|496521299|ref|WP_009229582.1| capsid protein 44.3 0.015
gi|565841287|ref|WP_023924568.1| hypothetical protein 43.9 0.022
gi|547312923|ref|WP_022044635.1| putative uncharacterized protein 42.7 0.045
>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis
CL09T03C24]
Length=538
Score = 71.6 bits (174), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/168 (33%), Positives = 82/168 (49%), Gaps = 13/168 (8%)
Query 7 GLKIKCTEPCMIMALGSITPRIDYSQG-NKWWTRLQNMDDFHKPTLDAIGFQELIaeeaa 65
G K E I+ + SI PR Y QG K + + NMD F+ P +G QE+ EE
Sbjct 383 GFKRYFEEHGYIIGIMSIRPRTGYQQGVPKDFRKFDNMD-FYFPEFAHLGEQEIKNEEVY 441
Query 66 awsteatGNHELVYQSLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEENSDHTI 125
T A+ N + G P + EY +NE +GDF M AF LNR++ E+ +
Sbjct 442 LQQTPASNN-----GTFGYTPRYAEYKYSMNEVHGDFRGNM--AFWHLNRIFSESPNLN- 493
Query 126 ANASTYIDPTIYNKIFAESRLSSQNFWVQVAFDVTARRVMSAKQIPNL 173
+T+++ N++FA + S +W+Q+ DV A R+M P L
Sbjct 494 ---TTFVECNPSNRVFATAETSDDKYWIQLYQDVKALRLMPKYGTPML 538
>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=245
Score = 68.9 bits (167), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/168 (32%), Positives = 77/168 (46%), Gaps = 13/168 (8%)
Query 7 GLKIKCTEPCMIMALGSITPRIDYSQG-NKWWTRLQNMDDFHKPTLDAIGFQELIaeeaa 65
G E IM + SI PR Y QG K + + NMD F+ P +G QE I E
Sbjct 90 GFTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMD-FYFPEFAHLGEQE-IKNEEL 147
Query 66 awsteatGNHELVYQSLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEENSDHTI 125
+ N + G P + EY NE +GDF M AF LNR+++E +
Sbjct 148 YLNESDAANE----GTFGYTPRYAEYKYSQNEVHGDFRGNM--AFWHLNRIFKEKPNLN- 200
Query 126 ANASTYIDPTIYNKIFAESRLSSQNFWVQVAFDVTARRVMSAKQIPNL 173
+T+++ N++FA + S +WVQ+ D+ A R+M P L
Sbjct 201 ---TTFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLMPKYGTPML 245
>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str.
3999B T(B) 6]
Length=390
Score = 68.9 bits (167), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 76/161 (47%), Gaps = 13/161 (8%)
Query 14 EPCMIMALGSITPRIDYSQG-NKWWTRLQNMDDFHKPTLDAIGFQELIaeeaaawsteat 72
E IM + SI PR Y QG K + + NMD F+ P +G QE I E +
Sbjct 242 EHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMD-FYFPEFAHLGEQE-IKNEELYLNESDA 299
Query 73 GNHELVYQSLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEENSDHTIANASTYI 132
N + G P + EY NE +GDF M AF LNR+++E + +T++
Sbjct 300 ANE----GTFGYTPRYAEYKYSQNEVHGDFRGNM--AFWHLNRIFKEKPNLN----TTFV 349
Query 133 DPTIYNKIFAESRLSSQNFWVQVAFDVTARRVMSAKQIPNL 173
+ N++FA + S +WVQ+ D+ A R+M P L
Sbjct 350 ECNPSNRVFATAETSDDKYWVQIYQDIKALRLMPKYGTPML 390
>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=541
Score = 68.9 bits (167), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 76/161 (47%), Gaps = 13/161 (8%)
Query 14 EPCMIMALGSITPRIDYSQG-NKWWTRLQNMDDFHKPTLDAIGFQELIaeeaaawsteat 72
E IM + SI PR Y QG K + + NMD F+ P +G QE I E +
Sbjct 393 EHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMD-FYFPEFAHLGEQE-IKNEELYLNESDA 450
Query 73 GNHELVYQSLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEENSDHTIANASTYI 132
N + G P + EY NE +GDF M AF LNR+++E + +T++
Sbjct 451 ANE----GTFGYTPRYAEYKYSQNEVHGDFRGNM--AFWHLNRIFKEKPNLN----TTFV 500
Query 133 DPTIYNKIFAESRLSSQNFWVQVAFDVTARRVMSAKQIPNL 173
+ N++FA + S +WVQ+ D+ A R+M P L
Sbjct 501 ECNPSNRVFATAETSDDKYWVQIYQDIKALRLMPKYGTPML 541
>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553
Score = 68.2 bits (165), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/168 (32%), Positives = 79/168 (47%), Gaps = 13/168 (8%)
Query 7 GLKIKCTEPCMIMALGSITPRIDYSQG-NKWWTRLQNMDDFHKPTLDAIGFQELIaeeaa 65
G K E I+ + SITPR Y QG + +T+ NMD F+ P + QE+ +E
Sbjct 398 GFKHYFEEHGYIIGIMSITPRSGYQQGVPRDFTKFDNMD-FYFPEFAHLSEQEIKNQELF 456
Query 66 awsteatGNHELVYQSLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEENSDHTI 125
A N + G P + EY +E +GDF L+F LNR++E+ +
Sbjct 457 VSEDAAYNN-----GTFGYTPRYAEYKYHPSEAHGDFRGN--LSFWHLNRIFEDKPNLN- 508
Query 126 ANASTYIDPTIYNKIFAESRLSSQNFWVQVAFDVTARRVMSAKQIPNL 173
+T+++ N++FA S FWVQ+ DV A R+M P L
Sbjct 509 ---TTFVECKPSNRVFATSETEDDKFWVQMYQDVKALRLMPKYGTPML 553
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 53.5 bits (127), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/198 (25%), Positives = 83/198 (42%), Gaps = 36/198 (18%)
Query 8 LKIKCTEPCMIMALGSITPRIDYSQG-----NKWWTRLQNMDDFHKPTLDAIGFQELI-- 60
++ TE +IM + S+ P+ +Y+ N+ TR Q F++P +G+Q LI
Sbjct 391 IEFDSTEHGIIMCIYSVAPQSEYNASYLDPFNRKLTREQ----FYQPEFADLGYQALIGS 446
Query 61 ----aeeaaawsteatGNHELVYQSLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRV 116
+ + EL LG Q + EY T + +GDF +G L++ C R
Sbjct 447 DLICSTLGMNEKQAGFSDIELNNNLLGYQVRYNEYKTARDLVFGDFESGKSLSYWCTPRF 506
Query 117 --------------------YEENSDHTI-ANASTYIDPTIYNKIFAESRLSSQNFWVQV 155
Y + + + ++ + YI+P + N IF S + + +F V
Sbjct 507 DFGYGDTEKKIAPENKGGADYRKKGNRSHWSSRNFYINPNLVNPIFLTSAVQADHFIVNS 566
Query 156 AFDVTARRVMSAKQIPNL 173
DV A R MS + +L
Sbjct 567 FLDVKAVRPMSVTGLSSL 584
>gi|494610271|ref|WP_007368517.1| capsid protein [Prevotella multiformis]
gi|324988543|gb|EGC20506.1| putative capsid protein (F protein) [Prevotella multiformis DSM
16608]
Length=531
Score = 50.8 bits (120), Expect = 1e-04, Method: Composition-based stats.
Identities = 47/189 (25%), Positives = 84/189 (44%), Gaps = 31/189 (16%)
Query 14 EPCMIMALGSITPRIDYSQGNKW--WTRLQNMDDFHKPTLDAIGFQELIaeeaaawstea 71
E +IM + S+ P+ +Y+ G + + R +DF +P +G+Q ++ + + +
Sbjct 345 EHGIIMCIYSVVPQTEYN-GTYFDPFNRKLRREDFFQPEFADLGYQPVVTSDLISTYLDN 403
Query 72 -----------------tGNHELVYQSLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLN 114
+ E + LG Q + EY T + +G+F +G+ L++ C
Sbjct 404 PVPDGPEKQKRLAAGYPLSSIEANNRLLGWQVRYNEYKTSRDLVFGEFESGLSLSYWCSP 463
Query 115 RVYE-----ENSDHTIAN-----ASTYIDPTIYNKIFAESRLSSQNFWVQVAFDVTARRV 164
R Y+ + D + N A Y++P+I N IF S + + +F V FDV A R
Sbjct 464 R-YDFGFDGKAGDKKLVNSPWSPAHFYVNPSILNTIFLVSAVKADHFLVNSFFDVKAVRP 522
Query 165 MSAKQIPNL 173
MS + L
Sbjct 523 MSVSGLAGL 531
>gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317]
gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon
317 str. F0108]
Length=541
Score = 44.3 bits (103), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 34/141 (24%), Positives = 62/141 (44%), Gaps = 10/141 (7%)
Query 4 SGRG-LKIKCTEPCMIMALGSITPRIDYS-QGNKWWTRLQNMDDFHKPTLDAIGFQELIa 61
SG G ++ EP ++M + S+ P + Y + Q D+ P + +G Q ++
Sbjct 375 SGYGEIQFDAKEPGVLMCIYSVVPAMQYDCMRLDPFVAKQTRGDYFIPEFENLGMQPIVP 434
Query 62 eeaaawsteatGNHELVYQSLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEENS 121
+ + S G QP + EY T + +G FA G PL++ + R ++
Sbjct 435 AFVSLNRAKD--------NSYGWQPRYSEYKTAFDINHGQFANGEPLSYWSIARARGSDT 486
Query 122 DHTIANASTYIDPTIYNKIFA 142
+T A+ I+P + +FA
Sbjct 487 LNTFNVAALKINPHWLDSVFA 507
>gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens]
gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens
CC14M]
Length=656
Score = 43.9 bits (102), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 38/162 (23%), Positives = 71/162 (44%), Gaps = 5/162 (3%)
Query 14 EPCMIMALGSITPRIDY-SQGNKWWTRLQNMDDFHKPTLDAIGFQELIaeeaaawsteat 72
E +IM + SI P++DY ++ + R + +D+ +P + +G Q +I + A
Sbjct 492 EHGLIMCIYSIAPQVDYDARELDPFNRKFSREDYFQPEFENLGMQPVIQSDLCLCINSAK 551
Query 73 GNHELVYQS-LGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEENSDHTIANASTY 131
+ + + LG ++EY T + +G+F +G L+ + ++
Sbjct 552 SDSSDQHNNVLGYSARYLEYKTARDIIFGEFMSGGSLSAWATPKNNYTFEFGKLSLPDLL 611
Query 132 IDPTIYNKIFA---ESRLSSQNFWVQVAFDVTARRVMSAKQI 170
+DP + IFA +S+ F V FDV A R M +
Sbjct 612 VDPKVLEPIFAVKYNGSMSTDQFLVNSYFDVKAIRPMQVNDM 653
>gi|547312923|ref|WP_022044635.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
gi|524208404|emb|CCZ76639.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
Length=338
Score = 42.7 bits (99), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 48/197 (24%), Positives = 77/197 (39%), Gaps = 35/197 (18%)
Query 7 GLKIKCTEPCMIMALGSITPRIDYSQG-NKWWTRLQNMDDFHKPTLDAIGFQELIaeeaa 65
G+ EP M + + P YSQG + + DDF+ P L+ IGFQ + +
Sbjct 145 GIDYYAKEPGTFMLITMLVPEPAYSQGLHPDLASISFGDDFN-PELNGIGFQLVPRHRFS 203
Query 66 awste---------------atGNHELV---YQSLGKQPSWIEYTTDVNETYGDFAAGMP 107
TG LV S+G++ +W TD + +GDFA
Sbjct 204 MMPRGFNFTGLDQEASPWFGHTGTGVLVDPNMVSVGEEVAWSWLRTDYSRLHGDFAQNGN 263
Query 108 LAFMCLNRVYE-----------ENSDHTIANASTYIDPTIYNKIFAESRLSSQNFWVQVA 156
+ L R + ++ ++T TYI+P + +F + L + NF
Sbjct 264 YQYWVLTRRFTTYFPDDGTGFYQDGEYT----GTYINPLDWQYVFVDQTLMAGNFAYYGT 319
Query 157 FDVTARRVMSAKQIPNL 173
FD+ +SA +P L
Sbjct 320 FDLNVTSSLSANYMPYL 336
Lambda K H a alpha
0.320 0.134 0.416 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 428836147623