bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-11_CDS_annotation_glimmer3.pl_2_1
Length=119
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 59.7 6e-08
gi|496050829|ref|WP_008775336.1| hypothetical protein 58.5 1e-07
gi|490418709|ref|WP_004291032.1| hypothetical protein 57.8 3e-07
gi|575094354|emb|CDL65742.1| unnamed protein product 54.7 3e-06
gi|494822885|ref|WP_007558293.1| hypothetical protein 53.9 5e-06
gi|565841287|ref|WP_023924568.1| hypothetical protein 48.1 4e-04
gi|506223300|ref|WP_015743075.1| hypothetical protein 42.7 0.013
gi|494610271|ref|WP_007368517.1| capsid protein 42.4 0.029
gi|494308783|ref|WP_007173938.1| hypothetical protein 41.2 0.075
gi|494306153|ref|WP_007173049.1| hypothetical protein 40.8 0.084
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 59.7 bits (143), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 60/123 (49%), Gaps = 15/123 (12%)
Query 7 LNLQNNPGRNVSGALGYNLRYWQWKSNIDTVHAGFRAGAAYQSWVAPL-DGW-------- 57
L++ P S +GY RY K++ID +H F SWV+PL D +
Sbjct 456 FGLEDLPSDPSSINMGYVPRYADLKTSIDEIHGSFIDTLV--SWVSPLTDSYISAYRQAC 513
Query 58 -NVLTSSGAWSYQSMKVRPQQLNSIFVPQVDSANCSVAFDQLLCNVNFQVYAVQNLDRNG 116
+ S +Y KV P +++IF + DS ++ DQLL N F + AV+N D NG
Sbjct 514 KDAGFSDITMTYNFFKVNPHIVDNIFGVKADS---TINTDQLLINSYFDIKAVRNFDYNG 570
Query 117 LPY 119
LPY
Sbjct 571 LPY 573
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 58.5 bits (140), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/133 (34%), Positives = 68/133 (51%), Gaps = 19/133 (14%)
Query 1 MQSVPSLNLQN--NPGRNV-SGALGYNLRYWQWKSNIDTVHAGFRAGAAYQSWVAPLDGW 57
M+SVP ++L N NV S LGY RY +K+++D+ F+ +SWV D
Sbjct 453 MESVPLVSLMNPLQSSYNVGSSILGYAPRYISYKTDVDSSVGAFKT--TLKSWVMSYDNQ 510
Query 58 NVLT----------SSGAW-SYQSMKVRPQQLNSIFVPQVDSANCSVAFDQLLCNVNFQV 106
+V+ S G +Y + KV P ++ +F +A+ S+ DQ LC+ F V
Sbjct 511 SVINQLNYQDDPNNSPGTLVNYTNFKVNPNCVDPLFAV---AASNSIDTDQFLCSSFFDV 567
Query 107 YAVQNLDRNGLPY 119
V+NLD +GLPY
Sbjct 568 KVVRNLDTDGLPY 580
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 57.8 bits (138), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 65/142 (46%), Gaps = 30/142 (21%)
Query 1 MQSVPSLNLQNNPGRNVSGA----LGYNLRYWQWKSNIDTVHAGFRAGAAYQSWVA---- 52
MQS+P + L N P R+ + A LGY RY +K+++D GF+ SWV
Sbjct 444 MQSMPLVQLMN-PLRSFANASGLVLGYVPRYIDYKTSVDQSVGGFKR--TLNSWVISYGN 500
Query 53 --------------PLDGWNVLTSSGAWSYQSMKVRPQQLNSIFVPQV-DSANCSVAFDQ 97
P++ + S ++ KV P L+ IF Q D N DQ
Sbjct 501 ISVLKQVTLPNDAPPIEPSEPVPSVAPMNFTFFKVNPDCLDPIFAVQAGDDTNT----DQ 556
Query 98 LLCNVNFQVYAVQNLDRNGLPY 119
LC+ F + AV+NLD +GLPY
Sbjct 557 FLCSSFFDIKAVRNLDTDGLPY 578
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 54.7 bits (130), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/136 (31%), Positives = 65/136 (48%), Gaps = 23/136 (17%)
Query 1 MQSVPSLNLQNNPGRNVSGA----LGYNLRYWQWKSNIDTVHAGFRAGAAYQSWVAPLDG 56
M+SVP + N + + + LGY RY WK+++D F + ++W P+ G
Sbjct 486 MESVPLVRAMNPVKESDTPSADTFLGYAPRYIDWKTSVDRSVGDF--ADSLRTWCLPV-G 542
Query 57 WNVLTSSGAWSYQS-------------MKVRPQQLNSIFVPQVDSANCSVAFDQLLCNVN 103
LTS+ + ++ S KV P ++ +F DS +V D+ LC+
Sbjct 543 DKELTSANSLNFPSNPNVEPDSIAAGFFKVNPSIVDPLFAVVADS---TVKTDEFLCSSF 599
Query 104 FQVYAVQNLDRNGLPY 119
F V V+NLD NGLPY
Sbjct 600 FDVKVVRNLDVNGLPY 615
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 53.9 bits (128), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/111 (32%), Positives = 55/111 (50%), Gaps = 17/111 (15%)
Query 21 LGYNLRYWQWKSNIDTVHAGFRAGAAYQSWVAPLDGWNVLTSSG----------AWSYQS 70
GY +Y+ WK+ +D FR + ++W+ P D +L + A S ++
Sbjct 508 FGYAPQYYNWKTTLDKSMGEFRR--SLKTWIIPFDDEALLAADSVDFPDNPNVEADSVKA 565
Query 71 --MKVRPQQLNSIFVPQVDSANCSVAFDQLLCNVNFQVYAVQNLDRNGLPY 119
KV P L+++F + AN + DQ LC+ F V V++LD NGLPY
Sbjct 566 GFFKVSPSVLDNLFAVK---ANSDLNTDQFLCSTLFDVNVVRSLDPNGLPY 613
>gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens]
gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens
CC14M]
Length=656
Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/97 (29%), Positives = 47/97 (48%), Gaps = 5/97 (5%)
Query 21 LGYNLRYWQWKSNIDTVHAGFRAGAAYQSWVAPLDGWNVLTSSGAWSYQSMKVRPQQLNS 80
LGY+ RY ++K+ D + F +G + +W P + + G S + V P+ L
Sbjct 562 LGYSARYLEYKTARDIIFGEFMSGGSLSAWATPKNNYTF--EFGKLSLPDLLVDPKVLEP 619
Query 81 IFVPQVDSANCSVAFDQLLCNVNFQVYAVQNLDRNGL 117
IF + N S++ DQ L N F V A++ + N +
Sbjct 620 IFAVKY---NGSMSTDQFLVNSYFDVKAIRPMQVNDM 653
>gi|506223300|ref|WP_015743075.1| hypothetical protein [Candidatus Methylomirabilis oxyfera]
gi|392373567|ref|YP_003205400.1| hypothetical protein DAMO_0481 [Candidatus Methylomirabilis oxyfera]
gi|258591260|emb|CBE67557.1| protein of unknown function [Candidatus Methylomirabilis oxyfera]
Length=234
Score = 42.7 bits (99), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/85 (31%), Positives = 43/85 (51%), Gaps = 1/85 (1%)
Query 33 NIDTVHAGFRAGAAYQSWVAPLDGWNVLTSSGAWSYQSMKVRPQQLNSIFVPQVDSANCS 92
+I+ V A R GAA+ + PL VL S A +++ K+R Q + V + +
Sbjct 86 SIENVIAAMRRGAAFDYLLKPLQDLTVLEVSVARAFEIRKLRAQAREAFQVGAIRELAVT 145
Query 93 VAFDQLLCNVNFQVYAVQNLDRNGL 117
A D++L +N +V+ L RNG+
Sbjct 146 -ASDRILNPLNIISLSVERLTRNGM 169
>gi|494610271|ref|WP_007368517.1| capsid protein [Prevotella multiformis]
gi|324988543|gb|EGC20506.1| putative capsid protein (F protein) [Prevotella multiformis DSM
16608]
Length=531
Score = 42.4 bits (98), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 30/104 (29%), Positives = 47/104 (45%), Gaps = 13/104 (13%)
Query 21 LGYNLRYWQWKSNIDTVHAGFRAGAAYQSWVAPLDGWNVLTSSG-------AWSYQSMKV 73
LG+ +RY ++K++ D V F +G + W +P + +G WS V
Sbjct 431 LGWQVRYNEYKTSRDLVFGEFESGLSLSYWCSPRYDFGFDGKAGDKKLVNSPWSPAHFYV 490
Query 74 RPQQLNSIFVPQVDSANCSVAFDQLLCNVNFQVYAVQNLDRNGL 117
P LN+IF+ +V D L N F V AV+ + +GL
Sbjct 491 NPSILNTIFLV------SAVKADHFLVNSFFDVKAVRPMSVSGL 528
>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=553
Score = 41.2 bits (95), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 46/99 (46%), Gaps = 7/99 (7%)
Query 21 LGYNLRYWQWKSNIDTVHAGFRAGAAYQSW-VAPLDGWNVLTSSGAWSYQSMKVRPQQLN 79
LGY RY ++K+ +D H F A SW V+ W T+ K+ P LN
Sbjct 459 LGYQPRYSEYKTALDVNHGQFAQSDALSSWSVSRFRRW---TTFPQLEIADFKIDPGCLN 515
Query 80 SIFVPQVDSANCSVAFDQLLCNVNFQVYAVQNLDRNGLP 118
SIF VD N + A D + NF + V ++ +G+P
Sbjct 516 SIF--PVD-YNGTEANDCVYGGCNFNIVKVSDMSVDGMP 551
>gi|494306153|ref|WP_007173049.1| hypothetical protein [Prevotella bergensis]
gi|270333881|gb|EFA44667.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=519
Score = 40.8 bits (94), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 7/100 (7%)
Query 20 ALGYNLRYWQWKSNIDTVHAGFRAGAAYQSW-VAPLDGWNVLTSSGAWSYQSMKVRPQQL 78
LGY RY ++K+ +D H F A SW V+ W T+ K+ P L
Sbjct 424 VLGYQPRYSEYKTALDINHGQFAQNDALSSWSVSRFRRW---TTFPQLEIADFKIDPGCL 480
Query 79 NSIFVPQVDSANCSVAFDQLLCNVNFQVYAVQNLDRNGLP 118
NS+F + N + + D + NF + V ++ +G+P
Sbjct 481 NSVFPVEF---NGTESTDCVFGGCNFNIVKVSDMSVDGMP 517
Lambda K H a alpha
0.318 0.131 0.423 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 440495117073