bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-2_CDS_annotation_glimmer3.pl_2_7
Length=155
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 111 1e-25
gi|575094354|emb|CDL65742.1| unnamed protein product 108 3e-24
gi|494822885|ref|WP_007558293.1| hypothetical protein 102 3e-22
gi|496050829|ref|WP_008775336.1| hypothetical protein 101 5e-22
gi|490418709|ref|WP_004291032.1| hypothetical protein 87.4 4e-17
gi|575094321|emb|CDL65708.1| unnamed protein product 73.6 3e-12
gi|517172762|ref|WP_018361580.1| hypothetical protein 65.1 2e-09
gi|565841287|ref|WP_023924568.1| hypothetical protein 64.3 4e-09
gi|496521299|ref|WP_009229582.1| capsid protein 57.4 6e-07
gi|490477384|ref|WP_004347761.1| capsid protein 55.1 3e-06
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 111 bits (278), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/159 (40%), Positives = 95/159 (60%), Gaps = 6/159 (4%)
Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAV-PAITLFNSNAFDNDLESDFDFLGYNP 59
LD++I+ Q T+ D IPEFD++GM+ + P+ +F +D S +GY P
Sbjct 417 LDWSINRIARQNFKTTFTDYAIPEFDSVGMQQLYPSEMIFGLEDLPSDPSSIN--MGYVP 474
Query 60 RYWPWKSKIDRVHGAFLTTLKDWVAPIDDFYLNRWFAS---GGSSQASISWPFFKVNPNT 116
RY K+ ID +HG+F+ TL WV+P+ D Y++ + + G S ++++ FFKVNP+
Sbjct 475 RYADLKTSIDEIHGSFIDTLVSWVSPLTDSYISAYRQACKDAGFSDITMTYNFFKVNPHI 534
Query 117 LDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155
+D+IF V ADST +DQLLIN K VR +G+PY
Sbjct 535 VDNIFGVKADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 108 bits (269), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 65/160 (41%), Positives = 87/160 (54%), Gaps = 6/160 (4%)
Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNSNAFDNDLESDFDFLGYNPR 60
+DY SG D PIPE D IGME+VP + N ++D S FLGY PR
Sbjct 457 VDYVGSGVDHSCTLVDATSFPIPELDQIGMESVPLVRAMNP-VKESDTPSADTFLGYAPR 515
Query 61 YWPWKSKIDRVHGAFLTTLKDWVAPIDDFYLNRW----FASGGSSQA-SISWPFFKVNPN 115
Y WK+ +DR G F +L+ W P+ D L F S + + SI+ FFKVNP+
Sbjct 516 YIDWKTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNFPSNPNVEPDSIAAGFFKVNPS 575
Query 116 TLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155
+D +FAV ADST ++D+ L + KVVR L +G+PY
Sbjct 576 IVDPLFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 102 bits (254), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/162 (40%), Positives = 88/162 (54%), Gaps = 7/162 (4%)
Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNS-NAFDNDLESDFD-FLGYN 58
LDY S T+V D PIPEFD IGME VP I N D D + + + GY
Sbjct 452 LDYITSAPHFGTTLTNVLDFPIPEFDKIGMEQVPVIRGLNPVKPKDGDFKVSPNLYFGYA 511
Query 59 PRYWPWKSKIDRVHGAFLTTLKDWVAPIDDFYL----NRWFASGGSSQA-SISWPFFKVN 113
P+Y+ WK+ +D+ G F +LK W+ P DD L + F + +A S+ FFKV+
Sbjct 512 PQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLAADSVDFPDNPNVEADSVKAGFFKVS 571
Query 114 PNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155
P+ LD++FAV A+S +DQ L + VVR L +G+PY
Sbjct 572 PSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSLDPNGLPY 613
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 101 bits (252), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 63/163 (39%), Positives = 87/163 (53%), Gaps = 14/163 (9%)
Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNSNAFDNDLESDFD----FLG 56
LDY + + D IPEFD +GME+VP ++L N L+S ++ LG
Sbjct 424 LDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLMNP------LQSSYNVGSSILG 477
Query 57 YNPRYWPWKSKIDRVHGAFLTTLKDWVAPIDDF----YLNRWFASGGSSQASISWPFFKV 112
Y PRY +K+ +D GAF TTLK WV D+ LN S +++ FKV
Sbjct 478 YAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNYQDDPNNSPGTLVNYTNFKV 537
Query 113 NPNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155
NPN +D +FAVAA ++ ++DQ L + KVVR L DG+PY
Sbjct 538 NPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLPY 580
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 87.4 bits (215), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 58/167 (35%), Positives = 81/167 (49%), Gaps = 15/167 (9%)
Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNS-NAFDNDLESDFDFLGYNP 59
LDY D L + D IPEFD +GM+++P + L N +F N + LGY P
Sbjct 415 LDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLMNPLRSFAN---ASGLVLGYVP 471
Query 60 RYWPWKSKIDRVHGAFLTTLKDWVAPIDDFYLNRWF-----------ASGGSSQASISWP 108
RY +K+ +D+ G F TL WV + + + + S A +++
Sbjct 472 RYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQVTLPNDAPPIEPSEPVPSVAPMNFT 531
Query 109 FFKVNPNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155
FFKVNP+ LD IFAV A +DQ L + K VR L DG+PY
Sbjct 532 FFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 73.6 bits (179), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/168 (35%), Positives = 79/168 (47%), Gaps = 18/168 (11%)
Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGME-------AVPAITLFNSNAFDNDLESDFD 53
LD+A G D L T D IPE D+IGM+ A PA AF S D
Sbjct 478 LDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAPAPYNDEFKAFRVGDGSSPD 537
Query 54 F---LGYNPRYWPWKSKIDRVHGAFLTTLKDWVAPI--DDFYLNRWFASGGSSQASISWP 108
GY PRY +K+ DR +GAF +LK WV I D N W ++ A I+ P
Sbjct 538 MSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQNNVW-----NTWAGINAP 592
Query 109 -FFKVNPNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155
F P+ + ++F V++ + + DQL + C R LS+ G+PY
Sbjct 593 NMFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPY 640
>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568
Score = 65.1 bits (157), Expect = 2e-09, Method: Composition-based stats.
Identities = 42/139 (30%), Positives = 74/139 (53%), Gaps = 12/139 (9%)
Query 19 DLPIPEFDNIGMEAVPAITL---FNSNAFDNDLESDFDFLGYNPRYWPWKSKIDRVHGAF 75
D +PEF+N+GM+ + A + +N+N ++ ++ + G+ PRY +K+ +D HG F
Sbjct 437 DFFVPEFENLGMQPLFAKNISYKYNNNTANSRIK-NLGAFGWQPRYSEYKTALDINHGQF 495
Query 76 LTTLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWESDQLL 135
+ P+ + + R + G S ++ + FK+NP LD +FAV + T +DQ+
Sbjct 496 VHQ-----EPLSYWTVAR---ARGESMSNFNISTFKINPKWLDDVFAVNYNGTELTDQVF 547
Query 136 INCDVSCKVVRPLSQDGMP 154
C + V +S DGMP
Sbjct 548 GGCYFNIVKVSDMSIDGMP 566
>gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens]
gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens
CC14M]
Length=656
Score = 64.3 bits (155), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 72/143 (50%), Gaps = 15/143 (10%)
Query 16 SVEDLPIPEFDNIGMEAVPAITL---FNSNAFDNDLESDFDFLGYNPRYWPWKSKIDRVH 72
S ED PEF+N+GM+ V L NS D+ + + + LGY+ RY +K+ D +
Sbjct 521 SREDYFQPEFENLGMQPVIQSDLCLCINSAKSDSSDQHN-NVLGYSARYLEYKTARDIIF 579
Query 73 GAFLT--TLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWE 130
G F++ +L W P +++ F G +S P V+P L+ IFAV + +
Sbjct 580 GEFMSGGSLSAWATPKNNY----TFEFG-----KLSLPDLLVDPKVLEPIFAVKYNGSMS 630
Query 131 SDQLLINCDVSCKVVRPLSQDGM 153
+DQ L+N K +RP+ + M
Sbjct 631 TDQFLVNSYFDVKAIRPMQVNDM 653
>gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317]
gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon
317 str. F0108]
Length=541
Score = 57.4 bits (137), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 67/137 (49%), Gaps = 16/137 (12%)
Query 19 DLPIPEFDNIGME-AVPAITLFNSNAFDNDLESDFDFLGYNPRYWPWKSKIDRVHGAFLT 77
D IPEF+N+GM+ VPA N A DN G+ PRY +K+ D HG F
Sbjct 418 DYFIPEFENLGMQPIVPAFVSLN-RAKDNSY-------GWQPRYSEYKTAFDINHGQFAN 469
Query 78 TLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWESDQLLIN 137
P+ + + R A G + + + K+NP+ LDS+FAV + T +D +
Sbjct 470 G-----EPLSYWSIAR--ARGSDTLNTFNVAALKINPHWLDSVFAVNYNGTEVTDCMFGY 522
Query 138 CDVSCKVVRPLSQDGMP 154
+ + V +++DGMP
Sbjct 523 AHFNIEKVSDMTEDGMP 539
>gi|490477384|ref|WP_004347761.1| capsid protein [Prevotella buccalis]
gi|281300712|gb|EFA93043.1| putative capsid protein (F protein) [Prevotella buccalis ATCC
35310]
Length=552
Score = 55.1 bits (131), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 69/140 (49%), Gaps = 15/140 (11%)
Query 16 SVEDLPIPEFDNIGMEAVPAITLFNSNAFDNDLESDFD-FLGYNPRYWPWKSKIDRVHGA 74
S D +PEF+++GM+ P T + S D+ + + F G+ PRY +K+ +D HG
Sbjct 425 SRGDFFMPEFEDLGMQ--PLQTRYIS-----DIRTQTEKFKGWQPRYSEYKTSLDINHGQ 477
Query 75 FLTTLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWESDQL 134
F P+ + + R A G + + K+NP LDSIFAV + T +D +
Sbjct 478 FANG-----QPLSYWTVGRGRA--GETLETFDIASLKINPKWLDSIFAVNYNGTQITDCV 530
Query 135 LINCDVSCKVVRPLSQDGMP 154
C + + V +S++G P
Sbjct 531 FGGCQFNVQKVSDMSENGEP 550
Lambda K H a alpha
0.320 0.137 0.444 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 435859188405