bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-34_CDS_annotation_glimmer3.pl_2_1
Length=299
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094321|emb|CDL65708.1| unnamed protein product 246 1e-72
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 155 1e-39
gi|490418709|ref|WP_004291032.1| hypothetical protein 119 1e-26
gi|575094354|emb|CDL65742.1| unnamed protein product 119 1e-26
gi|496050829|ref|WP_008775336.1| hypothetical protein 116 1e-25
gi|494822885|ref|WP_007558293.1| hypothetical protein 104 1e-21
gi|647452987|ref|WP_025792807.1| hypothetical protein 93.6 6e-18
gi|565841287|ref|WP_023924568.1| hypothetical protein 88.6 3e-16
gi|649557305|gb|KDS63784.1| capsid family protein 76.6 3e-13
gi|575094297|emb|CDL65693.1| unnamed protein product 78.2 6e-13
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 246 bits (628), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 135/292 (46%), Positives = 179/292 (61%), Gaps = 17/292 (6%)
Query 16 FSGKLTADASMK----ISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIKPKVDSRVSV 71
FSG + + S+ I ALR+A A QKYKEIQ +ND DF +QV AHFGIKP + S+
Sbjct 360 FSGNVAINTSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQVEAHFGIKPDEKNENSL 419
Query 72 FIGGDDKTLSINPQVNTNFQNGGEPEIKAIGIGDLSAGCKFTSTTYGMIIGIYRAIPQLD 131
FIGG ++IN Q+N N + A G+ SA KFT+ TYG++IGIYR P LD
Sbjct 420 FIGGSSSMININEQINQNLSGDNKATYGAAPQGNGSASIKFTAKTYGVVIGIYRCTPVLD 479
Query 132 YSHVGIDRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPY--GYQTSPLD 189
++H+GIDR LFKTDASDF IPE+DSIGMQ +RCE++AP ND + G +SP D
Sbjct 480 FAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAP-APYNDEFKAFRVGDGSSP-D 537
Query 190 MSVTYGYSPRYAELKSARDYYEGGFCGTYSTWVTG--YDESFLSGWRRNRGSVSVTDYDS 247
MS TYGY+PRY+E K++ D Y G FC + +WVTG +D + W G +
Sbjct 538 MSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQNNVWNTWAGI-------N 590
Query 248 IEDLFKCRASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGLPYSK 299
++F CR ++ +F+ + +DD+L +G VN C A R S YGLPYS
Sbjct 591 APNMFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPYSN 642
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 155 bits (393), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 99/277 (36%), Positives = 142/277 (51%), Gaps = 15/277 (5%)
Query 24 ASMKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIKPKVD-SRVSVFIGGDDKTLSI 82
A + + ALR A LQK++EI S D+ Q+ HF + P S ++GG L I
Sbjct 309 AGLSVLALRQAECLQKWREIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDI 368
Query 83 NPQVNTNFQNGGEPEIKAIGIGDLSAG-CKFTSTTYGMIIGIYRAIPQLDYSHVGIDRNL 141
+ VNTN + +I+ G G L+ F S+ +G+I+ IY +P LD+S I R
Sbjct 369 SEVVNTNLTGDNQADIQGKGTGTLNGNKVDFESSEHGIIMCIYHCLPLLDWSINRIARQN 428
Query 142 FKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLD-MSVTYGYSPRY 200
FKT +D+ IPE DS+GMQ Y E+ + GL DL P D S+ GY PRY
Sbjct 429 FKTTFTDYAIPEFDSVGMQQLYPSEM---IFGLEDL---------PSDPSSINMGYVPRY 476
Query 201 AELKSARDYYEGGFCGTYSTWVTGYDESFLSGWRRNRGSVSVTDYDSIEDLFKCRASLLY 260
A+LK++ D G F T +WV+ +S++S +R+ +D + FK ++
Sbjct 477 ADLKTSIDEIHGSFIDTLVSWVSPLTDSYISAYRQACKDAGFSDITMTYNFFKVNPHIVD 536
Query 261 PIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGLPY 297
IF + T+N D+LLI S AVR F GLPY
Sbjct 537 NIFGVKADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 119 bits (298), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/294 (30%), Positives = 126/294 (43%), Gaps = 46/294 (16%)
Query 26 MKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIKPKVD-SRVSVFIGGDDKTLSINP 84
+ I LR A LQK+KEI S + D+ Q+ H+G+ S + ++GG ++ IN
Sbjct 309 LSILVLRQAEFLQKWKEITQSGNKDYKDQLEKHWGVSVGDGFSELCTYLGGVSSSIDINE 368
Query 85 QVNTNFQNGGEPEIKAIGIGDLSAGCKFTST-TYGMIIGIYRAIPQLDYSHVGIDRNLFK 143
+NTN +I G+G + F S YG+I+ IY +P LDY+ +D K
Sbjct 369 VINTNITGSAAADIAGKGVGVANGEINFNSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLK 428
Query 144 TDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLDMSVTYGYSPRYAEL 203
+++D+ IPE D +GMQ+ +L PL + + GY PRY +
Sbjct 429 VNSTDYAIPEFDRVGMQSMPLVQLMNPLRSFANA------------SGLVLGYVPRYIDY 476
Query 204 KSARDYYEGGFCGTYSTWVTGYDESFLSGWRRNRGSVSV-------TDYDSIE------- 249
K++ D GGF T ++WV Y G++SV D IE
Sbjct 477 KTSVDQSVGGFKRTLNSWVISY------------GNISVLKQVTLPNDAPPIEPSEPVPS 524
Query 250 ------DLFKCRASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGLPY 297
FK L PIF Q N D+ L S AVR GLPY
Sbjct 525 VAPMNFTFFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 119 bits (298), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 90/276 (33%), Positives = 129/276 (47%), Gaps = 15/276 (5%)
Query 26 MKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIK-PKVDSRVSVFIGGDDKTLSINP 84
+ I ALR A LQK+KE+ S + D+ +Q+ H+GIK S + ++GG +L IN
Sbjct 351 VPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCATSLDINE 410
Query 85 QVNTNFQNGGEPEIKAIGIGDLSAGCKFTST-TYGMIIGIYRAIPQLDYSHVGIDRNLFK 143
+N N +I G + +F S YG+I+ IY +P +DY G+D +
Sbjct 411 VINNNITGDNAADIAGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIVDYVGSGVDHSCTL 470
Query 144 TDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLDMSVTYGYSPRYAEL 203
DA+ FPIPELD IGM+ S PL+ + P +P GY+PRY +
Sbjct 471 VDATSFPIPELDQIGME-------SVPLV---RAMNPVKESDTP-SADTFLGYAPRYIDW 519
Query 204 KSARDYYEGGFCGTYSTWVTGY-DESFLSGWRRNRGSVSVTDYDSI-EDLFKCRASLLYP 261
K++ D G F + TW D+ S N S + DSI FK S++ P
Sbjct 520 KTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNFPSNPNVEPDSIAAGFFKVNPSIVDP 579
Query 262 IFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGLPY 297
+F TV D+ L S VR + GLPY
Sbjct 580 LFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 116 bits (290), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/302 (31%), Positives = 140/302 (46%), Gaps = 33/302 (11%)
Query 6 LASTG--LQLVGFSGKLTADASMKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIKP 63
+STG LQ V SG T + ALR A LQK+KEI S + D+ Q+ H+ +
Sbjct 302 FSSTGVNLQTVNGSGTFT------VLALRQAEFLQKWKEITQSGNKDYKDQIEKHWNVSV 355
Query 64 -KVDSRVSVFIGGDDKTLSINPQVNTNFQNGGEPEIKAIGIGDLSAGCKFTS-TTYGMII 121
+ S +S+++GG +L IN VN N +I G+ + F + YG+I+
Sbjct 356 GEAYSEMSLYLGGTTASLDINEVVNNNITGSNAADIAGKGVVVGNGRISFDAGERYGLIM 415
Query 122 GIYRAIPQLDYSHVGIDRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGL-NDLLVP 180
IY ++P LDY+ ++ K +++DF IPE D +GM+ S PL+ L N L
Sbjct 416 CIYHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGME-------SVPLVSLMNPLQSS 468
Query 181 YGYQTSPLDMSVTYGYSPRYAELKSARDYYEGGFCGTYSTWVTGYDESFLSGWRR----- 235
Y +S L GY+PRY K+ D G F T +WV YD +
Sbjct 469 YNVGSSIL------GYAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNYQDDP 522
Query 236 NRGSVSVTDYDSIEDLFKCRASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGL 295
N ++ +Y + FK + + P+F S +++ D+ L S VR GL
Sbjct 523 NNSPGTLVNYTN----FKVNPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGL 578
Query 296 PY 297
PY
Sbjct 579 PY 580
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 104 bits (260), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 88/307 (29%), Positives = 141/307 (46%), Gaps = 29/307 (9%)
Query 2 HLLKLASTGLQLVGFSGKLTADASMKIS--ALRSATALQKYKEIQNSNDPDFAAQVLAHF 59
+L+ +T L+ + D+S +S ALR A A QK+KE+ +++ D+ +Q+ AH+
Sbjct 325 RILRFNNTNSGLI-----VEGDSSFGVSILALRRAEAAQKWKEVALASEEDYPSQIEAHW 379
Query 60 GIK-PKVDSRVSVFIGGDDKTLSINPQVNTNFQNGGEPEIKAIGIGDLSAGCKF-TSTTY 117
G K S + ++G + LSIN VN N +I G + F Y
Sbjct 380 GQSVNKAYSDMCQWLGSINIDLSINEVVNNNITGENAADIAGKGTMSGNGSINFNVGGQY 439
Query 118 GMIIGIYRAIPQLDYSHVGIDRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDL 177
G+++ ++ +PQLDY T+ DFPIPE D IGM E + GLN +
Sbjct 440 GIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIPEFDKIGM------EQVPVIRGLNPV 493
Query 178 LVPYG-YQTSPLDMSVTYGYSPRYAELKSARDYYEGGFCGTYSTWVTGYDESFLSGWRRN 236
G ++ SP ++ +GY+P+Y K+ D G F + TW+ +D+ L
Sbjct 494 KPKDGDFKVSP---NLYFGYAPQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLA---- 546
Query 237 RGSVSVTDYDSIE------DLFKCRASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPF 290
SV D ++E FK S+L +F + + +N D+ L ++ VR
Sbjct 547 ADSVDFPDNPNVEADSVKAGFFKVSPSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSL 606
Query 291 SMYGLPY 297
GLPY
Sbjct 607 DPNGLPY 613
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 93.6 bits (231), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 86/292 (29%), Positives = 137/292 (47%), Gaps = 26/292 (9%)
Query 24 ASMKISALRSATALQKYKE-IQNSNDPDFAAQVLAHFGIK-PKVDSRVSVFIGGDDKTLS 81
+S ++ LR+A AL K E + +N D+A+Q+ AHFG K P+ + + F+GG D ++
Sbjct 296 SSFSVNDLRAAFALDKMLEATRRANGLDYASQIEAHFGFKVPESRANDARFLGGFDNSIV 355
Query 82 INPQVNTNFQNGGEPEIKAIG------IGDLSAGC-KFTSTTYGMIIGIYRAIPQLDYSH 134
++ V+TN + +IG IG +S+G +F ST +G+I+ IY PQ +Y+
Sbjct 356 VSEVVSTNGNAASDGSHASIGDLGGKGIGSMSSGTIEFDSTEHGIIMCIYSVAPQSEYNA 415
Query 135 VGIDRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLDMSVTY 194
+D K F PE +G Q +L +G+N+ G+ L+ ++
Sbjct 416 SYLDPFNRKLTREQFYQPEFADLGYQALIGSDLICSTLGMNEKQA--GFSDIELNNNLL- 472
Query 195 GYSPRYAELKSARDYYEGGFCG--TYSTWVT-----GYDESFLSGWRRNRGSVSVTDYDS 247
GY RY E K+ARD G F + S W T GY ++ N+G +
Sbjct 473 GYQVRYNEYKTARDLVFGDFESGKSLSYWCTPRFDFGYGDTEKKIAPENKGGADYRKKGN 532
Query 248 IEDL----FKCRASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGL 295
F +L+ PIF+ + V D ++ S AVRP S+ GL
Sbjct 533 RSHWSSRNFYINPNLVNPIFL---TSAVQADHFIVNSFLDVKAVRPMSVTGL 581
>gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens]
gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens
CC14M]
Length=656
Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 127/277 (46%), Gaps = 37/277 (13%)
Query 31 LRSATALQKYKE-IQNSNDPDFAAQVLAHFGIK-PKVDSRVSVFIGGDDKTLSINPQVNT 88
+R+ AL+K E + +N D++ Q+ AHFG K P+ + FIGG D +SI+ V T
Sbjct 396 IRAMFALEKMLERTRAANGLDYSNQIAAHFGFKVPESRKNCASFIGGFDNQISISEVVTT 455
Query 89 NFQNGGEP----------EIKAIGIGDLSAG-CKFTSTTYGMIIGIYRAIPQLDYSHVGI 137
+ NG ++ GIG +++G + +G+I+ IY PQ+DY +
Sbjct 456 S--NGSVDGTASTGSVVGQVFGKGIGAMNSGHISYDVKEHGLIMCIYSIAPQVDYDAREL 513
Query 138 DRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLDMSVTYGYS 197
D K D+ PE +++GMQ + +L L + S + GYS
Sbjct 514 DPFNRKFSREDYFQPEFENLGMQPVIQSDLC--------LCINSAKSDSSDQHNNVLGYS 565
Query 198 PRYAELKSARDYYEGGFC--GTYSTWVTGYDESFLSGWRRNRGSVSVTDYDSIEDLFKCR 255
RY E K+ARD G F G+ S W T + + G +S+ D
Sbjct 566 ARYLEYKTARDIIFGEFMSGGSLSAWATPKN-----NYTFEFGKLSLPD-------LLVD 613
Query 256 ASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSM 292
+L PIF +++G+++ D+ L+ S A+RP +
Sbjct 614 PKVLEPIFAVKYNGSMSTDQFLVNSYFDVKAIRPMQV 650
>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=245
Score = 76.6 bits (187), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 58/202 (29%), Positives = 97/202 (48%), Gaps = 23/202 (11%)
Query 26 MKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIKPKVDSRVS--VFIGGDDKTLSIN 83
+ I+ +R++ ALQ++ E + + Q+L+HFG++ D+R+ F+GG +S++
Sbjct 3 VNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSS-DARLQRPQFLGGGRTPISVS 61
Query 84 PQVNTNFQNGGEPEIKAIGIGDLSAGCKFTSTTY----GMIIGIYRAIPQLDYSHVGIDR 139
+ T+ + P+ G G +SAG T Y G I+GI P+ Y G+ +
Sbjct 62 EVLQTSSTDSTSPQANMAGHG-ISAGVNHGFTRYFEEHGYIMGIMSIRPRTGYQQ-GVPK 119
Query 140 NLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLDMSVTYGYSPR 199
+ K D DF PE +G Q E+ + LN+ S T+GY+PR
Sbjct 120 DFRKFDNMDFYFPEFAHLGEQ-----EIKNEELYLNE---------SDAANEGTFGYTPR 165
Query 200 YAELKSARDYYEGGFCGTYSTW 221
YAE K +++ G F G + W
Sbjct 166 YAEYKYSQNEVHGDFRGNMAFW 187
>gi|575094297|emb|CDL65693.1| unnamed protein product [uncultured bacterium]
Length=630
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 120/292 (41%), Gaps = 26/292 (9%)
Query 25 SMKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIK-PKVDSRVSVFIGGDDKTLSIN 83
++ +S LR+ A K I + AQ LAHFG K P+ S ++GG + L I+
Sbjct 343 TLNVSQLRALYATDKLLRITQFAGKHYDAQTLAHFGKKVPQGVSGEVYYLGGQSQRLQIS 402
Query 84 PQVNTNFQNG----------GEPEIKAIGIGDLSAGCKFTSTTYGMIIGIYRAIPQLDYS 133
P T +G GE +A + F + +G+++ IY A+P+ +YS
Sbjct 403 PI--TALSSGQTSDGSDTVFGEQGARAASVTQGQKPFTFEAPCHGILMAIYSAVPEANYS 460
Query 134 HVGIDRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGL-NDLLVPYGYQTSPLDMSV 192
IDR ++DF PELD+IGM Y E S P L + PY S D +
Sbjct 461 CDAIDRINTLAYSNDFYKPELDNIGMSPLYSYEFSVPGYTLFRNPPTPY----SSDDAAQ 516
Query 193 TYGYSPRYAELKSARDYYEGGFCGTYSTWVTGYDESFLSGWRR----NRGSVSVTDYDSI 248
+ G+ RY+ K+ D G T +W D L R N S+ +
Sbjct 517 SLGWQFRYSWFKTKVDRTCGALNRTLRSWCPKRDYLALGLQSRPQLFNYASLYYVSPSYL 576
Query 249 EDLFKCRAS----LLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGLP 296
+ LF + YP+ + S D L+ TC S YGLP
Sbjct 577 DGLFYLNFAPPIDYRYPVDSDMSSIAFETDPLIHDMQITCYKTSVMSTYGLP 628
Lambda K H a alpha
0.318 0.137 0.409 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1574515238976