bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-2_CDS_annotation_glimmer3.pl_2_8
Length=342
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094354|emb|CDL65742.1| unnamed protein product 234 1e-67
gi|496050829|ref|WP_008775336.1| hypothetical protein 233 2e-67
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 226 7e-65
gi|490418709|ref|WP_004291032.1| hypothetical protein 218 5e-62
gi|494822885|ref|WP_007558293.1| hypothetical protein 208 6e-58
gi|575094321|emb|CDL65708.1| unnamed protein product 157 9e-40
gi|565841287|ref|WP_023924568.1| hypothetical protein 131 3e-30
gi|647452987|ref|WP_025792807.1| hypothetical protein 112 4e-24
gi|517172762|ref|WP_018361580.1| hypothetical protein 110 1e-23
gi|496521299|ref|WP_009229582.1| capsid protein 110 2e-23
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 234 bits (598), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 145/373 (39%), Positives = 212/373 (57%), Gaps = 35/373 (9%)
Query 2 GVLPNSQFGDIAVIDIEGGLNIPASRIS--LSSNNRPTIGIKVGAQVSSPNNCSITNSSG 59
GVLP +Q+G +V+ I G LN+ ++ S + + P G + V+ N + N S
Sbjct 246 GVLPVAQYGSASVVPINGQLNVISNGDSGPIFKTSTPDPGTPGTSYVTVGGNIGVDNRSF 305
Query 60 NLSTGDILSVGIPA--ASYKLQSSFN----------------------VLALRQAESLQK 95
+S G L+VG A + Y S+ + +LALRQAE LQK
Sbjct 306 GVS-GSTLNVGKSADPSGYGFPSNASTRSLLWENPNLIIENNQGFYVPILALRQAEFLQK 364
Query 96 YREITQSVDTNYRDQIKAHFGVNVPASDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVI 155
++E++ S + +Y+ QI+ H+G+ V SH A+Y+GG A +LDI+EV+NNN+ GD A I
Sbjct 365 WKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCATSLDINEVINNNITGDNAADI 424
Query 156 YGKGVGTGTGSMRYTTGSKYCILMCIYHCMPVLDYDISGQHPQLLATSVDELPIPEFDNI 215
GKG TG GS+R+ + +Y I+MCIYH +P++DY SG PIPE D I
Sbjct 425 AGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIVDYVGSGVDHSCTLVDATSFPIPELDQI 484
Query 216 GMEGVPLVQLVNSNLYKTNKSVKIDSILGYNPRYYAWKSNIDRIHGAFTTTLQDWVSPVD 275
GME VPLV+ +N K + + D+ LGY PRY WK+++DR G F +L+ W PV
Sbjct 485 GMESVPLVRAMNP--VKESDTPSADTFLGYAPRYIDWKTSVDRSVGDFADSLRTWCLPVG 542
Query 276 DSFLYS--TFGTPSSGSF----VTWPFFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGC 329
D L S + PS+ + + FFKVNP+ +D +FAV +DST ++D+FL +S+
Sbjct 543 DKELTSANSLNFPSNPNVEPDSIAAGFFKVNPSIVDPLFAVVADSTVKTDEFLCSSFFDV 602
Query 330 KVVRPLSRDGVPY 342
KVVR L +G+PY
Sbjct 603 KVVRNLDVNGLPY 615
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 233 bits (594), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/346 (40%), Positives = 193/346 (56%), Gaps = 32/346 (9%)
Query 2 GVLPNSQFGDIAVIDIEGGLNIPASRISLSSNNRPTIGIKVGAQVSSPNNCSITNSSGNL 61
GVLP Q+GD A +++ + A + + + P G S+ N N SG
Sbjct 262 GVLPRQQYGDTAAVNVNLSNVLSAQYMVQTPDGDPVGGSPFS---STGVNLQTVNGSG-- 316
Query 62 STGDILSVGIPAASYKLQSSFNVLALRQAESLQKYREITQSVDTNYRDQIKAHFGVNVPA 121
+F VLALRQAE LQK++EITQS + +Y+DQI+ H+ V+V
Sbjct 317 -------------------TFTVLALRQAEFLQKWKEITQSGNKDYKDQIEKHWNVSVGE 357
Query 122 SDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVIYGKGVGTGTGSMRYTTGSKYCILMCI 181
+ S M+ Y+GG +LDI+EVVNNN+ G A I GKGV G G + + G +Y ++MCI
Sbjct 358 AYSEMSLYLGGTTASLDINEVVNNNITGSNAADIAGKGVVVGNGRISFDAGERYGLIMCI 417
Query 182 YHCMPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLYKTNKSVKIDS 241
YH +P+LDY +P + + IPEFD +GME VPLV L+N N S
Sbjct 418 YHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLMNPLQSSYNVG---SS 474
Query 242 ILGYNPRYYAWKSNIDRIHGAFTTTLQDWVSPVDDSFL-----YSTFGTPSSGSFVTWPF 296
ILGY PRY ++K+++D GAF TTL+ WV D+ + Y S G+ V +
Sbjct 475 ILGYAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNYQDDPNNSPGTLVNYTN 534
Query 297 FKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVPY 342
FKVNPN +D +FAV + ++ ++DQFL +S+ KVVR L DG+PY
Sbjct 535 FKVNPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLPY 580
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 226 bits (576), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 146/352 (41%), Positives = 202/352 (57%), Gaps = 42/352 (12%)
Query 2 GVLPNSQFGDIAVID-IEGGLNIPASRISLSSNNRPTIG---IKVGAQVSSPNNCSITNS 57
G+LP +Q+GD++V I G L+I S SL+ + P G I+ G V + N +N+
Sbjct 253 GMLPRAQYGDVSVASPIFGDLDIGDSS-SLTFASAPQQGANTIQSGVLVVNNN----SNT 307
Query 58 SGNLSTGDILSVGIPAASYKLQSSFNVLALRQAESLQKYREITQSVDTNYRDQIKAHFGV 117
+ LS VLALRQAE LQK+REI QS +Y+ Q++ HF V
Sbjct 308 TAGLS---------------------VLALRQAECLQKWREIAQSGKMDYQTQMQKHFNV 346
Query 118 NVPASDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVIYGKGVGTGTGSMRYTTGSKYCI 177
+ A+ S +Y+GG NLDISEVVN NL GD +A I GKG GT G+ S++ I
Sbjct 347 SPSATLSGHCKYLGGWTSNLDISEVVNTNLTGDNQADIQGKGTGTLNGNKVDFESSEHGI 406
Query 178 LMCIYHCMPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLYKTNKSV 237
+MCIYHC+P+LD+ I+ Q T+ + IPEFD++GM+ QL S + + +
Sbjct 407 IMCIYHCLPLLDWSINRIARQNFKTTFTDYAIPEFDSVGMQ-----QLYPSEMIFGLEDL 461
Query 238 KID--SI-LGYNPRYYAWKSNIDRIHGAFTTTLQDWVSPVDDSFLYSTFGTPSSGSF--- 291
D SI +GY PRY K++ID IHG+F TL WVSP+ DS++ + F
Sbjct 462 PSDPSSINMGYVPRYADLKTSIDEIHGSFIDTLVSWVSPLTDSYISAYRQACKDAGFSDI 521
Query 292 -VTWPFFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVPY 342
+T+ FFKVNP+ +DNIF VK+DST +DQ L+NSY K VR +G+PY
Sbjct 522 TMTYNFFKVNPHIVDNIFGVKADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 218 bits (556), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 134/354 (38%), Positives = 195/354 (55%), Gaps = 42/354 (12%)
Query 2 GVLPNSQFGDIAVIDIEGGLNIPASRISLSSNNRPTIGIKVGAQVSSPNNCSITNSSGNL 61
GVLP+ Q+G+ AV I P L+ +N T+G +SP S T ++ NL
Sbjct 254 GVLPHQQYGETAVASI-----TPDVTGKLTLSNFSTVG-------TSPTTASGT-ATKNL 300
Query 62 STGDILSVGIPAASYKLQSSFNVLALRQAESLQKYREITQSVDTNYRDQIKAHFGVNVPA 121
D +VG ++L LRQAE LQK++EITQS + +Y+DQ++ H+GV+V
Sbjct 301 PAFD--TVG----------DLSILVLRQAEFLQKWKEITQSGNKDYKDQLEKHWGVSVGD 348
Query 122 SDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVIYGKGVGTGTGSMRYTTGSKYCILMCI 181
S + Y+GG++ ++DI+EV+N N+ G A I GKGVG G + + + +Y ++MCI
Sbjct 349 GFSELCTYLGGVSSSIDINEVINTNITGSAAADIAGKGVGVANGEINFNSNGRYGLIMCI 408
Query 182 YHCMPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLYKTNKSVKIDS 241
YHC+P+LDY P L + + IPEFD +GM+ +PLVQL+N N S
Sbjct 409 YHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLMNPLRSFANAS---GL 465
Query 242 ILGYNPRYYAWKSNIDRIHGAFTTTLQDWV-------------SPVDDSFLYSTFGTPSS 288
+LGY PRY +K+++D+ G F TL WV P D + + PS
Sbjct 466 VLGYVPRYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQVTLPNDAPPIEPSEPVPSV 525
Query 289 GSFVTWPFFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVPY 342
+ + FFKVNP+ LD IFAV++ +DQFL +S+ K VR L DG+PY
Sbjct 526 AP-MNFTFFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 208 bits (530), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 129/358 (36%), Positives = 197/358 (55%), Gaps = 23/358 (6%)
Query 2 GVLPNSQFGDIAVIDIEGGLNIPASRISLSSNNRPTIGIKVGAQVSSPNNCSITNSSGNL 61
G +P +Q+G+ + + + G + + + P N +I SSG L
Sbjct 262 GTIPQAQYGEASAVPVSGSMQV------VEGPTPPAFTTGQDGVAFLNGNVTIQGSSGYL 315
Query 62 ----STGD--ILSVGIPAASYKLQ--SSFNV--LALRQAESLQKYREITQSVDTNYRDQI 111
S G+ IL + ++ SSF V LALR+AE+ QK++E+ + + +Y QI
Sbjct 316 QAQTSVGESRILRFNNTNSGLIVEGDSSFGVSILALRRAEAAQKWKEVALASEEDYPSQI 375
Query 112 KAHFGVNVPASDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVIYGKGVGTGTGSMRYTT 171
+AH+G +V + S M Q++G I +L I+EVVNNN+ G+ A I GKG +G GS+ +
Sbjct 376 EAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNITGENAADIAGKGTMSGNGSINFNV 435
Query 172 GSKYCILMCIYHCMPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLY 231
G +Y I+MC++H +P LDY S H T+V + PIPEFD IGME VP+++ +N
Sbjct 436 GGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIPEFDKIGMEQVPVIRGLNPVKP 495
Query 232 KT-NKSVKIDSILGYNPRYYAWKSNIDRIHGAFTTTLQDWVSPVDDSFLYS--TFGTPS- 287
K + V + GY P+YY WK+ +D+ G F +L+ W+ P DD L + + P
Sbjct 496 KDGDFKVSPNLYFGYAPQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLAADSVDFPDN 555
Query 288 ---SGSFVTWPFFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVPY 342
V FFKV+P+ LDN+FAVK++S +DQFL ++ VVR L +G+PY
Sbjct 556 PNVEADSVKAGFFKVSPSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSLDPNGLPY 613
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 157 bits (398), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 111/366 (30%), Positives = 177/366 (48%), Gaps = 36/366 (10%)
Query 2 GVLPNSQFGDIAVIDIEGG-------LNIPASRISLSSNNRPTIG---IKVGAQVSSPNN 51
GVLP SQFG +V+++ G LN S+ S R T G ++ S+ N
Sbjct 286 GVLPTSQFGSESVVNLNLGNASGSAVLNGTTSKDS--GRWRTTTGEWEMEQRVASSANGN 343
Query 52 CSITNSSGNLSTGDILSVGIPAASYKLQSSFNVLALRQAESLQKYREITQSVDTNYRDQI 111
+ NS+G + D G A + L + +++ALR A + QKY+EI + D +++ Q+
Sbjct 344 LKLDNSNGTFISHDHTFSGNVAINTSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQV 403
Query 112 KAHFGVNVPASDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVIYGKGVGTGTGSMRYTT 171
+AHFG+ P + + +IGG + ++I+E +N NL GD +A G G+ S+++T
Sbjct 404 EAHFGIK-PDEKNENSLFIGGSSSMININEQINQNLSGDNKATYGAAPQGNGSASIKFTA 462
Query 172 GSKYCILMCIYHCMPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLY 231
+ Y +++ IY C PVLD+ G L T + IPE D+IGM+ ++ Y
Sbjct 463 KT-YGVVIGIYRCTPVLDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAPAPY 521
Query 232 KTN---------KSVKIDSILGYNPRYYAWKSNIDRIHGAFTTTLQDWVSPVDDSFLYST 282
S + GY PRY +K++ DR +GAF +L+ WV+ ++
Sbjct 522 NDEFKAFRVGDGSSPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGIN------- 574
Query 283 FGTPSSGSFVTWP------FFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLS 336
F + + TW F P+ + N+F V S + + DQ V C R LS
Sbjct 575 FDAIQNNVWNTWAGINAPNMFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLS 634
Query 337 RDGVPY 342
R G+PY
Sbjct 635 RYGLPY 640
>gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens]
gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens
CC14M]
Length=656
Score = 131 bits (329), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 90/264 (34%), Positives = 143/264 (54%), Gaps = 26/264 (10%)
Query 87 LRQAESLQKYREITQSVD-TNYRDQIKAHFGVNVPASDSHMAQYIGGIARNLDISEVVN- 144
+R +L+K E T++ + +Y +QI AHFG VP S + A +IGG + ISEVV
Sbjct 396 IRAMFALEKMLERTRAANGLDYSNQIAAHFGFKVPESRKNCASFIGGFDNQISISEVVTT 455
Query 145 NNLQGDGEAV-------IYGKGVGT-GTGSMRYTTGSKYCILMCIYHCMPVLDYDISGQH 196
+N DG A ++GKG+G +G + Y ++ ++MCIY P +DYD
Sbjct 456 SNGSVDGTASTGSVVGQVFGKGIGAMNSGHISYDV-KEHGLIMCIYSIAPQVDYDARELD 514
Query 197 PQLLATSVDELPIPEFDNIGMEGVPLVQ---LVNSNLYKTNKSVKIDSILGYNPRYYAWK 253
P S ++ PEF+N+GM+ P++Q + N K++ S + +++LGY+ RY +K
Sbjct 515 PFNRKFSREDYFQPEFENLGMQ--PVIQSDLCLCINSAKSDSSDQHNNVLGYSARYLEYK 572
Query 254 SNIDRIHGAFTT--TLQDWVSPVDDSFLYSTFGTPSSGSFVTWPFFKVNPNTLDNIFAVK 311
+ D I G F + +L W +P ++ FG ++ P V+P L+ IFAVK
Sbjct 573 TARDIIFGEFMSGGSLSAWATPKNNYTF--EFGK------LSLPDLLVDPKVLEPIFAVK 624
Query 312 SDSTWESDQFLVNSYVGCKVVRPL 335
+ + +DQFLVNSY K +RP+
Sbjct 625 YNGSMSTDQFLVNSYFDVKAIRPM 648
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 112 bits (280), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 90/290 (31%), Positives = 141/290 (49%), Gaps = 33/290 (11%)
Query 80 SSFNVLALRQAESLQKYREITQSVD-TNYRDQIKAHFGVNVPASDSHMAQYIGGIARNLD 138
SSF+V LR A +L K E T+ + +Y QI+AHFG VP S ++ A+++GG ++
Sbjct 296 SSFSVNDLRAAFALDKMLEATRRANGLDYASQIEAHFGFKVPESRANDARFLGGFDNSIV 355
Query 139 ISEVV--NNNLQGDGEAVIYGKGVGTGTGSMRYTT----GSKYCILMCIYHCMPVLDYDI 192
+SEVV N N DG G G G GSM T +++ I+MCIY P +Y+
Sbjct 356 VSEVVSTNGNAASDGSHASIGDLGGKGIGSMSSGTIEFDSTEHGIIMCIYSVAPQSEYNA 415
Query 193 SGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLYKTNKSVKI------DSILGYN 246
S P + ++ PEF ++G + + L+ S L K +++LGY
Sbjct 416 SYLDPFNRKLTREQFYQPEFADLGYQALIGSDLICSTLGMNEKQAGFSDIELNNNLLGYQ 475
Query 247 PRYYAWKSNIDRIHGAFTT--TLQDWVSPVDDSFLY--------------STFGTPSSGS 290
RY +K+ D + G F + +L W +P D F Y + + + S
Sbjct 476 VRYNEYKTARDLVFGDFESGKSLSYWCTPRFD-FGYGDTEKKIAPENKGGADYRKKGNRS 534
Query 291 FVTWPFFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGV 340
+ F +NPN ++ IF S ++D F+VNS++ K VRP+S G+
Sbjct 535 HWSSRNFYINPNLVNPIFLT---SAVQADHFIVNSFLDVKAVRPMSVTGL 581
>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568
Score = 110 bits (276), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 81/282 (29%), Positives = 125/282 (44%), Gaps = 35/282 (12%)
Query 79 QSSFNVLALRQAESLQKYREITQSVDTNYRDQIKAHFGVNVPASDSHMAQYIGGIARNLD 138
++ +V +R A +L+K +T Y++Q++AHFG++V YIGG N+
Sbjct 301 RTMISVADIRNAFALEKLASVTMRAGKTYKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQ 360
Query 139 ISEVVNNNLQGDGEAV--------------IYGKGVGTGTGSMRYTTGSKYCILMCIYHC 184
+ +V Q G V GK G+G+G +R+ ++ ILMCIY
Sbjct 361 VGDVT----QSSGTTVTGTKDTSFGGYLGRTTGKATGSGSGHIRF-DAKEHGILMCIYSL 415
Query 185 MPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLYKTNKS---VKIDS 241
+P + YD P + + +PEF+N+GM+ PL S Y N + +K
Sbjct 416 VPDVQYDSKRVDPFVQKIERGDFFVPEFENLGMQ--PLFAKNISYKYNNNTANSRIKNLG 473
Query 242 ILGYNPRYYAWKSNIDRIHGAFTTT--LQDWVSPVDDSFLYSTFGTPSSGSFVTWPFFKV 299
G+ PRY +K+ +D HG F L W S F + FK+
Sbjct 474 AFGWQPRYSEYKTALDINHGQFVHQEPLSYWTVARARGESMSNFNIST---------FKI 524
Query 300 NPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVP 341
NP LD++FAV + T +DQ Y V +S DG+P
Sbjct 525 NPKWLDDVFAVNYNGTELTDQVFGGCYFNIVKVSDMSIDGMP 566
>gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317]
gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon
317 str. F0108]
Length=541
Score = 110 bits (275), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 94/338 (28%), Positives = 150/338 (44%), Gaps = 53/338 (16%)
Query 28 ISLSSNNRPTIGIKVGA-------QVSSPNNCSITNSSGNLSTGDILSVGIPAASYKLQS 80
+ +N RPT +G+ Q+S P + ++ GN + ++ S +
Sbjct 231 LDFYTNLRPTPLFTIGSDSFSSVLQLSDPTGSAGFSADGNSAKLNMASPDV--------- 281
Query 81 SFNVLALRQAESLQKYREITQSVDTNYRDQIKAHFGVNVPASDSHMAQYIGGIARNLDIS 140
NV A+R A +L K I+ Y +QI+AHFGV V Y+GG N+ +
Sbjct 282 -LNVSAIRSAFALDKLLSISMRAGKTYAEQIEAHFGVTVSEGRDGQVYYLGGFDSNVQVG 340
Query 141 EV------VNNNLQGDGEA-------VIYGKGVGTGTGSMRYTTGSKYCILMCIYHCMPV 187
+V N N+ G A I GKG G+G G +++ +LMCIY +P
Sbjct 341 DVTQTSGTTNPNVSEVGNAKLAGYLGKITGKGTGSGYGEIQFDAKEP-GVLMCIYSVVPA 399
Query 188 LDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLV-QLVNSNLYKTNKSVKIDSILGYN 246
+ YD P + + + IPEF+N+GM+ P+V V+ N K N G+
Sbjct 400 MQYDCMRLDPFVAKQTRGDYFIPEFENLGMQ--PIVPAFVSLNRAKDNS-------YGWQ 450
Query 247 PRYYAWKSNIDRIHGAFTT--TLQDW-VSPVDDSFLYSTFGTPSSGSFVTWPFFKVNPNT 303
PRY +K+ D HG F L W ++ S +TF + K+NP+
Sbjct 451 PRYSEYKTAFDINHGQFANGEPLSYWSIARARGSDTLNTFNVAA---------LKINPHW 501
Query 304 LDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVP 341
LD++FAV + T +D ++ + V ++ DG+P
Sbjct 502 LDSVFAVNYNGTEVTDCMFGYAHFNIEKVSDMTEDGMP 539
Lambda K H a alpha
0.316 0.134 0.399 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 2000070484950