bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-12_CDS_annotation_glimmer3.pl_2_3
Length=613
Score E
Sequences producing significant alignments: (Bits) Value
gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 426 6e-138
gi|490418709|ref|WP_004291032.1| hypothetical protein 416 3e-134
gi|575094354|emb|CDL65742.1| unnamed protein product 415 4e-133
gi|496050829|ref|WP_008775336.1| hypothetical protein 409 2e-131
gi|494822885|ref|WP_007558293.1| hypothetical protein 362 7e-113
gi|575094321|emb|CDL65708.1| unnamed protein product 259 1e-73
gi|517172762|ref|WP_018361580.1| hypothetical protein 204 4e-54
gi|647452987|ref|WP_025792807.1| hypothetical protein 194 1e-50
gi|494308783|ref|WP_007173938.1| hypothetical protein 193 3e-50
gi|575094339|emb|CDL65730.1| unnamed protein product 192 6e-50
>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573
Score = 426 bits (1095), Expect = 6e-138, Method: Compositional matrix adjust.
Identities = 268/624 (43%), Positives = 359/624 (58%), Gaps = 62/624 (10%)
Query 1 MASYTGMSNLQNHPHRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQP 60
M+S ++ L+N R+GFD+ KNAFTAKVGELLP+ PGDK+ + FTRTQP
Sbjct 1 MSSVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQP 60
Query 61 VETSAYTRLREYFDFYAVPLRLLWKSAPSVLTQMQDVNKIQALSLTQNLSLGTFLPSIPL 120
V ++AY+RLREY+DFY VP RLLW AP+ T M D + L + NLS P
Sbjct 61 VNSAAYSRLREYYDFYFVPYRLLWNMAPTFFTNMPDPHHAADLVSSVNLS-----QRHPW 115
Query 121 SVLSDAM-YLLNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGR-WWS 178
D M YL N S + KN FGF R +L KLL+YL YG G+ + S
Sbjct 116 FTFFDIMEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYG-------FGKDYES 168
Query 179 TSVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvsp 238
V S +D + ++ FPLLAYQKI +D+FR QW+++ P YN+DY G S
Sbjct 169 VKVPSDSD---------DIVLSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSS 219
Query 239 slissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLG 298
+ S + D +K+ TMFDL YCN+ KD G+LP +Q+GDV+V P GD +
Sbjct 220 GFHIPMSSFTNDAFKNPTMFDLNYCNFQKDYFTGMLPRAQYGDVSVAS-PIFGDLD---- 274
Query 299 TDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAE 358
+G +S++T +AP N I + + S+ + +VLALRQAE
Sbjct 275 -------IGDSSSLTFASAP-------QQGANTIQSGVLVVNNNSNTTAGLSVLALRQAE 320
Query 359 ALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEG 418
LQ+W+EI+QSG DY+ Q++KHF V LS C Y+GG + NLDISEVVN NL T
Sbjct 321 CLQKWREIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDISEVVNTNL-TGD 379
Query 419 DTAVIAGKGVGAGNGS-FEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIP 477
+ A I GKG G NG+ ++ ++EH ++MCIYH +PLLD+++ Q T IP
Sbjct 380 NQADIQGKGTGTLNGNKVDFESSEHGIIMCIYHCLPLLDWSINRIARQNFKTTFTDYAIP 439
Query 478 EFDNIGMEVL-PMTQVFN-----SPKASIVNLFNAGYNPRYFNWKTKLDVINGAFTTTLK 531
EFD++GM+ L P +F S +SI N GY PRY + KT +D I+G+F TL
Sbjct 440 EFDSVGMQQLYPSEMIFGLEDLPSDPSSI----NMGYVPRYADLKTSIDEIHGSFIDTLV 495
Query 532 SWVSPVTESLLSGW--FCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDT 589
SWVSP+T+S +S + C KD D + M Y FFKVNP ++D IFGV ADST +T
Sbjct 496 SWVSPLTDSYISAYRQAC----KDAGFSD--ITMTYNFFKVNPHIVDNIFGVKADSTINT 549
Query 590 DQLLVNSYIGCYVVRNLSRDGVPY 613
DQLL+NSY VRN +G+PY
Sbjct 550 DQLLINSYFDIKAVRNFDYNGLPY 573
>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM
20697]
Length=578
Score = 416 bits (1070), Expect = 3e-134, Method: Compositional matrix adjust.
Identities = 251/622 (40%), Positives = 344/622 (55%), Gaps = 53/622 (9%)
Query 1 MASYTGMSNLQNHPHRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQP 60
MA+ + +++N P R+GFD+ K FTAK GELLPV +PGD ++ N++ FTRTQP
Sbjct 1 MANIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQP 60
Query 61 VETSAYTRLREYFDFYAVPLRLLWKSAPSVLTQMQDVNKIQALSL--TQNLSLGTFLPSI 118
V T+A+ R+REY+DF+ VP LLW A +VLTQM D N A+S+ T+N L +P +
Sbjct 61 VNTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYD-NPQHAVSIDPTRNFVLSGEMPYM 119
Query 119 PLSVLSDAMYLLNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWWS 178
++ +N S + N FG++RS KLL YLGYGN S T W
Sbjct 120 TSEAIAS---YINALSTASALADYKSNYFGYNRSKSSVKLLEYLGYGNY-ESFLTDDW-- 173
Query 179 TSVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvsp 238
T + N N+F LLAYQKIY DF+R SQWE +PS++NVDY G
Sbjct 174 ----------NTAPLMANLNHNIFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDG--- 220
Query 239 slissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLG 298
S ++ + S +++++ FDL+YCNW KD+ GVLP+ Q+G+ AV I + L
Sbjct 221 SSMNLDNAYSTEFYQNYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGKLTL- 279
Query 299 TDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAE 358
S+ S+VG + S TA L A D + ++L LRQAE
Sbjct 280 --SNFSTVGTSPTTASGTATKNLPAFDTVGD-------------------LSILVLRQAE 318
Query 359 ALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEG 418
LQ+WKEI+QSG+ DY++Q+ KH+GV + S +CTY+GGVS ++DI+EV+N N+ T
Sbjct 319 FLQKWKEITQSGNKDYKDQLEKHWGVSVGDGFSELCTYLGGVSSSIDINEVINTNI-TGS 377
Query 419 DTAVIAGKGVGAGNGSFEYTTT-EHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIP 477
A IAGKGVG NG + + + ++MCIYH +PLLDYT D L ++ IP
Sbjct 378 AAADIAGKGVGVANGEINFNSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIP 437
Query 478 EFDNIGMEVLPMTQVFNSPKASIVNL--FNAGYNPRYFNWKTKLDVINGAFTTTLKSWVS 535
EFD +GM+ +P+ Q+ N P S N GY PRY ++KT +D G F TL SWV
Sbjct 438 EFDRVGMQSMPLVQLMN-PLRSFANASGLVLGYVPRYIDYKTSVDQSVGGFKRTLNSWVI 496
Query 536 PVTESLLSGWFCFGYNKDDAAPDTKV----IMNYKFFKVNPSVLDPIFGVNADSTWDTDQ 591
+ + P V MN+ FFKVNP LDPIF V A +TDQ
Sbjct 497 SYGNISVLKQVTLPNDAPPIEPSEPVPSVAPMNFTFFKVNPDCLDPIFAVQAGDDTNTDQ 556
Query 592 LLVNSYIGCYVVRNLSRDGVPY 613
L +S+ VRNL DG+PY
Sbjct 557 FLCSSFFDIKAVRNLDTDGLPY 578
>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615
Score = 415 bits (1066), Expect = 4e-133, Method: Compositional matrix adjust.
Identities = 251/646 (39%), Positives = 359/646 (56%), Gaps = 72/646 (11%)
Query 7 MSNLQNHPHRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQPVETSAY 66
M++++N P R+GFD+ K FTAK GELLPV + +PGD + N+ FTRTQP+ TSA+
Sbjct 3 MADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTSAF 62
Query 67 TRLREYFDFYAVPLRLLWKSAPSVLTQMQ-DVNKIQALSLTQNLSLGTFLPSIPLSVLSD 125
R+REY+DFY VP +W S +TQM +V +L N L +P ++D
Sbjct 63 ARMREYYDFYFVPFEQMWNKFDSCITQMNANVQHASGPTLDDNTPLSGRMPYFTSEQIAD 122
Query 126 AMYLLNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWWSTSVSSKN 185
LN ++ +++ KN FGF+RS L KLL YLGYG+ S +S WS
Sbjct 123 ---YLNDQA-----TAARKNPFGFNRSTLTCKLLQYLGYGDYNSFDSETNTWSA------ 168
Query 186 DASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvspslissll 245
+ + N ++ FPLLAYQKIY DF+R++QWE +NPS++N+DY G S +
Sbjct 169 -----KPLLYNLELSPFPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKGTSDLQMDLTG 223
Query 246 svspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAV------LDIPDSGDSNVV--- 296
S D FD++YCN+ KDM GVLP +Q+G +V L++ +GDS +
Sbjct 224 LPSDD----NNFFDIRYCNYQKDMFHGVLPVAQYGSASVVPINGQLNVISNGDSGPIFKT 279
Query 297 ------------------LGTDSHKSSVGIASAITSKTAP-----FPLFALDASP--ENP 331
+G D+ V ++ K+A FP A S ENP
Sbjct 280 STPDPGTPGTSYVTVGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENP 339
Query 332 IPINsklrldlsslksQFTVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALS 391
I ++ +LALRQAE LQ+WKE+S SG+ DY+ QI KH+G+K+ LS
Sbjct 340 NLI------IENNQGFYVPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLS 393
Query 392 NMCTYIGGVSRNLDISEVVNNNLATEGDTAVIAGKGVGAGNGSFEYTTT-EHCVVMCIYH 450
+ Y+GG + +LDI+EV+NNN+ T + A IAGKG GNGS + + E+ ++MCIYH
Sbjct 394 HQARYLGGCATSLDINEVINNNI-TGDNAADIAGKGTFTGNGSIRFESKGEYGIIMCIYH 452
Query 451 AVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFNSPKASIVNLFNA--GY 508
+P++DY +G D + DA S PIPE D IGME +P+ + N K S + GY
Sbjct 453 VLPIVDYVGSGVDHSCTLVDATSFPIPELDQIGMESVPLVRAMNPVKESDTPSADTFLGY 512
Query 509 NPRYFNWKTKLDVINGAFTTTLKSWVSPVTESLLSGWFCFGYNKD-DAAPDTKVIMNYKF 567
PRY +WKT +D G F +L++W PV + L+ + + + PD+ + F
Sbjct 513 APRYIDWKTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNFPSNPNVEPDS---IAAGF 569
Query 568 FKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY 613
FKVNPS++DP+F V ADST TD+ L +S+ VVRNL +G+PY
Sbjct 570 FKVNPSIVDPLFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615
>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580
Score = 409 bits (1051), Expect = 2e-131, Method: Compositional matrix adjust.
Identities = 254/621 (41%), Positives = 356/621 (57%), Gaps = 49/621 (8%)
Query 1 MASYTGMSNLQNHPHRSGFDIGRKNAFTAKVGELLPVY-WDISMPGDKYRFNIEYFTRTQ 59
MA+ + +L+N R+GFD+ K FTAK GELLPV W++ +PGDK+ +++ FTRTQ
Sbjct 1 MANIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEV-LPGDKWSIDLKSFTRTQ 59
Query 60 PVETSAYTRLREYFDFYAVPLRLLWKSAPSVLTQMQDVNKIQALSL--TQNLSLGTFLPS 117
P+ T+A+ R+REY+DFY VP LLW A +VLTQM D N A S + N +L +P+
Sbjct 60 PLNTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYD-NPQHATSYIPSANQALAGVMPN 118
Query 118 IPLSVLSDAMYLLNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWW 177
+ ++D + L+ T +S KN FG+ RS KLL YLGYGN
Sbjct 119 VTCKGIADYLNLVAPDVTT--TNSYEKNYFGYSRSLGTAKLLEYLGYGNFY--------- 167
Query 178 STSVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvs 237
T +SKN+ N +N++ +LAYQKIY D R SQWE +PS +NVDY +G
Sbjct 168 -TYATSKNNTWTKSPLSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSGTV 226
Query 238 pslissllsvspD-YWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVV 296
S ++ ++ + MFDL+YCNW KD+ GVLP Q+GD A +++ SNV+
Sbjct 227 DSAMTIDSMITGQGFAPFYNMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNV---NLSNVL 283
Query 297 LGTDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslks-QFTVLALR 355
+A + + D P P +S + S FTVLALR
Sbjct 284 -------------------SAQYMVQTPDGDPVGGSPFSSTGVNLQTVNGSGTFTVLALR 324
Query 356 QAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLA 415
QAE LQ+WKEI+QSG+ DY++QI KH+ V + +A S M Y+GG + +LDI+EVVNNN+
Sbjct 325 QAEFLQKWKEITQSGNKDYKDQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNI- 383
Query 416 TEGDTAVIAGKGVGAGNGSFEYTTTE-HCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESL 474
T + A IAGKGV GNG + E + ++MCIYH++PLLDYT + ++
Sbjct 384 TGSNAADIAGKGVVVGNGRISFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDF 443
Query 475 PIPEFDNIGMEVLPMTQVFNSPKASIVNLFNA--GYNPRYFNWKTKLDVINGAFTTTLKS 532
IPEFD +GME +P+ + N P S N+ ++ GY PRY ++KT +D GAF TTLKS
Sbjct 444 AIPEFDRVGMESVPLVSLMN-PLQSSYNVGSSILGYAPRYISYKTDVDSSVGAFKTTLKS 502
Query 533 WVSPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQL 592
WV + + +DD ++NY FKVNP+ +DP+F V A ++ DTDQ
Sbjct 503 WVMSYDNQSVINQLNY---QDDPNNSPGTLVNYTNFKVNPNCVDPLFAVAASNSIDTDQF 559
Query 593 LVNSYIGCYVVRNLSRDGVPY 613
L +S+ VVRNL DG+PY
Sbjct 560 LCSSFFDVKVVRNLDTDGLPY 580
>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM
17135]
Length=613
Score = 362 bits (929), Expect = 7e-113, Method: Compositional matrix adjust.
Identities = 230/634 (36%), Positives = 347/634 (55%), Gaps = 49/634 (8%)
Query 1 MASYTGMSNLQNHPHRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQP 60
MA+ M +++N P R+G+D+ +K FTAK G L+PV+W +P D ++ F RTQP
Sbjct 8 MANIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQP 67
Query 61 VETSAYTRLREYFDFYAVPLRLLWKSAPSVLTQMQDVNKIQALS--LTQNLSLGTFLPSI 118
+ T+A+ R+R YFDFY VP R +W P+ +TQM+ N + A L N+ L LP
Sbjct 68 LNTAAFARMRGYFDFYFVPFRQMWNKFPTAITQMR-TNLLHASGPVLADNVPLSDELPYF 126
Query 119 PLSVLSDAMYLLNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWWS 178
++D + L + KN FG+ R+ L +L YLGYG+ +
Sbjct 127 TAEQVADYIVSL----------ADSKNQFGYYRAWLVCIILEYLGYGDFYP-------YI 169
Query 179 TSVSSKNDASYTQRYIQNNY-VNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvs 237
+ A++ R + NN + FPL AYQKIY DF R++QWE SNPS++N+DY +G
Sbjct 170 VEAAGGEGATWATRPMLNNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISG-- 227
Query 238 pslissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVL 297
+ L + S +FD++Y NW +D+L G +P +Q+G+ + +P SG VV
Sbjct 228 SADSLQLDFTVEGFKDSFNLFDMRYSNWQRDLLHGTIPQAQYGEASA--VPVSGSMQVVE 285
Query 298 GTDSHKSSVG------IASAITSKTAPFPLFALDASPENPI-PINsklrldlsslksQF- 349
G + G + +T + + L A + E+ I N+ + S F
Sbjct 286 GPTPPAFTTGQDGVAFLNGNVTIQGSSGYLQAQTSVGESRILRFNNTNSGLIVEGDSSFG 345
Query 350 -TVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISE 408
++LALR+AEA Q+WKE++ + + DY QI H+G + +A S+MC ++G ++ +L I+E
Sbjct 346 VSILALRRAEAAQKWKEVALASEEDYPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINE 405
Query 409 VVNNNLATEGDTAVIAGKGVGAGNGSFEYTT-TEHCVVMCIYHAVPLLDYTLTGQDGQLL 467
VVNNN+ E + A IAGKG +GNGS + ++ +VMC++H +P LDY +
Sbjct 406 VVNNNITGE-NAADIAGKGTMSGNGSINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTT 464
Query 468 VTDAESLPIPEFDNIGMEVLPMTQVFNSPKAS------IVNLFNAGYNPRYFNWKTKLDV 521
+T+ PIPEFD IGME +P+ + N K NL+ GY P+Y+NWKT LD
Sbjct 465 LTNVLDFPIPEFDKIGMEQVPVIRGLNPVKPKDGDFKVSPNLY-FGYAPQYYNWKTTLDK 523
Query 522 INGAFTTTLKSWVSPV-TESLLSG-WFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIF 579
G F +LK+W+ P E+LL+ F N + A K FFKV+PSVLD +F
Sbjct 524 SMGEFRRSLKTWIIPFDDEALLAADSVDFPDNPNVEADSVKA----GFFKVSPSVLDNLF 579
Query 580 GVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY 613
V A+S +TDQ L ++ VVR+L +G+PY
Sbjct 580 AVKANSDLNTDQFLCSTLFDVNVVRSLDPNGLPY 613
>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642
Score = 259 bits (663), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 214/658 (33%), Positives = 317/658 (48%), Gaps = 76/658 (12%)
Query 6 GMSNLQNHPHRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQPVETSA 65
G+ L+N P R+ FD+ +N FTAKVGELLP + PGD + + YFTRT P++++A
Sbjct 9 GLHGLKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPLQSNA 68
Query 66 YTRLREYFDFYAVPLRLLWKSAPSVLTQMQ------DVNKIQALSLTQNLSLGTFLPSIP 119
+TRLRE ++ VP LWK S + M D+++I A SL N + T +P +
Sbjct 69 FTRLRENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRI-ASSLVGNQKVTTQMPCVN 127
Query 120 LSVLSDAMYLLNGRSWTPGNSSSLKNMF--GFDRSDLCYKLLSYLGYGNLISSESTGRWW 177
L + RS T G+ S+ F G R KLL LGYGN + +
Sbjct 128 YKTLHAYLLKFINRS-TVGSDGSVGPEFNRGCYRHAESAKLLQLLGYGNFPEQFANFKVN 186
Query 178 STSVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvs 237
+ + Y + Y+++F LLAY KI D + + QW+ N S NVDY T S
Sbjct 187 NDKHNQSGQNFKDVTYNNSPYLSIFRLLAYHKICNDHYLYRQWQPYNASLCNVDYLTPNS 246
Query 238 pslissllsvsp---DYWKSG--TMFDLKYCNWNKDMLMGVLPNSQFGDVAV--LDIPDS 290
SL+S ++ D K+ + D+++ N D GVLP SQFG +V L++ ++
Sbjct 247 SSLLSIDDALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQFGSESVVNLNLGNA 306
Query 291 GDSNVVLGTDSH-----KSSVG-------IASAITSK----TAPFPLFALDASPENPIPI 334
S V+ GT S +++ G +AS+ + + D + + I
Sbjct 307 SGSAVLNGTTSKDSGRWRTTTGEWEMEQRVASSANGNLKLDNSNGTFISHDHTFSGNVAI 366
Query 335 NsklrldlsslksQFTVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMC 394
N +SL +++ALR A A Q++KEI + D D++ Q+ HFG+K P +
Sbjct 367 N-------TSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQVEAHFGIK-PDEKNENS 418
Query 395 TYIGGVSRNLDISEVVNNNLATEGDTAVIAGKG-VGAGNGSFEYTTTEHCVVMCIYHAVP 453
+IGG S ++I+E +N NL+ GD G G G+ S ++T + VV+ IY P
Sbjct 419 LFIGGSSSMININEQINQNLS--GDNKATYGAAPQGNGSASIKFTAKTYGVVIGIYRCTP 476
Query 454 LLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFNSPKASIVNLFNA------- 506
+LD+ G D L TDA IPE D+IGM+ +V + A + F A
Sbjct 477 VLDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEV--AAPAPYNDEFKAFRVGDGS 534
Query 507 --------GYNPRYFNWKTKLDVINGAFTTTLKSWVSPVTESLLSG--WFCF-GYNKDDA 555
GY PRY +KT D NGAF +LKSWV+ + + W + G N
Sbjct 535 SPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQNNVWNTWAGIN---- 590
Query 556 APDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY 613
AP+ F P ++ +F V++ + D DQL V CY RNLSR G+PY
Sbjct 591 APN--------MFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPY 640
>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568
Score = 204 bits (520), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 178/621 (29%), Positives = 264/621 (43%), Gaps = 96/621 (15%)
Query 16 RSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQPVETSAYTRLREYFDF 75
R+ FDI +++ FTA G LLPV +P D N F RT P+ ++A+ +R ++F
Sbjct 18 RNAFDISQRHLFTAPAGALLPVLSLDLLPHDHVEINASDFMRTLPMNSAAFMSMRGVYEF 77
Query 76 YAVPLRLLWKSAPSVLTQMQDVNKIQALSLTQNLSLGTFLPSIPLSVLS-DAMYLLNGRS 134
Y VP + LW +T M D ++ + F P S +S D L++
Sbjct 78 YFVPYKQLWSGFDQFITGMSDY---------KSSFMYAFKGKTPPSCVSFDVQKLVD--- 125
Query 135 WTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWWSTSVSSKNDASYTQRYI 194
W N++ K++ GFD++ Y++L LGYG +S V N S T
Sbjct 126 WCKTNTA--KDIHGFDKNKGVYRILDLLGYGKYANS--------AGVPYTNPTSTTM--- 172
Query 195 QNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvspslissllsvspDYWKS 254
F LAYQKIY DF+R + +E S+NVD F G + W
Sbjct 173 --GKCTPFRGLAYQKIYNDFYRNTTYEEYQLESFNVDMFYGSGKVKETIPNEPWDYDW-- 228
Query 255 GTMFDLKYCNWNKDMLMGVLP---------NSQFGDVAVLDIPDSGDSNVVLGTDSHKSS 305
F L+Y N KD+L V P N QF DI NV GT ++ S
Sbjct 229 ---FTLRYRNAQKDLLTNVRPTPLFSIDDFNPQFF-TGGSDIVMEKGPNVTGGTHEYRDS 284
Query 306 VGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKE 365
V I EN + S ++ +V +R A AL++
Sbjct 285 VVIVGKNLK--------------ENGV----------DSKRTMISVADIRNAFALEKLAS 320
Query 366 ISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLAT--------- 416
++ Y+EQ+ HFG+ + + CTYIGG N+ + +V ++ T
Sbjct 321 VTMRAGKTYKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSF 380
Query 417 EGDTAVIAGKGVGAGNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPI 476
G GK G+G+G + EH ++MCIY VP + Y D + + +
Sbjct 381 GGYLGRTTGKATGSGSGHIRFDAKEHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFV 440
Query 477 PEFDNIGMEVLPMTQVF-----NSPKASIVNLFNAGYNPRYFNWKTKLDVINGAFTTTLK 531
PEF+N+GM+ L + N+ + I NL G+ PRY +KT LD+ +G F
Sbjct 441 PEFENLGMQPLFAKNISYKYNNNTANSRIKNLGAFGWQPRYSEYKTALDINHGQF----- 495
Query 532 SWVSPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQ 591
V + LS W A ++ N FK+NP LD +F VN + T TDQ
Sbjct 496 -----VHQEPLSYWTV-----ARARGESMSNFNISTFKINPKWLDDVFAVNYNGTELTDQ 545
Query 592 LLVNSYIGCYVVRNLSRDGVP 612
+ Y V ++S DG+P
Sbjct 546 VFGGCYFNIVKVSDMSIDGMP 566
>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584
Score = 194 bits (494), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 177/644 (27%), Positives = 300/644 (47%), Gaps = 120/644 (19%)
Query 16 RSGFDIGRKNAFTAKVGELLPV-YWDISMPGDKYRFNIEYFTRTQPVETSAYTRLREYFD 74
R+GFD+ + F+AK G+LLP+ W+++ P + ++F+++ RT + T++Y R++EY+
Sbjct 10 RNGFDLSSRRIFSAKAGQLLPIGCWEVN-PSEHFKFSVQDLVRTTTLNTASYARMKEYYH 68
Query 75 FYAVPLRLLWKSAPSVLTQMQD----VNKIQALSLTQNLSLGTFLPSIPLSVLSDAMYLL 130
F+ V R LW+ + + +N ++ T + + +P+ L
Sbjct 69 FFFVSYRSLWQWFDQFIVGTNNPHSALNGVKKNGTTNYNQICSSVPTFDL---------- 118
Query 131 NGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYG-----------NLISSESTGRWWST 179
G+ T +S + + GF+ S+ KLL+ L YG NLI+S S
Sbjct 119 -GKLITRLKTSDMDSQ-GFNYSEGAAKLLNMLNYGVTNKGKFMNLENLITSTSY------ 170
Query 180 SVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvsps 239
+ SK+D + Y V+ F LLAYQKI+ DF+R W S+ S+NVD + S
Sbjct 171 -LPSKDDKEPSSIYACK--VSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVDDYADDSNL 227
Query 240 lissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPD--SGDSNVVL 297
I +++ ++Y + KD L + P + D + ++P+ G+ NV+L
Sbjct 228 TIEPDVAL--------KFCQMRYRPYAKDWLTSMKPTPNYSD-GIFNLPEYVRGNGNVIL 278
Query 298 GTDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQA 357
T++ SV + S S ++ F+V LR A
Sbjct 279 -TNNKSGSVSLDSGTVSPSS-------------------------------FSVNDLRAA 306
Query 358 EALQRWKEISQSGDS-DYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVV--NNNL 414
AL + E ++ + DY QI HFG K+P++ +N ++GG ++ +SEVV N N
Sbjct 307 FALDKMLEATRRANGLDYASQIEAHFGFKVPESRANDARFLGGFDNSIVVSEVVSTNGNA 366
Query 415 ATEGDTAVI---AGKGVGA-GNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTD 470
A++G A I GKG+G+ +G+ E+ +TEH ++MCIY P +Y + D
Sbjct 367 ASDGSHASIGDLGGKGIGSMSSGTIEFDSTEHGIIMCIYSVAPQSEYNASYLDPFNRKLT 426
Query 471 AESLPIPEFDNIGMEVLPMTQV------FNSPKA--SIVNLFN--AGYNPRYFNWKTKLD 520
E PEF ++G + L + + N +A S + L N GY RY +KT D
Sbjct 427 REQFYQPEFADLGYQALIGSDLICSTLGMNEKQAGFSDIELNNNLLGYQVRYNEYKTARD 486
Query 521 VINGAFTT--TLKSWVSPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKF----------- 567
++ G F + +L W +P + F +G + AP+ K +Y+
Sbjct 487 LVFGDFESGKSLSYWCTPRFD------FGYGDTEKKIAPENKGGADYRKKGNRSHWSSRN 540
Query 568 FKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGV 611
F +NP++++PIF S D +VNS++ VR +S G+
Sbjct 541 FYINPNLVNPIF---LTSAVQADHFIVNSFLDVKAVRPMSVTGL 581
>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=553
Score = 193 bits (490), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 167/613 (27%), Positives = 266/613 (43%), Gaps = 92/613 (15%)
Query 15 HRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQPVETSAYTRLREYFD 74
+R+ FD+ +++ FTA G LLPV +P D N + F RT P+ T+A+ +R ++
Sbjct 16 NRNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASMRGVYE 75
Query 75 FYAVPLRLLWKSAPSVLTQMQDVNKIQALSLTQNLSLGTFLPSIPLSVLSDAMYLLN-GR 133
F+ VP LW +T M D + S +++ GT +P + LN G+
Sbjct 76 FFFVPYHQLWAQFDQFITGMNDFHS----SANKSIQGGTSPLQVPYFNVDSVFNSLNTGK 131
Query 134 SWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWWSTSVSS-KNDASYTQR 192
G++ L+ F + ++LL LGYG +S G + +VS KN+ Y
Sbjct 132 ESGSGSTDDLQYKFKYG----AFRLLDLLGYGRKF--DSFGTAYPDNVSGLKNNLDYN-- 183
Query 193 YIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvspslissllsvspDYW 252
++F +LAY KIYQD++R S +E + S+N D F G D
Sbjct 184 ------CSVFRILAYNKIYQDYYRNSNYENFDTDSFNFDKFKGGLV-----------DAK 226
Query 253 KSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVGIASAI 312
+F L+Y N D + + F + D N+ + + S G S
Sbjct 227 VVADLFKLRYRNAQTDYFTNLRQSQLFSFTTAFEDVD----NINIAPRDYVKSDG--SNF 280
Query 313 TSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKEISQSGDS 372
T F +D F+V +LR A A+ + ++
Sbjct 281 TRVN-----FGVDTDSSE----------------GDFSVSSLRAAFAVDKLLSVTMRAGK 319
Query 373 DYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNN--LATE-----GDTAVIAG 425
+++Q+R H+GV++P + Y+GG ++ +S+V + ATE G +AG
Sbjct 320 TFQDQMRAHYGVEIPDSRDGRVNYLGGFDSDMQVSDVTQTSGTTATEYKPEAGYLGRVAG 379
Query 426 KGVGAGNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGME 485
KG G+G G + EH V+MCIY VP + Y T D + D PEF+N+GM+
Sbjct 380 KGTGSGRGRIVFDAKEHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENLGMQ 439
Query 486 VLPMTQVFN----SPKASIVNLFNAGYNPRYFNWKTKLDVINGAFTTT--LKSWVSPVTE 539
L + + + PK ++ GY PRY +KT LDV +G F + L SW +
Sbjct 440 PLNSSYISSFCTTDPKNPVL-----GYQPRYSEYKTALDVNHGQFAQSDALSSW----SV 490
Query 540 SLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIG 599
S W F P ++ FK++P L+ IF V+ + T D +
Sbjct 491 SRFRRWTTF--------PQLEIAD----FKIDPGCLNSIFPVDYNGTEANDCVYGGCNFN 538
Query 600 CYVVRNLSRDGVP 612
V ++S DG+P
Sbjct 539 IVKVSDMSVDGMP 551
>gi|575094339|emb|CDL65730.1| unnamed protein product [uncultured bacterium]
Length=588
Score = 192 bits (489), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 187/658 (28%), Positives = 277/658 (42%), Gaps = 124/658 (19%)
Query 7 MSNLQNHP-HRS-----GFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQP 60
M+N+ P HR+ GFD+ +++ FT+ VG+LLPV++D PGDK R + FTRTQP
Sbjct 1 MANINQKPSHRANLSKNGFDMSQRHPFTSSVGQLLPVFYDYLNPGDKIRISANLFTRTQP 60
Query 61 VETSAYTRLREYFDFYAVPLRLLWKSAPSVLTQMQDVNKIQALSLTQNLSLGTFLPSIPL 120
++++A RL E+ +++ VP ++ SV + D N +L NL++ F S +
Sbjct 61 MKSTAMARLTEHIEYFFVPFEQMFSLFGSVFYGIDDYNS-SSLVKHNNLTM-PFFKSDAV 118
Query 121 SVLSDAMYL-----LNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGR 175
S +A Y +N + TP +M G R +L LGYG+L+ S
Sbjct 119 SAALEAAYTSFSSSINRKVLTP-------DMMGQPRVYGILRLSEMLGYGSLLLSNDNNL 171
Query 176 WWSTSVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTG 235
+S +F AYQKI+ DF+R + + SYNVDY G
Sbjct 172 LPHADMS------------------VFLFTAYQKIFNDFYRLDDYTSVQHKSYNVDYAQG 213
Query 236 vspslissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPN---------SQFGDVAVLD 286
+ +MF+L Y W KD V+PN S FG + D
Sbjct 214 QPIT--------------DNSMFELHYRPWKKDYFTNVIPNPYFSSVDNKSSFGGAGLFD 259
Query 287 IPDSGDSNVVLGTDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslk 346
P G S D + S +++ P+F +P+N
Sbjct 260 RP-VGLSITSFNFDG-SDFLQAPSDLSTMENNQPIF-------QELPVNLTSASSAG--- 307
Query 347 sQFTVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDI 406
+V LR A + I+Q Y Q HFG ++PQ +S YIGG S+ L I
Sbjct 308 --LSVSDLRYLYATDKLLRITQFAGKHYDAQTLAHFGKRVPQGVSGEVYYIGGQSQPLQI 365
Query 407 SEVVNNNLATEGDTAVIAGKGVG--AGNG--------SFEYTTTEHCVVMCIYHAVPLLD 456
S V + AT D+ + G +G AG G F + H V+M IY AVP D
Sbjct 366 SSV--ESTATTFDSGDVVGSVLGELAGKGYSQTGNQKDFSFEAPCHGVLMAIYSAVPEAD 423
Query 457 YTLTGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFNSPKASIVNLFNAGYNPRYFNWK 516
Y D + + PEFD++GME P ++ + N G+ RY K
Sbjct 424 YLDERIDYLNTLIQSNDFYKPEFDSLGMEPFPNYEL--DQYRMVGNNSRLGWRYRYSGLK 481
Query 517 TKLDVINGAFTTTLKSWVSPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLD 576
+K D+I+GAF TL+ WV+ +S A D + F ++P+ LD
Sbjct 482 SKPDLISGAFKYTLRDWVAVRNDSRY-------------AEDESWWQSAAFMYIDPAYLD 528
Query 577 PIFGV-----------NADSTWD-----------TDQLLVNSYIGCYVVRNLSRDGVP 612
IF + +A+ T+D D LL + YI CY +S G+P
Sbjct 529 NIFELSFTPRLYQQQDSANVTYDGTFIDRSLVYQRDPLLHDLYIKCYKSSAMSTYGLP 586
Lambda K H a alpha
0.318 0.134 0.414 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 4597148648223