bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-2_CDS_annotation_glimmer3.pl_2_1 Length=206 Score E Sequences producing significant alignments: (Bits) Value gi|490418709|ref|WP_004291032.1| hypothetical protein 119 1e-27 gi|575094354|emb|CDL65742.1| unnamed protein product 110 1e-24 gi|496050829|ref|WP_008775336.1| hypothetical protein 105 7e-23 gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 104 2e-22 gi|494822885|ref|WP_007558293.1| hypothetical protein 92.8 2e-18 gi|575094321|emb|CDL65708.1| unnamed protein product 90.5 2e-17 gi|647452987|ref|WP_025792807.1| hypothetical protein 66.2 2e-09 gi|565841287|ref|WP_023924568.1| hypothetical protein 64.3 7e-09 gi|494308783|ref|WP_007173938.1| hypothetical protein 60.8 9e-08 gi|575094339|emb|CDL65730.1| unnamed protein product 59.7 2e-07 >gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii] gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 20697] Length=578 Score = 119 bits (297), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 68/173 (39%), Positives = 97/173 (56%), Gaps = 11/173 (6%) Query 1 MAHFTGLKQLQNHPHRSGFDIGAKNVFSAKCGELLPVYWDLGIPGCTYDIDIQYFTRTRP 60 MA+ LK ++N P R+GFD+ K F+AK GELLPV +PG T+ I+++ FTRT+P Sbjct 1 MANIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQP 60 Query 61 VQTAAYTRIREYFDFYAVPIDLIWKSFDASVIQMGETAPVQAKDI--LTALTVSGDLPYC 118 V TAA+ RIREY+DF+ VP DL+W + + QM + P A I +SG++PY Sbjct 61 VNTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDN-PQHAVSIDPTRNFVLSGEMPYM 119 Query 119 SLSDLGLSCFFASGSMSVPSLKSWQANNAYANIFGYIRGDVNYKLIHMLNYGN 171 + + S + ++ KS N FGY R + KL+ L YGN Sbjct 120 TSEAIASYINALSTASALADYKS--------NYFGYNRSKSSVKLLEYLGYGN 164 >gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium] Length=615 Score = 110 bits (276), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 59/166 (36%), Positives = 93/166 (56%), Gaps = 14/166 (8%) Query 7 LKQLQNHPHRSGFDIGAKNVFSAKCGELLPVYWDLGIPGCTYDIDIQYFTRTRPVQTAAY 66 + ++N P R+GFD+ K F+AK GELLPV + +PG +++I+++ FTRT+P+ T+A+ Sbjct 3 MADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTSAF 62 Query 67 TRIREYFDFYAVPIDLIWKSFDASVIQMGETAPVQAKDILTALT-VSGDLPYCSLSDLGL 125 R+REY+DFY VP + +W FD+ + QM + L T +SG +PY + Sbjct 63 ARMREYYDFYFVPFEQMWNKFDSCITQMNANVQHASGPTLDDNTPLSGRMPYFT------ 116 Query 126 SCFFASGSMSVPSLKSWQANNAYANIFGYIRGDVNYKLIHMLNYGN 171 S + + QA A N FG+ R + KL+ L YG+ Sbjct 117 -------SEQIADYLNDQATAARKNPFGFNRSTLTCKLLQYLGYGD 155 >gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4] Length=580 Score = 105 bits (262), Expect = 7e-23, Method: Compositional matrix adjust. Identities = 64/174 (37%), Positives = 100/174 (57%), Gaps = 12/174 (7%) Query 1 MAHFTGLKQLQNHPHRSGFDIGAKNVFSAKCGELLPVY-WDLGIPGCTYDIDIQYFTRTR 59 MA+ LK L+N R+GFD+ +K F+AK GELLPV W++ +PG + ID++ FTRT+ Sbjct 1 MANIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEV-LPGDKWSIDLKSFTRTQ 59 Query 60 PVQTAAYTRIREYFDFYAVPIDLIWKSFDASVIQMGETAPVQAKDILTAL--TVSGDLPY 117 P+ TAA+ R+REY+DFY VP +L+W + + QM + P A + + ++G +P Sbjct 60 PLNTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYDN-PQHATSYIPSANQALAGVMPN 118 Query 118 CSLSDLGLSCFFASGSMSVPSLKSWQANNAYANIFGYIRGDVNYKLIHMLNYGN 171 + G++ + + V + S++ N FGY R KL+ L YGN Sbjct 119 VTCK--GIADYLNLVAPDVTTTNSYE-----KNYFGYSRSLGTAKLLEYLGYGN 165 >gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=573 Score = 104 bits (260), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 62/170 (36%), Positives = 97/170 (57%), Gaps = 9/170 (5%) Query 1 MAHFTGLKQLQNHPHRSGFDIGAKNVFSAKCGELLPVYWDLGIPGCTYDIDIQYFTRTRP 60 M+ L L+N R+GFD+ KN F+AK GELLP+ PG ++I Q FTRT+P Sbjct 1 MSSVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQP 60 Query 61 VQTAAYTRIREYFDFYAVPIDLIWKSFDASVIQMGETAPVQAKDILTALTVSGDLPYCSL 120 V +AAY+R+REY+DFY VP L+W M + P A D+++++ +S P+ + Sbjct 61 VNSAAYSRLREYYDFYFVPYRLLWNMAPTFFTNMPD--PHHAADLVSSVNLSQRHPWFTF 118 Query 121 SDLGLSCFFASGSMSVPSLKSWQANNAYANIFGYIRGDVNYKLIHMLNYG 170 D+ + S+S + + +Q N FG+ R +++ KL++ LNYG Sbjct 119 FDI-MEYLGNLNSLS-GAYEKYQ-----KNFFGFSRVELSVKLLNYLNYG 161 >gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius] gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 17135] Length=613 Score = 92.8 bits (229), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 57/179 (32%), Positives = 91/179 (51%), Gaps = 24/179 (13%) Query 1 MAHFTGLKQLQNHPHRSGFDIGAKNVFSAKCGELLPVYWDLGIPGCTYDIDIQYFTRTRP 60 MA+ +K ++N P R+G+D+ K F+AK G L+PV+W +P + ++ F RT+P Sbjct 8 MANIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQP 67 Query 61 VQTAAYTRIREYFDFYAVPIDLIWKSFDASVIQM-----GETAPVQAKDILTALTVSGDL 115 + TAA+ R+R YFDFY VP +W F ++ QM + PV A ++ +S +L Sbjct 68 LNTAAFARMRGYFDFYFVPFRQMWNKFPTAITQMRTNLLHASGPVLADNV----PLSDEL 123 Query 116 PYCSLSDLGLSCFFASGSMSVPSLKSWQANNAYANIFGYIRGDVNYKLIHMLNYGNIIP 174 PY + + A +S+ K N FGY R + ++ L YG+ P Sbjct 124 PYFTAEQV------ADYIVSLADSK---------NQFGYYRAWLVCIILEYLGYGDFYP 167 >gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium] Length=642 Score = 90.5 bits (223), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 40/97 (41%), Positives = 61/97 (63%), Gaps = 0/97 (0%) Query 2 AHFTGLKQLQNHPHRSGFDIGAKNVFSAKCGELLPVYWDLGIPGCTYDIDIQYFTRTRPV 61 ++ GL L+N P R+ FD+ +N+F+AK GELLP + PG + + YFTRT P+ Sbjct 5 SNIMGLHGLKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPL 64 Query 62 QTAAYTRIREYFDFYAVPIDLIWKSFDASVIQMGETA 98 Q+ A+TR+RE ++ VP +WK FD+ V+ M + A Sbjct 65 QSNAFTRLRENVQYFFVPYSALWKYFDSQVLNMTKNA 101 >gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola] Length=584 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 50/176 (28%), Positives = 85/176 (48%), Gaps = 29/176 (16%) Query 16 RSGFDIGAKNVFSAKCGELLPV-YWDLGIPGCTYDIDIQYFTRTRPVQTAAYTRIREYFD 74 R+GFD+ ++ +FSAK G+LLP+ W++ P + +Q RT + TA+Y R++EY+ Sbjct 10 RNGFDLSSRRIFSAKAGQLLPIGCWEVN-PSEHFKFSVQDLVRTTTLNTASYARMKEYYH 68 Query 75 FYAVPIDLIWKSFDASVIQMGETAPVQAKDILTALTVSGDLPYCSLSDLGLSCFFASGSM 134 F+ V +W+ FD ++ G P A L + +G Y + Sbjct 69 FFFVSYRSLWQWFDQFIV--GTNNPHSA---LNGVKKNGTTNYNQICS------------ 111 Query 135 SVPS------LKSWQANNAYANIFGYIRGDVNYKLIHMLNYGNIIPNNMPALNIGN 184 SVP+ + + ++ + F Y G KL++MLNYG + N +N+ N Sbjct 112 SVPTFDLGKLITRLKTSDMDSQGFNYSEGAA--KLLNMLNYG--VTNKGKFMNLEN 163 >gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens] gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens CC14M] Length=656 Score = 64.3 bits (155), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 29/78 (37%), Positives = 47/78 (60%), Gaps = 2/78 (3%) Query 13 HPH--RSGFDIGAKNVFSAKCGELLPVYWDLGIPGCTYDIDIQYFTRTRPVQTAAYTRIR 70 HP+ R+G+D+ ++ +FSA G LLP+ PG + I +Q R +P+ TAA+ R + Sbjct 10 HPNLNRNGYDLSSRRIFSAPAGALLPIATWEANPGEKFRISVQDLVRAQPLNTAAFARCK 69 Query 71 EYFDFYAVPIDLIWKSFD 88 EY+ F+ VP +W+ D Sbjct 70 EYYHFFFVPYKSLWQHSD 87 >gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis] gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM 17361] Length=553 Score = 60.8 bits (146), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 49/178 (28%), Positives = 83/178 (47%), Gaps = 13/178 (7%) Query 7 LKQLQNHPHRSGFDIGAKNVFSAKCGELLPVYWDLGIPGCTYDIDIQYFTRTRPVQTAAY 66 +K + + +R+ FD+ +++F+A G LLPV IP +I+ Q F RT P+ TAA+ Sbjct 8 IKATRPNRNRNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAF 67 Query 67 TRIREYFDFYAVPIDLIWKSFDASVIQMGETAPVQAKDILTALTVSGDLPYCSLSDL--G 124 +R ++F+ VP +W FD + M + K I T +PY ++ + Sbjct 68 ASMRGVYEFFFVPYHQLWAQFDQFITGMNDFHSSANKSIQGG-TSPLQVPYFNVDSVFNS 126 Query 125 LSCFFASGSMSVPSLKSWQANNAYA--NIFGYIRGDVNYKLIHMLNYGNIIPNNMPAL 180 L+ SGS S L+ A+ ++ GY R ++G P+N+ L Sbjct 127 LNTGKESGSGSTDDLQYKFKYGAFRLLDLLGYGR--------KFDSFGTAYPDNVSGL 176 >gi|575094339|emb|CDL65730.1| unnamed protein product [uncultured bacterium] Length=588 Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 24/72 (33%), Positives = 46/72 (64%), Gaps = 0/72 (0%) Query 16 RSGFDIGAKNVFSAKCGELLPVYWDLGIPGCTYDIDIQYFTRTRPVQTAAYTRIREYFDF 75 ++GFD+ ++ F++ G+LLPV++D PG I FTRT+P+++ A R+ E+ ++ Sbjct 16 KNGFDMSQRHPFTSSVGQLLPVFYDYLNPGDKIRISANLFTRTQPMKSTAMARLTEHIEY 75 Query 76 YAVPIDLIWKSF 87 + VP + ++ F Sbjct 76 FFVPFEQMFSLF 87 Lambda K H a alpha 0.321 0.137 0.436 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 692426810790