bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-13_CDS_annotation_glimmer3.pl_2_1 Length=378 Score E Sequences producing significant alignments: (Bits) Value gi|547226428|ref|WP_021963491.1| putative uncharacterized protein 59.3 2e-06 gi|494822881|ref|WP_007558289.1| hypothetical protein 58.5 3e-06 gi|639237431|ref|WP_024568108.1| hypothetical protein 53.9 6e-05 gi|575094344|emb|CDL65728.1| unnamed protein product 53.9 8e-05 gi|494610273|ref|WP_007368519.1| hypothetical protein 52.4 3e-04 gi|490418711|ref|WP_004291034.1| hypothetical protein 51.6 5e-04 gi|647452992|ref|WP_025792810.1| hypothetical protein 50.1 0.002 gi|575094372|emb|CDL65753.1| unnamed protein product 46.2 0.028 gi|575094301|emb|CDL65691.1| unnamed protein product 45.4 0.050 gi|547920047|ref|WP_022322418.1| putative uncharacterized protein 42.4 0.27 >gi|547226428|ref|WP_021963491.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103380|emb|CCY83991.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=416 Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 82/321 (26%), Positives = 128/321 (40%), Gaps = 64/321 (20%) Query 1 MSARSQRKANEMNFKINQMNNEFNAKEAEKARAFQLDMWNKE------------------ 42 ++SQ N+ N +I QMNNE+N + K + DM+N++ Sbjct 21 FGSKSQSDTNKTNLQIAQMNNEYNERMFNKQLEYNQDMFNQQVEYDQKKMEQQNNFNARM 80 Query 43 --------NAYNTPSAQRERMEQAGYNAYM-------------Npadagsasgmssttsa 81 YN+ AQR R+E AG N Y+ + + S + Sbjct 81 QNEAIGAQQVYNSAKAQRARLEAAGLNPYLMMSGGNAGAVSAVSGSSGSGGSPSPMGVNP 140 Query 82 sasspaVMQA--TDFSSLGEV-----------GVRLAQELKLFSEKKGLDIRNFSLKDYL 128 +S AVMQA DFS + + GVR AQ L + G I N + L Sbjct 141 PTASSAVMQAFRPDFSGVTGIIQTLLDIQAQKGVRDAQAFSLGEQASGFKIENKYKAEKL 200 Query 129 KAQIDKMRGETNWRNLSPEAI------RFNIMSGLEAAKIQMEGLKEQWINQKWSNNLLR 182 I + + N +N S E++ R M + +K Q E N +++ L+R Sbjct 201 LWDIYNSKADYNLKN-SQESLNNMSFARLQAMFSSDVSKAQREAE-----NAQFTGELIR 254 Query 183 ANVANSLLDAEGKTVINKYLDQQQQADLNVKAAHYEELLLRGQLHSREARESLSRELLNY 242 A A L KY DQ+ +L + +A L+ G+ +AR+++ L Sbjct 255 AQTACQQLQGLLGAKELKYYDQKVLQELAIMSAQQYSLVAAGKASEAQARQAIENALNLV 314 Query 243 TRANGQKISNKVAERTANNLI 263 + G K+ N V ++TAN LI Sbjct 315 EQREGIKVDNYVKQKTANALI 335 >gi|494822881|ref|WP_007558289.1| hypothetical protein [Bacteroides plebeius] gi|198272097|gb|EDY96366.1| hypothetical protein BACPLE_00802 [Bacteroides plebeius DSM 17135] Length=344 Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 27/56 (48%), Positives = 40/56 (71%), Gaps = 0/56 (0%) Query 9 ANEMNFKINQMNNEFNAKEAEKARAFQLDMWNKENAYNTPSAQRERMEQAGYNAYM 64 N+ N +I QM+NE+N ++ E+ + DMWN EN YN+ S+QR+R+E+AG N YM Sbjct 43 TNQANIQIAQMSNEYNREQLERQIEQEWDMWNAENEYNSASSQRKRLEEAGLNPYM 98 >gi|639237431|ref|WP_024568108.1| hypothetical protein [Elizabethkingia anophelis] Length=287 Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 25/46 (54%), Positives = 32/46 (70%), Gaps = 0/46 (0%) Query 19 MNNEFNAKEAEKARAFQLDMWNKENAYNTPSAQRERMEQAGYNAYM 64 MNN N K A + RAF LDMWN+ N YNTP AQ +R+++AG N + Sbjct 18 MNNSSNKKIARENRAFALDMWNRNNEYNTPLAQMQRLKEAGLNPNL 63 >gi|575094344|emb|CDL65728.1| unnamed protein product [uncultured bacterium] Length=368 Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 25/56 (45%), Positives = 39/56 (70%), Gaps = 2/56 (4%) Query 8 KANEMNFKINQMNNEFNAKEAEKARAFQLDMWNKENAYNTPSAQRERMEQAGYNAY 63 ++++ F + QM E+ ++EA+K R FQLDMWN+ N YN P Q +R+E+AG N + Sbjct 39 QSSDQRFALGQM--EWQSQEAQKQRDFQLDMWNRNNEYNKPDEQMKRLEEAGINPW 92 >gi|494610273|ref|WP_007368519.1| hypothetical protein [Prevotella multiformis] gi|324988545|gb|EGC20508.1| hypothetical protein HMPREF9141_0987 [Prevotella multiformis DSM 16608] Length=437 Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 70/260 (27%), Positives = 121/260 (47%), Gaps = 31/260 (12%) Query 5 SQRKANEMNFKINQMNNEFNAKEAEKARAFQLDMWNKENAYNTPSAQRERMEQAGYNAYM 64 +Q +ANE N +I + N+ + ++ AF MWN+ N YN+P+AQ +R AG N Y+ Sbjct 92 AQDRANETNLQIAREANQNQYQMFQEQNAFNERMWNQMNQYNSPAAQMQRYTDAGINPYI 151 Query 65 ---NpadagsasgmssttsasasspaVMQAT--------DFSSLGEVGVRLAQELKLFSE 113 N + S + S + VM AT F+ +G V + AQ ++ Sbjct 152 AAGNVQTGNAQSALQSAPAPQQHVAQVMPATGMGDAVQNSFAQIGNVISQFAQNQLALAQ 211 Query 114 KKGLDIRNFSLKDYLKAQIDKMRGET-NWRNLSPEAIRFNIMSGLEAAKIQMEGLKEQWI 172 K D + AQ+ K+ ET N N N + GL+ +I+ + L + Sbjct 212 AKKTDAEASWIDRLNSAQMGKLGAETLNIHNQ-------NSLLGLD-YQIKSDTLGNYKL 263 Query 173 NQKWSNNLLRANVANSLLDAEGKTVINKYLDQQQQADLNVKAAHYEELLLRGQLHSREAR 232 S + +A + N+L+DA+ + + + + ++KA + E+ +L +E Sbjct 264 LSDLS--VQQAALTNNLVDAQTRKAL--FESDLAMVESHIKAKYGEKQVL------QEIS 313 Query 233 ESLSRELLNYTRA-NGQKIS 251 ES+SR+ LNY A N +K++ Sbjct 314 ESVSRQYLNYVSAHNSEKLT 333 >gi|490418711|ref|WP_004291034.1| hypothetical protein [Bacteroides eggerthii] gi|217986638|gb|EEC52972.1| hypothetical protein BACEGG_02723 [Bacteroides eggerthii DSM 20697] Length=368 Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 32/88 (36%), Positives = 40/88 (45%), Gaps = 33/88 (38%) Query 10 NEMNFKINQMNNEFNAKEAEKA---------------------------------RAFQL 36 N+ N +I QMNN FN K +K +A+Q Sbjct 31 NQANKEIAQMNNAFNEKMFDKQIAYNKEMYQTQLGDQWKFYDDQKANAWKLYEDNKAYQT 90 Query 37 DMWNKENAYNTPSAQRERMEQAGYNAYM 64 +MWNK+N YN PSAQR R+E AG N YM Sbjct 91 EMWNKQNEYNDPSAQRARLEAAGLNPYM 118 >gi|647452992|ref|WP_025792810.1| hypothetical protein [Prevotella histicola] Length=424 Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust. Identities = 25/60 (42%), Positives = 39/60 (65%), Gaps = 0/60 (0%) Query 5 SQRKANEMNFKINQMNNEFNAKEAEKARAFQLDMWNKENAYNTPSAQRERMEQAGYNAYM 64 S RK N+ N +I + N+ N + +++ AF M+++ NAYNTPSAQ +R +AG N Y+ Sbjct 25 SNRKTNQTNLQIARETNQMNYQLFQESNAFNEKMYHEANAYNTPSAQMQRYAEAGINPYI 84 >gi|575094372|emb|CDL65753.1| unnamed protein product [uncultured bacterium] Length=385 Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust. Identities = 24/54 (44%), Positives = 34/54 (63%), Gaps = 0/54 (0%) Query 8 KANEMNFKINQMNNEFNAKEAEKARAFQLDMWNKENAYNTPSAQRERMEQAGYN 61 KA + N + + NEFNA EAEK RAFQ ++ + ++N+PS Q + M AG N Sbjct 58 KARQFNSQQTALQNEFNASEAEKNRAFQKSLYERSLSWNSPSNQLKMMADAGLN 111 >gi|575094301|emb|CDL65691.1| unnamed protein product [uncultured bacterium] Length=437 Score = 45.4 bits (106), Expect = 0.050, Method: Compositional matrix adjust. Identities = 21/43 (49%), Positives = 26/43 (60%), Gaps = 0/43 (0%) Query 23 FNAKEAEKARAFQLDMWNKENAYNTPSAQRERMEQAGYNAYMN 65 F +E R F MW N YNTP AQ++R+EQAG N Y+N Sbjct 54 FTHQENALQRDFARQMWKDTNDYNTPIAQKQRLEQAGMNPYVN 96 >gi|547920047|ref|WP_022322418.1| putative uncharacterized protein [Parabacteroides merdae CAG:48] gi|524592959|emb|CDD13571.1| putative uncharacterized protein [Parabacteroides merdae CAG:48] Length=259 Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust. Identities = 20/42 (48%), Positives = 27/42 (64%), Gaps = 0/42 (0%) Query 21 NEFNAKEAEKARAFQLDMWNKENAYNTPSAQRERMEQAGYNA 62 N + E EKA A ++MWN +N YN+P+AQ R+ QAG N Sbjct 8 NNWQTAENEKAYARSVEMWNMQNQYNSPTAQMSRLRQAGLNP 49 Lambda K H a alpha 0.312 0.126 0.355 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 2349684375798