BLASTP 2.2.24 [Aug-08-2010] Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= Eten_7813_orf1 (114 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 14,777,732 sequences; 5,058,227,080 total letters
Score E Sequences producing significant alignments: (bits) Value gi|225423682|ref|XP_002277005.1| PREDICTED: hypothetical protein... 37 0.93 gi|225423684|ref|XP_002277047.1| PREDICTED: hypothetical protein... 37 1.0 gi|154287242|ref|XP_001544416.1| conserved hypothetical protein ... 35 2.5 gi|122114583|ref|NP_001073657.1| G protein-coupled receptor kina... 35 3.2 gi|331216574|ref|XP_003320966.1| hypothetical protein PGTG_02008... 35 5.1 gi|327271996|ref|XP_003220772.1| PREDICTED: histone H2A deubiqui... 34 5.6 >gi|225423682|ref|XP_002277005.1| PREDICTED: hypothetical protein isoform 1 [Vitis vinifera] gi|297737977|emb|CBI27178.3| unnamed protein product [Vitis vinifera] Length = 489 Score = 37.0 bits (84), Expect = 0.93, Method: Composition-based stats. Identities = 23/85 (27%), Positives = 34/85 (40%), Gaps = 7/85 (8%) Query: 3 VTFFSLQLFGGFECSNCGERGNWKLNCKLESELQIKSSNFPFLLRKAQNLLLKHQTVHAC 62 + F L FEC +CGER N + ELQ + ++ +L Q V + Sbjct: 50 IPHFRKILLSAFECPHCGERNN---EVQFAGELQPRGCSYSLHFSAGDQKMLNRQVVKSE 106 Query: 63 VALSPLPHR----PADRHCGVLAGV 83 A +P P + CG L+ V Sbjct: 107 SATIKIPELDFEIPPEAQCGSLSTV 131 >gi|225423684|ref|XP_002277047.1| PREDICTED: hypothetical protein isoform 2 [Vitis vinifera] Length = 494 Score = 37.0 bits (84), Expect = 1.0, Method: Composition-based stats. Identities = 23/85 (27%), Positives = 34/85 (40%), Gaps = 7/85 (8%) Query: 3 VTFFSLQLFGGFECSNCGERGNWKLNCKLESELQIKSSNFPFLLRKAQNLLLKHQTVHAC 62 + F L FEC +CGER N + ELQ + ++ +L Q V + Sbjct: 50 IPHFRKILLSAFECPHCGERNN---EVQFAGELQPRGCSYSLHFSAGDQKMLNRQVVKSE 106 Query: 63 VALSPLPHR----PADRHCGVLAGV 83 A +P P + CG L+ V Sbjct: 107 SATIKIPELDFEIPPEAQCGSLSTV 131 >gi|154287242|ref|XP_001544416.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1] gi|150408057|gb|EDN03598.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1] Length = 527 Score = 35.4 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 23/72 (31%), Positives = 34/72 (47%), Gaps = 7/72 (9%) Query: 32 ESELQIKSSNFPFLLRKAQNLLLKHQTVHACVALSPLPHRPADRHCG-VLAGVPPGLVNH 90 E E Q++ +L K Q + ++ C+ PLPH+P V G+PP L++H Sbjct: 365 EEEYQVEE-----ILEKRQRKNKPPEYLNICLLTPPLPHQPTHSPAHPVTKGIPPHLLSH 419 Query: 91 -SIFRGFEKSCS 101 S RG CS Sbjct: 420 SSTVRGTRSPCS 431 >gi|122114583|ref|NP_001073657.1| G protein-coupled receptor kinase interacting ArfGAP 2 [Danio rerio] gi|120538643|gb|AAI29228.1| Zgc:158416 [Danio rerio] Length = 736 Score = 35.4 bits (80), Expect = 3.2, Method: Composition-based stats. Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 2/56 (3%) Query: 26 KLNCKLESELQIKSSNFPFLLRKAQNLLLKHQTVHACVALSPLPHRPADRHCGVLA 81 K+NC L EL++ S L + NL K QT ++ V +PHR A R C ++ Sbjct: 450 KVNCHLSEELRLMQSKLNSLQNENSNL--KWQTPNSLVRPQDVPHRAAPRGCRAMS 503 >gi|331216574|ref|XP_003320966.1| hypothetical protein PGTG_02008 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309299956|gb|EFP76547.1| hypothetical protein PGTG_02008 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 437 Score = 34.7 bits (78), Expect = 5.1, Method: Composition-based stats. Identities = 22/67 (32%), Positives = 28/67 (41%), Gaps = 21/67 (31%) Query: 10 LFGGFECSNCGERGNWKLNCKLESELQIKSSNFPFLLRKAQNLLLKHQTVHACVALSPLP 69 +F G CS CG GNW+L+C LR+ NL H ++PLP Sbjct: 376 IFTGKPCSYCGGLGNWRLSCP--------------RLRRDANLPPPH-------LVAPLP 414 Query: 70 HRPADRH 76 P RH Sbjct: 415 VSPVKRH 421 >gi|327271996|ref|XP_003220772.1| PREDICTED: histone H2A deubiquitinase MYSM1-like [Anolis carolinensis] Length = 820 Score = 34.3 bits (77), Expect = 5.6, Method: Composition-based stats. Identities = 14/55 (25%), Positives = 31/55 (56%) Query: 23 GNWKLNCKLESELQIKSSNFPFLLRKAQNLLLKHQTVHACVALSPLPHRPADRHC 77 G+++L K E + ++ + +L K + ++ K++ H+CV + + HR +D C Sbjct: 709 GSYRLPYKFEVQEMLEEPQWELVLEKTRWIIEKYRLSHSCVPMDKIFHRDSDLTC 763 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jul 22, 2011 4:42 PM Number of letters in database: 5,058,227,080 Number of sequences in database: 14,777,732 Lambda K H 0.325 0.140 0.457 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 14777732 Number of Hits to DB: 1,053,476,125 Number of extensions: 37296739 Number of successful extensions: 115632 Number of sequences better than 10.0: 8 Number of HSP's gapped: 116146 Number of HSP's successfully gapped: 8 Length of query: 114 Length of database: 5,058,227,080 Length adjustment: 81 Effective length of query: 33 Effective length of database: 3,861,230,788 Effective search space: 127420616004 Effective search space used: 127420616004 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 76 (33.9 bits)