bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-29_CDS_annotation_glimmer3.pl_2_1
Length=295
Score E
Sequences producing significant alignments: (Bits) Value
gi|575094486|emb|CDL65860.1| unnamed protein product 196 2e-56
gi|575096057|emb|CDL66940.1| unnamed protein product 122 6e-29
gi|575094545|emb|CDL65905.1| unnamed protein product 112 2e-25
gi|575094568|emb|CDL65929.1| unnamed protein product 97.4 4e-20
gi|575094495|emb|CDL65861.1| unnamed protein product 67.4 9e-10
gi|393707865|ref|YP_004732987.1| structural protein VP2 63.9 8e-09
gi|547839281|ref|WP_022246923.1| putative minor capsid protein 54.7 1e-05
gi|470147451|ref|XP_004309317.1| PREDICTED: minor spike protein ... 52.4 3e-05
gi|575094416|emb|CDL65791.1| unnamed protein product 52.0 2e-04
gi|568290031|gb|ETN78178.1| hypothetical protein NECAME_18237 48.5 5e-04
>gi|575094486|emb|CDL65860.1| unnamed protein product [uncultured bacterium]
Length=344
Score = 196 bits (498), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 130/252 (52%), Positives = 179/252 (71%), Gaps = 12/252 (5%)
Query 8 LQGIAGQNTSSSAKQAEELRTWQEQQAELARKYNSQEAQKNRDWQERMSSTAHQREVRDL 67
LQ I N + SA QA++ +Q Q L R++N EA+ +R WQERMS+TAHQRE++DL
Sbjct 68 LQSITASNNAWSAAQAQKQMDFQASQGALVRQFNHDEAELSRLWQERMSNTAHQREIKDL 127
Query 68 IAAGLNPVLsvtggsgaavtsgatass-sapsgamgsvDNSATGAVAglfgsllssflsl 126
AAGLNPVLS GGSGA VTSG+TAS S PSG+ G D S GA+ L GS + + S+
Sbjct 128 QAAGLNPVLSAMGGSGAPVTSGSTASGYSPPSGSKGDTDTSLAGALVSLLGSSMMAQASM 187
Query 127 EGTRVSAQSNQAIADKYTAMSKYTSELQAQTQLTSTNIQAMAQKYTADAHLAGTKYAADQ 186
T +SA++ +++ADKYTAMSK +E+Q +T L+++ I AMA ++YAAD+
Sbjct 188 ANTAMSARTQESVADKYTAMSKLVAEIQQETTLSASTISAMA-----------SRYAADR 236
Query 187 SAAAQKVSASIHAAAQKYGYNVQSMTQRDIAAFNAQVNKDLQKAGFKQEFDIKKAFPNNA 246
SA A KV+ASIHAAAQ+YGY+VQ+MTQRDIA+FNAQVNKDL + G++ +FDIK+A+P++
Sbjct 237 SADASKVAASIHAAAQRYGYDVQAMTQRDIASFNAQVNKDLAQMGYQHDFDIKEAYPSSM 296
Query 247 WNVFGGLGTQAV 258
+ L +++
Sbjct 297 AGLMASLFGESI 308
>gi|575096057|emb|CDL66940.1| unnamed protein product [uncultured bacterium]
Length=275
Score = 122 bits (305), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 89/168 (53%), Positives = 118/168 (70%), Gaps = 0/168 (0%)
Query 8 LQGIAGQNTSSSAKQAEELRTWQEQQAELARKYNSQEAQKNRDWQERMSSTAHQREVRDL 67
++GIA N++ +A+QAE R WQE Q A ++NS EA KNR WQE MS+TAHQREV+DL
Sbjct 28 MKGIAQANSAWNAEQAEIQRDWQEAQNAKAMQFNSMEAAKNRKWQEMMSNTAHQREVKDL 87
Query 68 IAAGLNPVLsvtggsgaavtsgatasssapsgamgsvDNSATGAVAglfgsllssflslE 127
+AAGLNPVLS G+GAAV SGATAS +GA G D S +GA+A L GS+LS+ +++
Sbjct 88 MAAGLNPVLSAMNGNGAAVGSGATASGVTSAGAKGEADTSTSGAIANLLGSILSASTAIQ 147
Query 128 GTRVSAQSNQAIADKYTAMSKYTSELQAQTQLTSTNIQAMAQKYTADA 175
V+A++ +A+ADKYTAMS+ +E+ L S I A A +Y ADA
Sbjct 148 AANVNARTQEAVADKYTAMSQIVAEINKAATLGSAGIHAGATRYAADA 195
>gi|575094545|emb|CDL65905.1| unnamed protein product [uncultured bacterium]
Length=325
Score = 112 bits (281), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/246 (44%), Positives = 151/246 (61%), Gaps = 11/246 (4%)
Query 4 WAGALQGIAGQNTSSSAKQAEELRTWQEQQAELARKYNSQEAQKNRDWQERMSSTAHQRE 63
+ G + +A N++ +A QA R WQ+QQ +A +++S EA KNRDWQ MS+TAHQRE
Sbjct 23 YLGQITRMASDNSAFNASQAAANRNWQQQQNNIAMQFSSAEAAKNRDWQSYMSNTAHQRE 82
Query 64 VRDLIAAGLNPVLsvtggsgaavtsgatasssapsgamgsvDNSATGAVAglfgsllssf 123
V DL AAGLNPVLS GG+GAAVTSGATA SG S D SAT A+ GL GSLL++
Sbjct 83 VADLKAAGLNPVLSAMGGNGAAVTSGATAQGYTSSGGQASADTSATAALVGLLGSLLNAQ 142
Query 124 lslEGTRVSAQSNQAIADKYTAMSKYTSELQAQTQLTSTNIQAM-----------AQKYT 172
S+ T +A +N ++ADKYT+ ++Y +++ S N+ A A KY
Sbjct 143 TSIANTATNAVANLSVADKYTSATRYAADVGYAGTSYSANVAAYASRFASNNALAASKYA 202
Query 173 ADAHLAGTKYAADQSAAAQKVSASIHAAAQKYGYNVQSMTQRDIAAFNAQVNKDLQKAGF 232
+D A +KYA+DQS A K ++ + + KY + ++ T RD+A FNA VN+DLQK
Sbjct 203 SDNSRAASKYASDQSYLASKFASILQSNTAKYNIDTRTATDRDLAEFNAAVNRDLQKNEI 262
Query 233 KQEFDI 238
+F +
Sbjct 263 DAKFSL 268
>gi|575094568|emb|CDL65929.1| unnamed protein product [uncultured bacterium]
Length=310
Score = 97.4 bits (241), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 42/67 (63%), Positives = 57/67 (85%), Gaps = 0/67 (0%)
Query 10 GIAGQNTSSSAKQAEELRTWQEQQAELARKYNSQEAQKNRDWQERMSSTAHQREVRDLIA 69
G++ +NT+ S ++AE LRTWQE+Q +A ++N+ EA+KNR+WQE MS+TAHQREV DL+A
Sbjct 48 GLSEKNTARSVQEAESLRTWQEEQNRIAMQFNAAEAEKNRNWQEIMSNTAHQREVNDLMA 107
Query 70 AGLNPVL 76
AGLNPVL
Sbjct 108 AGLNPVL 114
>gi|575094495|emb|CDL65861.1| unnamed protein product [uncultured bacterium]
Length=266
Score = 67.4 bits (163), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 62/151 (41%), Positives = 82/151 (54%), Gaps = 18/151 (12%)
Query 40 YNSQEAQKNRDWQERMSSTAHQREVRDLIAAGLNPVLsvtggsgaavtsgatasssapsg 99
YN+Q A++ +QERMSSTAHQREV+DLIAAGLNPVL S + +SAPSG
Sbjct 66 YNTQSAREQMAFQERMSSTAHQREVKDLIAAGLNPVL-----------SAGGSGASAPSG 114
Query 100 amgsvDNSATGAVAglfgsllssflslEGTRVSAQSNQAIADKYTAMSKYTSELQAQTQL 159
AM + D+S A A L+ + N+A D AM+KY+ ++ AQT L
Sbjct 115 AMATADSSMMSAKANAALQKRIVNAQLKNAK---DINKAQLDAQKAMNKYSVDVGAQTSL 171
Query 160 TSTNIQAMAQKY----TADAHLAGTKYAADQ 186
+ I A A K+ A A + G+ AA Q
Sbjct 172 ANAQISASASKFGAMQAAAASMYGSNLAAKQ 202
>gi|393707865|ref|YP_004732987.1| structural protein VP2 [Microviridae phi-CA82]
gi|311336637|gb|ADP89808.1| structural protein VP2 [Microviridae phi-CA82]
Length=234
Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 29/55 (53%), Positives = 42/55 (76%), Gaps = 0/55 (0%)
Query 22 QAEELRTWQEQQAELARKYNSQEAQKNRDWQERMSSTAHQREVRDLIAAGLNPVL 76
QA++ W +Q E + ++N+QEAQKNRDWQE+MS+TA QR+++D AGLNP+
Sbjct 11 QADKQNKWNAEQTEKSNQFNAQEAQKNRDWQEQMSNTALQRKMQDAEKAGLNPIF 65
>gi|547839281|ref|WP_022246923.1| putative minor capsid protein [Clostridium sp. CAG:306]
gi|524476581|emb|CDC18646.1| putative minor capsid protein [Clostridium sp. CAG:306]
Length=236
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/27 (85%), Positives = 26/27 (96%), Gaps = 0/27 (0%)
Query 50 DWQERMSSTAHQREVRDLIAAGLNPVL 76
D+QERMSSTAHQREV+DL AAGLNP+L
Sbjct 40 DFQERMSSTAHQREVKDLRAAGLNPIL 66
>gi|470147451|ref|XP_004309317.1| PREDICTED: minor spike protein H-like, partial [Fragaria vesca
subsp. vesca]
Length=139
Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/33 (64%), Positives = 31/33 (94%), Gaps = 0/33 (0%)
Query 44 EAQKNRDWQERMSSTAHQREVRDLIAAGLNPVL 76
EAQ+NR++QER+S++A+QR+V DL +AGLNP+L
Sbjct 25 EAQRNREFQERLSNSAYQRQVADLSSAGLNPML 57
>gi|575094416|emb|CDL65791.1| unnamed protein product [uncultured bacterium]
Length=311
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 29/40 (73%), Gaps = 0/40 (0%)
Query 37 ARKYNSQEAQKNRDWQERMSSTAHQREVRDLIAAGLNPVL 76
A++YN +EAQ R W + M TA+Q V+DL AAGLNP+L
Sbjct 144 AQRYNREEAQAERSWAQSMRQTAYQDTVKDLKAAGLNPIL 183
>gi|568290031|gb|ETN78178.1| hypothetical protein NECAME_18237 [Necator americanus]
Length=112
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/39 (59%), Positives = 28/39 (72%), Gaps = 0/39 (0%)
Query 37 ARKYNSQEAQKNRDWQERMSSTAHQREVRDLIAAGLNPV 75
A K N + ++ WQERMS+TAHQRE DL AAGLNP+
Sbjct 30 ANKANRKMMREQMAWQERMSNTAHQREQADLKAAGLNPI 68
Lambda K H a alpha
0.310 0.120 0.334 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 1550741951049