bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.30+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
49,011,213 sequences; 17,563,301,199 total letters
Query= Contig-32_CDS_annotation_glimmer3.pl_2_1
Length=411
Score E
Sequences producing significant alignments: (Bits) Value
gi|648626869|ref|WP_026318620.1| hypothetical protein 234 6e-71
gi|547312923|ref|WP_022044635.1| putative uncharacterized protein 63.5 8e-08
gi|517172762|ref|WP_018361580.1| hypothetical protein 60.8 7e-07
gi|547920049|ref|WP_022322420.1| capsid protein VP1 56.6 2e-05
gi|492501782|ref|WP_005867318.1| hypothetical protein 55.8 4e-05
gi|649557305|gb|KDS63784.1| capsid family protein 50.1 0.001
gi|649569140|gb|KDS75238.1| capsid family protein 50.8 0.001
gi|649555287|gb|KDS61824.1| capsid family protein 50.8 0.001
gi|494308783|ref|WP_007173938.1| hypothetical protein 50.1 0.002
gi|609718276|emb|CDN73650.1| conserved hypothetical protein 48.5 0.006
>gi|648626869|ref|WP_026318620.1| hypothetical protein [Alistipes onderdonkii]
Length=231
Score = 234 bits (597), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 120/231 (52%), Positives = 149/231 (65%), Gaps = 9/231 (4%)
Query 190 LSGGTRFRRRNYHFNDDGYFMEITSIVPRVYYPSYINPTSRQISLGQQYAPALDNIAMQG 249
+SGG F+RR +HFN+ GYFMEITS+VP V YP+Y+NPT Q +LGQ+YAPALDNI MQ
Sbjct 1 MSGGDSFKRRTFHFNESGYFMEITSVVPTVMYPNYLNPTLLQTNLGQRYAPALDNIQMQP 60
Query 250 LKASTVFGEVQ-NLGANTVTYANS--------TLSIPGFKLQESNYVGYEPAWSELMTAV 300
L T+ G N G+ + ++ + T+++ E VGY+PAW+ELMT V
Sbjct 61 LTVPTLLGNAYFNTGSGSYSHVLNHMGTGELRTVAVDKLSAAEGIAVGYQPAWAELMTGV 120
Query 301 SKPHGRLCNDLDYWVLSRDYGRNLASVMDTPAYSDFIKAAGTYVDELSLQRLTAFLKRIY 360
SKPHGRLCNDLDYW R YG L S D S F++ G VD L ++ A+LK Y
Sbjct 121 SKPHGRLCNDLDYWAFQRRYGTVLYSSNDAQDASVFLEELGNEVDTLDVETFNAWLKNTY 180
Query 361 VSPSSCPYILCGDFNYVFYDQRPTAENFVLDNVADIVVFREKSKVNVATTL 411
VS PYIL +NYVF D P A+NFVLDN A+I V+REKSKVNV TL
Sbjct 181 VSTDFVPYILPAMYNYVFADTDPNAQNFVLDNSAEISVYREKSKVNVPNTL 231
>gi|547312923|ref|WP_022044635.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
gi|524208404|emb|CCZ76639.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
Length=338
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 82/326 (25%), Positives = 123/326 (38%), Gaps = 70/326 (21%)
Query 101 AVDISVS-GQSVSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDNT-CPAFL 158
A+D+++S G SV++ + +++Q +MD F GGR D + + + K S P FL
Sbjct 41 ALDLNISTGFSVAVPELRLRTKIQNWMDRLFVSGGRVGDVFRTLWGTKSSAIYVNKPDFL 100
Query 159 GSDSFDMNVNTLY-----QTTGFEDNSSPLGAFSGQLSGGTRFRRRNYHFNDDGYFMEIT 213
G +N + + +G + N L A + + +Y+ + G FM IT
Sbjct 101 GVWQASINPSNVRAMANGSASGEDANLGQLAACVDRYCDFSGHSGIDYYAKEPGTFMLIT 160
Query 214 SIVPRVYYPSYINPTSRQISLGQQYAPALDNIAMQ-----------------GL--KAST 254
+VP Y ++P IS G + P L+ I Q GL +AS
Sbjct 161 MLVPEPAYSQGLHPDLASISFGDDFNPELNGIGFQLVPRHRFSMMPRGFNFTGLDQEASP 220
Query 255 VFGEVQNLGANTVTYANSTLSIPGFKLQESNYVGYEPAWSELMTAVSKPHGRLC--NDLD 312
FG G + N VG E AWS L T S+ HG +
Sbjct 221 WFGHT---GTGVLVDPNMV------------SVGEEVAWSWLRTDYSRLHGDFAQNGNYQ 265
Query 313 YWVLSRDYGRNLASVMDTPAYSDFIKAAGTYVDELSLQRLTAFLKRIYVSPSSCPYILCG 372
YWVL+R + D + + GTY++ L
Sbjct 266 YWVLTRRFTTYFPD--DGTGFYQDGEYTGTYINPL------------------------- 298
Query 373 DFNYVFYDQRPTAENFVLDNVADIVV 398
D+ YVF DQ A NF D+ V
Sbjct 299 DWQYVFVDQTLMAGNFAYYGTFDLNV 324
>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 64/261 (25%), Positives = 112/261 (43%), Gaps = 40/261 (15%)
Query 111 VSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQ--DNTCPAFLGSDSFDMNVN 168
+S+ +I A +++ + G + E+ F + + + D C G DS ++ V
Sbjct 304 ISVADIRNAFALEKLASVTMRAGKTYKEQMEAHFGISVEEGRDGRCTYIGGFDS-NIQVG 362
Query 169 TLYQT-----TGFEDNS------SPLGAFSGQLSGGTRFRRRNYHFNDDGYFMEITSIVP 217
+ Q+ TG +D S G +G SG RF + + G M I S+VP
Sbjct 363 DVTQSSGTTVTGTKDTSFGGYLGRTTGKATGSGSGHIRFDAKEH-----GILMCIYSLVP 417
Query 218 RVYYPSY-INPTSRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSI 276
V Y S ++P ++I G + P +N+ MQ L A + + N AN+
Sbjct 418 DVQYDSKRVDPFVQKIERGDFFVPEFENLGMQPLFAKNISYKYNNNTANS---------- 467
Query 277 PGFKLQESNYVGYEPAWSELMTAVSKPHGRLCND--LDYWVLSRDYGR-----NLASVMD 329
+++ G++P +SE TA+ HG+ + L YW ++R G N+++
Sbjct 468 ---RIKNLGAFGWQPRYSEYKTALDINHGQFVHQEPLSYWTVARARGESMSNFNISTFKI 524
Query 330 TPAYSDFIKAAGTYVDELSLQ 350
P + D + A EL+ Q
Sbjct 525 NPKWLDDVFAVNYNGTELTDQ 545
>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/222 (23%), Positives = 94/222 (42%), Gaps = 23/222 (10%)
Query 99 DAAVDISVSGQSVSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDN-TCPAF 157
+ + ++V +++ ++ ++ +QR+ + GG R + S F V+ S P F
Sbjct 299 NGTLKVNVDEMGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQF 358
Query 158 LGSDSFDMNVNTLYQTTGFEDNSSPLGAFSGQ-LSGGTRFRRRNYHFNDDGYFMEITSIV 216
LG ++V+ + QT+ D +SP +G +S G ++Y F + GY + I SI
Sbjct 359 LGGGRMPISVSEVLQTSS-TDETSPQANMAGHGISAGINNGFKHY-FEEHGYIIGIMSIT 416
Query 217 PRVYYPSYINPTSRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSI 276
PR Y + + Y P +++ Q +K +F + Y N T
Sbjct 417 PRSGYQQGVPRDFTKFDNMDFYFPEFAHLSEQEIKNQELFV------SEDAAYNNGTF-- 468
Query 277 PGFKLQESNYVGYEPAWSELMTAVSKPHGRLCNDLDYWVLSR 318
GY P ++E S+ HG +L +W L+R
Sbjct 469 -----------GYTPRYAEYKYHPSEAHGDFRGNLSFWHLNR 499
>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis
CL09T03C24]
Length=538
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 52/217 (24%), Positives = 95/217 (44%), Gaps = 23/217 (11%)
Query 104 ISVSGQSVSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDN-TCPAFLGSDS 162
++V VS+ ++ ++ +QR+ + G R + S F V+ S P FLG
Sbjct 289 VNVDELGVSINDLRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGR 348
Query 163 FDMNVNTLYQTTGFEDNSSPLGAFSGQ-LSGGTRFRRRNYHFNDDGYFMEITSIVPRVYY 221
++V+ + QT+ D++SP +G +S G + Y F + GY + I SI PR Y
Sbjct 349 TPISVSEVLQTSA-TDSTSPQANMAGHGISAGVNHGFKRY-FEEHGYIIGIMSIRPRTGY 406
Query 222 PSYINPTSRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSIPGFKL 281
+ R+ Y P ++ Q +K V+ + Q +N T+
Sbjct 407 QQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEEVYLQ-QTPASNNGTF------------ 453
Query 282 QESNYVGYEPAWSELMTAVSKPHGRLCNDLDYWVLSR 318
GY P ++E ++++ HG ++ +W L+R
Sbjct 454 ------GYTPRYAEYKYSMNEVHGDFRGNMAFWHLNR 484
>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=245
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 90/210 (43%), Gaps = 23/210 (11%)
Query 111 VSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDN-TCPAFLGSDSFDMNVNT 169
V++ +I ++ +QR+ + G R + S F V+ S P FLG ++V+
Sbjct 3 VNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSE 62
Query 170 LYQTTGFEDNSSPLGAFSGQ-LSGGTRFRRRNYHFNDDGYFMEITSIVPRVYYPSYINPT 228
+ QT+ D++SP +G +S G Y F + GY M I SI PR Y +
Sbjct 63 VLQTSS-TDSTSPQANMAGHGISAGVNHGFTRY-FEEHGYIMGIMSIRPRTGYQQGVPKD 120
Query 229 SRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSIPGFKLQESNYVG 288
R+ Y P ++ Q +K ++ ++ AN T+ G
Sbjct 121 FRKFDNMDFYFPEFAHLGEQEIKNEELYLN-ESDAANEGTF------------------G 161
Query 289 YEPAWSELMTAVSKPHGRLCNDLDYWVLSR 318
Y P ++E + ++ HG ++ +W L+R
Sbjct 162 YTPRYAEYKYSQNEVHGDFRGNMAFWHLNR 191
>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str.
3999B T(B) 6]
Length=390
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 90/210 (43%), Gaps = 23/210 (11%)
Query 111 VSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDN-TCPAFLGSDSFDMNVNT 169
V++ +I ++ +QR+ + G R + S F V+ S P FLG ++V+
Sbjct 148 VNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSE 207
Query 170 LYQTTGFEDNSSPLGAFSGQ-LSGGTRFRRRNYHFNDDGYFMEITSIVPRVYYPSYINPT 228
+ QT+ D++SP +G +S G Y F + GY M I SI PR Y +
Sbjct 208 VLQTSS-TDSTSPQANMAGHGISAGVNHGFTRY-FEEHGYIMGIMSIRPRTGYQQGVPKD 265
Query 229 SRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSIPGFKLQESNYVG 288
R+ Y P ++ Q +K ++ ++ AN T+ G
Sbjct 266 FRKFDNMDFYFPEFAHLGEQEIKNEELYLN-ESDAANEGTF------------------G 306
Query 289 YEPAWSELMTAVSKPHGRLCNDLDYWVLSR 318
Y P ++E + ++ HG ++ +W L+R
Sbjct 307 YTPRYAEYKYSQNEVHGDFRGNMAFWHLNR 336
>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 4]
gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B
T(B) 6]
Length=541
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/217 (24%), Positives = 92/217 (42%), Gaps = 23/217 (11%)
Query 104 ISVSGQSVSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDN-TCPAFLGSDS 162
++ V++ +I ++ +QR+ + G R + S F V+ S P FLG
Sbjct 292 VNTDQMGVNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGR 351
Query 163 FDMNVNTLYQTTGFEDNSSPLGAFSGQ-LSGGTRFRRRNYHFNDDGYFMEITSIVPRVYY 221
++V+ + QT+ D++SP +G +S G Y F + GY M I SI PR Y
Sbjct 352 TPISVSEVLQTSS-TDSTSPQANMAGHGISAGVNHGFTRY-FEEHGYIMGIMSIRPRTGY 409
Query 222 PSYINPTSRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSIPGFKL 281
+ R+ Y P ++ Q +K ++ ++ AN T+
Sbjct 410 QQGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEELYLN-ESDAANEGTF------------ 456
Query 282 QESNYVGYEPAWSELMTAVSKPHGRLCNDLDYWVLSR 318
GY P ++E + ++ HG ++ +W L+R
Sbjct 457 ------GYTPRYAEYKYSQNEVHGDFRGNMAFWHLNR 487
>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM
17361]
Length=553
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/228 (24%), Positives = 96/228 (42%), Gaps = 31/228 (14%)
Query 101 AVDISVSGQSVSMRNITFASRMQRYMDLAFAGGGRNSDFYESQFDVKL--SQDNTCPAFL 158
VD S S+ ++ A + + + + G D + + V++ S+D
Sbjct 286 GVDTDSSEGDFSVSSLRAAFAVDKLLSVTMRAGKTFQDQMRAHYGVEIPDSRDGRVNYLG 345
Query 159 GSDSFDMNVNTLYQTTG-----FEDNSSPLGAFSGQLSGGTRFRRRNYHFNDDGYFMEIT 213
G DS DM V+ + QT+G ++ + LG +G+ +G R R + + G M I
Sbjct 346 GFDS-DMQVSDVTQTSGTTATEYKPEAGYLGRVAGKGTGSGR-GRIVFDAKEHGVLMCIY 403
Query 214 SIVPRVYYP-SYINPTSRQISLGQQYAPALDNIAMQGLKASTVFGEVQNLGANTVTYANS 272
S+VP++ Y + ++P ++ + P +N+ MQ L +S + N V
Sbjct 404 SLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENLGMQPLNSSYISSFCTTDPKNPV----- 458
Query 273 TLSIPGFKLQESNYVGYEPAWSELMTAVSKPHGRLCND--LDYWVLSR 318
+GY+P +SE TA+ HG+ L W +SR
Sbjct 459 --------------LGYQPRYSEYKTALDVNHGQFAQSDALSSWSVSR 492
>gi|609718276|emb|CDN73650.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=537
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 51/233 (22%), Positives = 100/233 (43%), Gaps = 27/233 (12%)
Query 102 VDISVSGQSVSMRN-ITFASRMQRYMDLAFAGGGRNSDFYESQFDVKLSQDNTC-PAFLG 159
+ ++++ ++VS N + A ++Q +++ G R ++ S F VK S P FLG
Sbjct 279 LKLNMASENVSTVNDLRRAFKLQEWLEKNARAGSRYAESILSFFGVKTSDGRLQRPEFLG 338
Query 160 SDSFDMNVNTLYQTTGFEDNSSPLGAFSGQLSGGTRFRRRNYHFNDDGYFMEITSIVPRV 219
+ + ++ + Q + D+++P G +G G + + F + GY + + S++P+
Sbjct 339 GNKSPIMISEVLQQSA-TDSTTPQGNMAGHGIGIGKDGGFSRFFEEHGYVIGLMSVIPKT 397
Query 220 YYPSYINPTSRQISLGQQYA---PALDNIAMQGLKASTVFGEVQNLGANTVTYANSTLSI 276
SY R S ++ P ++I Q + +F +N+ A
Sbjct 398 ---SYSQGIPRHFSKSDKFDYFWPQFEHIGEQPVYNKEIFA--KNIDA------------ 440
Query 277 PGFKLQESNYVGYEPAWSELMTAVSKPHGRLCNDLDYWVLSRDYGRNLASVMD 329
GY P +SE + S HG +DL +W L R + + V++
Sbjct 441 ----FDSEAVFGYLPRYSEYKFSPSTVHGDFKDDLYFWHLGRIFDTDKPPVLN 489
Lambda K H a alpha
0.318 0.133 0.390 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 2665232623989