What is MiST?
The Microbial Signal Transduction database contains the signal transduction proteins for 1,538 complete and 1,537 draft bacterial and archaeal genomes. These are identified using various domain profiles that directly or indirectly implicate a particular protein in participating in signal transduction.
Latest news
30 December 2011, 10:02 AM
- Genome update
- Made a slight modification to the signal protein prediction algorithm. Originally, all domain predictions were considered regardless of any overlap with other domains. Now, when dealing with overlapping domain predictions, all overlaps that have a lower E-value by at least 3 orders of magnitude are ignored. This adjustment still provides for close matches, but helps rule out likely false positives by giving preference to significantly stronger domain predictions.
04 November 2011, 04:10 PM
- New version: 2.1 - overhauled the database schema and reloaded the database from scratch. Contains all available genomes available through September.
- Because the database has been rebuilt from scratch, all the previous MiST identifiers are no longer valid with MiST2.1. To help with the transition, the old database and identifiers are accessible at http://v2-0.mistdb.com. We apologize for any inconvenience this may cause.
- Massively improved search speeds! The Sphinx search engine now services all domain and description searches resulting in reasonable search times.
12 November 2010, 03:34 PM
- Moved to dedicated servers which should improve performance. There are still some optimization issues that need to be dealt with. In particular, domain searches are very slow and often do not complete.
Citing MiST
If you use information from the MiST database in your research, please cite:
- The MiST2 database: a comprehensive genomics resource on microbial signal transduction.
Luke E. Ulrich and Igor B. Zhulin
Nucleic Acids Research, 2010, doi:10.1093/nar/gkp940.
Definitions of microbial signal transduction implemented in MiST are described in:
- One-component systems dominate signal transduction in prokaryotes.
Luke E. Ulrich, Eugene V. Koonin, and Igor B. Zhulin
Trends in Microbiology, 2005, 13:52-56.
Genomic distribution of signal transduction proteins
| Genomes | One-component | Two-component | Chemotaxis * | ECF | ||
|---|---|---|---|---|---|---|
| Archaea | Complete | 118 | 8,577 | 1,726 | 731 | 0 |
| Archaea | Draft | 17 | 1,414 | 290 | 123 | 0 |
| Bacteria | Complete | 1,420 | 263,604 | 87,173 | 20,749 | 9,708 |
| Bacteria | Draft | 1,520 | 276,867 | 86,375 | 17,169 | 9,606 |
| Total | 3,075 | 550,462 | 175,564 | 38,772 | 19,314 |
Champion genomes:
- 1,428 signal transduction proteins: Streptomyces bingchenggensis BCW-1
- 1,054 one-component proteins: Streptomyces bingchenggensis BCW-1
- 366 two-component proteins: Ktedonobacter racemifer DSM 44963
- 129 chemotaxis proteins: Pseudomonas syringae pv. oryzae str. 1_6
- 118 ECF proteins: Plesiocystis pacifica SIR-1
- Largest genome (13.7 Mbp): Ktedonobacter racemifer DSM 44963
- Most genes (12,718 genes): Ktedonobacter racemifer DSM 44963
* Note: Chemotaxis is a specialized form of two-component systems (details)