What is MiST?
The Microbial Signal Transduction database contains the signal transduction proteins for 1,496 complete and 1,435 draft bacterial and archaeal genomes. These are identified using various domain profiles that directly or indirectly implicate a particular protein in participating in signal transduction.
Latest news
30 December 2011, 10:02 AM
- Genome update
- Made a slight modification to the signal protein prediction algorithm. Originally, all domain predictions were considered regardless of any overlap with other domains. Now, when dealing with overlapping domain predictions, all overlaps that have a lower E-value by at least 3 orders of magnitude are ignored. This adjustment still provides for close matches, but helps rule out likely false positives by giving preference to significantly stronger domain predictions.
04 November 2011, 04:10 PM
- New version: 2.1 - overhauled the database schema and reloaded the database from scratch. Contains all available genomes available through September.
- Because the database has been rebuilt from scratch, all the previous MiST identifiers are no longer valid with MiST2.1. To help with the transition, the old database and identifiers are accessible at http://v2-0.mistdb.com. We apologize for any inconvenience this may cause.
- Massively improved search speeds! The Sphinx search engine now services all domain and description searches resulting in reasonable search times.
12 November 2010, 03:34 PM
- Moved to dedicated servers which should improve performance. There are still some optimization issues that need to be dealt with. In particular, domain searches are very slow and often do not complete.
Why version 2.1? Where is the previous version?
In short, to facilitate keeping MiST updated and maintainable. This involved overhauling the database schema and data-loading scripts. Apart from predictions for many more genomes, the only major difference from the previous version is that all MiST ids have been changed. The previous version is still accessible at http://v2-0.mistdb.com and still works with the old identifiers. Please note, however, that MiST2.0 will no longer be updated. Feel free to contact us if you have any questions.
Citing MiST
If you use information from the MiST database in your research, please cite:
- The MiST2 database: a comprehensive genomics resource on microbial signal transduction.
Luke E. Ulrich and Igor B. Zhulin
Nucleic Acids Research, 2010, doi:10.1093/nar/gkp940.
Definitions of microbial signal transduction implemented in MiST are described in:
- One-component systems dominate signal transduction in prokaryotes.
Luke E. Ulrich, Eugene V. Koonin, and Igor B. Zhulin
Trends in Microbiology, 2005, 13:52-56.
Genomic distribution of signal transduction proteins
| Genomes | One-component | Two-component | Chemotaxis * | ECF | ||
|---|---|---|---|---|---|---|
| Archaea | Complete | 117 | 8,520 | 1,726 | 731 | 0 |
| Archaea | Draft | 14 | 1,232 | 280 | 114 | 0 |
| Bacteria | Complete | 1,379 | 255,082 | 84,160 | 20,148 | 9,163 |
| Bacteria | Draft | 1,421 | 256,573 | 80,286 | 15,875 | 8,586 |
| Total | 2,931 | 521,407 | 166,452 | 36,868 | 17,749 |
Champion genomes:
- 1,414 signal transduction proteins: Ktedonobacter racemifer DSM 44963
- 1,015 one-component proteins: Amycolatopsis mediterranei U32
- 366 two-component proteins: Ktedonobacter racemifer DSM 44963
- 129 chemotaxis proteins: Pseudomonas syringae pv. oryzae str. 1_6
- 118 ECF proteins: Plesiocystis pacifica SIR-1
- Largest genome (13.7 Mbp): Ktedonobacter racemifer DSM 44963
- Most genes (12,718 genes): Ktedonobacter racemifer DSM 44963
* Note: Chemotaxis is a specialized form of two-component systems (details)