Welcome to SIMAP.
SIMAP is a database containing the similarity space formed by about all amino-acid sequences from public databases and completely sequenced genomes.
You may find sequences and protein entries of interest by fulltext search which uses an index of proteins IDs, accession numbers and descriptions, and the Biothesaurus.
Starting from your query sequence you may find the nearest sequences in SIMAP. By searching parts of your query in a suffix array of all SIMAP sequences (generated by VMATCH), this search runs much faster than BLAST.
News:
- 2009, November 12: New SIMAP Paper:
- Thomas Rattei, Patrick Tischler, Stefan Gotz, Marc-Andre Jehl, Jonathan Hoser, Roland Arnold, Ana Conesa, Hans-Werner Mewes. SIMAP--a comprehensive database of pre-calculated protein sequence similarities, domains, annotations and clusters. Nucleic Acids Research 2009; doi: 10.1093/nar/gkp949. Full Text
- 2009, October 12: SIMAP Submatrix Export Tool available:
- This tool utilizes the webbrowser based export of parts of the SIMAP matrix. For a given set of sequences or proteins the similarity scores are exported to a tab delimited flat file. You can use the tool by clicking on "SIMAP Submatrix Export Tool" in the main navigation on the left side.