SIMAP fulltext search
Databases
Your fulltext query:
Info
» Home
» Glossary
» FAQ
» Web-Service
» References
» Contact
Main
» SIMAP Sequence Search
» SIMAP Taxonomic Search
» Mapping of Protein Sets
» SIMAP Submatrix Export Tool

» SIMAP Feature Download
» SIMAP Hardware
mips home mail to webmaster
 

Welcome to SIMAP.

SIMAP is a database containing the similarity space formed by about all amino-acid sequences from public databases and completely sequenced genomes.

You may find sequences and protein entries of interest by fulltext search which uses an index of proteins IDs, accession numbers and descriptions, and the Biothesaurus.
Starting from your query sequence you may find the nearest sequences in SIMAP. By searching parts of your query in a suffix array of all SIMAP sequences (generated by VMATCH), this search runs much faster than BLAST.

News:

  • 2009, November 12: New SIMAP Paper:
    • Thomas Rattei, Patrick Tischler, Stefan Gotz, Marc-Andre Jehl, Jonathan Hoser, Roland Arnold, Ana Conesa, Hans-Werner Mewes. SIMAP--a comprehensive database of pre-calculated protein sequence similarities, domains, annotations and clusters. Nucleic Acids Research 2009; doi: 10.1093/nar/gkp949. Full Text
  • 2009, October 12: SIMAP Submatrix Export Tool available:
    • This tool utilizes the webbrowser based export of parts of the SIMAP matrix. For a given set of sequences or proteins the similarity scores are exported to a tab delimited flat file. You can use the tool by clicking on "SIMAP Submatrix Export Tool" in the main navigation on the left side.

SIMAP Release Statistics

Current Release Statistics
Release Date:2010-02-27
Number of Databases:3,436
Number of Proteins:60,622,735
Number of Sequences:29,601,293
Number of Residues:7,335,302,966
Number of processed Sequences:29,601,293
Number of Hits:835,183,185,210

SIMAP Usage Statistics

Usage Statistics
Sequence homologs queries today:8164
Sequence homologs queries last week:256020
Domain homologs queries today:1314
Domain Homologs queries last week:11277