Table 1 Overview of Protein Bioinformatics Databases

Category	DB Short Name	DB Name	Database Content	URLs	Ref.
Sequence databases	CCDS	The Consensus CDS protein set database	Protein-coding regions that are identically annotated on the human and mouse reference genome assemblies	http://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi	*(1)*
	DDBJ	DNA Data Bank of Japan	Annotated nucleotide sequence	http://www.ddbj.nig.ac.jp/	*(2)*
	ENA	European Nucleotide Archive	Raw sequencing data, sequence assembly information and functional annotation	http://www.ebi.ac.uk/ena	*(3)*
	GenBank	GenBank nucleotide sequence database	All publicly available DNA sequences	http://www.ncbi.nlm.nih.gov/genbank/	*(4)*
	RefSeq*	NCBI Reference Sequence Database	Comprehensive, integrated, non-redundant, well-annotated set of sequences, including genomic DNA, transcripts, and proteins	http://www.ncbi.nlm.nih.gov/refseq/	*(5)*
	UniGene	Database of computationally identifies transcripts from the same locus	Gene-oriented nucleotide sequence clusters	http://www.ncbi.nlm.nih.gov/unigene/	*(4)*
	UniProtKB*	Universal Protein Resource (UniProt)	Comprehensive resource for protein sequence and annotation data	http://www.uniprot.org/	*(6)*
2D gel databases	COMPLUYEAST-2DPAGE	2-DE database at Universidad Complutense de Madrid, Spain	Candida albicans two-dimensional polyacrilamide gel electrophoresis database	http://compluyeast2dpage.dacya.ucm.es/	*(7)*
	REPRODUCTION-2DPAGE	2-DE database at Nanjing Medical University, China	Two-dimensional polyacrylamide gel electrophoresis database	http://reprod.njmu.edu.cn/cgi-bin/2d/2d.cgi	*(8)*
	SWISS-2DPAGE	2-DE database at Swiss Institute of Bioinformatics, Switzerland	Data on proteins identified on various 2-D PAGE and SDS-PAGE reference maps	http://world-2dpage.expasy.org/swiss-2dpage/	*(9)*
	World-2DPAGE*	The World-2DPAGE database	2D-PAGE resources and tools	http://world-2dpage.expasy.org/repository/	*(10)*
3D structure databases	DisProt	Database of Protein Disorder	Information about proteins that lack fixed 3D structure in their putatively native states, either in their entirety or in part	http://www.disprot.org/	*(11)*
	MobiDB	Database of intrinsically disordered and mobile proteins	Intrinsically disordered and mobile proteins	http://mobidb.bio.unipd.it/	*(12)*
	ModBase	Database of Comparative Protein Structure Models	Annotated protein structure models	http://modbase.compbio.ucsf.edu/modbase-cgi/index.cgi	*(13)*
	PDBe*	Protein Data Bank at Europe	European resource for the collection, organization and dissemination of data on biological macromolecular structures	http://www.ebi.ac.uk/pdbe/	*(14)*
	PDBj*	Protein Data Bank at Japan	Centralized PDB archive of macromolecular structures and provides integrated tools	http://pdbj.org/	*(15)*
	PDBsum	Pictorial database of 3D structures in the Protein Data Bank	At-a-glance overview of the contents of each 3D structure deposited in the Protein Data Bank (PDB)	http://www.ebi.ac.uk/pdbsum/	*(16)*
	ProteinModelPortal	Protein Model Portal of the PSI-Nature Structural Biology Knowledgebase	PSI-Nature Structural Biology Knowledgebase	http://www.proteinmodelportal.org/	*(17)*
	RCSB-PDB*	Protein Data Bank at RCSB	Single worldwide repository of information about the 3D structures of large biological molecules, including proteins and nucleic acids	http://www.pdb.org/	*(18)*
	SMR	Database of annotated 3D protein structure models	Annotated three-dimensional comparative protein structure models generated by the fully automated homology-modelling pipeline SWISS-MODEL	http://swissmodel.expasy.org/repository/	*(19)*
Chemistry databases	BindingDB	The Binding Datbase	Measured binding affinities data for protein targets and small molecules	http://www.bindingdb.org/	*(20)*
	ChEMBL*	Database of bioactive drug-like small molecules	Bioactive drug-like small molecules	https://www.ebi.ac.uk/chembldb	*(21)*
	DrugBank	Drug and Drug Target Databse	Detailed drug (i.e. chemical, pharmacological and pharmaceutical) data with comprehensive drug target (i.e. sequence, structure, and pathway) information	http://www.drugbank.ca/	*(22)*
Enzyme and pathway databases	MetaCyc/BioCyc*	MetaCyc Database of Metabolic Pathways, BioCyc Collection of Pathway/Genome Databases	Full genome and predicted metabolic network of organism-specific Pathway/Genome Databases	http://www.biocyc.org/	*(23)*
	BRENDA*	BRaunschweig ENzyme DAtabase	Enzyme and enzyme-ligand information obtained from different sources	http://www.brenda-enzymes.org	*(24)*
	ENZYME	Enzyme nomenclature database	Information relative to the nomenclature of enzymes	http://enzyme.expasy.org/	*(25)*
	Reactome*	A knowledgebase of biological pathways and processes	Manually curated and peer-reviewed pathway database	http://www.reactome.org/	*(26)*
	SABIO-RK	SABIO-RK: Biochemical Reaction Kinetics Database	Biochemical reactions, their kinetic rate equations with parameters and experimental conditions	http://sabiork.h-its.org/	*(27)*
	SignaLink	A signaling pathway resource with multi-layered regulatory networks	Integrated resource to analyze signaling pathway cross-talks, transcription factors, miRNAs and regulatory enzymes	http://signalink.org/	*(28)*
	UniPathway	UniPathway: a resource for the exploration of metabolic pathways	Curated resource of enzyme-catalyzed and spontaneous chemical reactions	http://www.unipathway.org	*(29)*
Family and domain databases	Gene3D	Structural and Functional Annotation of Protein Families	Structural and Functional Annotation of Protein Families	http://gene3d.biochem.ucl.ac.uk/Gene3D/	*(30)*
	HAMAP	High-quality Automated and Manual Annotation of Proteins	Collection of manually curated family profiles for protein classification, and associated, manually created annotation rules that specify annotations that apply to family members	http://hamap.expasy.org/	*(31)*
	InterPro*	Integrated resource of protein families, domains and functional sites	Functional analysis of proteins by classifying them into families and predicting domains and important sites	http://www.ebi.ac.uk/interpro/	*(32)*
	PANTHER	The PANTHER Classification System	Classify proteins (and their genes) in order to facilitate high-throughput analysis	http://www.pantherdb.org/	*(33)*
	Pfam*	The Pfam protein families database	Protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs)	http://pfam.xfam.org/	*(34)*
	PIRSF*	A whole-protein classification database	Whole-protein classification database	http://pir.georgetown.edu/pirwww/dbinfo/pirsf.shtml	*(35)*
	PRINTS	Protein Motif fingerprint database	Protein fingerprints	http://www.bioinf.manchester.ac.uk/dbbrowser/PRINTS/	*(36)*
	ProDom	Protein domain families database	Protein domain families automatically generated from the UniProt Knowledge Database	http://prodom.prabi.fr/prodom/current/html/home.php	*(37)*
	PROSITE*	Database of protein domains, families and functional sites	Protein domains, families and functional sites as well as associated patterns and profiles to identify them	http://prosite.expasy.org/	*(38)*
	ProtoNet	Automatic hierarchical classification of proteins	Automatic hierarchical classification of protein sequences	http://www.protonet.cs.huji.ac.il/	*(39)*
	SMART	Simple Modular Architecture Research Tool	Identification and annotation of genetically mobile domains and the analysis of domain architectures	http://smart.embl.de/	*(40)*
	SUPFAM	Superfamily database of structural and functional annotation	Structural and functional annotation for all proteins and genomes	http://supfam.org	*(41)*
	TIGRFAMs	TIGRFAMs protein family database	Curated multiple sequence alignments, Hidden Markov Models (HMMs) for protein sequence classification, and associated information designed to support automated annotation of (mostly prokaryotic) proteins	http://www.jcvi.org/cgi-bin/tigrfams/index.cgi	*(42)*
Gene expression databases	Bgee	Database for Gene Expression Evolution	Comparison of expression patterns between animals	http://bgee.unil.ch	*(43)*
	CleanEx	Database of gene expression profiles	Gene expression data via unique approved gene symbols	http://cleanex.vital-it.ch/	*(44)*
	Genevisible	Search portal to normalized and curated expression data from Genevestigator	Curated expression data	http://genevisible.com/search	*(45)*
	ExpressionAtlas*	Database of Differential and Baseline Expression	Gene expression patterns under different biological conditions	http://www.ebi.ac.uk/gxa/home	*(46)*
Genome annotation databases	Ensembl*	Ensembl Eukaryotic genome annotation database	Annotations, querying tools and access methods for chordates and key model organisms	http://www.ensembl.org/	*(47)*
	EnsemblBacteria	Ensembl Bacteria genome annotation database	Genome sequences from bacteria and archaea have been annotated and deposited in the public archives of the members of the International Nucleotide Sequence Database Collaboration	http://bacteria.ensembl.org/	*(48)*
	EnsemblFungi	Ensembl Fungi genome annotation database	All fungal genomes whose sequence and annotation has been completed and submitted to the International Nucleotide Sequence Database Collaboration	http://fungi.ensembl.org/	*(48)*
	EnsemblMetazoa	Ensembl Metazoa genome annotation database	Genome sequences and annotations of all Ensembl Metazoa species	http://metazoa.ensembl.org/	*(48)*
	EnsemblPlants	Ensembl Plants genome annotation database	Genome sequences and annotations of all Ensembl Plants species	http://plants.ensembl.org/	*(48)*
	EnsemblProtists	Ensembl Protists genome annotation database	Genome sequences and annotations of all Ensembl Protists species	http://protists.ensembl.org/	*(48)*
	Entrez Gene*	Database of Genes of Genomes in the Reference Sequence Collection	Known and predicted genes defined by nucleo- tide sequence or map position	http://www.ncbi.nlm.nih.gov/gene	*(49)*
	KEGG	Kyoto Encyclopedia of Genes and Genomes	Biological interpretation of genome sequences and other high-throughput data	http://www.genome.jp/kegg/	*(50)*
	PATRIC	Bacterial Bioinformatics Resource Center	Bacterial infectious diseases via integration of vital pathogen information with rich data and analysis tools	http://patricbrc.org/	*(51)*
	UCSC*	UCSC Genome Bioinformatics	Reference sequence and working draft assemblies for a large collection of genomes	http://genome.ucsc.edu	*(52)*
	VectorBase	Bioinformatics Resource for Invertebrate Vectors of Human Pathogens	Invertebrate Vectors of Human Pathogens	http://www.vectorbase.org/	*(53)*
	WBParaSite	WormBase ParaSite	Parasitic worm genomics	http://parasite.wormbase.org	*(54)*
Organism specific databases	ArachnoServer	ArachnoServer: Spider toxin database	Information on the sequence, three-dimensional structure, and biological activity of protein toxins derived from spider venom	http://www.arachnoserver.org	*(55)*
	CGD	Candida Genome Database	Genomic sequence data and gene and protein information for Candida albicans and related species	http://www.candidagenome.org/	*(56)*
	ConoServer	ConoServer: Cone snail toxin database	Sequence and structures of conopeptides, which are peptides expressed by carnivorous marine cone snails	http://www.conoserver.org/	*(57)*
	CTD	Comparative Toxicogenomics Database	Genes and proteins responding to environmental toxic agents across diverse species	http://ctdbase.org/	*(58)*
	dictyBase	Central resource for Dictyostelid genomics	Model organism database for the social amoeba Dictyostelium discoideum	http://dictybase.org/	*(59)*
	EchoBASE	EchoBASE - an integrated post-genomic database for E. coli.	Information from post-genomic experiments using the model bacterium Escherichia coli K-12	http://www.york.ac.uk/res/thomas/	*(60)*
	EcoGene	Escherichia coli strain K12 genome database	Structural and functional annotation of Escherichia coli K-12	http://www.ecogene.org/	*(61)*
	euHCVdb	The European Hepatitis C Virus database	Collection of computer-annotated sequences based on Hepatitis C Virus reference genomes	https://euhcvdb.ibcp.fr/euHCVdb/	*(62)*
	EuPathDB	Eukaryotic Pathogen Database Resources	Eukaryotic pathogens of the genera Cryptosporidium, Giardia, Leishmania, Neospora, Plasmodium, Toxoplasma, Trichomonas and Trypanosoma	http://eupathdb.org/eupathdb/	*(63)*
	FlyBase*	A Database of Drosophila Genes & Genomes	Drosophila Genes and Genomes	http://flybase.org/	*(64)*
	GenAtlas	A database on genes, functions and related diseases	Scientific and clinical digests on human genes and diseases	http://genatlas.medecine.univ-paris5.fr/	*(65)*
	GeneCards	The Human Gene Database	All annotated and predicted human genes	http://www.genecards.org/	*(66)*
	GenoList	Integrated Environment for the Analysis of Microbial Genomes	Genome data from bacterial species	http://genodb.pasteur.fr/cgi-bin/WebObjects/GenoList	*(67)*
	Gramene	A comparative resource for plants	Data resource for comparative functional genomics in crops and model plant species	http://www.gramene.org/	*(68)*
	H-InvDB	H-Invitational Database	Curated annotations of human genes and transcripts	http://www.h-invitational.jp/	*(69)*
	HGNC	HUGO Gene Nomenclature Committee Database	Unique symbols and names for human loci, including protein coding genes, ncRNA genes and pseudogenes	http://www.genenames.org/	*(70)*
	HPA	The Human Protein Atlas	A large majority of all human protein-coding genes regarding the expression and localization of the corresponding proteins based on both RNA and protein data	http://www.proteinatlas.org/	*(71)*
	HUGE	A Database of Human Unidentified Gene-Encoded Large Proteins	Large human proteins of hitherto unidentified functions	http://www.kazusa.or.jp/huge/	*(72)*
	LegioList	Legionella pneumophila genome database	Genomic information from L. pneumophila	http://genolist.pasteur.fr/LegioList/	*(73)*
	Leproma	Mycobacterium leprae genome database	Broad view of the Mycobacterium leprae genome	http://mycobrowser.epfl.ch/leprosy.html	*(74)*
	MaizeGDB	Maize Genetics and genomics Database	Curated, community-oriented database and informatics service to researchers focused on the crop plant and model organism Zea mays ssp. mays	http://www.maizegdb.org/	*(75)*
	MGD*	Mouse Genome Database	Integrated genetic, genomic, and biological data for the laboratory mouse	http://www.informatics.jax.org/	*(76)*
	Micado	MICrobial Advanced Database Organization	Microbial genomes and functional analysis of Bacillus subtilis	http://genome.jouy.inra.fr/cgi-bin/micado/index.cgi	*(77)*
	OMIM	Online Mendelian Inheritance in Man	An Online Catalog of Human Genes and Genetic Disorders	http://www.omim.org/	*(78)*
	neXtProt*	Exploring the universe of human proteins	Human protein knowledge platform	http://www.nextprot.org/	*(79)*
	Orphanet	The portal for rare diseases and orphan drugs	Information on rare diseases and orphan drugs	http://www.orpha.net/consor/cgi-bin/home.php?Lng=GB	*(80)*
	PharmGKB	The Pharmacogenomics Knowledgebase	Curated knowledge about the impact of genetic variation on drug response for clinicians and researchers.	http://www.pharmgkb.org	*(81)*
	PomBase	The scientific resource for fission yeast	Fission yeast Schizosaccharomyces pombe	http://www.pombase.org/	*(82)*
	PseudoCAP	The Pseudomonas Genome Database	Internet-based and community-based genome annotation project for analysis of a genome of a free-living organism	http://www.pseudomonas.com/	*(83)*
	RGD	Rat Genome Database	Genetic, genomic, phenotype, and disease data generated from rat research	http://rgd.mcw.edu/	*(84)*
	Rouge	A Database of Rodent Unidentified Gene-Encoded Large Proteins	Information obtained from mouse homologues of the KIAA cDNAs (mKIAA cDNAs)	http://www.kazusa.or.jp/rouge/	*(72)*
	SGD	Saccharomyces Genome Database	Comprehensive integrated biological information for the budding yeast Saccharomyces cerevisiae	http://www.yeastgenome.org/	*(85)*
	TAIR	The Arabidopsis Information Resource	Genetic and molecular biology data for the model higher plant Arabidopsis thaliana	http://www.arabidopsis.org/	*(86)*
	TubercuList	Mycobacterium tuberculosis strain H37Rv genome database	Mycobacterium tuberculosis genome database	http://tuberculist.epfl.ch	*(87)*
	WormBase	C. elegans and related nematodes Genetics and Genomics Database	Model organism database for Caenorhabditis elegans and other related nematodes	http://www.wormbase.org/	*(54)*
	Xenbase	Xenopus laevis and tropicalis biology and genomics resource	Diverse and rich genomic, expression and functional data available from Xenopus research	http://www.xenbase.org/	*(88)*
	ZFIN	The Zebrafish Model Organism Database	Zebrafish Information Network genome database	http://zfin.org/	*(89)*
Phylogenomic databases	eggNOG	Database of orthologous groups and functional annotation	Non-supervised orthologous groups (NOGs) from complete genomes	http://eggnog.embl.de/	*(90)*
	HOGENOM	Database of Homologous Genes from Fully Sequenced Organisms	Complete Genome Homologous Genes Families	http://pbil.univ-lyon1.fr/databases/hogenom/home.php	*(91)*
	HOVERGEN	Homologous Vertebrate Genes Database	Homologous vertebrate genes, structured under ACNUC sequence database management system	http://pbil.univ-lyon1.fr/databases/hovergen.html	*(92)*
	InParanoid	Eukaryotic Ortholog Groups with inparalogs	Protein orthologs inferred by the InParanoid algorithm	http://inparanoid.sbc.su.se/	*(93)*
	KO	Kyoto Encyclopedia of Genes and Genomes Orthology	Manually defined ortholog groups (KO entries), which are categorized under the hierarchy of KEGG pathways and BRITE ontologies	http://www.genome.jp/kegg/	*(94)*
	OMA*	The OMA orthology database	Inference of orthologs among complete genomes	http://omabrowser.org/	*(95)*
	OrthoDB	Database of Orthologous Groups	Catalog of orthologous protein-coding genes	http://cegg.unige.ch/orthodb6	*(96)*
	PhylomeDB	Database for complete catalogs of gene phylogenies (phylomes)	Complete catalogs of gene phylogenies (phylomes)	http://phylomedb.org/	*(97)*
	TreeFam	Database of animal gene trees	Phylogenetic trees inferred from animal genomes	http://www.treefam.org	*(98)*
Polymorphism and mutation databases	BioMuta	Single-nucleotide variation and disease association database	Curated single-nucleotide variation (SNV) and disease association database where the variations are mapped to the genome/protein/gene	https://hive.biochemistry.gwu.edu/tools/biomuta/	*(99)*
	dbSNP*	Database of Short Genetic Variations	Simple genetic variations such as single-base nucleotide substitutions, small-scale multi-base deletions or insertions, and microsatellite repeats	http://www.ncbi.nlm.nih.gov/SNP/	*(4)*
	DMDM	Domain Mapping of Disease Mutations	Unique domain-level view where all human coding mutations are mapped on the protein domain	http://bioinf.umbc.edu/dmdm/	*(100)*
Protein-protein interaction databases	BioGRID	The Biological General Repository for Interaction Datasets	Genetic and protein interactions curated from the primary biomedical literature for all major model organism species and humans	http://thebiogrid.org	*(101)*
	DIP	Database of Interacting Proteins	Experimentally determined interactions between proteins	http://dip.doe-mbi.ucla.edu/	*(102)*
	IntAct*	IntAct Molecular Interaction Database	Open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions	http://www.ebi.ac.uk/intact/	*(103)*
	MINT	The Molecular INTeraction database	Experimentally verified protein-protein interactions mined from the scientific literature by expert curators	http://mint.bio.uniroma2.it/mint/	*(104)*
	STRING	Search Tool for the Retrieval of Interacting Genes/Proteins	Known and Predicted Protein-Protein Interactions	http://string-db.org	*(105)*
Proteomic databases	MaxQB	The MaxQuant DataBase	Large proteomics projects and allows joint analysis and comparison	http://maxqb.biochem.mpg.de/mxdb/	*(106)*
	PaxDb	Protein Abundance Across Organisms	Comprehensive absolute protein abundance database, which contains whole genome protein abundance information across organisms and tissues	http://pax-db.org	*(107)*
	PeptideAtlas*	PeptideAtlas	Peptides identified in a large set of tandem mass spectrometry proteomics experiments	http://www.peptideatlas.org	*(108)*
	PRIDE*	PRoteomics IDEntifications database	Centralized, standards compliant, public data repository for proteomics data	http://www.ebi.ac.uk/pride	*(109)*
	ProMEX	Protein Mass spectra EXtraction	Tryptic peptide fragmentation mass spectra derived from plants.	http://promex.pph.univie.ac.at/promex/	*(110)*
PTM databases	DEPOD*	The Human DEPhOsphorylation Database	Human active phosphatases, their experimentally verified protein and non-protein substrates and dephosphorylation site information, and pathways in which they are involved	http://www.koehnlab.de/depod/index.php	*(111)*
	iPTMnet*	Protein post-translational modifications (PTMs) in systems biology context	Protein phosphorylation information	http://research.bioinformatics.udel.edu/iptmnet/	*(112)*
	PhosPhAt*	The Arabidopsis Protein Phosphorylation Site Database	Information on Arabidopsis phosphorylation sites which were identified by mass spectrometry in large scale experiments by different research groups	http://phosphat.uni-hohenheim.de	*(113)*
	Phospho.ELM*	Database of S/T/Y phosphorylation sites	Experimentally verified phosphorylation sites in eukaryotic proteins	http://phospho.elm.eu.org	*(114)*
	PhosphoGrid*	Database of experimentally verified in vivo protein phosphorylation sites	Experimentally verified in vivo protein phosphorylation sites in the model eukaryotic organism Saccharomyces cerevisisae	http://www.phosphogrid.org	*(115)*
	PhosphoSitePlus*	Phosphorylation site database	Comprehensive information and tools for the study of protein post-translational modifications (PTMs)	http://www.phosphosite.org	*(116)*
	UniCarbKB*	Database of glycomics and glycobiology	Online information storage and search platform for glycomics and glycobiology research	http://www.unicarbkb.org/	*(117)*
Ontology	GO*	Gene Ontology	Consistent descriptions of gene products across databases	http://www.geneontology.org/	*(118)*
Ontology	PRO	Protein Ontology	Ontological representation of protein-related entities by explicitly defining them and showing the relationships between them	http://pir.georgetown.edu/pro/pro.shtml	*(119)*
Specialized protein databases	Allergome	Allergome: platform for allergen knowledge	Allergenic molecules and proteins reported as allergens	http://www.allergome.org/	*(120)*
	CAZy	Carbohydrate-Active enZYmes Database	Families of structurally-related catalytic and carbohydrate-binding modules (or functional domains) of enzymes that degrade, modify, or create glycosidic bonds	http://www.cazy.org/	*(121)*
	ESTHER	ESTerases and alpha/beta-Hydrolase Enzymes and Relatives database	Proteins or protein domains belonging to the superfamily of alpha/beta-hydrolases, exemplified by the cholinesterases	http://bioweb.ensam.inra.fr/ESTHER/general?what=index	*(122)*
	GPCRDB	Information system for G protein-coupled receptors (GPCRs)	Data on sequences, ligand binding constants and mutations on G protein-coupled receptors (GPCRs)	http://www.gpcr.org/7tm/	*(123)*
	IMGT	The International ImMunoGeneTics information system	Immunoglobulin (IG) and T cell receptor (TR) nucleotide sequences from human and other vertebrate species	http://www.imgt.org/	*(124)*
	MEROPS*	MEROPS protease database	Peptidases (also termed proteases, proteinases and proteolytic enzymes) and the proteins that inhibit them	http://merops.sanger.ac.uk/	*(125)*
	MoonProt	Moonlighting protein database	Information about the proteins that have been experimentally verified to be moonlighting proteins	http://www.moonlightingproteins.org/	*(126)*
	mycoCLAP	Characterized Lignocellulose-Active Proteins of Fungal Origin	Fungal and bacterial genes encoding lignocellulose-active proteins that have been biochemically characterized	https://mycoclap.fungalgenomics.ca/mycoCLAP/	*(127)*
	PeroxiBase	The peroxidases database	Enzymes able to catalyze reduction of hydrogen peroxide and oxidize various substrates	http://peroxibase.toulouse.inra.fr/	*(128)*
	REBASE	The Restriction Enzyme Database	Information about restriction enzymes and related proteins	http://rebase.neb.com/rebase/rebase.html	*(129)*
	TCDB	Transporter Classification Database	Functional and Phylogenetic Classification of Membrane Transport Proteins	http://www.tcdb.org/	*(130)*
Other (Miscellaneous) databases	ChiTaRS	Database of chimeric transcripts and RNA-seq data	Chimeric RNAs from humans, mice and fruit flies with confirmation from RNA-seq reads	http://chitars.bioinfo.cnio.es/	*(131)*
	EvolutionaryTrace	Database of relative evolutionary importance of amino acids within a protein sequence	Evolutionary Trace Analysis of a protein structure	http://mammoth.bcm.tmc.edu/ETserver.html	*(132)*
	GeneWiki*	Wiki portal for the annotation of gene and protein function	Applying community intelligence to the annotation of gene and protein function	http://en.wikipedia.org/wiki/Portal:Gene_Wiki	*(133)*
	GenomeRNAi	Database of phenotypes from RNA interference screens in Drosophila and Homo sapiens	Phenotypes from RNA interference (RNAi) screens in Drosophila and Homo sapiens	http://genomernai.dkfz.de/GenomeRNAi/	*(134)*
	PMAP-CutDB	Proteolytic event database	Proteolytic pathways	http://www.proteolysis.org/	*(135)*
	SOURCE	The Stanford Online Universal Resource for Clones and ESTs	Encapsulate the genetics and molecular biology of genes from the genomes of Homo sapiens, Mus musculus, Rattus norvegicus into easy to navigate GeneReports	http://smd.princeton.edu/cgi-bin/source/sourceSearch	*(136)*

* Covered in section 3 of the chapter.

References:

1. Farrell C. M., O'Leary N. A., Harte R. A., Loveland J. E., Wilming L. G., Wallin C., Diekhans M., Barrell D., Searle S. M., Aken B., Hiatt S. M., Frankish A., Suner M. M., Rajput B., Steward C. A., Brown G. R., Bennett R., Murphy M., Wu W., Kay M. P., Hart J., Rajan J., Weber J., Snow C., Riddick L. D., Hunt T., Webb D., Thomas M., Tamez P., Rangwala S. H., McGarvey K. M., Pujar S., Shkeda A., Mudge J. M., Gonzalez J. M., Gilbert J. G., Trevanion S. J., Baertsch R., Harrow J. L., Hubbard T., Ostell J. M., Haussler D., Pruitt K. D. (2014) Current status and new features of the Consensus Coding Sequence database. Nucleic Acids Res. 42, D865-D872.

2. Kodama Y., Mashima J., Kosuge T., Katayama T., Fujisawa T., Kaminuma E., Ogasawara O., Okubo K., Takagi T., Nakamura Y. (2015) The DDBJ Japanese Genotype-phenotype Archive for genetic and phenotypic human data. Nucleic Acids Res. 43, D18-D22.

3. Kulikova T., Akhtar R., Aldebert P., Althorpe N., Andersson M., Baldwin A., Bates K., Bhattacharyya S., Bower L., Browne P., Castro M., Cochrane G., Duggan K., Eberhardt R., Faruque N., Hoad G., Kanz C., Lee C., Leinonen R., Lin Q., Lombard V., Lopez R., Lorenc D., McWilliam H., Mukherjee G., Nardone F., Pastor M. P., Plaister S., Sobhany S., Stoehr P., Vaughan R., Wu D., Zhu W., Apweiler R. (2007) EMBL Nucleotide Sequence Database in 2006. Nucleic Acids Res. 35, D16-D20.

4. Agarwala R., Barrett T., Beck J., Benson D. A., Bollin C., Bolton E., Bourexis D., Brister J., Bryant S. H., Canese K., Clark K., DiCuccio M., Dondoshansky I., Federhen S., Feolo M., Funk K., Geer L. Y., Gorelenkov V., Hoeppner M., Holmes B., Johnson M., Khotomlianski V., Kimchi A., Kimelman M., Kitts P., Klimke W., Krasnov S., Kuznetsov A., Landrum M. J., Landsman D., Lee J. M., Lipman D. J., Lu Z., Madden T. L., Madej T., Marchler-Bauer A., Karsch-Mizrachi I., Murphy T., Orris R., Ostell J., O'Sullivan C., Panchenko A., Phan L., Preuss D., Pruitt K. D., Rubinstein W., Sayers E. W., Schneider V., Schuler G. D., Sherry S. T., Sirotkin K., Siyan K., Slotta D., Soboleva A., Soussov V., Starchenko G., Tatusova T. A., Trawick B. W., Vakatov D., Wang Y., Ward M., Wilbur W., Yaschenko E., Zbicz K. (2015) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 43, D6-D17.

5. Pruitt K. D., Tatusova T., Maglott D. R. (2006) NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 35, D61-D65.

6. The UniProt Consortium (2015) UniProt: a hub for protein information. Nucleic Acids Res. 43, D204-D212.

7. Pitarch A., Sánchez M., Nombela C., Gil C. (2003) Analysis of the Candida albicans proteome. II. Protein information technology on the Net (update 2002) J Chromatogr B Analyt Technol Biomed Life Sci. 787, 129-148.

8. Zhou T., Zhou Z. M., Guo X. J. (2013) Bioinformatics for spermatogenesis: annotation of male reproduction based on proteomics. Asian J Androl. 15, 594-602.

9. Hoogland C., Mostaguir K., Sanchez J. C., Hochstrasser D. F., Appel R. D. (2004) SWISS-2DPAGE, ten years later. Proteomics 4, 2352-2356.

10. Hoogland C., Mostaguir K., Appel R. D., Lisacek F. (2008) The World-2DPAGE Constellation to promote and publish gel-based proteomics data through the ExPASy server. J Proteomics. 71, 245-248.

11. Sickmeier M., Hamilton J. A., LeGall T., Vacic V., Cortese M. S., Tantos A., Szabo B., Tompa P., Chen J., Uversky V. N., Obradovic Z., Dunker A. K. (2007) DisProt: the Database of Disordered Proteins. Nucleic Acids Res. 35, D786-D793.

12. Potenza E., Di Domenico T., Walsh I., Tosatto S. C. (2014) MobiDB 2.0: an improved database of intrinsically disordered and mobile proteins. Nucleic Acids Res. 43, D315-D320.

13. Pieper U., Webb B. M., Dong G. Q., Schneidman-Duhovny D., Fan H., Kim S. J., Khuri N., Spill Y. G., Weinkam P., Hammel M., Tainer J. A., Nilges M., Sali A. (2014) ModBase, a database of annotated comparative protein structure models and associated resources. Nucleic Acids Res. 42, D336-D346.

14. Velankar S., van Ginkel G., Alhroub Y., Battle G. M., Berrisford J. M., Conroy M. J., Dana J. M., Gore S. P., Gutmanas A., Haslam P., Hendrickx P. M., Lagerstedt I., Mir S., Fernandez Montecelo M. A., Mukhopadhyay A., Oldfield T. J., Patwardhan A., Sanz-García E., Sen S., Slowley R. A., Wainwright M. E., Deshpande M. S., Iudin A., Sahni G., Salavert T. J., Hirshberg M., Mak L., Nadzirin N., Armstrong D. R., Clark A. R., Smart O. S., Korir P. K., Kleywegt G. J. (2015) PDBe: improved accessibility of macromolecular structure data from PDB and EMDB. Nucleic Acids Res. doi: 10.1093/nar/gkv1047.

15. Kinjo A. R., Suzuki H., Yamashita R., Ikegawa Y., Kudou T., Igarashi R., Kengaku Y., Cho H., Standley D. M., Nakagawa A., Nakamura H. (2012) Protein Data Bank Japan (PDBj): maintaining a structural data archive and resource description framework format. Nucleic Acids Res. 40, D453-D460.

16. de Beer T. A., Berka K., Thornton J. M., Laskowski R. A. (2014) PDBsum additions. Nucleic Acids Res. 42, D292-D296.

17. Haas J., Roth S., Arnold K., Kiefer F., Schmidt T., Bordoli L., Schwede T. (2013) The Protein Model Portal-a comprehensive resource for protein structure and model information. Database doi: 10.1093/database/bat031.

18. Berman H. M., Westbrook J., Feng Z., Gilliland G., Bhat T. N., Weissig H., Shindyalov I. N., Bourne P. E. (2000) The Protein Data Bank. Nucleic Acids Res. 28, 235-242.

19. Schwede T., Kopp J., Guex N. and Peitsch M. C. (2003) SWISS-MODEL: an automated protein homology-modeling server. Nucleic Acids Res. 31, 3381-3385.

20. Liu T., Lin Y., Wen X., Jorissen R. N., Gilson M. K. (2007) BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res. 35, D198-D201.

21. Bento A. P., Gaulton A., Hersey A., Bellis L. J., Chambers J., Davies M., Krüger F. A., Light Y., Mak L., McGlinchey S., Nowotka M., Papadatos G., Santos R., Overington J. P. (2014) The ChEMBL bioactivity database: an update. Nucleic Acids Res. 42, D1083-D1090.

22. Law V., Knox C., Djoumbou Y., Jewison T., Guo A. C., Liu Y., Maciejewski A., Arndt D., Wilson M., Neveu V., Tang A., Gabriel G., Ly C., Adamjee S., Dame Z. T., Han B., Zhou Y., Wishart D. S. (2014) DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res. 42, D1091-D1097.

23. Caspi R. Altman T. Billington R., Dreher K., Foerster H., Fulcher C. A., Holland T. A., Keseler I. M., Kothari A., Kubo A., Krummenacker M., Latendresse M., Mueller L. A., Ong Q., Paley S., Subhraveti P., Weaver D. S., Weerasinghe D., Zhang P., and Karp P. D. (2014) The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Res. 42, D459-D471.

24. Chang A., Schomburg I., Placzek S., Jeske L., Ulbrich M., Xiao M., Sensen C. W., Schomburg D. (2015) BRENDA in 2015: exciting developments in its 25th year of existence. Nucleic Acids Res. 43, D439-D446.

25. Bairoch A. (2000) The ENZYME database in 2000. Nucleic Acids Res. 28, 304-305.

26. Croft D., Mundo A. F., Haw R., Milacic M., Weiser J., Wu G., Caudy M., Garapati P., Gillespie M., Kamdar M. R., Jassal B., Jupe S., Matthews L., May B., Palatnik S., Rothfels K., Shamovsky V., Song H., Williams M., Birney E., Hermjakob H., Stein L., D'Eustachio P. (2014) The Reactome pathway knowledgebase. Nucleic Acids Res. 42, D472-D477.

27. Wittig U., Kania R., Golebiewski M., Rey M., Shi L., Jong L., Algaa E., Weidemann A., Sauer-Danzwith H., Mir S., Krebs O., Bittkowski M., Wetsch E., Rojas I., Müller W. (2012) SABIO-RK - database for biochemical reaction kinetics. Nucleic Acids Res. 40, D790-D796.

28. Fazekas D., Koltai M., Türei D., Módos D., Pálfy M., Dúl Z., Zsákai L., Szalay-Bekő M., Lenti K., Farkas I. J., Vellai T., Csermely P., Korcsmáros T. (2013) SignaLink 2 - a signaling pathway resource with multi-layered regulatory networks. BMC Syst Biol. 7, 7.

29. Morgat A., Coissac E., Coudert E., Axelsen K. B., Keller G., Bairoch A., Bridge A., Bougueleret L., Xenarios I., Viari A. (2012) UniPathway: a resource for the exploration and annotation of metabolic pathways. Nucleic Acids Res. 40, D761-D769.

30. Yeats C., Maibaum M., Marsden R., Dibley M., Lee D., Addou S., Orengo C. A. (2006) Gene3D: modelling protein structure, function and evolution. Nucleic Acids Res. 34, D281-D284.

31. Pedruzzi I., Rivoire C., Auchincloss A. H., Coudert E., Keller G., de Castro E., Baratin D., Cuche B. A., Bougueleret L., Poux S., Redaschi N., Xenarios I., Bridge A. HAMAP in 2015: updates to the protein family classification and annotation system. Nucleic Acids Res. 43, D1064-D1070.

32. Mitchell A., Chang H. Y., Daugherty L., Fraser M., Hunter S., Lopez R., McAnulla C., McMenamin C., Nuka G., Pesseat S., Sangrador-Vegas A., Scheremetjew M., Rato C., Yong S. Y., Bateman A., Punta M., Attwood T. K., Sigrist C. J., Redaschi N., Rivoire C., Xenarios I., Kahn D., Guyot D., Bork P., Letunic I., Gough J., Oates M., Haft D., Huang H., Natale D. A., Wu C. H., Orengo C., Sillitoe I., Mi H., Thomas P. D., Finn R. D. (2015) The InterPro protein families database: the classification resource after 15 years. Nucleic Acids Res. 43, D213-D221.

33. Mi H., Muruganujan A., Casagrande J. T., Thomas P. D. (2013) Large-scale gene function analysis with the PANTHER classification system. Nat Protoc. 8, 1551-1566.

34. Finn R. D., Bateman A., Clements J., Coggill P., Eberhardt R. Y., Eddy S. R., Heger A., Hetherington K., Holm L., Mistry J., Sonnhammer E. L., Tate J., Punta M. (2014) The Pfam protein families database. Nucleic Acids Res. 42, D222-D230.

35. Wu C. H., Nikolskaya A., Huang H., Yeh L. S., Natale D. A., Vinayaka C. R., Hu Z. Z., Mazumder R., Kumar S., Kourtesis P., Ledley R. S., Suzek B. E., Arminski L., Chen Y., Zhang J., Cardenas J. L., Chung S., Castro-Alvear J., Dinkov G., Barker W. C. (2004) PIRSF: family classification system at the Protein Information Resource. Nucleic Acids Res. 32, D112-D114.

36. Attwood T. K., Bradley P., Flower D. R., Gaulton A., Maudling N., Mitchell A., Moulton G., Nordle A., Paine K., Taylor P., Uddin A. and Zygouri, C. (2003) PRINTS and its automatic supplement, prePRINTS. Nucleic Acids Res. 31, 400-402.

37. Servant F., Bru C., Carrère S., Courcelle E., Gouzy J., Peyruc D., Kahn D. (2002) ProDom: Automated clustering of homologous domains. Briefings in Bioinformatics. 3, 246-251.

38. Sigrist C. J., de Castro E., Cerutti L., Cuche B. A., Hulo N., Bridge A., Bougueleret L., Xenarios I. (2013) New and continuing developments at PROSITE. Nucleic Acids Res. 41, D344-D347.

39. Rappoport N., Karsenty S., Stern A., Linial N., Linial M. (2011) ProtoNet 6.0: organizing 10 million protein sequences in a compact hierarchical family tree. Nucleic Acids Res. 40, D313-D320.

40. Letunic I., Doerks T., Bork P. (2015) SMART: recent updates, new developments and status in 2015. Nucleic Acids Res. 43, D257-D260.

41. Wilson D., Pethica R., Zhou Y., Talbot C., Vogel C., Madera M., Chothia C., Gough J. (2009) SUPERFAMILY - Comparative Genomics, Datamining and Sophisticated Visualisation. Nucleic Acids Res. 37, D380-D386.

42. Selengut J. D., Haft D. H., Davidsen T., Ganapathy A., Gwinn-Giglio M., Nelson W. C., Richter A. R., White O. (2007) TIGRFAMs and Genome Properties: tools for the assignment of molecular function and biological process in prokaryotic genomes. Nucleic Acids Res. 35, D260-D264.

43. Bastian F., Parmentier G., Roux J., Moretti S., Laudet V., Robinson-Rechavi M. (2008) Bgee: Integrating and Comparing Heterogeneous Transcriptome Data Among Species. Lecture Notes in Computer Science 5109, 124-131.

44. Praz V., Jagannathan V., Bucher P. (2004) CleanEx: a database of heterogeneous gene expression data based on a consistent gene nomenclature. Nucleic Acids Res. 32, D542-D547.

45. Grennan AK. (2006) Genevestigator. Facilitating web-based gene-expression analysis. Plant Physiol. 141, 1164-1166.

46. Petryszak R., Burdett T., Fiorelli B., Fonseca N. A., Gonzalez-Porta M., Hastings E., Huber W., Jupp S., Keays M., Kryvych N., McMurry J., Marioni J. C., Malone J., Megy K., Rustici G., Tang A. Y., Taubert J., Williams E., Mannion O., Parkinson H. E., Brazma A. (2014) Expression Atlas update-a database of gene and transcript expression from microarray- and sequencing-based functional genomics experiments. Nucleic Acids Res. 42, D926-D932.

47. Cunningham F., Amode M. R., Barrell D., Beal K., Billis K., Brent S., Carvalho-Silva D., Clapham P., Coates G., Fitzgerald S., Gil L., Girón C. G., Gordon L., Hourlier T., Hunt S. E., Janacek S. H., Johnson N., Juettemann T., Kähäri A. K., Keenan S., Martin F. J., Maurel T., McLaren W., Murphy D. N., Nag R., Overduin B., Parker A., Patricio M., Perry E., Pignatelli M., Riat H. S., Sheppard D., Taylor K., Thormann A., Vullo A., Wilder S. P., Zadissa A., Aken B. L., Birney E., Harrow J., Kinsella R., Muffato M., Ruffier M., Searle S. M., Spudich G., Trevanion S. J., Yates A., Zerbino D. R., Flicek P. (2015) Ensembl 2015. Nucleic Acids Res. 43, D662-D669.

48. Kersey P. J., Lawson D., Birney E., Derwent P. S., Haimel M., Herrero J., Keenan S., Kerhornou A., Koscielny G., Kähäri A., Kinsella R. J., Kulesha E., Maheswari U., Megy K., Nuhn M., Proctor G., Staines D., Valentin F., Vilella A. J., Yates A. (2010) Ensembl Genomes: extending Ensembl across the taxonomic space. Nucleic Acids Res. 38, D563-D569.

49. Maglott D., Ostell J., Pruitt K. D., Tatusova T. (2005) Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res. 33, D54-D58.

50. Kanehisa M., Sato Y., Kawashima M., Furumichi M., Tanabe M. (2015) KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. doi: 10.1093/nar/gkv1070

51. Wattam A. R., Abraham D., Dalay O., Disz T. L., Driscoll T., Gabbard J. L., Gillespie J. J., Gough R., Hix D., Kenyon R., Machi D., Mao C., Nordberg E. K., Olson R., Overbeek R., Pusch G. D., Shukla M., Schulman J., Stevens R. L., Sullivan D. E., Vonstein V., Warren A., Will R., Wilson M. J., Yoo H. S., Zhang C., Zhang Y., Sobral B. W. (2014) PATRIC, the bacterial bioinformatics database and analysis resource. Nucleic Acids Res. 42, D581-D591.

52. Rosenbloom K. R., Armstrong J., Barber G. P., Casper J., Clawson H., Diekhans M., Dreszer T. R., Fujita P. A., Guruvadoo L., Haeussler M., Harte R. A., Heitner S., Hickey G., Hinrichs A. S., Hubley R., Karolchik D., Learned K., Lee B. T., Li C. H., Miga K. H., Nguyen N., Paten B., Raney B. J., Smit A. F., Speir M. L., Zweig A. S., Haussler D., Kuhn R. M., Kent W. J. (2015) The UCSC Genome Browser database: 2015 update. Nucleic Acids Res. 43, D670-D681.

53. Lawson D., Arensburger P., Atkinson P., Besansky N. J., Bruggner R. V., Butler R., Campbell K. S., Christophides G. K., Christley S., Dialynas E., Emmert D., Hammond M., Hill C. A., Kennedy R. C., Lobo N. F., MacCallum M. R., Madey G., Megy K., Redmond S., Russo S., Severson D. W., Stinson E. O., Topalis P., Zdobnov E. M., Birney E., Gelbart W. M., Kafatos F. C., Louis C., Collins F. H. (2007) VectorBase: a home for invertebrate vectors of human pathogens. Nucleic Acids Res. 35, D503-D505.

54. Harris T. W., Baran J., Bieri T., Cabunoc A., Chan J., Chen W. J., Davis P., Done J., Grove C., Howe K., Kishore R., Lee R., Li Y., Muller H. M., Nakamura C., Ozersky P., Paulini M., Raciti D., Schindelman G., Tuli M. A., Van Auken K., Wang D., Wang X., Williams G., Wong J. D., Yook K., Schedl T., Hodgkin J., Berriman M., Kersey P., Spieth J., Stein L., Sternberg P. W. (2014) WormBase 2014: new views of curated biology. Nucleic Acids Res. 42, D789-D793.

55. Herzig V., Wood D. L., Newell F., Chaumeil P. A., Kaas Q., Binford G. J., Nicholson G. M., Gorse D., King G. F. (2011) ArachnoServer 2.0, an updated online resource for spider toxin sequences and structures. Nucleic Acids Res. 39, D653-D657.

56. Inglis D. O., Arnaud M. B., Binkley J., Shah P., Skrzypek M. S., Wymore F., Binkley G., Miyasato S. R., Simison M., Sherlock G. (2012). The Candida genome database incorporates multiple Candida species: multispecies search and analysis tools with curated gene and protein information for Candida albicans and Candida glabrata. Nucleic Acids Res. 40, D667-D674.

57. Kaas Q., Yu R., Jin A. H., Dutertre S. and Craik D. J. (2012) ConoServer: updated content, knowledge, and discovery tools in the conopeptide database. Nucleic Acids Res. 40, D325-D330.

58. Davis A. P., Grondin C. J., Lennon-Hopkins K., Saraceni-Richards C., Sciaky D., King B. L., Wiegers T. C., Mattingly C. J. (2015) The Comparative Toxicogenomics Database's 10th year anniversary: update 2015. Nucleic Acids Res. 43, D914-D920.

59. Basu S., Fey P., Pandit Y., Dodson R. J., Kibbe W. A., Chisholm R. L. (2013) DictyBase 2013: integrating multiple Dictyostelid species. Nucleic Acids Res. 41, D676-D683.

60. Misra R. V., Horler R. S., Reindl W., Goryanin I. I., Thomas G. H. (2005) EchoBASE: an integrated post-genomic database for Escherichia coli. Nucleic Acids Res. 33, D329-D333.

61. Zhou J., Rudd K. E. (2013) EcoGene 3.0. Nucleic Acids Res., 41, D613-D624.

62. Combet C., Garnier N., Charavay C., Grando D., Crisan D., Lopez J., Dehne-Garcia A., Geourjon C., Bettler E., Hulo C., Mercier PL., Bartenschlager R., Diepolder H., Moradpour D., Pawlotsky J.M., Rice C.M., Trepo C., Penin F. and Deléage G. (2007) euHCVdb: the European hepatitis C virus database. Nucleic Acids Res. 35, D363-D366.

63. Aurrecoechea C., Brestelli J., Brunk B. P., Fischer S., Gajria B., Gao X., Gingle A., Grant G., Harb O. S., Heiges M., Innamorato F., Iodice J., Kissinger J. C., Kraemer E. T., Li W., Miller J. A., Nayak V., Pennington C., Pinney D. F., Roos D. S., Ross C., Srinivasamoorthy G., Stoeckert C. J. Jr, Thibodeau R., Treatman C., Wang H. (2010) EuPathDB: a portal to eukaryotic pathogen databases. Nucleic Acids Res. 38, D415-D419.

64. dos Santos G., Schroeder A. J., Goodman J. L., Strelets V. B., Crosby M. A., Thurmond J., Emmert D. B., Gelbart W. M., FlyBase Consortium. (2015) FlyBase: introduction of the Drosophila melanogaster Release 6 reference genome assembly and large-scale migration of genome annotations. Nucleic Acids Res. 43, D690-D697.

65. Frézal J. (1998) Genatlas database, genes and development defects. C R Acad Sci III 321, 805-817.

66. Safran M., Solomon I., Shmueli O., Lapidot M., Shen-Orr S., Adato A., Ben-Dor U., Esterman N., Rosen N., Peter I., Olender T., Chalifa-Caspi V., Lancet D. (2002) GeneCards 2002: towards a complete, object-oriented, human gene compendium. Bioinformatics 18, 1542-1543.

67. Lechat P., Hummel L., Rousseau S., Moszer I. (2008) GenoList: an integrated environment for comparative analysis of microbial genomes. Nucleic Acids Res. 36, D469-D474.

68. Monaco M. K., Stein J., Naithani S., Wei S., Dharmawardhana P., Kumari S., Amarasinghe V., Youens-Clark K., Thomason J., Preece J., Pasternak S., Olson A., Jiao Y., Lu Z., Bolser D., Kerhornou A., Staines D., Walts B., Wu G., D'Eustachio P., Haw R., Croft D., Kersey P. J., Stein L., Jaiswal P., Ware D (2014). Gramene 2013: comparative plant genomics resources. Nucleic Acids Res. 42, D1193-D1199.

69. Yamasaki C., Murakami K., Takeda J., Sato Y., Noda A., Sakate R., Habara T., Nakaoka H., Todokoro F., Matsuya A., Imanishi T., Gojobori T. (2009) H-InvDB in 2009: extended database and data mining resources for human genes and transcripts. Nucleic Acids Res. 38, D626-D632.

70. Gray K. A., Daugherty L. C., Gordon S. M., Seal R. L., Wright M. W., Bruford E. A. (2013) Genenames.org: the HGNC resources in 2013. Nucleic Acids Res. 41, D545–D552.

71. Uhlén M., Björling E., Agaton C., Szigyarto C. A., Amini B., Andersen E., Andersson A. C., Angelidou P., Asplund A., Asplund C., Berglund L., Bergström K., Brumer H., Cerjan D., Ekström M., Elobeid A., Eriksson C., Fagerberg L., Falk R., Fall J., Forsberg M., Björklund M. G., Gumbel K., Halimi A., Hallin I., Hamsten C., Hansson M., Hedhammar M., Hercules G., Kampf C., Larsson K., Lindskog M., Lodewyckx W., Lund J., Lundeberg J., Magnusson K., Malm E., Nilsson P., Odling J., Oksvold P., Olsson I., Oster E., Ottosson J., Paavilainen L., Persson A., Rimini R., Rockberg J., Runeson M., Sivertsson A., Sköllermo A., Steen J., Stenvall M., Sterky F., Strömberg S., Sundberg M., Tegel H., Tourle S., Wahlund E., Waldén A., Wan J., Wernérus H., Westberg J., Wester K., Wrethagen U., Xu L. L., Hober S., Pontén F. (2005) A human protein atlas for normal and cancer tissues based on antibody proteomics. Mol Cell Proteomics. 4, 1920-1932.

72. Kikuno R., Nagase T., Nakayama M., Koga H., Okazaki N., Nakajima D., Ohara O. (2004) HUGE: a database for human KIAA proteins, a 2004 update integrating HUGEppi and ROUGE. Nucleic Acids Res. 32, D502-D504.

73. Moszer I., Glaser P. and Danchin A. (1995) SubtiList: a relational database for the Bacillus subtilis genome. Microbiology 141, 261-268.

74. Kapopoulou A., Lew J. M., Cole S. T. (2011) The MycoBrowser portal: a comprehensive and manually annotated resource for mycobacterial genomes. Tuberculosis (Edinb) 91, 8-13.

75. Andorf C. M., Cannon E. K., Portwood J. L., Gardiner J. M., Harper L. C., Schaeffer M. L., Braun B. L., Campbell D. A., Vinnakota A. G., Sribalusu V. V., Huerta M., Cho K. T., Wimalanathan K., Richter J. D., Mauch E. D., Rao B. S., Birkett S. M., Richter J. D., Sen T. Z., Lawrence C. J. (2015) MaizeGDB 2015: New tools, data, and interface for the maize model organism database. Nucleic Acids Res. doi: 10.1093/nar/gkv1007.

76. Eppig J. T., Blake J. A., Bult C. J., Kadin J. A., Richardson J. E., The Mouse Genome Database Group. (2015) The Mouse Genome Database (MGD): facilitating mouse as a model for human biology and disease. Nucleic Acids Res. 43, D726-D736.

77. Biaudet V., Samson F., Bessières P. (1997) Micado-a network-oriented database for microbial genomes. Comput Appl Biosci. 13, 431-438.

78. Hamosh A., Scott A. F., Amberger J. S., Bocchini C. A., McKusick V. A. (2005) Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 33, D514-D517.

79. Gaudet P., Argoud-Puy G., Cusin I., Duek P., Evalet O., Gateau A., Gleizes A., Pereira M., Zahn-Zabal M., Zwahlen C., Bairoch A., Lane L. (2013) neXtProt: organizing protein knowledge in the context of human proteome projects. J Proteome Res. 12, 293-298.

80. Aymé S., Schmidtke J. (2007). Networking for rare diseases: a necessity for Europe. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz 50, 1477–1483.

81. Thorn C. F., Klein T. E., Altman R. B. (2005) PharmGKB: the pharmacogenetics and pharmacogenomics knowledge base. Methods Mol Biol. 311, 179-191.

82. Wood V., Harris M. A., McDowall M. D., Rutherford K., Vaughan B. W., Staines D. M., Aslett M., Lock A., Bähler J., Kersey P. J., Oliver S. G. (2012) PomBase: a comprehensive online resource for fission yeast. Nucleic Acids Res. 40, D695-D699.

83. Winsor G. L., Lo R., Ho Sui S. J., Ung K. S., Huang S., Cheng D., Ching W. K., Hancock R. E., Brinkman F. S. (2005) Pseudomonas aeruginosa Genome Database and PseudoCAP: facilitating community-based, continually updated, genome annotation. Nucleic Acids Res. 33, D338-D343.

84. Shimoyama M., De Pons J., Hayman G. T., Laulederkind S. J., Liu W., Nigam R., Petri V., Smith J. R., Tutaj M., Wang S. J., Worthey E., Dwinell M., Jacob H. (2015) The Rat Genome Database 2015: genomic, phenotypic and environmental variations and disease. Nucleic Acids Res. 28, D743-D750.

85. Cherry J. M., Hong E. L., Amundsen C., Balakrishnan R., Binkley G., Chan E. T., Christie K. R., Costanzo M. C., Dwight S. S., Engel S. R., Fisk D. G., Hirschman J. E., Hitz B. C., Karra K., Krieger C. J., Miyasato S. R., Nash R. S., Park J., Skrzypek M. S., Simison M., Weng S., Wong E. D. (2012) Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Res. 40, D700-D705.

86. Lamesch P., Berardini T. Z., Li D., Swarbreck D., Wilks C., Sasidharan R., Muller R., Dreher K., Alexander D. L., Garcia-Hernandez M., Karthikeyan A. S., Lee C. H., Nelson W. D., Ploetz L., Singh S., Wensel A., Huala E. (2012) The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools. Nucleic Acids Res. 40, D1202-D1210.

87. Lew J. M., Kapopoulou A., Jones L. M., Cole S. T. (2011) TubercuList - 10 years after. Tuberculosis (Edinb). 1, 1-7.

88. Bowes J. B., Snyder K. A., Segerdell E., Gibb R., Jarabek C., Noumen E., Pollet N., Vize P. D. (2008) Xenbase: a Xenopus biology and genomics resource. Nucleic Acids Res. 36, D761-D767.

89. Howe D. G., Bradford Y. M., Conlin T., Eagle A. E., Fashena D., Frazer K., Knight J., Mani P., Martin R., Moxon S. A., Paddock H., Pich C., Ramachandran S., Ruef B. J., Ruzicka L., Schaper K., Shao X., Singer A., Sprunger B., Van Slyke C. E., Westerfield M. (2013). ZFIN, the Zebrafish Model Organism Database: increased support for mutants and transgenics. Nucleic Acids Res. 41, D854-D860.

90. Powell S., Forslund K., Szklarczyk D., Trachana K., Roth A., Huerta-Cepas J., Gabaldón T., Rattei T., Creevey C., Kuhn M., Jensen L. J., von Mering C. and Bork P. (2014) eggNOG v4.0: nested orthology inference across 3686 organisms. Nucleic Acids Res. 42, D231-D239.

91. Perrière G., Duret L., Gouy M. (2000) HOBACGEN: database system for comparative genomics in bacteria. Genome Res. 10, 379-385.

92. Duret L., Mouchiroud D., Gouy M. (1994) HOVERGEN: a database of homologous vertebrate genes. Nucleic Acids Res. 22, 2360-2365.

93. Sonnhammer E. L., Östlund G2. (2015) InParanoid 8: orthology analysis between 273 proteomes, mostly eukaryotic. Nucleic Acids Res. 43, D234-D239.

94. Kanehisa M., Goto S., Kawashima S., Okuno Y., Hattori M. (2004) The KEGG resource for deciphering the genome. Nucleic Acids Res. 32, D277-D280.

95. Altenhoff A. M., Škunca N., Glover N., Train C. M., Sueki A., Piližota I., Gori K., Tomiczek B., Müller S., Redestig H., Gonnet G. H., Dessimoz C. (2015) The OMA orthology database in 2015: function predictions, better plant support, synteny view and other improvements. Nucleic Acids Res. 43, D240-D249.

96. Waterhouse R. M., Tegenfeldt F., Li J., Zdobnov E. M., Kriventseva E. V. (2013) OrthoDB: a hierarchical catalog of animal, fungal and bacterial orthologs. Nucleic Acids Res. 41, D358-365.

97. Huerta-Cepas J., Capella-Gutiérrez S., Pryszcz L. P., Marcet-Houben M., Gabaldón T. (2014) PhylomeDB v4: zooming into the plurality of evolutionary histories of a genome. Nucleic Acids Res. 42, D897-D902.

98. Ruan J., Li H., Chen Z., Coghlan A., Coin L. J., Guo Y., Hériché J. K., Hu Y., Kristiansen K., Li R., Liu T., Moses A., Qin J., Vang S., Vilella A. J., Ureta-Vidal A., Bolund L., Wang J., Durbin R. (2008) TreeFam: 2008 Update. Nucleic Acids Res. 36, D735-D740.

99. Wu T. J., Shamsaddini A., Pan Y., Smith K., Crichton D. J., Simonyan V., Mazumder R. (2014) A framework for organizing cancer-related variations from existing databases, publications and NGS data using a High-performance Integrated Virtual Environment (HIVE). Database. bau022.

100. Peterson T.A., Adadey A., Santana-Cruz I., Sun, Y., Winder A., Kann M. G. (2010) DMDM: Domain Mapping of Disease Mutations. Bioinformatics 26, 2458-2459.

101. Chatr-Aryamontri A., Breitkreutz B. J., Oughtred R., Boucher L., Heinicke S., Chen D., Stark C., Breitkreutz A., Kolas N., O'Donnell L., Reguly T., Nixon J., Ramage L., Winter A., Sellam A., Chang C., Hirschman J., Theesfeld C., Rust J., Livstone M. S., Dolinski K., Tyers M. (2015) The BioGRID interaction database: 2015 update. Nucleic Acids Research. 43, D470-D478.

102. Salwinski L., Miller C. S., Smith A. J., Pettit F. K., Bowie J. U., Eisenberg D. (2004) The Database of Interacting Proteins: 2004 update. Nucleic Acids Res. 32, D449-D451.

103. Orchard S., Ammari M., Aranda B., Breuza L., Briganti L., Broackes-Carter F., Campbell N. H., Chavali G., Chen C., del-Toro N., Duesbury M., Dumousseau M., Galeota E., Hinz U., Iannuccelli M., Jagannathan S., Jimenez R., Khadake J., Lagreid A., Licata L., Lovering R. C., Meldal B., Melidoni A. N., Milagros M., Peluso D., Perfetto L., Porras P., Raghunath A., Ricard-Blum S., Roechert B., Stutz A., Tognolli M., van Roey K., Cesareni G., Hermjakob H. (2014) The MIntAct project--IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res. 42, D358-D363.

104. Licata L., Briganti L., Peluso D., Perfetto L., Iannuccelli M., Galeota E., Sacco F., Palma A., Nardozza A. P., Santonico E., Castagnoli L., Cesareni G. (2012) MINT, the molecular interaction database: 2012 update. Nucleic Acids Res. 40, D857-D861.

105. Szklarczyk D., Franceschini A., Wyder S., Forslund K., Heller D., Huerta-Cepas J., Simonovic M., Roth A., Santos A., Tsafou K. P., Kuhn M., Bork P., Jensen L. J., von Mering C. (2015) STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 43, D447-D452.

106. Schaab C., Geiger T., Stoehr G., Cox J., Mann M. (2012) Analysis of high accuracy, quantitative proteomics data in the MaxQB database. Mol Cell Proteomics. 11, M111.014068.

107. Wang M., Herrmann C. J., Simonovic M., Szklarczyk D., von Mering C. (2015) Version 4.0 of PaxDb: Protein abundance data, integrated across model organisms, tissues, and cell-lines. Proteomics. 15, 3163-3168.

108. Desiere F., Deutsch E. W., King N. L., Nesvizhskii A. I., Mallick P., Eng J., Chen S., Eddes J., Loevenich S. N., Aebersold R. (2006) The PeptideAtlas project. Nucleic Acids Res. 34, D655-D658.

109. Vizcaino J. A., Cote R. G., Csordas A., Dianes J. A., Fabregat A., Foster J. M., Griss J., Alpi E., Birim M., Contell J., O'Kelly G., Schoenegger A., Ovelleiro D., Perez-Riverol Y., Reisinger F., Rios D., Wang R., Hermjakob H. (2013) The Proteomics Identifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res. 41, D1063-D1069.

110. Wienkoop S., Staudinger C., Hoehenwarter W., Weckwerth W. and Egelhofer V. (2012) ProMEX - a mass spectral reference database for plant proteomics. Front. Plant Sci. 3, 125.

111. Duan G., Li X., Köhn M. (2015) The human DEPhOsphorylation database DEPOD: a 2015 update. Nucleic Acids Res. 43, D531-D535.

112. Ross K. E., Arighi C. N., Ren J., Huang H., Wu C. H. (2013) Construction of protein phosphorylation networks by data mining, text mining and ontology integration: analysis of the spindle checkpoint. Database (Oxford) bat038.iPTMnet

113. Durek P., Schmidt R., Heazlewood J. L., Jones A., Maclean D., Nagel A., Kersten B., Schulze W. X. (2010) PhosPhAt: the Arabidopsis thaliana phosphorylation site database. An update. Nucleic Acids Research 38, D828-D834.

114. Dinkel H., Chica C., Via A., Gould C. M., Jensen L. J., Gibson T. J., Diella F. (2011) Phospho.ELM: a database of phosphorylation sites--update 2011. Nucleic Acids Res. 39, D261-D27.

115. Sadowski I., Breitkreutz B. J., Stark C., Su T. C., Dahabieh M., Raithatha S., Bernhard W., Oughtred R., Dolinski K., Barreto K., Tyers M. (2013) The PhosphoGRID Saccharomyces cerevisiae protein phosphorylation site database: version 2.0 update. Database (Oxford). bat026.

116. Hornbeck P. V., Zhang B., Murray B., Kornhauser J. M., Latham V., Skrzypek E. (2014) PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res. 43, D512-D520.

117. Campbell M. P., Peterson R., Mariethoz J., Gasteiger E., Akune Y., Aoki-Kinoshita K. F., Lisacek F., Packer N. H. (2014) UniCarbKB: building a knowledge platform for glycoproteomics. Nucleic Acids Res. 42, D215-D221.

118. The Gene Ontology Consortium. (2015) Gene Ontology Consortium: going forward. Nucl Acids Res. 43, D1049-D1056.

119. Natale D. A., Arighi C. N., Blake J. A., Bult C. J., Christie K. R., Cowart J., D'Eustachio P., Diehl A. D., Drabkin H. J., Helfer O., Huang H., Masci A. M., Ren J., Roberts N. V., Ross K., Ruttenberg A., Shamovsky V., Smith B., Yerramalla M. S., Zhang J., AlJanahi A., Çelen I., Gan C., Lv M., Schuster-Lezell E., Wu C. H. (2014) Protein Ontology: a controlled structured network of protein entities. Nucleic Acids Res. 42, D415-D421.

120. Mari A., Rasi C., Palazzo P., Scala E. (2009) Allergen databases: current status and perspectives. Curr Allergy Asthma Rep. 9, 376-83.

121. Lombard V., Golaconda Ramulu H., Drula E., Coutinho P. M., Henrissat B. (2014) The Carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 42, D490-D495.

122. Lenfant N., Hotelier T., Velluet E., Bourne Y., Marchot P., Chatonnet A. (2013) ESTHER, the database of the alpha/beta-hydrolase fold superfamily of proteins: tools to explore diversity of functions. Nucleic Acids Res. 41, D423-D429.

123. Isberg V., Vroling B., van der Kant R., Li K., Vriend G., Gloriam D. (2014) GPCRDB: an information system for G protein-coupled receptors. Nucleic Acids Res. 42, D422-D425.

124. Giudicelli V., Duroux P., Ginestoux C., Folch G., Jabado-Michaloud J., Chaume D., Lefranc M. P. (2006) IMGT/LIGM-DB, the IMGT comprehensive database of immunoglobulin and T cell receptor nucleotide sequences. Nucleic Acids Res. 34, D781-D784.

125. Rawlings N. D., Waller M., Barrett A. J., Bateman A. (2014) MEROPS: the database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 42, D503-D509.

126. Jeffery C. J. (1999) Moonlighting proteins. Trends Biochem Sci. 24, 8-11.

127. Murphy C., Powlowski J., Wu M., Butler G., Tsang A. (2011) Curation of characterized glycoside hydrolases of fungal origin. Database doi: 10.1093/database/bar020

128. Fawal N., Li Q., Savelli B., Brette M., Passaia G., Fabre M., Mathé C., Dunand C. (2013) PeroxiBase: a database for large-scale evolutionary analysis of peroxidases. Nucleic Acids Res. 41, D441-414.

129. Roberts R.J., Vincze T., Posfai J., Macelis D. (2015) REBASE-a database for DNA restriction and modification: enzymes, genes and genomes. Nucleic Acids Res. 43, D298-D299.

130. Saier M. H., Reddy V. S., Tamang D. G., Vastermark A. (2014), The transporter classification database. Nucleic. Acids Res. 42, D251-D258.

131. Frenkel-Morgenstern M., Gorohovski A., Lacroix V., Rogers M., Ibanez K., Boullosa C., Andres Leon E., Ben-Hur A., Valencia A. (2013) ChiTaRS: a database of human, mouse and fruit fly chimeric transcripts and RNA-sequencing data. Nucleic Acids Res. 41, D142-D151.

132. Mihalek I., Res I., Lichtarge O. (2004). A Family of Evolution-Entropy Hybrid Methods for Ranking of Protein Residues by Importance. J. Mol. Bio. 336, 1265-1282.

133. Good B. M., Clarke E. L., de Alfaro L., Su A. I. (2012) The Gene Wiki in 2011: community intelligence applied to human gene annotation. Nucleic Acids Res. 40, D1255-D1261.

134. Schmidt E. E., Pelz O., Buhlmann S., Kerr G., Horn T., Boutros M. (2013) GenomeRNAi: a database for cell-based and in vivo RNAi phenotypes, 2013 update. Nucleic Acids Res. 41, D1021-1026.

135. Igarashi Y., Heureux E., Doctor K. S., Talwar P., Gramatikova S., Gramatikoff K., Zhang Y., Blinov M., Ibragimova S. S., Boyd S., Ratnikov B., Cieplak P., Godzik A., Smith J. W., Osterman A. L., Eroshkin A. M. (2009) PMAP: databases for analyzing proteolytic events and pathways. Nucleic Acids Res. 37, D611-D618.

136. Diehn M., Sherlock G., Binkley G., Jin H., Matese J. C., Hernandez-Boussard T., Rees C. A., Cherry J. M., Botstein D., Brown P. O., Alizadeh A. A. (2003) SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data. Nucleic Acids Res. 31, 219-23.