Table
1 Overview of Protein Bioinformatics Databases
Category |
DB Short Name |
DB Name |
Database Content |
URLs |
Ref. |
Sequence
databases |
CCDS |
The
Consensus CDS protein set database |
Protein-coding
regions that are identically annotated on the human and mouse reference
genome assemblies |
(1) |
|
DDBJ |
DNA
Data Bank of Japan |
Annotated
nucleotide sequence |
(2) |
||
ENA |
European
Nucleotide Archive |
Raw
sequencing data, sequence assembly information and functional annotation |
(3) |
||
GenBank |
GenBank
nucleotide sequence database |
All
publicly available DNA sequences |
(4) |
||
RefSeq* |
NCBI
Reference Sequence Database |
Comprehensive,
integrated, non-redundant, well-annotated set of sequences, including genomic
DNA, transcripts, and proteins |
(5) |
||
UniGene |
Database
of computationally identifies transcripts from the same locus |
Gene-oriented
nucleotide sequence clusters |
(4) |
||
UniProtKB* |
Universal
Protein Resource (UniProt) |
Comprehensive
resource for protein sequence and annotation data |
(6) |
||
2D gel
databases |
COMPLUYEAST-2DPAGE |
2-DE
database at Universidad Complutense de Madrid, Spain |
Candida
albicans two-dimensional polyacrilamide gel electrophoresis database |
(7) |
|
REPRODUCTION-2DPAGE |
2-DE
database at Nanjing Medical University, China |
Two-dimensional
polyacrylamide gel electrophoresis database |
(8) |
||
SWISS-2DPAGE |
2-DE
database at Swiss Institute of Bioinformatics, Switzerland |
Data
on proteins identified on various 2-D PAGE and SDS-PAGE reference maps |
(9) |
||
World-2DPAGE* |
The
World-2DPAGE database |
2D-PAGE
resources and tools |
(10) |
||
3D
structure databases |
DisProt |
Database
of Protein Disorder |
Information
about proteins that lack fixed 3D structure in their putatively native
states, either in their entirety or in part |
(11) |
|
MobiDB |
Database
of intrinsically disordered and mobile proteins |
Intrinsically
disordered and mobile proteins |
(12) |
||
ModBase |
Database
of Comparative Protein Structure Models |
Annotated
protein structure models |
(13) |
||
PDBe* |
Protein
Data Bank at Europe |
European
resource for the collection, organization and dissemination of data on
biological macromolecular structures |
(14) |
||
PDBj* |
Protein
Data Bank at Japan |
Centralized
PDB archive of macromolecular structures and provides integrated tools |
(15) |
||
PDBsum |
Pictorial
database of 3D structures in the Protein Data Bank |
At-a-glance
overview of the contents of each 3D structure deposited in the Protein Data
Bank (PDB) |
(16) |
||
ProteinModelPortal |
Protein
Model Portal of the PSI-Nature Structural Biology Knowledgebase |
PSI-Nature
Structural Biology Knowledgebase |
(17) |
||
RCSB-PDB* |
Protein
Data Bank at RCSB |
Single
worldwide repository of information about the 3D structures of large
biological molecules, including proteins and nucleic acids |
(18) |
||
SMR |
Database
of annotated 3D protein structure models |
Annotated
three-dimensional comparative protein structure models generated by the fully
automated homology-modelling pipeline SWISS-MODEL |
(19) |
||
Chemistry
databases |
BindingDB |
The
Binding Datbase |
Measured
binding affinities data for protein targets and small molecules |
(20) |
|
ChEMBL* |
Database
of bioactive drug-like small molecules |
Bioactive
drug-like small molecules |
(21) |
||
DrugBank |
Drug
and Drug Target Databse |
Detailed
drug (i.e. chemical, pharmacological and pharmaceutical) data with
comprehensive drug target (i.e. sequence, structure, and pathway) information |
(22) |
||
Enzyme
and pathway databases |
MetaCyc/BioCyc* |
MetaCyc
Database of Metabolic Pathways, BioCyc Collection of Pathway/Genome Databases |
Full
genome and predicted metabolic network of organism-specific Pathway/Genome
Databases |
(23) |
|
BRENDA* |
BRaunschweig
ENzyme DAtabase |
Enzyme
and enzyme-ligand information obtained from different sources |
(24) |
||
ENZYME |
Enzyme
nomenclature database |
Information
relative to the nomenclature of enzymes |
(25) |
||
Reactome* |
A
knowledgebase of biological pathways and processes |
Manually
curated and peer-reviewed pathway database |
(26) |
||
SABIO-RK |
SABIO-RK:
Biochemical Reaction Kinetics Database |
Biochemical
reactions, their kinetic rate equations with parameters and experimental
conditions |
(27) |
||
SignaLink |
A
signaling pathway resource with multi-layered regulatory networks |
Integrated
resource to analyze signaling pathway cross-talks, transcription factors,
miRNAs and regulatory enzymes |
(28) |
||
UniPathway |
UniPathway: a resource for the exploration of
metabolic pathways |
Curated resource of enzyme-catalyzed and
spontaneous chemical reactions |
(29) |
||
Family
and domain databases |
Gene3D |
Structural
and Functional Annotation of Protein Families |
Structural
and Functional Annotation of Protein Families |
(30) |
|
HAMAP |
High-quality
Automated and Manual Annotation of Proteins |
Collection
of manually curated family profiles for protein classification, and
associated, manually created annotation rules that specify annotations that
apply to family members |
(31) |
||
InterPro* |
Integrated
resource of protein families, domains and functional sites |
Functional
analysis of proteins by classifying them into families and predicting domains
and important sites |
(32) |
||
PANTHER |
The
PANTHER Classification System |
Classify
proteins (and their genes) in order to facilitate high-throughput analysis |
(33) |
||
Pfam* |
The
Pfam protein families database |
Protein
families, each represented by multiple sequence alignments and hidden Markov models
(HMMs) |
(34) |
||
PIRSF* |
A
whole-protein classification database |
Whole-protein
classification database |
(35) |
||
PRINTS |
Protein
Motif fingerprint database |
Protein
fingerprints |
(36) |
||
ProDom |
Protein
domain families database |
Protein
domain families automatically generated from the UniProt Knowledge Database |
(37) |
||
PROSITE* |
Database
of protein domains, families and functional sites |
Protein
domains, families and functional sites as well as associated patterns and
profiles to identify them |
(38) |
||
ProtoNet |
Automatic
hierarchical classification of proteins |
Automatic
hierarchical classification of protein sequences |
(39) |
||
SMART |
Simple
Modular Architecture Research Tool |
Identification
and annotation of genetically mobile domains and the analysis of domain
architectures |
(40) |
||
SUPFAM |
Superfamily
database of structural and functional annotation |
Structural
and functional annotation for all proteins and genomes |
(41) |
||
TIGRFAMs |
TIGRFAMs
protein family database |
Curated
multiple sequence alignments, Hidden Markov Models (HMMs) for protein
sequence classification, and associated information designed to support
automated annotation of (mostly prokaryotic) proteins |
(42) |
||
Gene
expression databases |
Bgee |
Database
for Gene Expression Evolution |
Comparison
of expression patterns between animals |
(43) |
|
CleanEx |
Database
of gene expression profiles |
Gene
expression data via unique approved gene symbols |
(44) |
||
Genevisible |
Search
portal to normalized and curated expression data from Genevestigator |
Curated
expression data |
(45) |
||
ExpressionAtlas* |
Database
of Differential and Baseline Expression |
Gene
expression patterns under different biological conditions |
(46) |
||
Genome
annotation databases |
Ensembl* |
Ensembl
Eukaryotic genome annotation database |
Annotations,
querying tools and access methods for chordates and key model organisms |
(47) |
|
EnsemblBacteria |
Ensembl
Bacteria genome annotation database |
Genome
sequences from bacteria and archaea have been annotated and deposited in the
public archives of the members of the International Nucleotide Sequence
Database Collaboration |
(48) |
||
EnsemblFungi |
Ensembl
Fungi genome annotation database |
All
fungal genomes whose sequence and annotation has been completed and submitted
to the International Nucleotide Sequence Database Collaboration |
(48) |
||
EnsemblMetazoa |
Ensembl
Metazoa genome annotation database |
Genome
sequences and annotations of all Ensembl Metazoa species |
(48) |
||
EnsemblPlants |
Ensembl
Plants genome annotation database |
Genome
sequences and annotations of all Ensembl Plants species |
(48) |
||
EnsemblProtists |
Ensembl
Protists genome annotation database |
Genome
sequences and annotations of all Ensembl Protists species |
(48) |
||
Entrez
Gene* |
Database
of Genes of Genomes in the Reference Sequence Collection |
Known
and predicted genes defined by nucleo- tide sequence or map position |
(49) |
||
KEGG |
Kyoto
Encyclopedia of Genes and Genomes |
Biological
interpretation of genome sequences and other high-throughput data |
(50) |
||
PATRIC |
Bacterial
Bioinformatics Resource Center |
Bacterial
infectious diseases via integration of vital pathogen information with rich data
and analysis tools |
(51) |
||
UCSC* |
UCSC
Genome Bioinformatics |
Reference
sequence and working draft assemblies for a large collection of genomes |
(52) |
||
VectorBase |
Bioinformatics
Resource for Invertebrate Vectors of Human Pathogens |
Invertebrate
Vectors of Human Pathogens |
(53) |
||
WBParaSite |
WormBase
ParaSite |
Parasitic
worm genomics |
(54) |
||
Organism
specific databases |
ArachnoServer |
ArachnoServer:
Spider toxin database |
Information
on the sequence, three-dimensional structure, and biological activity of
protein toxins derived from spider venom |
(55) |
|
CGD |
Candida
Genome Database |
Genomic
sequence data and gene and protein information for Candida albicans and related
species |
(56) |
||
ConoServer |
ConoServer:
Cone snail toxin database |
Sequence
and structures of conopeptides, which are peptides expressed by carnivorous
marine cone snails |
(57) |
||
CTD |
Comparative
Toxicogenomics Database |
Genes
and proteins responding to environmental toxic agents across diverse species |
(58) |
||
dictyBase |
Central
resource for Dictyostelid genomics |
Model
organism database for the social amoeba Dictyostelium
discoideum |
(59) |
||
EchoBASE |
EchoBASE
- an integrated post-genomic database for E. coli. |
Information
from post-genomic experiments using the model bacterium Escherichia coli K-12
|
(60) |
||
EcoGene |
Escherichia
coli strain K12 genome database |
Structural
and functional annotation of Escherichia coli K-12 |
(61) |
||
euHCVdb |
The
European Hepatitis C Virus database |
Collection
of computer-annotated sequences based on Hepatitis C Virus reference genomes |
(62) |
||
EuPathDB |
Eukaryotic
Pathogen Database Resources |
Eukaryotic
pathogens of the genera Cryptosporidium, Giardia, Leishmania, Neospora,
Plasmodium, Toxoplasma, Trichomonas and Trypanosoma |
(63) |
||
FlyBase* |
A Database of Drosophila Genes & Genomes |
Drosophila Genes
and Genomes |
(64) |
||
GenAtlas |
A
database on genes, functions and related diseases |
Scientific
and clinical digests on human genes and diseases |
(65) |
||
GeneCards |
The
Human Gene Database |
All
annotated and predicted human genes |
(66) |
||
GenoList |
Integrated
Environment for the Analysis of Microbial Genomes |
Genome
data from bacterial species |
(67) |
||
Gramene |
A
comparative resource for plants |
Data
resource for comparative functional genomics in crops and model plant species |
(68) |
||
H-InvDB |
H-Invitational
Database |
Curated
annotations of human genes and transcripts |
(69) |
||
HGNC |
HUGO
Gene Nomenclature Committee Database |
Unique
symbols and names for human loci, including protein coding genes, ncRNA genes
and pseudogenes |
(70) |
||
HPA |
The
Human Protein Atlas |
A
large majority of all human protein-coding genes regarding the expression and
localization of the corresponding proteins based on both RNA and protein data |
(71) |
||
HUGE |
A
Database of Human Unidentified Gene-Encoded Large Proteins |
Large
human proteins of hitherto unidentified functions |
(72) |
||
LegioList |
Legionella
pneumophila genome database |
Genomic
information from L. pneumophila |
(73) |
||
Leproma |
Mycobacterium
leprae genome database |
Broad
view of the Mycobacterium leprae genome |
(74) |
||
MaizeGDB |
Maize
Genetics and genomics Database |
Curated,
community-oriented database and informatics service to researchers focused on
the crop plant and model organism Zea mays ssp. mays |
(75) |
||
MGD* |
Mouse
Genome Database |
Integrated
genetic, genomic, and biological data for the laboratory mouse |
(76) |
||
Micado |
MICrobial
Advanced Database Organization |
Microbial
genomes and functional analysis of Bacillus subtilis |
(77) |
||
OMIM |
Online
Mendelian Inheritance in Man |
An
Online Catalog of Human Genes and Genetic Disorders |
(78) |
||
neXtProt* |
Exploring
the universe of human proteins |
Human
protein knowledge platform |
(79) |
||
Orphanet |
The
portal for rare diseases and orphan drugs |
Information
on rare diseases and orphan drugs |
(80) |
||
PharmGKB |
The
Pharmacogenomics Knowledgebase |
Curated
knowledge about the impact of genetic variation on drug response for
clinicians and researchers. |
(81) |
||
PomBase |
The
scientific resource for fission yeast |
Fission
yeast Schizosaccharomyces pombe |
(82) |
||
PseudoCAP |
The
Pseudomonas Genome Database |
Internet-based
and community-based genome annotation project for analysis of a genome of a
free-living organism |
(83) |
||
RGD |
Rat
Genome Database |
Genetic,
genomic, phenotype, and disease data generated from rat research |
(84) |
||
Rouge |
A
Database of Rodent Unidentified Gene-Encoded Large Proteins |
Information
obtained from mouse homologues of the KIAA cDNAs (mKIAA cDNAs) |
(72) |
||
SGD |
Saccharomyces
Genome Database |
Comprehensive
integrated biological information for the budding yeast Saccharomyces
cerevisiae |
(85) |
||
TAIR |
The
Arabidopsis Information Resource |
Genetic
and molecular biology data for the model higher plant Arabidopsis thaliana |
(86) |
||
TubercuList |
Mycobacterium tuberculosis strain H37Rv genome
database |
Mycobacterium
tuberculosis genome database |
(87) |
||
WormBase |
C. elegans and related nematodes Genetics and Genomics
Database |
Model organism
database for Caenorhabditis elegans and other related nematodes |
(54) |
||
Xenbase |
Xenopus
laevis and tropicalis biology and genomics resource |
Diverse
and rich genomic, expression and functional data available from Xenopus
research |
(88) |
||
ZFIN |
The
Zebrafish Model Organism Database |
Zebrafish
Information Network genome database |
(89) |
||
Phylogenomic
databases |
eggNOG |
Database
of orthologous groups and functional annotation |
Non-supervised
orthologous groups (NOGs) from complete genomes |
(90) |
|
HOGENOM |
Database
of Homologous Genes from Fully Sequenced Organisms |
Complete
Genome Homologous Genes Families |
(91) |
||
HOVERGEN |
Homologous
Vertebrate Genes Database |
Homologous
vertebrate genes, structured under ACNUC sequence database management system |
(92) |
||
InParanoid |
Eukaryotic
Ortholog Groups with inparalogs |
Protein
orthologs inferred by the InParanoid algorithm |
(93) |
||
KO |
Kyoto
Encyclopedia of Genes and Genomes Orthology |
Manually
defined ortholog groups (KO entries), which are categorized under the
hierarchy of KEGG pathways and BRITE ontologies |
(94) |
||
OMA* |
The
OMA orthology database |
Inference
of orthologs among complete genomes |
(95) |
||
OrthoDB |
Database
of Orthologous Groups |
Catalog
of orthologous protein-coding genes |
(96) |
||
PhylomeDB |
Database
for complete catalogs of gene phylogenies (phylomes) |
Complete
catalogs of gene phylogenies (phylomes) |
(97) |
||
TreeFam |
Database
of animal gene trees |
Phylogenetic
trees inferred from animal genomes |
(98) |
||
Polymorphism
and mutation databases |
BioMuta |
Single-nucleotide
variation and disease association database |
Curated
single-nucleotide variation (SNV) and disease association database where the
variations are mapped to the genome/protein/gene |
(99) |
|
dbSNP* |
Database
of Short Genetic Variations |
Simple
genetic variations such as single-base nucleotide substitutions, small-scale
multi-base deletions or insertions, and microsatellite repeats |
(4) |
||
DMDM |
Domain
Mapping of Disease Mutations |
Unique
domain-level view where all human coding mutations are mapped on the protein
domain |
(100) |
||
Protein-protein
interaction databases |
BioGRID |
The
Biological General Repository for Interaction Datasets |
Genetic
and protein interactions curated from the primary biomedical literature for
all major model organism species and humans |
(101) |
|
DIP |
Database
of Interacting Proteins |
Experimentally
determined interactions between proteins |
(102) |
||
IntAct* |
IntAct
Molecular Interaction Database |
Open-source,
open data molecular interaction database populated by data either curated
from the literature or from direct data depositions |
(103) |
||
MINT |
The
Molecular INTeraction database |
Experimentally
verified protein-protein interactions mined from the scientific literature by
expert curators |
(104) |
||
STRING |
Search
Tool for the Retrieval of Interacting Genes/Proteins |
Known
and Predicted Protein-Protein Interactions |
(105) |
||
Proteomic
databases |
MaxQB |
The
MaxQuant DataBase |
Large
proteomics projects and allows joint analysis and comparison |
(106) |
|
PaxDb |
Protein
Abundance Across Organisms |
Comprehensive
absolute protein abundance database, which contains whole genome protein
abundance information across organisms and tissues |
(107) |
||
PeptideAtlas* |
PeptideAtlas |
Peptides
identified in a large set of tandem mass spectrometry proteomics experiments |
(108) |
||
PRIDE* |
PRoteomics
IDEntifications database |
Centralized,
standards compliant, public data repository for proteomics data |
(109) |
||
ProMEX |
Protein
Mass spectra EXtraction |
Tryptic
peptide fragmentation mass spectra derived from plants. |
(110) |
||
PTM
databases |
DEPOD* |
The
Human DEPhOsphorylation Database |
Human
active phosphatases, their experimentally verified protein and non-protein
substrates and dephosphorylation site information, and pathways in which they
are involved |
(111) |
|
iPTMnet* |
Protein
post-translational modifications (PTMs) in systems biology context |
Protein
phosphorylation information |
(112) |
||
PhosPhAt* |
The
Arabidopsis Protein Phosphorylation Site Database |
Information
on Arabidopsis phosphorylation sites which were identified by mass
spectrometry in large scale experiments by different research groups |
(113) |
||
Phospho.ELM* |
Database
of S/T/Y phosphorylation sites |
Experimentally
verified phosphorylation sites in eukaryotic proteins |
(114) |
||
PhosphoGrid* |
Database
of experimentally verified in vivo protein phosphorylation sites |
Experimentally
verified in vivo protein phosphorylation sites in the model eukaryotic
organism Saccharomyces cerevisisae |
(115) |
||
PhosphoSitePlus* |
Phosphorylation
site database |
Comprehensive
information and tools for the study of protein post-translational
modifications (PTMs) |
(116) |
||
UniCarbKB* |
Database
of glycomics and glycobiology |
Online
information storage and search platform for glycomics and glycobiology
research |
(117) |
||
Ontology |
GO* |
Gene
Ontology |
Consistent
descriptions of gene products across databases |
(118) |
|
PRO |
Protein
Ontology |
Ontological
representation of protein-related entities by explicitly defining them and
showing the relationships between them |
(119) |
||
Specialized
protein databases |
Allergome |
Allergome:
platform for allergen knowledge |
Allergenic
molecules and proteins reported as allergens |
(120) |
|
CAZy |
Carbohydrate-Active
enZYmes Database |
Families
of structurally-related catalytic and carbohydrate-binding modules (or
functional domains) of enzymes that degrade, modify, or create glycosidic
bonds |
(121) |
||
ESTHER |
ESTerases
and alpha/beta-Hydrolase Enzymes and Relatives database |
Proteins
or protein domains belonging to the superfamily of alpha/beta-hydrolases,
exemplified by the cholinesterases |
(122) |
||
GPCRDB |
Information
system for G protein-coupled receptors (GPCRs) |
Data
on sequences, ligand binding constants and mutations on G protein-coupled
receptors (GPCRs) |
(123) |
||
IMGT |
The
International ImMunoGeneTics information system |
Immunoglobulin
(IG) and T cell receptor (TR) nucleotide sequences from human and other
vertebrate species |
(124) |
||
MEROPS* |
MEROPS
protease database |
Peptidases
(also termed proteases, proteinases and proteolytic enzymes) and the proteins
that inhibit them |
(125) |
||
MoonProt |
Moonlighting
protein database |
Information
about the proteins that have been experimentally verified to be moonlighting
proteins |
(126) |
||
mycoCLAP |
Characterized
Lignocellulose-Active Proteins of Fungal Origin |
Fungal
and bacterial genes encoding lignocellulose-active proteins that have been
biochemically characterized |
(127) |
||
PeroxiBase |
The
peroxidases database |
Enzymes
able to catalyze reduction of hydrogen peroxide and oxidize various
substrates |
(128) |
||
REBASE |
The
Restriction Enzyme Database |
Information
about restriction enzymes and related proteins |
(129) |
||
TCDB |
Transporter
Classification Database |
Functional
and Phylogenetic Classification of Membrane Transport Proteins |
(130) |
||
Other
(Miscellaneous) databases |
ChiTaRS |
Database
of chimeric transcripts and RNA-seq data |
Chimeric
RNAs from humans, mice and fruit flies with confirmation from RNA-seq reads |
(131) |
|
EvolutionaryTrace |
Database
of relative evolutionary importance of amino acids within a protein sequence |
Evolutionary
Trace Analysis of a protein structure |
(132) |
||
GeneWiki* |
Wiki
portal for the annotation of gene and protein function |
Applying
community intelligence to the annotation of gene and protein function |
(133) |
||
GenomeRNAi |
Database
of phenotypes from RNA interference screens in Drosophila and Homo sapiens |
Phenotypes
from RNA interference (RNAi) screens in Drosophila and Homo sapiens |
(134) |
||
PMAP-CutDB |
Proteolytic
event database |
Proteolytic
pathways |
(135) |
||
SOURCE |
The
Stanford Online Universal Resource for Clones and ESTs |
Encapsulate
the genetics and molecular biology of genes from the genomes of Homo sapiens,
Mus musculus, Rattus norvegicus into easy to navigate GeneReports |
(136) |
* Covered in section 3 of the chapter.
References:
1.
Farrell
C. M., O'Leary N. A., Harte R. A., Loveland J. E., Wilming L. G., Wallin C.,
Diekhans M., Barrell D., Searle S. M., Aken B., Hiatt S. M., Frankish A., Suner
M. M., Rajput B., Steward C. A., Brown G. R., Bennett R., Murphy M., Wu W., Kay
M. P., Hart J., Rajan J., Weber J., Snow C., Riddick L. D., Hunt T., Webb D.,
Thomas M., Tamez P., Rangwala S. H., McGarvey K. M., Pujar S., Shkeda A., Mudge
J. M., Gonzalez J. M., Gilbert J. G., Trevanion S. J., Baertsch R., Harrow J.
L., Hubbard T., Ostell J. M., Haussler D., Pruitt K. D. (2014) Current status
and new features of the Consensus Coding Sequence database. Nucleic Acids Res. 42, D865-D872.
2.
Kodama
Y., Mashima J., Kosuge T., Katayama T., Fujisawa T., Kaminuma E., Ogasawara O.,
Okubo K., Takagi T., Nakamura Y. (2015) The DDBJ Japanese Genotype-phenotype
Archive for genetic and phenotypic human data. Nucleic Acids Res. 43,
D18-D22.
3.
Kulikova
T., Akhtar R., Aldebert P., Althorpe N., Andersson M., Baldwin A., Bates K.,
Bhattacharyya S., Bower L., Browne P., Castro M., Cochrane G., Duggan K.,
Eberhardt R., Faruque N., Hoad G., Kanz C., Lee C., Leinonen R., Lin Q.,
Lombard V., Lopez R., Lorenc D., McWilliam H., Mukherjee G., Nardone F., Pastor
M. P., Plaister S., Sobhany S., Stoehr P., Vaughan R., Wu D., Zhu W., Apweiler
R. (2007) EMBL Nucleotide Sequence Database in 2006. Nucleic Acids Res. 35,
D16-D20.
4.
Agarwala
R., Barrett T., Beck J., Benson D. A., Bollin C., Bolton E., Bourexis D.,
Brister J., Bryant S. H., Canese K., Clark K., DiCuccio M., Dondoshansky I.,
Federhen S., Feolo M., Funk K., Geer L. Y., Gorelenkov V., Hoeppner M., Holmes
B., Johnson M., Khotomlianski V., Kimchi A., Kimelman M., Kitts P., Klimke W.,
Krasnov S., Kuznetsov A., Landrum M. J., Landsman D., Lee J. M., Lipman D. J.,
Lu Z., Madden T. L., Madej T., Marchler-Bauer A., Karsch-Mizrachi I., Murphy
T., Orris R., Ostell J., O'Sullivan C., Panchenko A., Phan L., Preuss D.,
Pruitt K. D., Rubinstein W., Sayers E. W., Schneider V., Schuler G. D., Sherry
S. T., Sirotkin K., Siyan K., Slotta D., Soboleva A., Soussov V., Starchenko
G., Tatusova T. A., Trawick B. W., Vakatov D., Wang Y., Ward M., Wilbur W.,
Yaschenko E., Zbicz K. (2015) Database resources of the National Center for
Biotechnology Information. Nucleic Acids
Res. 43, D6-D17.
5.
Pruitt
K. D., Tatusova T., Maglott D. R. (2006) NCBI reference sequences (RefSeq): a
curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 35, D61-D65.
6.
The
UniProt Consortium (2015) UniProt: a hub for protein information. Nucleic Acids Res. 43, D204-D212.
7.
Pitarch
A., S‡nchez M., Nombela C., Gil C. (2003) Analysis of the Candida albicans
proteome. II. Protein information technology on the Net (update 2002) J Chromatogr B Analyt Technol Biomed Life
Sci. 787, 129-148.
8.
Zhou
T., Zhou Z. M., Guo X. J. (2013) Bioinformatics for spermatogenesis: annotation
of male reproduction based on proteomics. Asian
J Androl. 15, 594-602.
9.
Hoogland
C., Mostaguir K., Sanchez J. C., Hochstrasser D. F., Appel R. D. (2004)
SWISS-2DPAGE, ten years later. Proteomics
4, 2352-2356.
10.
Hoogland
C., Mostaguir K., Appel R. D., Lisacek F. (2008) The World-2DPAGE Constellation
to promote and publish gel-based proteomics data through the ExPASy server. J Proteomics. 71, 245-248.
11.
Sickmeier
M., Hamilton J. A., LeGall T., Vacic V., Cortese M. S., Tantos A., Szabo B.,
Tompa P., Chen J., Uversky V. N., Obradovic Z., Dunker A. K. (2007) DisProt:
the Database of Disordered Proteins. Nucleic
Acids Res. 35, D786-D793.
12.
Potenza
E., Di Domenico T., Walsh I., Tosatto S. C. (2014) MobiDB 2.0:
an improved database of intrinsically disordered and mobile proteins. Nucleic Acids Res. 43, D315-D320.
13.
Pieper
U., Webb B. M., Dong G. Q., Schneidman-Duhovny D., Fan H., Kim S. J., Khuri N.,
Spill Y. G., Weinkam P., Hammel M., Tainer J. A., Nilges M., Sali A. (2014)
ModBase, a database of annotated comparative protein structure models and
associated resources. Nucleic Acids Res.
42, D336-D346.
14.
Velankar
S., van Ginkel G., Alhroub Y., Battle G. M., Berrisford J. M., Conroy M. J.,
Dana J. M., Gore S. P., Gutmanas A., Haslam P., Hendrickx P. M., Lagerstedt I.,
Mir S., Fernandez Montecelo M. A., Mukhopadhyay A., Oldfield T. J., Patwardhan
A., Sanz-Garc’a E., Sen S., Slowley R. A., Wainwright M. E., Deshpande M. S.,
Iudin A., Sahni G., Salavert T. J., Hirshberg M., Mak L., Nadzirin N.,
Armstrong D. R., Clark A. R., Smart O. S., Korir P. K., Kleywegt G. J. (2015)
PDBe: improved accessibility of macromolecular structure data from PDB and
EMDB. Nucleic Acids Res. doi:
10.1093/nar/gkv1047.
15.
Kinjo
A. R., Suzuki H., Yamashita R., Ikegawa Y., Kudou T., Igarashi R., Kengaku Y.,
Cho H., Standley D. M., Nakagawa A., Nakamura H. (2012) Protein Data Bank Japan
(PDBj): maintaining a structural data archive and resource description
framework format. Nucleic Acids Res. 40, D453-D460.
16.
de Beer T. A., Berka K., Thornton J. M.,
Laskowski R. A. (2014) PDBsum additions. Nucleic
Acids Res. 42, D292-D296.
17.
Haas
J., Roth S., Arnold K., Kiefer F., Schmidt T., Bordoli L., Schwede T. (2013)
The Protein Model Portal-a comprehensive resource for protein structure and
model information. Database doi:
10.1093/database/bat031.
18.
Berman
H. M., Westbrook J., Feng Z., Gilliland G., Bhat T. N., Weissig H., Shindyalov
I. N., Bourne P. E. (2000) The Protein Data Bank. Nucleic Acids Res. 28,
235-242.
19.
Schwede
T., Kopp J., Guex N. and Peitsch M. C. (2003) SWISS-MODEL: an automated protein
homology-modeling server. Nucleic Acids
Res. 31, 3381-3385.
20.
Liu
T., Lin Y., Wen X., Jorissen R. N., Gilson M. K. (2007) BindingDB: a
web-accessible database of experimentally determined protein-ligand binding
affinities. Nucleic Acids Res. 35, D198-D201.
21.
Bento
A. P., Gaulton A., Hersey A., Bellis L. J., Chambers J., Davies M., Krźger F.
A., Light Y., Mak L., McGlinchey S., Nowotka M., Papadatos G., Santos R., Overington
J. P. (2014) The ChEMBL bioactivity database: an update. Nucleic Acids Res. 42,
D1083-D1090.
22.
Law
V., Knox C., Djoumbou Y., Jewison T., Guo A. C., Liu Y., Maciejewski A., Arndt
D., Wilson M., Neveu V., Tang A., Gabriel G., Ly C., Adamjee S., Dame Z. T.,
Han B., Zhou Y., Wishart D. S. (2014) DrugBank 4.0: shedding new light on drug
metabolism. Nucleic Acids Res. 42, D1091-D1097.
23.
Caspi
R. Altman T. Billington R., Dreher K., Foerster H., Fulcher C. A., Holland T.
A., Keseler I. M., Kothari A., Kubo A., Krummenacker M., Latendresse M.,
Mueller L. A., Ong Q., Paley S., Subhraveti P., Weaver D. S., Weerasinghe D.,
Zhang P., and Karp P. D. (2014) The MetaCyc Database of metabolic pathways and
enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Res. 42, D459-D471.
24.
Chang
A., Schomburg I., Placzek S., Jeske L., Ulbrich M., Xiao M., Sensen C. W.,
Schomburg D. (2015) BRENDA in 2015: exciting developments in its 25th year of
existence. Nucleic Acids Res. 43,
D439-D446.
25.
Bairoch
A. (2000) The ENZYME database in 2000. Nucleic
Acids Res. 28, 304-305.
26.
Croft
D., Mundo A. F., Haw R., Milacic M., Weiser J., Wu G., Caudy M., Garapati P.,
Gillespie M., Kamdar M. R., Jassal B., Jupe S., Matthews L., May B., Palatnik
S., Rothfels K., Shamovsky V., Song H., Williams M., Birney E., Hermjakob H.,
Stein L., D'Eustachio P. (2014) The Reactome pathway knowledgebase. Nucleic Acids Res. 42, D472-D477.
27.
Wittig
U., Kania R., Golebiewski M., Rey M., Shi L., Jong L., Algaa E., Weidemann A.,
Sauer-Danzwith H., Mir S., Krebs O., Bittkowski M., Wetsch E., Rojas I., Mźller
W. (2012) SABIO-RK - database for biochemical reaction kinetics. Nucleic Acids Res. 40, D790-D796.
28.
Fazekas
D., Koltai M., Tźrei D., M—dos D., P‡lfy M., Dśl Z., Zs‡kai L.,
Szalay-Bekő M., Lenti K., Farkas I. J., Vellai T., Csermely P., Korcsm‡ros
T. (2013) SignaLink 2 - a signaling pathway resource with multi-layered
regulatory networks. BMC Syst Biol. 7, 7.
29.
Morgat
A., Coissac E., Coudert E., Axelsen K. B., Keller G., Bairoch A., Bridge A.,
Bougueleret L., Xenarios I., Viari A. (2012) UniPathway: a resource for the
exploration and annotation of metabolic pathways. Nucleic Acids Res. 40,
D761-D769.
30.
Yeats
C., Maibaum M., Marsden R., Dibley M., Lee D., Addou S., Orengo C. A. (2006)
Gene3D: modelling protein structure, function and evolution. Nucleic Acids Res. 34, D281-D284.
31.
Pedruzzi
I., Rivoire C., Auchincloss A. H., Coudert E., Keller G., de Castro E., Baratin
D., Cuche B. A., Bougueleret L., Poux S., Redaschi N., Xenarios I., Bridge A.
HAMAP in 2015: updates to the protein family classification and annotation
system. Nucleic Acids Res. 43, D1064-D1070.
32.
Mitchell
A., Chang H. Y., Daugherty L., Fraser M., Hunter S., Lopez R., McAnulla C.,
McMenamin C., Nuka G., Pesseat S., Sangrador-Vegas A., Scheremetjew M., Rato
C., Yong S. Y., Bateman A., Punta M., Attwood T. K., Sigrist C. J., Redaschi
N., Rivoire C., Xenarios I., Kahn D., Guyot D., Bork P., Letunic I., Gough J.,
Oates M., Haft D., Huang H., Natale D. A., Wu C. H., Orengo C., Sillitoe I., Mi
H., Thomas P. D., Finn R. D. (2015)
The InterPro protein families database: the classification resource after 15 years.
Nucleic Acids Res. 43, D213-D221.
33.
Mi
H., Muruganujan A., Casagrande J. T., Thomas P. D. (2013) Large-scale gene
function analysis with the PANTHER classification system. Nat Protoc. 8,
1551-1566.
34.
Finn
R. D., Bateman A., Clements J., Coggill P., Eberhardt R. Y., Eddy S. R., Heger
A., Hetherington K., Holm L., Mistry J., Sonnhammer E. L., Tate J., Punta M.
(2014) The Pfam protein families database. Nucleic
Acids Res. 42, D222-D230.
35.
Wu
C. H., Nikolskaya A., Huang H., Yeh L. S., Natale D. A., Vinayaka C. R., Hu Z.
Z., Mazumder R., Kumar S., Kourtesis P., Ledley R. S., Suzek B. E., Arminski
L., Chen Y., Zhang J., Cardenas J. L., Chung S., Castro-Alvear J., Dinkov G.,
Barker W. C. (2004) PIRSF: family classification system at the Protein
Information Resource. Nucleic Acids Res.
32, D112-D114.
36.
Attwood
T. K., Bradley P., Flower D. R., Gaulton A., Maudling N., Mitchell A., Moulton
G., Nordle A., Paine K., Taylor P., Uddin A. and Zygouri, C. (2003) PRINTS and
its automatic supplement, prePRINTS.
Nucleic Acids Res. 31,
400-402.
37.
Servant
F., Bru C., CarrŹre S., Courcelle E., Gouzy J., Peyruc D., Kahn D. (2002)
ProDom: Automated clustering of homologous domains. Briefings in Bioinformatics. 3,
246-251.
38.
Sigrist
C. J., de Castro E., Cerutti L., Cuche B. A., Hulo N., Bridge A., Bougueleret
L., Xenarios I. (2013) New and continuing developments at PROSITE. Nucleic Acids Res. 41, D344-D347.
39.
Rappoport
N., Karsenty S., Stern A., Linial N., Linial M. (2011) ProtoNet 6.0: organizing
10 million protein sequences in a compact hierarchical family tree. Nucleic Acids Res. 40, D313-D320.
40.
Letunic
I., Doerks T., Bork P. (2015) SMART: recent updates,
new developments and status in 2015. Nucleic
Acids Res. 43, D257-D260.
41.
Wilson
D., Pethica R., Zhou Y., Talbot C., Vogel C., Madera M., Chothia C., Gough J.
(2009) SUPERFAMILY - Comparative Genomics, Datamining and Sophisticated
Visualisation. Nucleic Acids Res. 37, D380-D386.
42.
Selengut
J. D., Haft D. H., Davidsen T., Ganapathy A., Gwinn-Giglio M., Nelson W. C.,
Richter A. R., White O. (2007) TIGRFAMs and Genome Properties: tools for the
assignment of molecular function and biological process in prokaryotic genomes.
Nucleic Acids Res. 35, D260-D264.
43.
Bastian
F., Parmentier G., Roux J., Moretti S., Laudet V.,
Robinson-Rechavi M. (2008) Bgee: Integrating and Comparing Heterogeneous
Transcriptome Data Among Species. Lecture
Notes in Computer Science 5109,
124-131.
44.
Praz
V., Jagannathan V., Bucher P. (2004) CleanEx: a database of heterogeneous gene
expression data based on a consistent gene nomenclature. Nucleic
Acids Res. 32, D542-D547.
45.
Grennan
AK. (2006) Genevestigator. Facilitating web-based gene-expression analysis. Plant Physiol. 141, 1164-1166.
46.
Petryszak
R., Burdett T., Fiorelli B., Fonseca N. A., Gonzalez-Porta M., Hastings E.,
Huber W., Jupp S., Keays M., Kryvych N., McMurry J., Marioni J. C., Malone J.,
Megy K., Rustici G., Tang A. Y., Taubert J., Williams E., Mannion O., Parkinson
H. E., Brazma A. (2014) Expression Atlas update-a database of gene and
transcript expression from microarray- and sequencing-based functional genomics
experiments. Nucleic Acids Res. 42, D926-D932.
47.
Cunningham
F., Amode M. R., Barrell D., Beal K., Billis K., Brent S., Carvalho-Silva D.,
Clapham P., Coates G., Fitzgerald S., Gil L., Gir—n C. G., Gordon L., Hourlier
T., Hunt S. E., Janacek S. H., Johnson N., Juettemann T., KŠhŠri A. K., Keenan
S., Martin F. J., Maurel T., McLaren W., Murphy D. N., Nag R., Overduin B.,
Parker A., Patricio M., Perry E., Pignatelli M., Riat H. S., Sheppard D.,
Taylor K., Thormann A., Vullo A., Wilder S. P., Zadissa A., Aken B. L., Birney
E., Harrow J., Kinsella R., Muffato M., Ruffier M., Searle S. M., Spudich G.,
Trevanion S. J., Yates A., Zerbino D. R., Flicek P. (2015) Ensembl 2015. Nucleic Acids Res. 43, D662-D669.
48.
Kersey
P. J., Lawson D., Birney E., Derwent P. S., Haimel M., Herrero J., Keenan S.,
Kerhornou A., Koscielny G., KŠhŠri A., Kinsella R. J., Kulesha E., Maheswari
U., Megy K., Nuhn M., Proctor G., Staines D., Valentin F., Vilella A. J., Yates
A. (2010) Ensembl Genomes: extending Ensembl across the taxonomic space. Nucleic Acids Res. 38, D563-D569.
49.
Maglott
D., Ostell J., Pruitt K. D., Tatusova T. (2005) Entrez Gene: gene-centered
information at NCBI. Nucleic Acids Res.
33, D54-D58.
50.
Kanehisa
M., Sato Y., Kawashima M., Furumichi M., Tanabe M. (2015) KEGG as a reference
resource for gene and protein annotation. Nucleic
Acids Res. doi: 10.1093/nar/gkv1070
51.
Wattam
A. R., Abraham D., Dalay O., Disz T. L., Driscoll T., Gabbard J. L., Gillespie
J. J., Gough R., Hix D., Kenyon R., Machi D., Mao C., Nordberg E. K., Olson R.,
Overbeek R., Pusch G. D., Shukla M., Schulman J., Stevens R. L., Sullivan D.
E., Vonstein V., Warren A., Will R., Wilson M. J., Yoo H. S., Zhang C., Zhang
Y., Sobral B. W. (2014) PATRIC, the bacterial bioinformatics database and
analysis resource. Nucleic Acids Res.
42, D581-D591.
52.
Rosenbloom
K. R., Armstrong J., Barber G. P., Casper J., Clawson H., Diekhans M., Dreszer
T. R., Fujita P. A., Guruvadoo L., Haeussler M., Harte R. A., Heitner S.,
Hickey G., Hinrichs A. S., Hubley R., Karolchik D., Learned K., Lee B. T., Li
C. H., Miga K. H., Nguyen N., Paten B., Raney B. J., Smit A. F., Speir M. L., Zweig
A. S., Haussler D., Kuhn R. M., Kent W. J. (2015) The UCSC Genome Browser
database: 2015 update. Nucleic Acids Res.
43, D670-D681.
53.
Lawson
D., Arensburger P., Atkinson P., Besansky N. J., Bruggner R. V., Butler R.,
Campbell K. S., Christophides G. K., Christley S., Dialynas E., Emmert D.,
Hammond M., Hill C. A., Kennedy R. C., Lobo N. F., MacCallum M. R., Madey G.,
Megy K., Redmond S., Russo S., Severson D. W., Stinson E. O., Topalis P.,
Zdobnov E. M., Birney E., Gelbart W. M., Kafatos F. C., Louis C., Collins F. H.
(2007) VectorBase: a home for invertebrate vectors of human pathogens. Nucleic Acids Res. 35, D503-D505.
54.
Harris
T. W., Baran J., Bieri T., Cabunoc A., Chan J., Chen W. J., Davis P., Done J.,
Grove C., Howe K., Kishore R., Lee R., Li Y., Muller H. M., Nakamura C.,
Ozersky P., Paulini M., Raciti D., Schindelman G., Tuli M. A., Van Auken K.,
Wang D., Wang X., Williams G., Wong J. D., Yook K., Schedl T., Hodgkin J.,
Berriman M., Kersey P., Spieth J., Stein L., Sternberg P. W. (2014) WormBase 2014:
new views of curated biology. Nucleic
Acids Res. 42, D789-D793.
55.
Herzig
V., Wood D. L., Newell F., Chaumeil P. A., Kaas Q., Binford G. J., Nicholson G.
M., Gorse D., King G. F. (2011) ArachnoServer 2.0, an updated online resource
for spider toxin sequences and structures. Nucleic
Acids Res. 39, D653-D657.
56.
Inglis
D. O., Arnaud M. B., Binkley J., Shah P., Skrzypek M. S., Wymore F., Binkley
G., Miyasato S. R., Simison M., Sherlock G. (2012). The Candida genome database
incorporates multiple Candida species: multispecies search and analysis tools
with curated gene and protein information for Candida albicans and Candida
glabrata. Nucleic Acids Res. 40, D667-D674.
57.
Kaas
Q., Yu R., Jin A. H., Dutertre S. and Craik D. J. (2012) ConoServer: updated
content, knowledge, and discovery tools in the conopeptide database. Nucleic Acids Res. 40, D325-D330.
58.
Davis
A. P., Grondin C. J., Lennon-Hopkins K., Saraceni-Richards C., Sciaky D., King
B. L., Wiegers T. C., Mattingly C. J. (2015) The Comparative Toxicogenomics
Database's 10th year anniversary: update 2015. Nucleic Acids Res. 43,
D914-D920.
59.
Basu
S., Fey P., Pandit Y., Dodson R. J., Kibbe W. A., Chisholm R. L. (2013)
DictyBase 2013: integrating multiple Dictyostelid species. Nucleic Acids Res. 41,
D676-D683.
60.
Misra
R. V., Horler R. S., Reindl W., Goryanin I. I., Thomas G. H. (2005) EchoBASE:
an integrated post-genomic database for Escherichia coli. Nucleic Acids Res. 33,
D329-D333.
61.
Zhou
J., Rudd K. E. (2013) EcoGene 3.0. Nucleic
Acids Res., 41, D613-D624.
62.
Combet
C., Garnier N., Charavay C., Grando D., Crisan D., Lopez J., Dehne-Garcia A.,
Geourjon C., Bettler E., Hulo C., Mercier PL., Bartenschlager R., Diepolder H.,
Moradpour D., Pawlotsky J.M., Rice C.M., Trepo C., Penin F. and DelŽage G.
(2007) euHCVdb: the European hepatitis C virus database. Nucleic Acids Res. 35,
D363-D366.
63.
Aurrecoechea
C., Brestelli J., Brunk B. P., Fischer S., Gajria B., Gao X., Gingle A., Grant
G., Harb O. S., Heiges M., Innamorato F., Iodice J., Kissinger J. C., Kraemer
E. T., Li W., Miller J. A., Nayak V., Pennington C., Pinney D. F., Roos D. S.,
Ross C., Srinivasamoorthy G., Stoeckert C. J. Jr, Thibodeau R., Treatman C.,
Wang H. (2010) EuPathDB: a portal to eukaryotic pathogen databases. Nucleic Acids Res. 38, D415-D419.
64.
dos Santos G., Schroeder A. J., Goodman J.
L., Strelets V. B., Crosby M. A., Thurmond J., Emmert D. B., Gelbart W. M.,
FlyBase Consortium. (2015) FlyBase: introduction of the Drosophila melanogaster
Release 6 reference genome assembly and large-scale migration of genome
annotations. Nucleic Acids Res. 43, D690-D697.
65.
FrŽzal
J. (1998) Genatlas database, genes and development defects. C R Acad Sci III 321, 805-817.
66.
Safran
M., Solomon I., Shmueli O., Lapidot M., Shen-Orr S., Adato A., Ben-Dor U.,
Esterman N., Rosen N., Peter I., Olender T., Chalifa-Caspi V., Lancet D. (2002)
GeneCards 2002: towards a complete, object-oriented, human gene compendium. Bioinformatics 18, 1542-1543.
67.
Lechat
P., Hummel L., Rousseau S., Moszer I. (2008) GenoList:
an integrated environment for comparative analysis of microbial genomes. Nucleic Acids Res. 36, D469-D474.
68.
Monaco
M. K., Stein J., Naithani S., Wei S., Dharmawardhana P., Kumari S., Amarasinghe
V., Youens-Clark K., Thomason J., Preece J., Pasternak S., Olson A., Jiao Y.,
Lu Z., Bolser D., Kerhornou A., Staines D., Walts B., Wu G., D'Eustachio P.,
Haw R., Croft D., Kersey P. J., Stein L., Jaiswal P., Ware D (2014). Gramene
2013: comparative plant genomics resources. Nucleic
Acids Res. 42, D1193-D1199.
69.
Yamasaki
C., Murakami K., Takeda J., Sato Y., Noda A., Sakate R., Habara T., Nakaoka H.,
Todokoro F., Matsuya A., Imanishi T., Gojobori T. (2009) H-InvDB in 2009:
extended database and data mining resources for human genes and transcripts. Nucleic Acids Res. 38, D626-D632.
70.
Gray
K. A., Daugherty L. C., Gordon S. M., Seal R. L., Wright M. W., Bruford E. A.
(2013) Genenames.org: the HGNC resources in 2013. Nucleic Acids Res. 41, D545–D552.
71.
UhlŽn
M., Bjšrling E., Agaton C., Szigyarto C. A., Amini B., Andersen E., Andersson
A. C., Angelidou P., Asplund A., Asplund C., Berglund L., Bergstršm K., Brumer
H., Cerjan D., Ekstršm M., Elobeid A., Eriksson C., Fagerberg L., Falk R., Fall
J., Forsberg M., Bjšrklund M. G., Gumbel K., Halimi A., Hallin I., Hamsten C.,
Hansson M., Hedhammar M., Hercules G., Kampf C., Larsson K., Lindskog M.,
Lodewyckx W., Lund J., Lundeberg J., Magnusson K., Malm E., Nilsson P., Odling
J., Oksvold P., Olsson I., Oster E., Ottosson J., Paavilainen L., Persson A.,
Rimini R., Rockberg J., Runeson M., Sivertsson A., Skšllermo A., Steen J., Stenvall
M., Sterky F., Stršmberg S., Sundberg M., Tegel H., Tourle S., Wahlund E.,
WaldŽn A., Wan J., WernŽrus H., Westberg J., Wester K., Wrethagen U., Xu L. L.,
Hober S., PontŽn F. (2005) A human protein atlas for normal and cancer tissues
based on antibody proteomics. Mol Cell
Proteomics. 4, 1920-1932.
72.
Kikuno
R., Nagase T., Nakayama M., Koga H., Okazaki N., Nakajima D., Ohara O. (2004)
HUGE: a database for human KIAA proteins, a 2004 update integrating HUGEppi and
ROUGE. Nucleic Acids Res. 32, D502-D504.
73.
Moszer
I., Glaser P. and Danchin A. (1995) SubtiList: a relational database for the
Bacillus subtilis genome. Microbiology 141, 261-268.
74.
Kapopoulou
A., Lew J. M., Cole S. T. (2011) The MycoBrowser portal: a comprehensive and
manually annotated resource for mycobacterial genomes. Tuberculosis
(Edinb) 91, 8-13.
75.
Andorf
C. M., Cannon E. K., Portwood J. L., Gardiner J. M., Harper L. C., Schaeffer M.
L., Braun B. L., Campbell D. A., Vinnakota A. G., Sribalusu V. V., Huerta M.,
Cho K. T., Wimalanathan K., Richter J. D., Mauch E. D., Rao B. S., Birkett S.
M., Richter J. D., Sen T. Z., Lawrence C. J. (2015) MaizeGDB 2015: New tools,
data, and interface for the maize model organism database. Nucleic Acids Res. doi: 10.1093/nar/gkv1007.
76.
Eppig
J. T., Blake J. A., Bult C. J., Kadin J. A., Richardson J. E., The Mouse Genome
Database Group. (2015) The Mouse Genome Database (MGD): facilitating mouse as a
model for human biology and disease. Nucleic
Acids Res. 43, D726-D736.
77.
Biaudet
V., Samson F., BessiŹres P. (1997) Micado-a network-oriented database for
microbial genomes. Comput Appl Biosci.
13, 431-438.
78.
Hamosh
A., Scott A. F., Amberger J. S., Bocchini C. A., McKusick V. A. (2005) Online
Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic
disorders. Nucleic Acids Res. 33, D514-D517.
79.
Gaudet
P., Argoud-Puy G., Cusin I., Duek P., Evalet O., Gateau A., Gleizes A., Pereira
M., Zahn-Zabal M., Zwahlen C., Bairoch A., Lane L. (2013) neXtProt: organizing
protein knowledge in the context of human proteome projects. J Proteome Res. 12, 293-298.
80.
AymŽ
S., Schmidtke J. (2007). Networking for rare diseases: a necessity for Europe. Bundesgesundheitsblatt Gesundheitsforschung
Gesundheitsschutz 50,
1477–1483.
81.
Thorn
C. F., Klein T. E., Altman R. B. (2005) PharmGKB: the pharmacogenetics and
pharmacogenomics knowledge base. Methods
Mol Biol. 311, 179-191.
82.
Wood
V., Harris M. A., McDowall M. D., Rutherford K., Vaughan B. W., Staines D. M.,
Aslett M., Lock A., BŠhler J., Kersey P. J., Oliver S. G. (2012) PomBase: a
comprehensive online resource for fission yeast. Nucleic Acids Res. 40,
D695-D699.
83.
Winsor
G. L., Lo R., Ho Sui S. J., Ung K. S., Huang S., Cheng D., Ching W. K., Hancock
R. E., Brinkman F. S. (2005) Pseudomonas aeruginosa Genome Database and
PseudoCAP: facilitating community-based, continually updated, genome
annotation. Nucleic Acids Res. 33, D338-D343.
84.
Shimoyama
M., De Pons J., Hayman G. T., Laulederkind S. J., Liu W., Nigam R., Petri V.,
Smith J. R., Tutaj M., Wang S. J., Worthey E., Dwinell M., Jacob H. (2015) The
Rat Genome Database 2015: genomic, phenotypic and environmental variations and
disease. Nucleic Acids Res. 28, D743-D750.
85.
Cherry
J. M., Hong E. L., Amundsen C., Balakrishnan R., Binkley G., Chan E. T.,
Christie K. R., Costanzo M. C., Dwight S. S., Engel S. R., Fisk D. G.,
Hirschman J. E., Hitz B. C., Karra K., Krieger C. J., Miyasato S. R., Nash R.
S., Park J., Skrzypek M. S., Simison M., Weng S., Wong E. D. (2012)
Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Res. 40, D700-D705.
86.
Lamesch
P., Berardini T. Z., Li D., Swarbreck D., Wilks C., Sasidharan R., Muller R.,
Dreher K., Alexander D. L., Garcia-Hernandez M., Karthikeyan A. S., Lee C. H.,
Nelson W. D., Ploetz L., Singh S., Wensel A., Huala E. (2012) The Arabidopsis
Information Resource (TAIR): improved gene annotation and new tools. Nucleic Acids Res. 40, D1202-D1210.
87.
Lew
J. M., Kapopoulou A., Jones L. M., Cole S. T. (2011) TubercuList - 10 years
after. Tuberculosis (Edinb). 1, 1-7.
88.
Bowes
J. B., Snyder K. A., Segerdell E., Gibb R., Jarabek C., Noumen E., Pollet N.,
Vize P. D. (2008) Xenbase: a Xenopus biology and genomics resource. Nucleic Acids Res. 36, D761-D767.
89.
Howe
D. G., Bradford Y. M., Conlin T., Eagle A. E., Fashena D., Frazer K., Knight
J., Mani P., Martin R., Moxon S. A., Paddock H., Pich C., Ramachandran S., Ruef
B. J., Ruzicka L., Schaper K., Shao X., Singer A., Sprunger B., Van Slyke C.
E., Westerfield M. (2013). ZFIN,
the Zebrafish Model Organism Database: increased support for mutants and
transgenics. Nucleic Acids Res. 41, D854-D860.
90.
Powell
S., Forslund K., Szklarczyk D., Trachana K., Roth A., Huerta-Cepas J., Gabald—n
T., Rattei T., Creevey C., Kuhn M., Jensen L. J., von Mering C. and Bork P.
(2014) eggNOG v4.0: nested orthology inference across 3686 organisms. Nucleic Acids Res. 42, D231-D239.
91.
PerriŹre
G., Duret L., Gouy M. (2000) HOBACGEN: database system for comparative genomics
in bacteria. Genome Res. 10, 379-385.
92.
Duret
L., Mouchiroud D., Gouy M. (1994) HOVERGEN: a database of homologous vertebrate
genes. Nucleic Acids Res. 22, 2360-2365.
93.
Sonnhammer
E. L., …stlund G2. (2015) InParanoid 8: orthology analysis between 273
proteomes, mostly eukaryotic. Nucleic
Acids Res. 43, D234-D239.
94.
Kanehisa
M., Goto S., Kawashima S., Okuno Y., Hattori M. (2004) The KEGG resource for deciphering
the genome. Nucleic Acids Res. 32, D277-D280.
95.
Altenhoff
A. M., Škunca N., Glover N., Train C. M., Sueki A., Piližota I., Gori
K., Tomiczek B., Mźller S., Redestig H., Gonnet G. H., Dessimoz C. (2015) The
OMA orthology database in 2015: function predictions, better plant support,
synteny view and other improvements. Nucleic
Acids Res. 43, D240-D249.
96.
Waterhouse
R. M., Tegenfeldt F., Li J., Zdobnov E. M., Kriventseva E. V. (2013) OrthoDB: a
hierarchical catalog of animal, fungal and bacterial orthologs. Nucleic Acids Res. 41, D358-365.
97.
Huerta-Cepas
J., Capella-GutiŽrrez S., Pryszcz L. P., Marcet-Houben M., Gabald—n T. (2014)
PhylomeDB v4: zooming into the plurality of evolutionary histories of a genome.
Nucleic Acids Res. 42, D897-D902.
98.
Ruan
J., Li H., Chen Z., Coghlan A., Coin L. J., Guo Y., HŽrichŽ J. K., Hu Y.,
Kristiansen K., Li R., Liu T., Moses A., Qin J., Vang S., Vilella A. J.,
Ureta-Vidal A., Bolund L., Wang J., Durbin R. (2008) TreeFam: 2008 Update. Nucleic Acids Res. 36, D735-D740.
99.
Wu
T. J., Shamsaddini A., Pan Y., Smith K., Crichton D. J., Simonyan V., Mazumder
R. (2014) A framework for organizing cancer-related variations from existing
databases, publications and NGS data using a High-performance Integrated
Virtual Environment (HIVE). Database.
bau022.
100. Peterson T.A., Adadey A., Santana-Cruz
I., Sun, Y., Winder A., Kann M. G. (2010) DMDM: Domain Mapping of Disease
Mutations. Bioinformatics 26, 2458-2459.
101. Chatr-Aryamontri A., Breitkreutz B. J.,
Oughtred R., Boucher L., Heinicke S., Chen D., Stark C., Breitkreutz A., Kolas
N., O'Donnell L., Reguly T., Nixon J., Ramage L., Winter A., Sellam A., Chang
C., Hirschman J., Theesfeld C., Rust J., Livstone M. S., Dolinski K., Tyers M.
(2015) The BioGRID interaction database: 2015 update. Nucleic Acids Research. 43,
D470-D478.
102. Salwinski L., Miller C. S., Smith A. J.,
Pettit F. K., Bowie J. U., Eisenberg D. (2004) The Database of Interacting
Proteins: 2004 update. Nucleic Acids Res.
32, D449-D451.
103. Orchard S., Ammari M., Aranda B., Breuza
L., Briganti L., Broackes-Carter F., Campbell N. H., Chavali G., Chen C.,
del-Toro N., Duesbury M., Dumousseau M., Galeota E., Hinz U., Iannuccelli M.,
Jagannathan S., Jimenez R., Khadake J., Lagreid A., Licata L., Lovering R. C.,
Meldal B., Melidoni A. N., Milagros M., Peluso D., Perfetto L., Porras P.,
Raghunath A., Ricard-Blum S., Roechert B., Stutz A., Tognolli M., van Roey K.,
Cesareni G., Hermjakob H. (2014) The MIntAct project--IntAct as a common
curation platform for 11 molecular interaction databases. Nucleic Acids Res. 42,
D358-D363.
104. Licata L., Briganti L., Peluso D.,
Perfetto L., Iannuccelli M., Galeota E., Sacco F., Palma A., Nardozza A. P.,
Santonico E., Castagnoli L., Cesareni G. (2012) MINT, the molecular interaction
database: 2012 update. Nucleic Acids Res.
40, D857-D861.
105. Szklarczyk D., Franceschini A., Wyder S.,
Forslund K., Heller D., Huerta-Cepas J., Simonovic M., Roth A., Santos A.,
Tsafou K. P., Kuhn M., Bork P., Jensen L. J., von Mering C. (2015) STRING v10:
protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 43, D447-D452.
106. Schaab C., Geiger T., Stoehr G., Cox J.,
Mann M. (2012) Analysis of high accuracy, quantitative proteomics data in the
MaxQB database. Mol Cell Proteomics. 11, M111.014068.
107. Wang M., Herrmann C. J., Simonovic M.,
Szklarczyk D., von Mering C. (2015) Version 4.0 of PaxDb: Protein abundance
data, integrated across model organisms, tissues, and cell-lines. Proteomics. 15, 3163-3168.
108. Desiere F., Deutsch E. W., King N. L.,
Nesvizhskii A. I., Mallick P., Eng J., Chen S., Eddes J., Loevenich S. N.,
Aebersold R. (2006) The PeptideAtlas project. Nucleic Acids Res. 34,
D655-D658.
109. Vizcaino J. A., Cote R. G., Csordas A., Dianes
J. A., Fabregat A., Foster J. M., Griss J., Alpi E., Birim M., Contell J.,
O'Kelly G., Schoenegger A., Ovelleiro D., Perez-Riverol Y., Reisinger F., Rios
D., Wang R., Hermjakob H. (2013) The Proteomics Identifications (PRIDE)
database and associated tools: status in 2013. Nucleic Acids Res. 41,
D1063-D1069.
110. Wienkoop S., Staudinger C., Hoehenwarter
W., Weckwerth W. and Egelhofer V. (2012) ProMEX - a mass spectral reference
database for plant proteomics. Front.
Plant Sci. 3, 125.
111. Duan G., Li X., Kšhn M. (2015) The human DEPhOsphorylation database DEPOD: a 2015 update. Nucleic Acids Res. 43, D531-D535.
112. Ross K. E., Arighi C. N., Ren J., Huang
H., Wu C. H. (2013) Construction of protein phosphorylation networks by data
mining, text mining and ontology integration: analysis of the spindle
checkpoint. Database (Oxford) bat038.iPTMnet
113. Durek P., Schmidt R., Heazlewood J. L.,
Jones A., Maclean D., Nagel A., Kersten B., Schulze W. X. (2010) PhosPhAt: the
Arabidopsis thaliana phosphorylation site database. An update. Nucleic Acids Research 38, D828-D834.
114. Dinkel H., Chica C., Via A., Gould C. M.,
Jensen L. J., Gibson T. J., Diella F. (2011) Phospho.ELM: a database of
phosphorylation sites--update 2011. Nucleic
Acids Res. 39, D261-D27.
115. Sadowski I., Breitkreutz B. J., Stark C.,
Su T. C., Dahabieh M., Raithatha S., Bernhard W., Oughtred R., Dolinski K.,
Barreto K., Tyers M. (2013) The PhosphoGRID Saccharomyces cerevisiae protein
phosphorylation site database: version 2.0 update. Database (Oxford). bat026.
116. Hornbeck P. V., Zhang B., Murray B.,
Kornhauser J. M., Latham V., Skrzypek E. (2014) PhosphoSitePlus, 2014:
mutations, PTMs and recalibrations. Nucleic
Acids Res. 43, D512-D520.
117. Campbell M. P., Peterson R., Mariethoz
J., Gasteiger E., Akune Y., Aoki-Kinoshita K. F., Lisacek F., Packer N. H.
(2014) UniCarbKB: building a knowledge platform for glycoproteomics. Nucleic Acids Res. 42, D215-D221.
118. The Gene Ontology Consortium. (2015) Gene
Ontology Consortium: going forward. Nucl
Acids Res. 43, D1049-D1056.
119. Natale D. A., Arighi C. N., Blake J. A.,
Bult C. J., Christie K. R., Cowart J., D'Eustachio P., Diehl A. D., Drabkin H.
J., Helfer O., Huang H., Masci A. M., Ren J., Roberts N. V., Ross K.,
Ruttenberg A., Shamovsky V., Smith B., Yerramalla M. S., Zhang J., AlJanahi A.,
‚elen I., Gan C., Lv M., Schuster-Lezell E., Wu C. H. (2014) Protein Ontology:
a controlled structured network of protein entities. Nucleic Acids Res. 42,
D415-D421.
120. Mari A., Rasi C., Palazzo
P., Scala E. (2009) Allergen databases: current status and perspectives.
Curr Allergy Asthma Rep. 9, 376-83.
121. Lombard V., Golaconda Ramulu H., Drula
E., Coutinho P. M., Henrissat B. (2014) The Carbohydrate-active enzymes
database (CAZy) in 2013. Nucleic Acids
Res. 42, D490-D495.
122. Lenfant N., Hotelier T., Velluet E.,
Bourne Y., Marchot P., Chatonnet A. (2013) ESTHER, the database of the
alpha/beta-hydrolase fold superfamily of proteins: tools to explore diversity
of functions. Nucleic Acids Res. 41, D423-D429.
123. Isberg V., Vroling B., van der Kant R.,
Li K., Vriend G., Gloriam D. (2014) GPCRDB: an information system for G
protein-coupled receptors. Nucleic Acids
Res. 42, D422-D425.
124. Giudicelli V., Duroux P., Ginestoux C.,
Folch G., Jabado-Michaloud J., Chaume D., Lefranc M. P. (2006) IMGT/LIGM-DB,
the IMGT comprehensive database of immunoglobulin and T cell receptor
nucleotide sequences. Nucleic Acids Res.
34, D781-D784.
125. Rawlings N. D., Waller M., Barrett A. J.,
Bateman A. (2014) MEROPS: the database of proteolytic enzymes, their substrates
and inhibitors. Nucleic Acids Res. 42, D503-D509.
126. Jeffery C. J. (1999) Moonlighting
proteins. Trends Biochem Sci. 24, 8-11.
127. Murphy C., Powlowski J., Wu M., Butler
G., Tsang A. (2011) Curation of characterized glycoside hydrolases of fungal
origin. Database doi:
10.1093/database/bar020
128. Fawal N., Li Q., Savelli B., Brette M.,
Passaia G., Fabre M., MathŽ C., Dunand C. (2013) PeroxiBase: a database for
large-scale evolutionary analysis of peroxidases. Nucleic Acids Res. 41,
D441-414.
129. Roberts R.J., Vincze T., Posfai J.,
Macelis D. (2015) REBASE-a database for DNA restriction and modification:
enzymes, genes and genomes. Nucleic Acids
Res. 43, D298-D299.
130. Saier M. H., Reddy V. S., Tamang D. G.,
Vastermark A. (2014), The transporter classification database. Nucleic. Acids Res. 42, D251-D258.
131. Frenkel-Morgenstern M., Gorohovski A.,
Lacroix V., Rogers M., Ibanez K., Boullosa C., Andres Leon E., Ben-Hur A.,
Valencia A. (2013) ChiTaRS: a database of human, mouse and fruit fly chimeric
transcripts and RNA-sequencing data. Nucleic
Acids Res. 41, D142-D151.
132. Mihalek I., Res I., Lichtarge O. (2004).
A Family of Evolution-Entropy Hybrid Methods for Ranking of Protein Residues by
Importance. J. Mol. Bio. 336, 1265-1282.
133. Good B. M., Clarke E. L., de Alfaro L.,
Su A. I. (2012) The Gene Wiki in 2011: community intelligence applied to human
gene annotation. Nucleic Acids Res. 40, D1255-D1261.
134. Schmidt E. E., Pelz O., Buhlmann S., Kerr
G., Horn T., Boutros M. (2013) GenomeRNAi: a database for cell-based and in
vivo RNAi phenotypes, 2013 update. Nucleic
Acids Res. 41, D1021-1026.
135. Igarashi Y., Heureux E., Doctor K. S.,
Talwar P., Gramatikova S., Gramatikoff K., Zhang Y., Blinov M., Ibragimova S.
S., Boyd S., Ratnikov B., Cieplak P., Godzik A., Smith J. W., Osterman A. L.,
Eroshkin A. M. (2009) PMAP: databases for analyzing proteolytic events and
pathways. Nucleic Acids Res. 37, D611-D618.
136. Diehn M., Sherlock G., Binkley G., Jin
H., Matese J. C., Hernandez-Boussard T., Rees C. A., Cherry J. M., Botstein D.,
Brown P. O., Alizadeh A. A. (2003) SOURCE: a unified genomic resource of
functional annotations, ontologies, and gene expression data. Nucleic Acids Res. 31, 219-23.