Search results
Results from the WOW.Com Content Network
Pfam is a database of protein families that includes their annotations and multiple sequence alignments generated using hidden Markov models. [ 1 ] [ 2 ] [ 3 ] The latest version of Pfam, 37.0, was released in June 2024 and contains 21,979 families. [ 4 ]
The Protein Common Interface Database (ProtCID) is a database of similar protein-protein interfaces in crystal structures of homologous proteins. [1] [5] Its main goal is to identify and cluster homodimeric and heterodimeric interfaces observed in multiple crystal forms of homologous proteins. Such interfaces, especially of non-identical ...
The CDD from NCBI amalgamates data from several different sources; Protein FAMilies (PFAM), Simple Modular Architecture Research Tool (SMART), Cluster of Orthologous Genes (COGs), and NCBI's own curated sequences. The data in SMID is derived from the Protein Data Bank (PDB), a database of known protein crystal structures.
TIGRFAMs is a database of protein families designed to support manual and automated genome annotation. [1] [2] [3] Each entry includes a multiple sequence alignment and hidden Markov model (HMM) built from the alignment. Sequences that score above the defined cutoffs of a given TIGRFAMs HMM are assigned to that protein family and may be ...
MobiDB is database annotating intrinsic disorder in proteins. PANTHER PANTHER is a large collection of protein families that have been subdivided into functionally related subfamilies, using human expertise. These subfamilies model the divergence of specific functions within protein families, allowing more accurate association with function ...
The databases in the table below are selected from the databases listed in the Nucleic Acids Research (NAR) databases issues and database collection and the databases cross-referenced in the UniProtKB. Most of these databases are cross-referenced with UniProt / UniProtKB so that identifiers can be mapped to each other. [15] Proteins in human:
Several biological databases document protein superfamilies and protein folds, for example: Pfam - Protein families database of alignments and HMMs; PROSITE - Database of protein domains, families and functional sites; PIRSF - SuperFamily Classification System; PASS2 - Protein Alignment as Structural Superfamilies v2
Stockholm format is a multiple sequence alignment format used by Pfam, Rfam and Dfam, to disseminate protein, RNA and DNA sequence alignments. [1] [2] [3] The alignment editors Ralee, [4] Belvu and Jalview support Stockholm format as do the probabilistic database search tools, Infernal and HMMER, and the phylogenetic analysis tool Xrate.