The gene information section lists the gene name (HUGO Gene Nomenclature Committee (HGNC) name if available), any approved gene synonyms, Ensembl gene description, and the Entrez gene summary from the National Center for Biotechnology Information.
The chromosomal and cytoband location of the gene according to Ensembl is reported together with the Ensembl gene identifier and Ensembl database version.
The Entrez gene identifier for the gene is also given. If any of the protein products of
the gene is linked to a UniProt KB/SWISS-PROT entry, links to the UniProt and the
neXtProt databases for these proteins are displayed.
There is also a link to the Antibodypedia portal where validation data for antibodies produced by other suppliers
against this gene can be found.
Gene name
COL1A1 (HGNC Symbol)
Synonyms
OI4
Description
Collagen, type I, alpha 1 (HGNC Symbol)
Entrez gene summary
This gene encodes the pro-alpha1 chains of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon. Mutations in this gene are associated with osteogenesis imperfecta types I-IV, Ehlers-Danlos syndrome type VIIA, Ehlers-Danlos syndrome Classical type, Caffey Disease and idiopathic osteoporosis. Reciprocal translocations between chromosomes 17 and 22, where this gene and the gene for platelet-derived growth factor beta are located, are associated with a particular type of skin tumor called dermatofibrosarcoma protuberans, resulting from unregulated expression of the growth factor. Two transcripts, resulting from the use of alternate polyadenylation signals, have been identified for this gene. [provided by R. Dalgleish, Feb 2008]
The protein view displays protein features. The tabs at the top of the protein view section can be used to switch between the different splice variants encoded by this gene. The mouse over function displays additional data for the features in the protein view.
At the top of the protein view, the maximum percent sequence identity of the protein to all other proteins from other human genes is shown, using a sliding window of 10 aa residues
(HsID 10) or 50 aa residues (HsID 50) (read more).
If a signal peptide is predicted by a majority of the signal peptide predictors SPOCTOPUS,
SignalP 4.0, and
Phobius
(turquoise) and/or transmembrane regions (orange) are predicted by MDM, these are displayed.
Common (purple) and unique (grey) regions between different splice variants of the gene are also displayed
(read more), and at the bottom of the protein view is the protein scale.
The protein information section displays the alternative protein-coding transcripts (splice variants) encoded by this gene, according to the Ensembl database.
The ENSP identifier links to the Ensembl website protein summary, while the ENST identifier links to the Ensembl website transcript summary for the selected splice variant.
The data in the UniProt column can be expanded to show links to all matching
UniProt identifiers for this protein.
The protein classes to which this protein has been assigned are shown if expanding the data in the protein class column. Parent protein classes are in bold font and subclasses are listed under the parent class.
The Gene Ontology terms assigned to this protein are listed if expanding the Gene ontology column.
The length of the protein (amino acid residues according to Ensembl), molecular mass (kDalton), predicted signal peptide (according to a majority of the signal peptide predictors
SPOCTOPUS,
SignalP 4.0, and
Phobius) and the number of predicted transmembrane region(s) (according to
MDM) are also reported.
Predicted intracellular proteins Cancer-related genes COSMIC somatic mutations in cancer genes COSMIC Somatic Mutations COSMIC Translocations Protein evidence (Kim et al 2014) Protein evidence (Ezkurdia et al 2014)