Protein Domain : IPR006142

Type:  Domain Name:  Intein
Description:  Inteins, or protein introns, are parts of protein sequences that are post-translationally excised, their flanking regions (exteins) beingspliced together to yield an additional protein product [,]. Thisprocess is believed to be self-catalysed, apparently initiating at the C-terminal splice junction, where a conserved asparagine residuemediates the nucleophilic attack of the peptide bond between it and its neighbouring residue. Most inteins consist of two domains: One is involved in autocatalytic splicing, and the other is an endonucleasethat is important in the spread of inteins []. Inteins are between 134 and 608 amino acids long, and they are found in members of all three domains of life: eukaryotes, bacteria, and archaea, although most frequently in archaea. Inteinsare found in proteins with diverse functions, including metabolic enzymes, DNA and RNA polymerases, proteases, ribonucleotide reductases, and the vacuolar-type ATPase. However, enzymes involved in DNA replication and repair appear to dominate. Inteins are found in conserved regions of conserved proteins and can be regarded as parasitic genetic elements []. Inteins are difficult to identify from sequence data because they lie inthe same reading frame as the spliced protein and they are characterised by only a few short conserved motifs []: two of these are similar tothe nonapeptide LAGLIDADG, which is diagnostic of certain homing endonucleases (mutation of one such motif causes loss of endonucleicactivity, but not of the protein splicing function); another includes the C' splice site, mutations in which disable protein function. Short Name:  INTEIN

0 Child Features

3 Contains

DB identifier Type Name
IPR004042 Domain Intein DOD homing endonuclease
IPR006141 PTM Intein N-terminal splicing region
IPR007869 Domain Homing endonuclease PI-Sce

1 Cross References

Identifier
PR00379

0 Found In

1 GO Annotation

GO Term Gene Name
GO:0016539 IPR006142

1 Ontology Annotations

GO Term Gene Name
GO:0016539 IPR006142

0 Parent Features

0 Proteins

3 Publications

First Author Title Year Journal Volume Pages PubMed ID
            8165123
            7756989
            12142479