Search our database by keyword

- or -

Examples

  • Search this entire website. Enter identifiers, names or keywords for genes, pathways, authors, ontology terms, etc. (e.g. eve, embryo, zen, allele)
  • Use OR to search for either of two terms (e.g. fly OR drosophila) or quotation marks to search for phrases (e.g. "dna binding").
  • Boolean search syntax is supported: e.g. dros* for partial matches or fly AND NOT embryo to exclude a term

Search results 38601 to 38700 out of 38750 for *

Category restricted to ProteinDomain (x)

0.023s

Categories

Category: ProteinDomain
Type Details Score
Protein Domain
Name: Amino acid synthesis, putative
Type: Family
Description: This family of proteins is structurally similar to proteins with the Bacillus chorismate mutase-like (BCM-like) fold. This structure, combined with its genomic context, suggest that it has a role in amino acid synthesis [ ].
Protein Domain
Name: CSPG5, sulphate attachment domain
Type: Domain
Description: This entry represents the chondroitin sulphate attachment domain of Chondroitin sulfate proteoglycan 5 (CSPG5, also known as CALEB), which mediates dendritic tree and spine complexity [ ]. This domain contains several potential sites of chondroitin sulphate attachment, as well as potential sites of N-linked glycosylation [].
Protein Domain
Name: Oxygen-independent coproporphyrinogen-III oxidase-like protein HemZ
Type: Family
Description: Members of this radical SAM protein family are HemZ, a protein involved in coproporphyrinogen III decarboxylation. Alternative names for this enzyme (EC ) include coproporphyrinogen dehydrogenase and oxygen-independent coproporphyrinogen III oxidase. The family is related to, but distinct from HemN, and in Bacillus subtilis was shown to be connected to peroxide stress and catalase formation [ ].
Protein Domain
Name: Caenorhabditis elegans ly-6-related
Type: Family
Description: This family consists of several Caenorhabditis elegans specific ly-6-related HOT and ODR proteins. These proteins are involved in the olfactory system. Odr-2 mutants are known to be defective in the ability to chemotax to odorants that are recognised by the two AWC olfactory neurons. Odr-2 encodes a membrane-associated protein related to the Ly-6 superfamily of GPI-linked signalling proteins [ ].
Protein Domain
Name: Signal transduction histidine kinase, internal region
Type: Domain
Description: Two-component signal transduction systems enable bacteria to sense, respond, and adapt to a wide range of environments, stressors, and growth conditions [ ]. Some bacteria can contain up to as many as 200 two-component systems that need tight regulation to prevent unwanted cross-talk []. These pathways have been adapted to response to a wide variety of stimuli, including nutrients, cellular redox state, changes in osmolarity, quorum signals, antibiotics, and more []. Two-component systems are comprised of a sensor histidine kinase (HK) and its cognate response regulator (RR) []. The HK catalyses its own auto-phosphorylation followed by the transfer of the phosphoryl group to the receiver domain on RR; phosphorylation of the RR usually activates an attached output domain, which can then effect changes in cellular physiology, often by regulating gene expression. Some HK are bifunctional, catalysing both the phosphorylation and dephosphorylation of their cognate RR. The input stimuli can regulate either the kinase or phosphatase activity of the bifunctional HK.A variant of the two-component system is the phospho-relay system. Here a hybrid HK auto-phosphorylates and then transfers the phosphoryl group to an internal receiver domain, rather than to a separate RR protein. The phosphoryl group is then shuttled to histidine phosphotransferase (HPT) and subsequently to a terminal RR, which can evoke the desired response [ , ].Signal transducing histidine kinases are the key elements in two-component signal transduction systems, which control complex processes such as the initiation of development in microorganisms [ , ]. Examples of histidine kinases are EnvZ, which plays a central role in osmoregulation [], and CheA, which plays a central role in the chemotaxis system []. Histidine kinases usually have an N-terminal ligand-binding domain and a C-terminal kinase domain, but other domains may also be present. The kinase domain is responsible for the autophosphorylation of the histidine with ATP, the phosphotransfer from the kinase to an aspartate of the response regulator, and (with bifunctional enzymes) the phosphotransfer from aspartyl phosphate back to ADP or to water []. The kinase core has a unique fold, distinct from that of the Ser/Thr/Tyr kinase superfamily. HKs can be roughly divided into two classes: orthodox and hybrid kinases [ , ]. Most orthodox HKs, typified by the Escherichia coli EnvZ protein, function as periplasmic membrane receptors and have a signal peptide and transmembrane segment(s) that separate the protein into a periplasmic N-terminal sensing domain and a highly conserved cytoplasmic C-terminal kinase core. Members of this family, however, have an integral membrane sensor domain. Not all orthodox kinases are membrane bound, e.g., the nitrogen regulatory kinase NtrB (GlnL) is a soluble cytoplasmic HK []. Hybrid kinases contain multiple phosphodonor and phosphoacceptor sites and use multi-step phospho-relay schemes instead of promoting a single phosphoryl transfer. In addition to the sensor domain and kinase core, they contain a CheY-like receiver domain and a His-containing phosphotransfer (HPt) domain.This family represents a region within bacterial histidine kinase enzymes. Two-component signal transduction systems such as those mediated by histidine kinase are integral parts of bacterial cellular regulatory processes, and are used to regulate the expression of genes involved in virulence. Members of this family often contain and/or .
Protein Domain
Name: Spectinomycin biosynthesis
Type: Family
Description: Proteins in this family are involved in spectinomycin biosynthesis and are thought to play a role in generating TDP-actinospectose. Although the reaction is somewhat similar to that of DesII, these proteins are not detectably similar [ ].
Protein Domain
Name: Protein of unknown function DUF1176
Type: Family
Description: This family consists of several hypothetical bacterial proteins of around 340 residues in length. Members of this family contain six highly conserved cysteine residues. The function of this family is unknown.
Protein Domain
Name: Protein of unknown function DUF1177
Type: Family
Description: This family consists of several hypothetical archaeal and bacterial proteins of around 300 residues in length. The function of this family is unknown.
Protein Domain
Name: Protein of unknown function DUF1178
Type: Family
Description: This family consists of several hypothetical bacterial proteins of around 150 residues in length. The function of this family is unknown.
Protein Domain
Name: Sjoegren syndrome/scleroderma autoantigen 1
Type: Family
Description: This family consists of several Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27) sequences, also known as ZNRD2. It is thought that the potential association of anti-p27 with anti-centromere antibodies suggests that autoantigen p27 might play a role in mitosis [ ].
Protein Domain
Name: Uncharacterised protein family UPF0148
Type: Family
Description: The proteins in this entry are functionally uncharacterised. However, they show protein sequence similarity with Sjoegren syndrome/scleroderma autoantigen 1 protein ( ).
Protein Domain
Name: Protein of unknown function DUF1179
Type: Family
Description: This family consists of several hypothetical Caenorhabditis elegans proteins of around 106 residues in length. The function of the family is unknown.
Protein Domain
Name: Protein of unknown function DUF1180
Type: Family
Description: This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Protein Domain
Name: Conserved hypothetical protein CHP03815, CpaE-like
Type: Family
Description: Members of this protein family belong to the MinD/ParA family of P-loop NTPases, and in particular show homology to the CpaE family of pilus assembly proteins [ ]. Nearly all members are found, not only in a gene context consistent with pilus biogenesis or a pilus-like secretion apparatus, but also near a DEAD/DEAH-box helicase, suggesting an involvement in DNA transfer activity. The entry describes a clade restricted to the Actinobacteria.
Protein Domain
Name: Protein of unknown function DUF1181
Type: Family
Description: This family consists of several hypothetical proteins of around 120 residues in length which are found specifically in Trypanosoma brucei. The function of this family is unknown.
Protein Domain
Name: Flagellar regulator, FliZ
Type: Family
Description: FliZ is involved in the regulation of flagellar assembly [ ] and possibly also the down-regulation of the motile phenotype []. FliZ interacts with the flagellar translational activator FlhCD complex.
Protein Domain
Name: Flagellar motor stator, MotA
Type: Family
Description: The MotA protein, along with its partner MotB, comprises the stator complex of the bacterial flagellar motor. MotAB span the cytoplasmic membrane and undergo conformational changes powered by the translocation of protons. These conformational changes in turn are communicated to the rotor assembly, producing torque [ , ].This family represents MotA proteins found in proteobacteria.
Protein Domain
Name: Motility protein A-like
Type: Family
Description: This family represents a group of bacterial proteins that are required for the rotation of the flagellar motor, which probably forms a transmembrane proton channel used to energize the flagellar rotary motor. This entry includes MotA and related proteins, such as PomA and LafT [ , ]. These are integral membrane proteins that contains four transmembrane domains.
Protein Domain
Name: Beta-lysine N-acetyltransferase
Type: Family
Description: Members of this protein family are GNAT family acetyltransferases, based on a seed alignment in which every member is associated with a lysine 2,3-aminomutase family protein, usually as the adjacent gene. This family includes AblB, the enzyme beta-lysine acetyltransferase that completes the two-step synthesis of the osmolyte (compatible solute) N-epsilon-acetyl-beta-lysine; all members of the family may have this function [ ]. Note that N-epsilon-acetyl-beta-lysine has been observed only in methanogenic archaea (e.g. Methanosarcina) but that this model, paired with Lysine-2,3-aminomutase (), suggests a much broader distribution.
Protein Domain
Name: Flagellar assembly FliH, Bacilli
Type: Family
Description: This entry represents a bacillus clade of FliH proteins. Sequences in this entry are observed in flagellar operons in an analogous position relative to other flagellar operon genes.
Protein Domain
Name: F420-dependent oxidoreductase, Rv3093c
Type: Family
Description: This entry describes a small family of enzymes in the bacterial luciferase-like monooxygenase family, which includes F420-dependent enzymes such as N5,N10-methylenetetrahydromethanopterin reductase as well as FMN-dependent enzymes. All members of this family are from species that produce coenzyme F420; SIMBAL analysis suggests that members of this family bind F420 rather than FMN.
Protein Domain
Name: Protein of unknown function DUF3530
Type: Family
Description: This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 272 to 336 amino acids in length. These proteins are distantly related to alpa/beta hydrolases so they may act as enzymes.
Protein Domain
Name: 6-phosphogluconolactonase, YbhE-type
Type: Family
Description: 6-phosphogluconolactonases (6PGL) , which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is one of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE [ ] and the Pseudomonas aeruginosa DevB [ ] types. 6-phospho-D-glucono-1,5-lactone + H2O = 6-phospho-D-gluconate.This entry refers to the YbhE type of 6-phosphogluconolactonase.
Protein Domain
Name: Lactonase, 7-bladed beta propeller
Type: Family
Description: 6-phosphogluconolactonases (6PGL) , which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is one of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE [ ] and the Pseudomonas aeruginosa DevB [ ] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase and muconate cycloisomerase , which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed β-propeller fold [ ].The surface antigen Lp49 from Leptospira interrogans, the causative agent of leptospirosis, contains a C-terminal beta propellor domain that is also included in this entry. The structure of the Lp49 protein has been solved, and because it is an antigen is a potential drug target [ ].
Protein Domain
Name: Deoxycytidine triphosphate deaminase, bacterial
Type: Family
Description: This family consists of several bacterial 2'-deoxycytidine 5'-triphosphate deaminase (dCTP deaminase) proteins ( ).
Protein Domain
Name: dCTP deaminase
Type: Family
Description: Deoxycytidine triphosphate deaminase (dCTP deaminase) is an enzyme involved in nucleotide metabolism. It catalyses the formation of dUTP (deoxyuridine (5'-)triphosphate), which in turn is degraded by dUTPase (dUTP diphosphatase) to produce dUMP (deoxyuridine monophosphate). dUMP is the immediate precursor of thymidine nucleotides [ ].Members of this family include the Escherichia coli monofunctional deoxycytidine triphosphate deaminase (dCTP deaminase) [ ] and a Methanocaldococcus jannaschii (Methanococcus jannaschii) bifunctional dCTP deaminase ()/dUTP diphosphatase ( ) [ ].
Protein Domain
Name: Glucose-6-phosphate isomerase, prokaryote
Type: Domain
Description: This entry consists of several bacterial and archaeal glucose-6-phosphate isomerase (GPI) proteins ( ), which are involved in glycolysis and in gluconeogenesis and catalyse the conversion of D-glucose 6-phosphate to D-fructose 6-phosphate. The deduced amino acid sequence of the first archaeal PGI isolated from Pyrococcus furiosus revealed that it is not related to its eukaryotic and many of its bacterial counterparts. In contrast, this archaeal PGI shares similarity with the cupin superfamily that consists of a variety of proteins that are generally involved in sugar metabolism in both prokaryotes and eukaryotes [ ].
Protein Domain
Name: Peroxisome proliferator-activated receptor gamma, N-terminal
Type: Domain
Description: This domain is found in eukaryotes, and is typically between 42 and 65 amino acids in length. It is found N-terminal to and . There are two completely conserved residues (D and Y) that may be functionally important. Peroxisome proliferator-activated receptors (PPAR) are nuclear hormone receptors that control the expression of genes involved in lipid homeostasis in mammals[].
Protein Domain
Name: Nucleolar protein 19
Type: Family
Description: Nucleolar protein 19 (NOP19) plays an essential role in 40S ribosomal subunit biogenesis [ ].
Protein Domain
Name: Autographa californica nuclear polyhedrosis virus (AcMNPV), 30.6kDa
Type: Family
Description: This group of viral proteins has no known function.
Protein Domain
Name: 2-oxoisovalerate dehydrogenase, E1 alpha subunit, N-terminal domain
Type: Domain
Description: This domain is part of the N-terminal tail and β-sheet region of the bacterial 2-oxoisovalerate dehydrogenase E1 alpha subunit [ , ]. It is found N-terminal to the dehydrogenase component . There are two conserved sequence motifs: VPEP and RPG. The E1 subunit is a thiamin diphosphate (ThDP)-dependent 2-oxo acid dehydrogenase (decarboxylase) belonging to the pyruvate dehydrogenase multienzyme complex. This is the enzyme complex responsible for metabolism of pyruvate, 2-oxoglutarate, branched chain 2-oxo acids and acetoin. The E1 component is a heterotetramer of alpha2beta2. The homodimerised beta subunits are flanked by two alpha subunits in a 'vise' structure.
Protein Domain
Name: G protein-coupled receptor GPR1, C-terminal
Type: Domain
Description: GPR1 associate with GPA2 and acts as G protein-coupled receptor that senses glucose and controls filamentous growth [ , ]. This entry represents the conserved C-terminal domain.
Protein Domain
Name: Enterobacter phage Enc34, ssDNA-binding protein
Type: Family
Description: This entry includes protein 50 from Bacteriophage APSE-1 and ssDNA-binding protein from Enterobacter phage Enc34 ( ), which shows a globular shape with a mixed α/β architecture consisting mainly of β-strands that clearly resembles the oligosaccharide/oligonucleotide-binding (OB)-fold [ ]. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Protein Domain
Name: Protein of unknown function DUF6713
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in bacteria and archaea. Proteins in this family are typically between 117 and 141 amino acids in length.
Protein Domain
Name: Long-chain fatty acyl-CoA thioesterase FcoT-like
Type: Family
Description: Proteins in this family have a HotDog fold. This family was formerly known as DUF2662. The structure of FcoT from M. tuberculosis [ ] suggested a thioesterase function. Assays showed that this protein was a thioesterase with a preference for long chain fatty acyl groups []. The maximal Kcat was observed for palmitoyl-CoA, although longer and shorter molecules were also cleaved. In solution this protein forms a homo-hexameric complex.
Protein Domain
Name: Protein of unknown function DUF6712
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in bacteria and some viral sequences. Proteins in this family are typically between 270 and 340 amino acids in length.
Protein Domain
Name: Endoribonuclease antitoxin GhoS
Type: Family
Description: GhoS is part of the GhoT-GhoS type V toxin-antitoxin (TA) system. GhoT is inhibited by antitoxin GhoS, which specifically cleaves its mRNA [ , ].
Protein Domain
Name: Protein of unknown function DUF6711
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in bacteria and viruses. Proteins in this family are typically between 115 and 155 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF6710
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in bacteria. Proteins in this family are typically between 220 and 300 amino acids in length.
Protein Domain
Name: Lysis protein, bacteriophage MS2
Type: Family
Description: This entry represents putative lysis proteins from Bacteriophage MS2 and Bacteriophage BZ13.The bacteriophage MS2 lysin protein was identified as an overlapping cistron in the bacteriophage MS2 RNA [ ]. This proteins induces the formation of specific membrane adhesion sites between the inner and outer membranes, apparently leading to host cell lysis. Lysis may be performed via activation of host murein hydrolases [, ].
Protein Domain
Name: Protein of unknown function DUF6709
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in bacteria. Proteins in this family are typically between 340 and 435 amino acids in length.
Protein Domain
Name: Domain of unknown function DUF6708
Type: Domain
Description: This presumed domain is functionally uncharacterised. This domain is found in proteobacteria and is approximately 200 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF6707
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in bacteria. Proteins in this family are typically between 190 and 350 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF6706
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in bacteroidetes. Proteins in this family are approximately 100 amino acids in length.
Protein Domain
Name: Domain of unknown function DUF6705
Type: Domain
Description: This presumed domain is functionally uncharacterised. This domain is found in bacteroidetes, and is approximately 170-200 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF6704
Type: Family
Description: This family of presumed integral membrane proteins is functionally uncharacterised. This family of proteins is found in actinobacteria. Proteins in this family are typically between 76 and 89 amino acids in length.
Protein Domain
Name: Photosynthetic NDH subunit of subcomplex B 4, chloroplastic
Type: Family
Description: The chloroplast NADH dehydrogenase-like complex (NDH) was first discovered based on its similarity to complex I in respiratory electron transport. However, it seems to be a ferredoxin-dependent plastoquinone reductase rather than an NAD(P)H dehydrogenase. Furthermore, it also comprises a subcomplex on the stromal side of the thylakoid membrane and a lumenal subcomplex [ ]. The chloroplast NDH complex has been divided into four categories: membrane, lumen, A, and B subcomplexes. The subcomplex A and membrane subcomplex are conserved in cyanobacteria, but the subcomplex B and lumen subcomplex are specific to chloroplasts. PNSB4/NDF6 is a subunit of subcomplex B [].
Protein Domain
Name: Protein APEM9
Type: Family
Description: Several peroxisome biogenesis factors, termed peroxins (PEXs), are involved in the transport of peroxisomal proteins from the cytosol to peroxisomes. ABERRANT PEROXISOME MORPHOLOGY9 (APEM9) is involved in peroxisomal protein transport through its recruitment of the AAA ATPase complex PEX1-PEX6 to peroxisomal membranes via protein-protein interactions with PEX6 [ ]. APEM9 has a similar function in plants to mammalian Pex26 and yeast Pex15p, although the amino acid conservation among them is very low. APEM9 plays critical roles in peroxisome biogenesis and function, which is essential for jasmonic acid production and pollen maturation and germination [].
Protein Domain
Name: Protein MULTIPLE CHLOROPLAST DIVISION SITE 1
Type: Family
Description: Chloroplast division requires the formation of an FtsZ division ring, its positioning being regulated by the proteins MinD and MinE. MCD1 is a plant-specific protein that directly interacts with MinD and is required for MinD localization to regulate FtsZ ring formation [ ].
Protein Domain
Name: Succinate dehydrogenase subunit 7, mitochondrial
Type: Family
Description: The succinate dehydrogenase complex (complex II) is a highly conserved protein complex composed of the SDH1-SDH4 subunits in bacteria and in the mitochondria of animals and fungi, with up to four additional subunits in complex II of plants, termed SDH5-SDH8. SDH6 and SDH7 contribute to anchoring succinate dehydrogenase to the inner mitochondrial membrane [ ].This entry represents SDH7.
Protein Domain
Name: Succinate dehydrogenase subunit 6, mitochondrial
Type: Family
Description: The succinate dehydrogenase complex (complex II) is a highly conserved protein complex composed of the SDH1-SDH4 subunits in bacteria and in the mitochondria of animals and fungi, with up to four additional subunits in complex II of plants, termed SDH5-SDH8. SDH6 and SDH7 contribute to anchoring succinate dehydrogenase to the inner mitochondrial membrane [ ].This entry represents SDH6.
Protein Domain
Name: Outer envelope pore protein 21
Type: Family
Description: Oep21 has been identified in plants as an ATP-regulated anion-selective solute channel from the outer chloroplast membrane [ ]. Members of this family are also found in green algae.
Protein Domain
Name: Protein NIM1-INTERACTING 2
Type: Family
Description: NIMIN-2 is a member of a novel family of proteins from Arabidopsis (also consisting of NIMIN-1 and NIMIN-3) that interact with NPR1/NIM1, a key regulator of systemic acquired resistance in plants [ ].
Protein Domain
Name: Mediator of RNA polymerase II transcription subunit 30
Type: Family
Description: The large protein complex Mediator is a central coregulator of transcription in eukaryotes. It acts as a bridge to pass on the information from the transcription factors to the basal transcriptional unit at the core promoter, resulting in either transcriptional activation or repression. The plant Mediator subunits typically show very low homology to other species, plant Meds being closer to human than to yeast counterparts. The plant Mediator appears to be composed of one or more members of 34 subunits, as opposed to 25 and 30 members in yeast and metazoans, respectively [ ]. This entry represents subunit 30 of the plant complex.
Protein Domain
Name: Photosynthetic NDH subunit of subcomplex B 5, chloroplastic
Type: Family
Description: The chloroplast NADH dehydrogenase-like complex (NDH) was first discovered based on its similarity to complex I in respiratory electron transport. However, it seems to be a ferredoxin-dependent plastoquinone reductase rather than an NAD(P)H dehydrogenase. Furthermore, it also comprises a subcomplex on the stromal side of the thylakoid membrane and a lumenal subcomplex [ ]. The chloroplast NDH complex has been divided into four categories: membrane, lumen, A, and B subcomplexes. The subcomplex A and membrane subcomplex are conserved in cyanobacteria, but the subcomplex B and lumen subcomplex are specific to chloroplasts. PNSB5/ NDH18 is a subunit of subcomplex B [, ].
Protein Domain
Name: Protein of unknown function DUF2639
Type: Family
Description: This family is conserved in the Bacillaceae family. Several members are named as being YflJ, but the function is not known.
Protein Domain
Name: Microcystin synthetase, domain of unknown function
Type: Domain
Description: This domain of unknown function is found in cyanobacteria and is approximately 40 amino acids in length. It is found in association with , , , and and contains a conserved YAN motif. This region is found in microcystin synthetase, which is one of a number of proteins involved in microcystin toxin synthesis [ ]. Microcystins form a large family of small cyclic heptapeptides harbouring extensive modifications in amino acid residue composition and functional group chemistry. These peptide hepatotoxins contain a range of non-proteinogenic amino acids and unusual peptide bonds, and are typically N-methylated. They are synthesized on large enzyme complexes consisting of non-ribosomal peptide synthetases and polyketide synthases [].
Protein Domain
Name: T-box transcription factor 2/3, transcription activation domain
Type: Domain
Description: This region is found C-terminal to the T-box DNA binding domain in T-box transcription factors 2 and 3, which are highly homologous [ ]. This region may be a transcription activation domain (TAD) and contains two completely conserved residues (S and P) that may be functionally important []. T-box genes are highly conserved and encode transcription factors that are involved in morphogenesis and organogenesis of both invertebrates and vertebrates [, , ]. However, TBX2 is strongly re-expressed in adults which suggests a role in non-developmental processes [].
Protein Domain
Name: TraD coupling protein, N-terminal
Type: Domain
Description: This domain of unknown function is found in bacteria, and is typically between 96 and 107 amino acids in length. It is probably periplasmic and is found N-terminal to the DNA binding domain . TraD is a hexameric ring ATPase that is part of the bacterial F sex factor complex and involved in bacterial conjugation [ ], [], [].
Protein Domain
Name: Protein of unknown function DUF6703
Type: Family
Description: This family of putative integral membrane proteins is functionally uncharacterised. This family of proteins is found in actinobacteria. Proteins in this family are typically between 85 and 118 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF6804
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in bacteria. Proteins in this family are typically between 98 and 195 amino acids in length and adopt a helix-rich bundle structure and share a similar fold with a flagellin homologue, hook-associated protein.
Protein Domain
Name: Myotubularin-related 12-like C-terminal domain
Type: Domain
Description: This domain is found at the C terminus of myotubularin-related (MTMR) proteins MTMR10, MTMR11 and MTMR12; and is typically between 115 and 138 amino acids in length. Myotubularin-related protein 12 (also known as the phosphatidylinositol-3 phosphate 3-phosphatase adapter subunit, 3-PAP) is a catalytically inactive member of the myotubularin gene family, which plays a role as an adapter for the phosphatidylinositol 3-phosphate-3-phosphatase myotubularin [ ]. MTMR10 and MTMR11 are also catalytically inactive.
Protein Domain
Name: Protein of unknown function DUF6803
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in bacteria. Proteins in this family are typically between 162 and 222 amino acids in length. Structurally, proteins in this family are made of four-helix bundle.
Protein Domain
Name: Swinepox virus, C51
Type: Family
Description: This entry represents viral proteins with unknown function. The entry contains ORF C51 from Swinepox virus (strain Kasza).
Protein Domain
Name: Protein of unknown function DUF4784_N
Type: Domain
Description: This is a family of uncharacterised proteins from Bacteroidetes. This domain is found to the N-terminal of a WD40-like domain in DUF4784 family protein.
Protein Domain
Name: Protein of unknown function DUF2660
Type: Family
Description: This is a bacterial group of proteins with unknown function found in Rickettsiales.
Protein Domain
Name: Protein of unknown function DUF6806
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in proteobacteria. Proteins in this family are approximately 210 amino acids in length and their N-terminal share a similar structure with the Crispr-associated endoribonuclease Cas2.
Protein Domain
Name: Protein of unknown function DUF2659
Type: Family
Description: This bacterial family of proteins has no known function.
Protein Domain
Name: Domain of unknown function DUF6805
Type: Domain
Description: This presumed domain is functionally uncharacterised. This domain family is found in bacteria, and is approximately 130 amino acids in length. The family is found in association with , .
Protein Domain
Name: Domain of unknown function DUF6802
Type: Domain
Description: This presumed domain is functionally uncharacterised. This domain is found in actinobacteria and itr is typically between 101 and 135 amino acids in length.
Protein Domain
Name: Domain of unknown function DUF6801
Type: Domain
Description: This domain is found in a variety of bacterial extracellular proteins. It contains a β-barrel structure.
Protein Domain
Name: Domain of unknown function DUF6606
Type: Domain
Description: This entry represents a domain of unknown function found at the N-terminal of a variety of proteins that contain .
Protein Domain
Name: N,N-dimethylformamidase beta subunit-like, C-terminal
Type: Domain
Description: This entry represents a domain found at the C-terminal of bacterial proteins, including N,N-dimethylformamidase beta (DMFA2) subunit from Paracoccus [ ], the large subunit of dimethylformamidase (DMFase), an enzyme containing a Fe3 ion in the active site that catalyses the hydrolytic cleavage of the amide bond in N,N-dimethyl formamide. This domain is described as domain III, which adopts an α/β/α fold, with five β-strands that are parallel and sandwiched between α-helices. This domain is also found in some uncharacterised fungal sequences.
Protein Domain
Name: Spore photoproduct lyase, Bacilli type
Type: Family
Description: These enzymes are involved in the repair of UV radiation-induced DNA damage during spore germination [ , , ]. They can repair thymine dimer 5-thyminyl-5,6-dihydrothymine (known as spore photoproduct (SP)) by in situ monomerization of spore photoproduct to two thymines []. This family of enzymes uses the SAM as a cofactor (rather than in a stoichiometric manner) [, , ].This entry represents spore photoproduct lyases from Bacilli.
Protein Domain
Name: Spore photoproduct lyase
Type: Family
Description: Bacterial spores are remarkable in their resistance to chemical and physical stresses, including exposure to UV radiation. The unusual UV resistance of bacterial spores is a result of the unique photochemistry of spore DNA coupled with the efficient repair of accumulated damage. Exposure of bacterial spores to UV radiation results in the formation of a methylene-bridged thymine dimer, 5-thyminyl-5,6-dihydrothymine, a spore photoproduct, or SP, as the primary photoproduct. SP accumulates in UV-irradiated spores, however it is rapidly repaired upon germination, thus giving rise to the extraordinary UV resistance of bacterial spores. The repair of SP is catalyzed by the enzyme spore photoproduct lyase (SPL), and involves the direct reversal of SP to two thymines without base excision [ ].
Protein Domain
Name: Negative regulator of systemic acquired resistance SNI1
Type: Family
Description: SNI1 was isolated in a screen for negative regulators of NPR1, a transcriptional coactivator of plant pathogen defence gene PR1 []. SNl1 has been shown to cooperate with the transcription repressor CBNAC, which binds to cis-elements on the PR1 promoter, to suppress PR1 expression []. SNI1 also regulates DNA damage sensor proteins RAD17 and ATR, suggesting that DNA damage responses is an intrinsic component of the plant immune responses. SNI1 is a subunit of the Structural Maintenance of Chromosome (SMC) 5/6 complex involved in DNA damage responses [].
Protein Domain
Name: Protein PALE CRESS
Type: Family
Description: The PAC (PALE CRESS) protein is required for leaf and chloroplast development [ ]. PAC mutation arrests chloroplast development at an early stage, affecting the abundance and maturation of specific chloroplast-encoded transcripts. The PAC protein may be a nucleus-encoded factor that functions in plastid mRNA maturation and accumulation [].
Protein Domain
Name: Putative cell wall protein
Type: Family
Description: This family of plant proteins have not known function. It includes putative cell wall protein from Arabidopsis thaliana (At2g208700).
Protein Domain
Name: Type 2 DNA topoisomerase 6 subunit B-like, plants
Type: Family
Description: MTOPVIB (meiotic topoisomerase VIB-like) is a structural homologue of the archaeal topo VIB subunit [ ]. MTOPVIB forms a complex with the two Arabidopsis thaliana SPO11 orthologues (SPO11-1 and SPO11-2) and is essential for meiotic DSB formation []. The SPO11 protein catalyzes the formation of meiotic DNA double strand breaks (DSBs) and it is the eukaryotic homologue of the archaeal DNA topoisomerase VIA.
Protein Domain
Name: S-adenosyl-L-methionine-dependent tRNA 4-demethylwyosine synthase
Type: Family
Description: This entry respresents S-adenosyl-L-methionine-dependent tRNA 4-demethylwyosine synthase Tyw1 from eukaryotes and archaea. They are involved in the pathway wybutosine-tRNA(Phe) biosynthesis. It binds one [4Fe-4S] cluster and one [2Fe-2S]cluster. The [4Fe-4S] cluster is coordinated with three cysteines and an exchangeable S-adenosyl-L-methionine [, ]. The two carbon source required for this reaction has been identified as pyruvate. The exact role of the SAM molecule has yet to be determined. There appear to be two distinct sets of proteins with different multi-domain architectures, however they are sufficiently similar in the Radical SAM portion that they remain a single cluster [].Wybutosine is a hyper modified guanosine with a tricyclic base found at the 3'-position adjacent to the anticodon of eukaryotic phenylalanine tRNA. It is important for translational reading-frame maintenance [ ].
Protein Domain
Name: S-adenosyl-L-methionine-dependent tRNA 4-demethylwyosine synthase, archaea
Type: Family
Description: Members of this protein family are the archaeal protein TYW1 [ ], a radical SAM protein that catalyzes the second step in creating the wye-bases, wyosine and derivatives such as wybutosine, for tRNA base modification [].
Protein Domain
Name: Melanoplus sanguinipes entomopoxvirus (MsEPV), Orf214
Type: Family
Description: This entry is represented by a family of Orfs from Melanoplus sanguinipes entomopoxvirus (MsEPV), these include:Orf214, Orf215, Orf216, Orf217 and Orf62; they are a family of uncharacterised viral proteins.
Protein Domain
Name: Myb-related protein P/transcription factor Y1, C-terminal
Type: Domain
Description: This entry represents the C terminus of plant P/Y1 proteins. Members of this entry are transcriptional regulators of genes encoding enzymes for flavonoid biosynthesis [ , ]. P protein plays a role in the pathway leading to the production of a red phlobaphene pigment [], and P proteins are homologous to the DNA-binding domain of myb-like transcription factors []. This domain is associated with domain.
Protein Domain
Name: Spore photoproduct lyase, Clostridia type
Type: Family
Description: These enzymes are involved in the repair of UV radiation-induced DNA damage during spore germination. They can repair thymine dimer 5-thyminyl-5,6-dihydrothymine (known as spore photoproduct (SP)) by in situ monomerization of spore photoproduct to two thymines. This family of enzymes uses the SAM as a cofactor (rather than in a stoichiometric manner). This set of SPL enzymes is distinguished from the other SPL family because its fourth conserved Cys residue occurs before the CxxxCxxC motif, unlike in the other SPL family, in which is occurs after the CxxxCxxC motif [].This entry represents spore photoproduct lyases mostly from Clostridia.
Protein Domain
Name: Protein of unknown function DUF1202
Type: Family
Description: This family consists of several hypothetical bacterial proteins of around 335 residues in length. Members of this family are found exclusively in Escherichia coli and Salmonella species and are often referred to as YggM proteins. The function of this family is unknown.
Protein Domain
Name: Betacoronavirus, lineage A, NSP1
Type: Family
Description: This entry represents the N-terminal region of the betacoronavirus polyprotein, which contains non-structural protein 1 (Nsp1) from betacoronavirus lineage A. This protein is important for viral replication and pathogenesis [ , ]. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling.
Protein Domain
Name: Protein of unknown function DUF1203
Type: Family
Description: This family consists of several hypothetical bacterial proteins of around 155 residues in length. Family members are present in Rhizobium, Agrobacterium and Streptomyces species.
Protein Domain
Name: Transcription regulator HTH, AraC, N-terminal
Type: Domain
Description: This entry represents the N terminus of bacterial ARAC-type transcriptional regulators. In Escherichia coli these regulate the L-arabinose operon through sensing the presence of arabinose, and when the sugar is present, transmitting this information from the arabinose-binding domains to the protein s DNA-binding domains [ ]. This family might represent the N-terminal arm of the protein, which binds to the C-terminal DNA binding domains to hold them in a state where the protein prefers to loop and remain non-activating [].
Protein Domain
Name: DNA replication/recombination mediator RecO, N-terminal
Type: Domain
Description: This entry represents the N-terminal domain of the bacterial recombination protein O (RecO). This domain of RecO adopts a typical OB-fold DNA-binding structure; it contains two three-stranded beta sheets with strand B1 common between both sheets [ ]. The RecO OB-fold is unusual in that it interacts primarily with the DNA backbone and is capable of binding ss and dsDNA []. It is found in association with .
Protein Domain
Name: Bacteriophage DNA replication protein Gp16.7
Type: Family
Description: The early-expressed gene 16.7 is conserved in bacteriophage phi-29 and related phages. It encodes a membrane protein, GP16.7, consisting of an N-terminal transmembrane domain and a C-terminal DNA-binding and dimerisation domain. GP16.7 plays an important role in organising membrane-associated bacteriophage DNA replication [ , ]. The C-terminal domain has a similar secondary structure similar to homeodomains, but forms a fundamentally different tertiary structure consisting of a six-helical dimeric fold [ ]. Multimerisation of this dimer leads to efficient DNA binding.
Protein Domain
Name: Domain of unknown function DUF6604
Type: Domain
Description: This entry represents a putative domain found at the N-terminal of a wide range of fungal proteins of unknown function.
Protein Domain
Name: Protein of unknown function DUF2623
Type: Family
Description: This entry represents a group of proteins predominantly found in the Enterobacteriaceae family. Several members are named as YghW. The function is not known.
Protein Domain
Name: Domain of unknown function DUF6603
Type: Domain
Description: This entry represents a large potential domain found in uncharacterised fungal and bacterial proteins.
Protein Domain
Name: Domain of unknown function DUF1206
Type: Domain
Description: This region consists of two a pair of transmembrane helices and occurs three times in each of the family member proteins.
Protein Domain
Name: Protein of unknown function DUF2887
Type: Family
Description: This bacterial group of proteins has no known function.
Protein Domain
Name: Domain of unknown function DUF6602
Type: Domain
Description: This entry represents a bacterial domain of unknown function. This domain is sometimes associated with a domain. This could suggest this domain has a role in a bacterial cognate of a Ubiquitin-like pathway.
Protein Domain
Name: Bladder cancer-related BC10
Type: Family
Description: This family consists of a series of short proteins of around 90 residues in length. The human protein (or BC10) has been implicated in bladder cancer where the transcription of the gene coding for this protein is nearly completely abolished in highly invasive transitional cell carcinomas (TCCs) [ ]. The protein is a small globular protein containing two transmembrane helices, and it is a multiply edited transcript. All the editing sites are found in either the 5'-UTR or the N-terminal section of the protein, which is predicted to be outside the membrane. The three coding edits are all non-synonymous and predicted to encode exposed residues []. The function of this family is unknown.
Protein Domain
Name: Protein YfgG
Type: Family
Description: This entry represents Protein YfgG from Escherichia coli (strain K12) and similar proteins from Gammaproteobacteria. YfgG may be related to the resistance to the iron analogue gallium nitrate (Ga(NO3)3), and to the tolerance to nickel and cobalt stress [ , ]. However, the exact function is still unknown.
Protein Domain
Name: Protein of unknown function DUF6601
Type: Family
Description: This entry represents a family found in fungi. The function of this region is unknown. It appears to contain a pair of C-terminal transmembrane helices.
Protein Domain
Name: Protein of unknown function DUF1207
Type: Family
Description: This family consists of a number of hypothetical bacterial proteins of around 410 residues in length, which seem to be specific to Chlamydia species. The function of this family is unknown.
USDA
InterMine logo
The Legume Information System (LIS) is a research project of the USDA-ARS:Corn Insects and Crop Genetics Research in Ames, IA.
LegumeMine || ArachisMine | CicerMine | GlycineMine | LensMine | LupinusMine | PhaseolusMine | VignaMine | MedicagoMine
InterMine © 2002 - 2022 Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, United Kingdom