Search our database by keyword

- or -

Examples

  • Search this entire website. Enter identifiers, names or keywords for genes, pathways, authors, ontology terms, etc. (e.g. eve, embryo, zen, allele)
  • Use OR to search for either of two terms (e.g. fly OR drosophila) or quotation marks to search for phrases (e.g. "dna binding").
  • Boolean search syntax is supported: e.g. dros* for partial matches or fly AND NOT embryo to exclude a term

Search results 8601 to 8700 out of 30763 for seed protein

Category restricted to ProteinDomain (x)

0.029s

Categories

Category: ProteinDomain
Type Details Score
Protein Domain
Name: Antiactivator protein ExsD, domain 2
Type: Homologous_superfamily
Description: The antiactivator protein ExsD represses the transcriptional activator ExsA. ExsA activates expression of type III secretion system genes. ExsD inhibits the DNA-binding and self-association properties of ExsA [ ]. Repression of ExsA by ExsD is relieved by the secretion chaperone ExsC [, ].This superfamily represents the domain 2 of ExsD, which is consists of five helices and two beta sheets.
Protein Domain
Name: Antiactivator protein ExsD, domain 1
Type: Homologous_superfamily
Description: The antiactivator protein ExsD represses the transcriptional activator ExsA. ExsA activates expression of type III secretion system genes. ExsD inhibits the DNA-binding and self-association properties of ExsA [ ]. Repression of ExsA by ExsD is relieved by the secretion chaperone ExsC [, ].This superfamily represents the domain 1 of ExsD, located at the N terminus of the protein, which is exclusively helical.
Protein Domain
Name: Protein of unknown function DUF6183
Type: Family
Description: This entry represents a member of a biosynthetic gene cluster (BGC). This BGC (BGC0001726) is described by MIBiG as an example of the NRP (non-ribosomal peptide) and polyketide biosynthetic classes. It includes a member from the pactamides biosynthetic gene cluster from Streptomyces pactum []. This family appears to be predominantly found in Actinobacteria.
Protein Domain
Name: Protein of unknown function DUF6184
Type: Family
Description: This entry represents a member of a biosynthetic gene cluster (BGC). This BGC (BGC0000990) is described by MIBiG as an example of the polyketide biosynthetic class. It includes a member from the epothilone biosynthetic gene cluster from Sorangium cellulosum [ ]. This family appears to be predominantly found in Proteobacteria.
Protein Domain
Name: Protein of unknown function DUF3360
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 489 to 517 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF6182
Type: Family
Description: This entry represents a member of a biosynthetic gene cluster (BGC). This BGC (BGC0001774) is described by MIBiG as an example of the following biosynthetic class, other (unspecified). It includes a member from the spicamycin biosynthetic gene cluster from Streptomyces fimbriatus [ ]. This family appears to be predominantly found in Actinobacteria.
Protein Domain
Name: Protein of unknown function DUF3363
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 323 to 658 amino acids in length.
Protein Domain
Name: Bacteriodetes cell division protein FtsL-like
Type: Family
Description: This family of proteins share similarity with cell division protein FtsL, suggesting they may be cell division proteins. Proteins in this family are primarily found in Bacteroidetes.
Protein Domain
Name: Protein of unknown function DUF6180
Type: Family
Description: This entry represents a member of a biosynthetic gene cluster (BGC). This BGC (BGC0000385) is described by MIBiG as an example of the NRP (non-ribosomal peptide) biosynthetic class. It includes a member from the lysobactin biosynthetic gene cluster from Lysobacter sp. ATCC 53042 [ ].
Protein Domain
Name: Protein of unknown function DUF6181
Type: Family
Description: This entry represents a member of a biosynthetic gene cluster (BGC). This BGC (BGC0000703) is described by MIBiG as an example of the saccharide biosynthetic class. It includes a member from the kanamycin biosynthetic gene cluster from Streptomyces kanamyceticus [ ].
Protein Domain
Name: Protein of unknown function DUF6178
Type: Family
Description: This entry represents a member of a biosynthetic gene cluster (BGC). This BGC (BGC0001091) is described by MIBiG as an example of the NRP (non-ribosomal peptide) and polyketide biosynthetic classes. It includes a member from the corallopyronin biosynthetic gene cluster from Corallococcus coralloides [ ]. This family appears to be predominantly found in proteobacteria.
Protein Domain
Name: Protein of unknown function DUF3368
Type: Family
Description: This domain is functionally uncharacterised. This domain is found in bacteria and archaea. This presumed domain is about 50 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF6179
Type: Family
Description: This entry represents a member of a biosynthetic gene cluster (BGC). This BGC (BGC0001520) is described by MIBiG as an example of the polyketide biosynthetic class. It includes a member from the aurantinin B biosynthetic gene cluster from Bacillus subtilis [ ]. This family appears to be predominantly found in Firmicutes.
Protein Domain
Name: Protein of unknown function (DUF5137)
Type: Family
Description: This is a family of uncharacterised yeast proteins.
Protein Domain
Name: Ribosomal protein L10-like domain superfamily
Type: Homologous_superfamily
Description: Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [ , ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [ , ].This superfamily represents the N-terminal RNA-binding domain of the ribosomal protein L10 found in the large subunit (50S) of the bacterial ribosome. The entry contains homologues such as archaebacterial acidic ribosomal protein P0 homologue (L10E) and the eukaryotic 60S ribosomal protein P0 (L10E). The RNA-binding domain and the adjacent domain are thought to exist in archaeal L10 and eukaryotic P0 proteins only. Structurally, this domain contains three α-helices on each side of the five-stranded anti-parallel β-sheet.The large ribosomal subunit contains a highly flexible and functionally important lateral protuberance called the 'stalk' (L7/L12 stalk in Bacteria, L12 stalk in Archaea, and P1/P2 stalk in Eukarya). The stalk is involved in the formation of the so-called 'GTPase-associated site' and plays a key role in the interaction of ribosome with translation factors and in the control of translation accuracy [ ]. The L7/12 stalk is formed by two or three copies of the L7/L12 dimer bound to the C-terminal tail of protein L10 []. The N-terminal domain of L10 binds to a segment of domain II of 23S rRNA near the binding site for ribosomal protein L11 [].
Protein Domain
Name: Glyoxalase domain-containing protein 4, C-terminal
Type: Domain
Description: Glyoxalase domain-containing protein 4 (GLOD4) contains the vicinal oxygen chelate (VOC) domain. The function of GLOD4 is not clear. The C. elegans member in this entry, known as glyoxalase 1 or CeGly, has been shown to be involved in the MG modifications of mitochondrial proteins and mitochondrial ROS production [ ]. This entry represents the C-terminal domain of GLOD4.
Protein Domain
Name: Protein of unknown function DUF3404
Type: Family
Description: This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 260 amino acids in length. This domain is found associated with , .
Protein Domain
Name: Protein of unknown function DUF3405
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 636 to 810 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3408
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 128 to 160 amino acids in length.
Protein Domain
Name: Carbohydrate-binding protein CttA, X module
Type: Domain
Description: This entry represents a putative carbohydrate-binding domain conserved in Ruminococcus, which sits N-terminal to a dockerin repeat; the protein may be a component of the Ruminococcus cellulosome system. This X module does not share similarities with other known X modules from cellulolytic bacteria and may have a structural role [ ].
Protein Domain
Name: Protein of unknown function DUF3419
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 398 to 802 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3389
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are about 80 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3391
Type: Domain
Description: This functionally uncharacterised, N-terminal domain is found in a number of bacterial proteins, including Cyclic di-GMP phosphodiesterase PA4108. It is often found associated with the HD domain .
Protein Domain
Name: Protein of unknown function DUF3392
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are about 110 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3394
Type: Family
Description: This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 190 amino acids in length. This domain is found associated with .
Protein Domain
Name: Protein of unknown function DUF3396
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria and viruses. Proteins in this family are typically between 302 to 382 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3400
Type: Family
Description: This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 50 amino acids in length. This domain is found associated with , , .
Protein Domain
Name: Protein of unknown function DUF3370
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 452 to 532 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3373
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3375
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 479 to 499 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3379
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 234 to 251 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3383
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria and viruses. Proteins in this family are typically between 356 to 501 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3386
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are about 220 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3472
Type: Family
Description: This presumed domain is functionally uncharacterised. This domain is found in bacteria, eukaryotes and viruses. This domain is typically between 174 to 190 amino acids in length. This domain has a single completely conserved residue G that may be functionally important.
Protein Domain
Name: Tetratricopeptide repeat protein 7, N-terminal
Type: Domain
Description: This is the N-terminal domain of TTC7, a protein that forms the regulatory subunit of the PI4KIIIalpha complex (also known as PI4KA). This complex is composed of PI4KIIIalpha, TTC7 and FAM126A and catalyses the first step in plasma membrane phosphoinositide synthesis. This TTC7 N-terminal domain contains basic residues suitable for interaction with the acidic inner leaflet of the plasma membrane and is required for localising the active site near its substrate. TTC7 acts as a bridge between PI4KA and EFR3B-FAM126A, via direct interactions. ERF3B is not part of the complex but contributes to its recruitment to the membrane [ , ].
Protein Domain
Name: Protein angel homolog 2, N-terminal
Type: Domain
Description: This entry represents the N-terminal unstructured domain of protein angel homolog 2 (ANGEL2) [ ]. This protein is a member of the CCR4 nocturin family of deadenylases and has 2',3'-cyclic phosphatase activity involved in RNA processing and modification [].
Protein Domain
Name: Protein of unknown function DUF3455
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 174 to 251 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3460
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are about 70 amino acids in length. This protein has a conserved WDK sequence motif.
Protein Domain
Name: Protein of unknown function DUF3465
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 131 to 151 amino acids in length. This protein has a conserved HWTH sequence motif.
Protein Domain
Name: Protein of unknown function DUF3467
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria, archaea and viruses. Proteins in this family are typically between 101 to 118 amino acids in length.
Protein Domain
Name: DNA replication terminus site-binding protein
Type: Family
Description: This entry contains several bacterial DNA replication terminus site-binding proteins (also known as Tus or Ter-binding proteins). They are required for the termination of DNA replication and function by binding to DNA replication terminator sequences, thus preventing the passage of replication forks [ ]. The termination efficiency is affected by the affinity of a particular protein for the terminator sequence.
Protein Domain
Name: Protein of unknown function DUF3433
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 784 to 1240 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3435
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 435 to 791 amino acids in length. This family is related to suggesting it may be an integrase enzyme.
Protein Domain
Name: Protein of unknown function DUF3440
Type: Family
Description: This presumed domain is functionally uncharacterised. This domain is found in bacteria. This domain is typically between 53 to 190 amino acids in length. This domain is found associated with . This domain has a conserved KND sequence motif.
Protein Domain
Name: Protein of unknown function DUF3443
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 400 to 434 amino acids in length. This protein has two conserved sequence motifs: NPV and DNNG.
Protein Domain
Name: Protein of unknown function DUF3422
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria, archaea and eukaryotes. Proteins in this family are typically between 426 to 444 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3425
Type: Family
Description: This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is typically between 120 to 143 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3426
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 262 to 463 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3427
Type: Family
Description: This presumed domain is functionally uncharacterised. This domain is found in bacteria and archaea. This domain is typically between 243 to 275 amino acids in length. This domain is found associated with , .
Protein Domain
Name: Protein of unknown function DUF3429
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 147 to 245 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF3431
Type: Family
Description: This family of proteins are functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 291 to 390 amino acids in length. This protein has a conserved NLRC sequence motif.
Protein Domain
Name: Protein of unknown function DUF4680
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 65 and 178 amino acids in length. There are two conserved sequence motifs: VISRM and ENE.
Protein Domain
Name: Protein of unknown function DUF4679
Type: Family
Description: Proteins in this family are typically between 213 and 412 amino acids in length and are found in chordates. They are also known as proline-rich protein 30.
Protein Domain
Name: Protein of unknown function DUF4678
Type: Family
Description: This family of proteins includes the uncharacterized testis-expressed protein 44 (in humans the protein is a product of C2orf57). Proteins in the family are found in chordates and are typically between 318 and 395 amino acids in length.
Protein Domain
Name: Zinc finger C4H2 domain-containing protein
Type: Family
Description: Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [ , , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Protein Domain
Name: Protein of unknown function DUF4681
Type: Family
Description: This family of proteins is found in chordates and includes human uncharacterised protein C12orf54. Proteins in this family are typically between 101 and 127 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4677
Type: Family
Description: This uncharacterised family is also known as coiled-coil domain-containing protein 184 (CCDC184).
Protein Domain
Name: Protein of unknown function DUF4670
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.
Protein Domain
Name: CRISPR-associated protein Cas7, subtype I-B/I-C
Type: Family
Description: The CRISPR-Cas system is a prokaryotic defense mechanism against foreign genetic elements. The key elements of this defense system are the Cas proteins and the CRISPR RNA. Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes [ ]. CRISPRs appear to provide acquired resistance against mobile genetic elements (viruses, transposable elements and conjugative plasmids). CRISPR clusters contain sequences complementary to antecedent mobile elements and target invading nucleic acids. CRISPR clusters are transcribed and processed into CRISPR RNA (crRNA).The defense reaction is divided into three stages. In the adaptation stage, the invader DNA is cleaved, and a piece of it is selected to be integrated as a new spacer into the CRISPR locus, where it is stored as an identity tag for future attacks by this invader. During the second stage (the expression stage), the CRISPR RNA (pre-crRNA) is transcribed and subsequently processed into the mature crRNAs. In the third stage (the interference stage), Cas proteins, together with crRNAs, identify and degrade the invader [ , , ].The CRISPR-Cas systems have been sorted into three major classes. In CRISPR-Cas types I and III, the mature crRNA is generally generated by a member of the Cas6 protein family. Whereas in system III the Cas6 protein acts alone, in some class I systems it is part of a complex of Cas proteins known as Cascade (CRISPR-associated complex for antiviral defense). The Cas6 protein is an endoribonuclease necessary for crRNA production whereas the additional Cas proteins that form the Cascade complex are needed for crRNA stability [ ]. This entry represents Cas7 from the I-B (Csh2) and I-C (Csd2) subtypes. The Cascade I-B complex requires Cas5, Cas6, and Cas7 for maintaining a stable crRNA population [ ].
Protein Domain
Name: Batten's disease protein Cln3, subgroup
Type: Family
Description: Batten's disease, the juvenile variant of neuronal ceroid lipofuscionosis (NCL), is a recessively inherited disorder affecting children of 5-10years of age. The disease is characterised by progressive loss of vision, seizures and psychomotor disturbances. Biochemically, the disease ischaracterised by lysosomal accumulation of hydrophobic material, mainly ATP synthase subunit C, largely in the brain but also in other tissues. The disease is fatal within a decade [].Mutations in the CLN3 gene are believed to cause Batten's disease [ ]. TheCLN3 gene, with a predicted 438-residue product, maps to chromosome p16p12.1. The gene contains at least 15 exons spanning 15kb and is highly conserved in mammals [ ]. A 1.02kb deletion in the CLN3 gene, occurring in either one or both alleles, is found in 85% of Batten disease chromosomes causing a frameshift generating a predicted translated product of 181 amino acid residues [, ]. 22 other mutations, including deletions, insertions and point mutations, have beenreported. It has been suggested that such mutations result in severely truncated CLN3 proteins, or affect its structure/conformation [, ].CLN3 proteins, which are believed to associate in complexes, are heavily glycosylated lysosomal membrane proteins [], containing complex Asn-linkedoligosaccharides [ ]. Extensive glycosylation is important for the stabilityof these lysosomal proteins in the highly hydrolytic lysosomal lumen. Lysosomal sequestration of active lysosomal enzymes, transport of degraded moleculesfrom the lysosomes, and fusion and fission between lysosomes and other organelles. The CLN3 protein is a 43kDa, highly hydrophobic, multi-transmembrane (TM),phosphorylated protein [ ]. Hydrophobicity analysis predicts 6-9 TMsegments, suggesting that CLN3 is a TM protein that may function as a chaperone or signal transducer. The majority of putative phosphorylationsites are found in the N-terminal domain, encompassing 150 residues [ ].Phosphorylation is believed to be important for membrane compartment interaction, in the formation of functional complexes, and in regulation and interactions with other proteins [ ].CLN3 contains several motifs that may undergo lipid post-translational modifications (PTMs). PTMs contribute to targeting and anchoring of modifiedproteins to distinct biological membranes [ ]. There are three general classes of lipid modification: N-terminal myristoylation, C-terminal prenylation, and palmitoylation of cysteine residues. Such modifications are believed to be a common form of PTM occurring in 0.5% of all cellular proteins, including brain tissue []. The C terminus of the CLN3 containsvarious lipid modification sites: C435, target for prenylation; G419, target for myristoylation; and C414, target for palmitoylation [].Prenylation results in protein hydrophobicity, influences interaction with upstream regulatory proteins and downstream effectors, facilitates protein-protein interaction (multisubunit assembly) and promotes anchoring tomembrane lipids. The prenylation motif, Cys-A-A-X, is highly conserved within CLN3 protein sequences of different species [].Species with known CLN3 protein homologues include: Homo sapiens, Canis familiaris, Mus musculus, Saccharomyces cerevisiae and Drosophila melanogaster.
Protein Domain
Name: Protein of unknown function DUF4672
Type: Family
Description: This family of proteins is found in chordates. Proteins in this family are typically between 165 and 199 amino acids in length.
Protein Domain
Name: Mitochondrial biogenesis regulation protein 1
Type: Family
Description: The function of Mbr1 is not clear. It has a paralogue, Mbr3, also known as Isf1. They are serine-rich, hydrophilic proteins. Overexpression of Mbr1 and Mbr3 can suppress the growth defect of hap2, hap3 and hap4 mutants. They may be involved in mitochondrial functions and stress response [ , ].
Protein Domain
Name: Prostate androgen-regulated mucin-like protein 1
Type: Family
Description: Human PARM-1 is a mucin-like, androgen-regulated transmembrane protein present in most tissues, with high levels in the heart, kidney and placenta [ ]. It is induced and expressed in prostate after castration [] and may have a role in cell proliferation and immortalisation in prostate cancer [, ]. PARM-1 also contributes to ovulation and/or luteal function by acting as a novel regulator of progesterone metabolism [].
Protein Domain
Name: Outer spore wall protein 5
Type: Family
Description: The outermost cape of the fungal spore wall is made up of a polymer that contains cross-linked amino acid dityrosine, which is important for the stress resistance of the spore [ ]. The OSW family of proteins have been implicated in assembly of this protective dityrosine coat. This entry represents Osw5, which is a predicted integral membrane protein [ ]. The Osw5 null mutant spores show an enhanced spore wall permeability and vulnerability to beta glucanase digestion [].
Protein Domain
Name: Phage tail tube protein 1
Type: Family
Description: This is a family of phage tail tube proteins from virus [ ].
Protein Domain
Name: Mitochondrial coiled-coil domain protein 1
Type: Family
Description: This is a family of uncharacterised proteins known as mitochondrial coiled-coil domain protein 1 [ ].
Protein Domain
Name: Protein of unknown function DUF4688
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 331 and 596 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF5100
Type: Family
Description: This is a family of microsporidia-specific proteins of unknown function.
Protein Domain
Name: Protein of unknown function DUF5101
Type: Family
Description: This is a family of short microsporidia-specific proteins of unknown function.
Protein Domain
Name: Tight junction-associated protein 1 domain
Type: Domain
Description: This domain is found in members of the Tight junction-associated protein 1 (Tjap1) family, also known as Pilt family, a family of eukaryotic tight junction-proteins [ ]. Pilt is a component of Tight junctions (TJs) rather than Adhesin junctions (AJs). TJs function as a barrier preventing solutes and water from passing freely through the paracellular pathway. TJs consist of transmembrane proteins (claudin, occludin, and JAM) plus many peripheral membrane proteins and cell polarity molecules.Pilt is a novel peripheral membrane protein which is incorporated into TJs after TJ strands are formed, thereby the name Pilt stands for 'protein incorporated later into TJs'. Pilt binds to the guanylate-kinase region of disk large homologue/synapse-associated protein 97 (hDlg/SAP97) [ ].
Protein Domain
Name: Protein of unknown function DUF4689
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 202 and 224 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4687
Type: Family
Description: This family includes uncharacterised protein C11orf71. Proteins in this family are typically between 76 and 140 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4684
Type: Family
Description: This family of proteins is found in chordates. Proteins in this family are typically between 531 and 1277 amino acids in length.
Protein Domain
Name: Spermatogenesis-associated protein 1, C-terminal domain
Type: Domain
Description: The spermatogenesis associated 1 (SPATA1) protein is thought to be involved in spermatogenesis and is a potential marker for male fertility []. This entry represents the C-terminal domain of SPATA1.
Protein Domain
Name: Protein of unknown function DUF4686
Type: Family
Description: This family includes proteins described as coiled-coil domain-containing protein 30. Proteins in this family are typically between 498 and 775 amino acids in length. There is a conserved DLK sequence motif.
Protein Domain
Name: Protein of unknown function DUF697
Type: Family
Description: Proteins in this entry have no known function.
Protein Domain
Name: Protein of unknown function DUF1093
Type: Family
Description: These are a family of small (about 115 amino acids) uncharacterised proteins with N-terminal signal sequences, found predominantly in Gram-positive organisms. Most genomes that have any members of this family have at least two members. This entry includes YxeA from Bacillus subtilis, which is transcriptionally regulated by YxdJ [ ].
Protein Domain
Name: Histidine-rich carboxyl terminus protein 1
Type: Family
Description: The function of this family of proteins is not known.
Protein Domain
Name: Protein of unknown function DUF5098
Type: Family
Description: This entry represents a group of microsporidia-specific proteins with no known function. There is a very characteristic NPW sequence motif at the very C terminus.
Protein Domain
Name: Protein of unknown function DUF5099
Type: Family
Description: This is a family of microsporidia-specific sequences with unknown function.
Protein Domain
Name: Protein of unknown function DUF1993
Type: Family
Description: This family of proteins are functionally uncharacterised.
Protein Domain
Name: Putative autolysin regulatory protein ArpU-like
Type: Family
Description: This entry represents a group of proteins that includes Putative autolysin regulatory protein ArpU. ArpU was originally described as a regulator of cellular muramidase-2 of Enterococcus hirae [ ], but appears to have been cloned from a prophage. This family appears related to the RinA family of bacteriophage transcriptional activators and to some sporulation-specific sigma factors, and it is proposed to be a phage transcriptional activator family. Members of this protein family are found in tailed bacteriophages (Caudovirales) and in Bacilli [].
Protein Domain
Name: Protein of unknown function DUF5088
Type: Family
Description: This is a family of microsporidial sequences of unknown function.
Protein Domain
Name: DNA damage-induced apoptosis suppressor protein
Type: Family
Description: This entry includes the DNA damage-induced apoptosis suppressor protein (DDIAS), also known as Noxin (Nitric oxide inducible), which is an anti-apoptotic protein involved in DNA repair and cell survival. This protein is induced by a wide range of stressors but it has been shown that it acts when there is DNA damage, suppressing cell death in order to prevent from DNA lesions [ , ]. This protein is highly expressed in cancer tissues and cells, being a potential therapeutic target in cancer. Recently, it has been demonstrated that DDIAS knockdown caused growth inhibition of lung cancer cells. As it is constitutively activated in malignant cancers, this confirms its utility as therapeutic target [].
Protein Domain
Name: Protein of unknown function DUF2018
Type: Family
Description: This entry represents proteins of unknown function. The crystal structure of HP0242, a hypothetical protein from Helicobacter pylori (Campylobacter pylori) has been determined. It reveals an acid-adaptive protein possibly of physiological significance when H. pylori colonises the human stomach. The protein adopts a unique four α-helical triangular conformations. The biologically active form is thought to be a tetramer. The gene exist in an operon along with 6 other genes where the gene products appear to be related to iron storage and haem biosynthesis [ ].
Protein Domain
Name: Protein of unknown function DUF5089
Type: Family
Description: This is a family of largely microsporidial-specific proteins of unknown function. There is distant homology to synaptosomal-associated 25 family proteins [ ].
Protein Domain
Name: Protein of unknown function DUF5087
Type: Family
Description: This is a family of microsporidial sequences of unknown function.
Protein Domain
Name: Protein of unknown function YopT
Type: Domain
Description: The function of the YopT protein is not known, though it has been structurally characterised. The protein is a homo-dimer, with each monomer consisting of one alpha helix and three beta strands.
Protein Domain
Name: Protein of unknown function DUF5090
Type: Family
Description: This is a microsporidial-specific family of proteins with unknown function. The family is likely to have four transmembrane domains.
Protein Domain
Name: Protein of unknown function DUF5092
Type: Family
Description: This entry represent a group of largely microsporidial sequences with unknown function. There is one transmembrane domain towards the C terminus.
Protein Domain
Name: Protein of unknown function DUF5093
Type: Family
Description: This entry represents a group of microsporidial sequences that may be distantly related to Ribosomal RNA-processing protein 7 ( ).
Protein Domain
Name: Protein of unknown function DUF4702
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 346 and 637 amino acids in length.
Protein Domain
Name: Interactor of HORMAD1 protein 1
Type: Family
Description: Interactor of HORMAD1 protein 1 (IHO1, previously known as coiled-coil domain-containing protein 36 or DUF4700) is required for DNA double-strand breaks (DSBs) formation in unsynapsed regions during meiotic recombination. It is thought to function, in collaboration with SPO11-auxiliary proteins MEI4 and REC114, through the formation of DSB-promoting recombinosomes on chromatin at the onset of meiosis [ ].
Protein Domain
Name: Protein of unknown function DUF4699
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 303 and 319 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF5091
Type: Family
Description: This is a group of microsporidial-specific proteins with unknown function.
Protein Domain
Name: Protein of unknown function DUF4698
Type: Family
Description: This family of proteins is found in eukaryotes. They are also known as coiled-coil domain-containing protein 60. Proteins in this family are typically between 464 and 550 amino acids in length.
Protein Domain
Name: Armadillo-like helical domain-containing protein 4
Type: Family
Description: The function of ARMH4 is not clear.
Protein Domain
Name: Protein of unknown function DUF2004
Type: Family
Description: This is a family of proteins with unknown function. The structure of one of the proteins in this family has revealed a novel α-β fold [ ].
Protein Domain
Name: Protein of unknown function DUF4695
Type: Family
Description: This uncharacterised family has a conserved RFKTQP sequence motif.
Protein Domain
Name: Protein of unknown function DUF4694
Type: Family
Description: This uncharacterised family includes human protein TNT or C16orf82.
USDA
InterMine logo
The Legume Information System (LIS) is a research project of the USDA-ARS:Corn Insects and Crop Genetics Research in Ames, IA.
LegumeMine || ArachisMine | CicerMine | GlycineMine | LensMine | LupinusMine | PhaseolusMine | VignaMine | MedicagoMine
InterMine © 2002 - 2022 Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, United Kingdom