Search our database by keyword

- or -

Examples

  • Search this entire website. Enter identifiers, names or keywords for genes, pathways, authors, ontology terms, etc. (e.g. eve, embryo, zen, allele)
  • Use OR to search for either of two terms (e.g. fly OR drosophila) or quotation marks to search for phrases (e.g. "dna binding").
  • Boolean search syntax is supported: e.g. dros* for partial matches or fly AND NOT embryo to exclude a term

Search results 7501 to 7600 out of 30763 for seed protein

Category restricted to ProteinDomain (x)

0.051s

Categories

Category: ProteinDomain
Type Details Score
Protein Domain
Name: Ribosomal protein L19e, domain 2
Type: Homologous_superfamily
Description: Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [ , ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [ , ].Ribosomal protein L19 from eukaryotes and L19e from archaea [ ] form part of the large ribosomal subunit, whose structure has been determined in a number of eukaryotic and archaeal species []. L19/L19e is a multi-helical protein consisting of two different 3-helical domains connected by a long, partly helical linker. This superfamily represents an α-helical linker domain that assumes an up-and-down bundle topology from archaeal ribosomal protein L19e.
Protein Domain
Name: Protein of unknown function DUF4576
Type: Family
Description: This family of uncharacterised proteins is found in eukaryotes.
Protein Domain
Name: Protein of unknown function DUF4636
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 196 and 244 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4634
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 98 and 133 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4556
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in vertebrates and includes human protein C1orf127.
Protein Domain
Name: Ral GTPase-activating protein subunit beta
Type: Family
Description: Ral GTPase-activating protein subunit beta (RALGAPB) is a non-catalytic subunit of the heterodimeric RalGAP1 and RalGAP2 complexes which act as GTPase activators for the Ras-like small GTPases RALA and RALB [ ].
Protein Domain
Name: Repeat of uncharacterized protein PH0542
Type: Repeat
Description: Repeat found in PH0542 showing some sequence similarity to HEAT repeat.
Protein Domain
Name: Protein of unknown function (DUF4436)
Type: Family
Description: This is a family of membrane and transmembrane proteins. The function is not known.
Protein Domain
Name: Protein of unknown function DUF5557
Type: Family
Description: This is a family of unknown function found in primates.
Protein Domain
Name: RNA polymerase II-associated protein RPAP1/Rba50
Type: Family
Description: This entry represents a group of RNA polymerase II-associated proteins, including RPAP1 from animals and Rba50 from yeasts. RPAP1 interacts with the RPB3 and RPB11 subunits of the RNA Pol II complex and plays an important role in mediator-driven transcription and cell identity [ ].
Protein Domain
Name: Protein of unknown function DUF5535
Type: Family
Description: This is a family of unknown function found in mammals.
Protein Domain
Name: Recombination protein RecR, conserved site
Type: Conserved_site
Description: The bacterial protein RecR is an important regulator in the RecFOR homologous recombination pathway during DNA repair [ , , , ]. It acts with RecF and RecO forming a complex that facilitates the loading of RecA onto ssDNA [, ]. RecR is a zinc metalloprotein consisting of a N-terminal helix-hairpin-helix (HhH) motif, a middle region containing a zinc finger motif and a Toprim domain, and a C-terminal domain comprising a divergent Walker B motif and a C-terminal helix [, ]. This conserved site represents the C4-type zinc finger which contains four strictly conserved cysteine residues that coordinates a zinc ion. Mutations in this domain affects bacterial survival suggesting that it plays an important role, likely in DNA binding [ ].
Protein Domain
Name: Protein of unknown function DUF4606
Type: Family
Description: This family is found in eukaryotes.
Protein Domain
Name: Protein of unknown function DUF4609
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 70 and 139 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4246
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4099
Type: Family
Description: This family of uncharacterised proteins was found by clustering human gut metagenomic sequences [ ].
Protein Domain
Name: Protein of unknown function DUF4247
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4248
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4251
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4252
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4249
Type: Family
Description: This family of proteins is functionally uncharacterised. There are two completely conserved residues (C and G) that may be functionally important.
Protein Domain
Name: Protein of unknown function DUF4250
Type: Family
Description: This family of proteins is functionally uncharacterised. There are two completely conserved residues (N and R) that may be functionally important.
Protein Domain
Name: MAP kinase-activating death domain protein
Type: Family
Description: MAP kinase-activating death domain protein (MADD) regulates cell proliferation, survival and death through alternative mRNA splicing. Different isoforms have different effects: isoform 5 increases cell proliferation whereas isoform 2 decreases it; isoform 1 is susceptible to inducing apoptosis, isoform 5 is resistant [ ]. MADD activates RAB3A, RAB3C and RAB3D by converting them from GDP-bound inactive forms to GTP-bound active forms []. MADD is a component of the TNFRSF1A signaling complex, linking the type 1 tumor necrosis factor receptor TNFRSF1A with MAP kinase activation []. MADD contains a tripartite DENN domain. In the nematode Caenorhabditis elegans, MADD is also known as regulator of presynaptic activity aex-3 [].
Protein Domain
Name: Protein of unknown function DUF4633
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 94 and 123 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4238
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4239
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4495
Type: Family
Description: This family is found in eukaryotes, and is typically between 322 and 336 amino acids in length. There are two conserved sequence motifs: QMW and DLW. Proteins in this family vary in length from 793 to 1184 amino acids.
Protein Domain
Name: Protein of unknown function DUF5559
Type: Family
Description: This is a family of unknown function found in primates.
Protein Domain
Name: Protein Largen/Inhibitory synaptic factor 1
Type: Family
Description: This entry includes protein Largen and INSYN1 (inhibitory synaptic factor 1). Largen regulates cell size control [ ]. INSYN1 is part of the complex that mediates postsynaptic inhibition [].
Protein Domain
Name: Protein of unknown function DUF4241
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4244
Type: Family
Description: This family of proteins is functionally uncharacterised. There is a conserved EYA sequence motif.
Protein Domain
Name: Protein of unknown function DUF4245
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Rho-associated protein kinase 1, HR1
Type: Domain
Description: Rho-associated protein kinases (ROCKs) were originally identified as small GTPase Rho effectors. Later, ROCKs were found actively phosphorylating many actin-binding proteins and intermediate filament proteins to modulate their functions [ ]. Two ROCK isoforms have been identified:ROCK1 (ROKb, p160ROCK) and ROCK2. As major downstream effectors of the small GTPase RhoA, they regulate cellular contraction, motility, morphology, polarity, cell division, and gene expression [ , , ].This entry represents the HR1 domain found in Rho-associated protein kinase 1 (ROCK1), which is essential for the formation of stress fibres [ ]. ROCK1 contains an N-terminal extension, a catalytic kinase domain, and a long C-terminal extension, which contains a Rho-binding HR1 domain and a pleckstrin homology (PH) domain. ROCK1 is auto-inhibited by HR1 and PH domains interacting with the catalytic domain. HR1 domains are anti-parallel coiled-coil (ACC) domains that bind small GTPases from the Rho family [].
Protein Domain
Name: Protein kinase C-like, HR1 domain
Type: Domain
Description: This entry represents the second HR1 domain found in fungal PKC-like proteins including Pkc1p from Saccharomyces cerevisiae, and Pck1p and Pck2p from Schizosaccharomyces pombe. The yeast PKC-like proteins play a critical role in regulating cell wall biosynthesis and maintaining cell wall integrity [ ]. They contain two HR1 domains, C2 and C1 domains, and a kinase domain. HR1 domains are anti-parallel coiled-coil (ACC) domains that bind small GTPases from the Rho family []. The HR1 domains of Pck1p and Pck2p interact with GTP-bound Rho1p and Rho2p [].
Protein Domain
Name: Intraflagellar transport protein 52 homolog
Type: Family
Description: This entry includes mammalian intraflagellar transport protein 52 homolog (IFT52) and its homologues from bacteria. IFT52 is a component of the IFT complex B complex required for the movement of components required for the assembly, maintenance and function of primary cilia. Transport occurs in both directions [ ]. Mutations in the IFT52 gene can lead to short-rib thoracic dysplasia 16 with or without polydactyly [].
Protein Domain
Name: CRISPR-associated protein Cas5, Tneap type
Type: Family
Description: The CRISPR-Cas system is a prokaryotic defense mechanism against foreign genetic elements. The key elements of this defense system are the Cas proteins and the CRISPR RNA. Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes [ ]. CRISPRs appear to provide acquired resistance against mobile genetic elements (viruses, transposable elements and conjugative plasmids). CRISPR clusters contain sequences complementary to antecedent mobile elements and target invading nucleic acids. CRISPR clusters are transcribed and processed into CRISPR RNA (crRNA).The defense reaction is divided into three stages. In the adaptation stage, the invader DNA is cleaved, and a piece of it is selected to be integrated as a new spacer into the CRISPR locus, where it is stored as an identity tag for future attacks by this invader. During the second stage (the expression stage), the CRISPR RNA (pre-crRNA) is transcribed and subsequently processed into the mature crRNAs. In the third stage (the interference stage), Cas proteins, together with crRNAs, identify and degrade the invader [ , , ].The CRISPR-Cas systems have been sorted into three major classes. In CRISPR-Cas types I and III, the mature crRNA is generally generated by a member of the Cas6 protein family. Whereas in system III the Cas6 protein acts alone, in some class I systems it is part of a complex of Cas proteins known as Cascade (CRISPR-associated complex for antiviral defense). The Cas6 protein is an endoribonuclease necessary for crRNA production whereas the additional Cas proteins that form the Cascade complex are needed for crRNA stability []. This entry represents the Cas5 family of proteins (Tneap-type), which includes TM1800 from Thermotoga maritima [ ]. Cas5 is found within a cluster of three cas genes associated with CRISPR structures in many bacterial species, named cas1B, cas5 and cas6 [].,
Protein Domain
Name: Immune mapped protein 2, N-terminal
Type: Domain
Description: This domain is conserved across both IMP1 and IMP2 families. It is suggested that the globular domain likely contributes to a shared function, hence it is termed 'IMP1-like domain' [ ].
Protein Domain
Name: Protein of unknown function DUF4227
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4446
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in bacteria. Proteins in this family are typically between 165 and 176 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4231
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4232
Type: Family
Description: This is a family of uncharacterised proteins, many of which are thought to be lipoproteins.
Protein Domain
Name: Protein of unknown function DUF4229
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4230
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4235
Type: Family
Description: This family of integral membrane proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4233
Type: Family
Description: This family consists of functionally uncharacterised integral membrane proteins.
Protein Domain
Name: Protein of unknown function DUF5639
Type: Domain
Description: This is a domain of unknown function which is mainly found in Deinococcus-Thermus.
Protein Domain
Name: Protein of unknown function DUF4164
Type: Family
Description: This protein family of unknown function contains several conserved LE/LD sequence pairs.
Protein Domain
Name: Protein of unknown function DUF4489
Type: Family
Description: This family of uncharacterised proteins is found mainly in Clostridium.
Protein Domain
Name: Protein of unknown function DUF5571
Type: Family
Description: This entry includes a group of vertebrate proteins, including C12orf50 from humans.
Protein Domain
Name: Protein of unknown function DUF4621
Type: Family
Description: This family of proteins is functionally uncharacterised. It is found in bacteria (Bacteroidales). Proteins in this family are approximately 350 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4223
Type: Family
Description: This family of proteins is functionally uncharacterised. Proteins in this family are approximately 60 amino acids in length and are predicted to be to be lipoproteins.
Protein Domain
Name: Protein of unknown function DUF4221
Type: Family
Description: This protein family contains highly conserved asparagine and cysteine residues. The function is not known.
Protein Domain
Name: Protein of unknown function DUF4655
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 533 and 570 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4222
Type: Family
Description: These short proteins are largely found in enteric bacteria and are likely to be of phage origin. Their molecular function is unknown.
Protein Domain
Name: Protein of unknown function DUF4644
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 143 and 191 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF5527
Type: Family
Description: This family of uncharacterized proteins is found in vertebrates.
Protein Domain
Name: Protein of unknown function DUF4298
Type: Family
Description: This family of bacterial proteins includes sHIP ( ) from Streptococcus pyogenes. This protein is though to play an important role in the virulence of the bacteria. This protein shows a unique tetrameric organisation composed of four helix-loop-helix motifs [ ].
Protein Domain
Name: Protein of unknown function DUF4299
Type: Family
Description: This family is functionally uncharacterised. It contains two conserved sequence motifs (RGF and DAY), and two completely conserved residues (P and D) that may be functionally important.
Protein Domain
Name: Protein of unknown function DUF4300
Type: Family
Description: This family of lipoproteins is functionally uncharacterised. They are typically between 281 and 303 amino acids in length and contain two conserved sequence motifs: NCR and PYQ.
Protein Domain
Name: Type III secretion protein HpaP
Type: Family
Description: This entry represents proteins encoded by genes which are always found in type III secretion operons, although their function in the processes of secretion and virulence is unclear [ ]. Hpa stands for Hrp-associated gene, where Hrp stands for hypersensitivity response and virulence.
Protein Domain
Name: Type III secretion protein HrpB4
Type: Family
Description: This entry represents proteins encoded by genes which are found in type III secretion operons in a narrow range of species including Xanthomonas, Burkholderia and Ralstonia.
Protein Domain
Name: Type III secretion protein HrpB7
Type: Family
Description: This entry represents proteins encoded by genes which are found in type III secretion operons in a narrow range of species including Xanthomonas, Burkholderia and Ralstonia.
Protein Domain
Name: Type III secretion protein HrpB2
Type: Family
Description: This entry represents proteins encoded by genes which are found in type III secretion operons in a narrow group of species including Xanthomonas, Burkholderia and Ralstonia.
Protein Domain
Name: Protein of unknown function DUF4362
Type: Family
Description: This family is functionally uncharacterised. There is a conserved IRIV sequence motif.
Protein Domain
Name: Protein of unknown function DUF4363
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4364
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4295
Type: Family
Description: This family of functionally uncharacterised proteins is found in bacteria. Proteins in this family are approximately 50 amino acids in length. There are two completely conserved residues (K and Y) that may be functionally important.
Protein Domain
Name: Protein of unknown function DUF4265
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4269
Type: Family
Description: This family of proteins is functionally uncharacterised. There is a conserved KTE sequence motif.
Protein Domain
Name: Protein of unknown function DUF4270
Type: Family
Description: This family of putative lipoproteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4267
Type: Family
Description: This family of putative integral membrane proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4271
Type: Family
Description: This family of putative integral membrane proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4272
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4254
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4258
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4259
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4256
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4257)
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4262
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4260
Type: Family
Description: This family of integral membrane proteins is functionally uncharacterised. There is a conserved GLK sequence motif.
Protein Domain
Name: CRISPR-associated protein DxTHG, conserved site
Type: Conserved_site
Description: The CRISPR-Cas system is a prokaryotic defense mechanism against foreign genetic elements. The key elements of this defense system are the Cas proteins and the CRISPR RNA. Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes [ ]. CRISPRs appear to provide acquired resistance against mobile genetic elements (viruses, transposable elements and conjugative plasmids). CRISPR clusters contain sequences complementary to antecedent mobile elements and target invading nucleic acids. CRISPR clusters are transcribed and processed into CRISPR RNA (crRNA).The defense reaction is divided into three stages. In the adaptation stage, the invader DNA is cleaved, and a piece of it is selected to be integrated as a new spacer into the CRISPR locus, where it is stored as an identity tag for future attacks by this invader. During the second stage (the expression stage), the CRISPR RNA (pre-crRNA) is transcribed and subsequently processed into the mature crRNAs. In the third stage (the interference stage), Cas proteins, together with crRNAs, identify and degrade the invader [ , , ].The CRISPR-Cas systems have been sorted into three major classes. In CRISPR-Cas types I and III, the mature crRNA is generally generated by a member of the Cas6 protein family. Whereas in system III the Cas6 protein acts alone, in some class I systems it is part of a complex of Cas proteins known as Cascade (CRISPR-associated complex for antiviral defense). The Cas6 protein is an endoribonuclease necessary for crRNA production whereas the additional Cas proteins that form the Cascade complex are needed for crRNA stability [ ]. This entry represents a conserved site found in two otherwise substantially different families of Cas proteins [ ]. This site includes the motif [VIL]-D-x-[ST]-H-[GS].
Protein Domain
Name: Polyamine-modulated factor 1-binding protein 1
Type: Family
Description: PMF-1 binding protein, also known as Stap (for sperm-tail-associated protein), is located mainly at the tail of elongated spermatids in the testis of adult fertile mice. It may affect fertility [].
Protein Domain
Name: Bud22/Serum response factor-binding protein 1
Type: Family
Description: Budding yeast Bud22 is required for rRNA maturation and ribosomal subunit biogenesis [ , ]. Serum response factor-binding protein 1 (SRFB1) is Bud22 homologue in animals. SRFB1 may be involved in regulating transcriptional activation of cardiac genes during the aging process and may play a role in biosynthesis and/or processing of SLC2A4 in adipose cells [, ].
Protein Domain
Name: DALR anticodon-binding domain-containing protein 3
Type: Family
Description: The function of DALRD3 is not known; it has been associated to estrogen receptor alpha-positive tumours [ ].
Protein Domain
Name: Pentatricopeptide repeat domain-containing protein 3
Type: Family
Description: Pentatricopeptide repeat (PPR) domain proteins are involved in mitochondrial gene expression [ ]. PPR domain protein 3 (PTCD3) is a RNA-binding protein that associates with the mitochondrial small ribosomal subunit and regulates translation of mitochondrial encoded proteins [, ].
Protein Domain
Name: Outer dense fibre protein 1
Type: Family
Description: ODF1 is a component of the outer dense fibres (ODF), which are cytoskeletal structures specifically found in the sperm tails of vertebrates. ODF1 has been assigned to the heat shock protein family based on its overall structural features and especially on its conserved alpha-crystallin domain, and it is also known as HSPB10 [ ]. ODF1 is essential for tight linkage of sperm head to tail and is essential for male fertility [].
Protein Domain
Name: Protein of unknown function DUF4127
Type: Family
Description: This protein family is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4128
Type: Family
Description: This family of proteins is functionally uncharacterised. The structure of family member reveals a tail terminator protein gpU fold, which suggests that the protein could have a bacteriophage origin.
Protein Domain
Name: Protein of unknown function DUF4302
Type: Family
Description: This family of proteins is functionally uncharacterised. They contain two completely conserved residues (R and L) that may be functionally important.
Protein Domain
Name: Protein of unknown function DUF4287
Type: Family
Description: The function of these proteins is not known.
Protein Domain
Name: Upstream-binding protein 1, SAM domain
Type: Domain
Description: Ubp1 (also known as LBP-1) is involved in transcriptional regulation from early development to terminal differentiation. It regulates alpha-globin in erythroid cells and P450scc (the cholesterol side-chain cleavage enzyme, cytochrome) in human placenta [ ]. Human Ubp1 is known to be induced by HIV type I infection in lymphocytes; it represses HIV transcription by preventing the binding of TFIID to the virus promoter []. Ubp1 might also be part of a network controlling blood pressure []. Ubp1 contains an N-terminal DNA-binding CP2 domain and a central SAM domain. Ubp1 protein isoforms are able to form dimers via their SAM domain [ ].
Protein Domain
Name: Tumour protein p63, SAM domain
Type: Domain
Description: The transcription factor p63 shares protein sequence similarity with the tumour suppressor p53 [ ]. It plays an important part in maintaining the progenitor basal cell populations in stratified epithelium []. It is also involved in heart development and human reproduction []. Two p63 isoforms, the "trans-activating"isoforms and "truncated"isoforms, have been identified in humans. These isoforms may have different roles in skin, heart and germ line [ ].This entry represents the SAM domain of tumour protein p63. SAM domain is a putative protein-protein interaction domain. Mutations in the SAM domain of p63 are found in AEC (ankyloblepharonectodermal dysplasia-clefting) syndrome patients [ ].
Protein Domain
Name: Tumour protein p73, SAM domain
Type: Domain
Description: Tumour protein p73 shares protein sequence similarity with the tumour suppressor p53 [ ]. It is involved in the apoptotic response to DNA damage []. Human p73 has two isoforms, TA-isoforms that act as tumour-suppressors and DN-isoforms that behave as proto-oncogenes []. p73 also plays a role in neuronal development and neurodegeneration [].This entry represents the SAM domain of tumour protein p73. The SAM domain of p73 has been shown to bind membrane lipids, and structural rearrangements in SAM are necessary to accomplish the binding [ ].
Protein Domain
Name: Actin cytoskeleton-regulatory complex protein End3
Type: Family
Description: Endocytosis is accomplished through the sequential recruitment at endocytic sites of proteins that drive cargo sorting, membrane invagination and vesicle release [ ]. End3 forms a complex with Pan1 and Sla1 []. This complex plays important roles in organisation of the actin cytoskeleton and endocytosis [, ]. This complex is also required for normal cell wall morphogenesis in yeast []. The subunits of the Pan1 complex are homologous to mammalian intersectin.
Protein Domain
Name: EcoEI R protein C-terminal domain
Type: Domain
Description: There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [ , ], as summarised below:Type I enzymes ( ) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase ( ) activities. Type II enzymes ( ) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes ( ) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase ( ). Type IV enzymes target methylated DNA.This entry represents the C-terminal domain found in both the R subunit of type I enzymes and the Res subunit of type III enzymes. The type I enzymes include EcoEI, which recognises 5'-GAGN(7)ATGC-3; the R protein (HsdR) is required for both nuclease and ATPase activity [ , ].
Protein Domain
Name: Proline and serine-rich protein 3
Type: Family
Description: The function of proline and serine-rich protein 3 (PROSER3) is not known.
Protein Domain
Name: Protein of unknown function DUF4358
Type: Family
Description: The function of these proteins is not known.
Protein Domain
Name: Protein of unknown function DUF4360
Type: Family
Description: This family of proteins is functionally uncharacterised. There is a conserved GCP sequence motif near the N terminus.
Protein Domain
Name: Nuclear pore complex protein Nup133-like
Type: Family
Description: This entry includes the nuclear pore complex proteins Nup133, Nup132, Nup131 and Nup15 (from Caenorhabditis elegans). Nup133 is a component of the Nup160 subcomplex, which consists of Nup160, Nup133, Nup107 and Nup96, and is important for mRNA export from the nucleus [ ]. Nup160 subcomplexes are localized to both sides of the NPC during interphase. Nup133 also localizes to kinetochores []. Nup131 and Nup132 are components of the npc107-120 complex in Schizosaccharomyces pombe. This complex consists of nup85, nup107, nup120, nup131, nup132 and seh1 and is also required for mRNA export [].
Protein Domain
Name: Protein of unknown function DUF4288
Type: Family
Description: The function of these proteins is not known.
USDA
InterMine logo
The Legume Information System (LIS) is a research project of the USDA-ARS:Corn Insects and Crop Genetics Research in Ames, IA.
LegumeMine || ArachisMine | CicerMine | GlycineMine | LensMine | LupinusMine | PhaseolusMine | VignaMine | MedicagoMine
InterMine © 2002 - 2022 Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, United Kingdom