Search our database by keyword

- or -

Examples

  • Search this entire website. Enter identifiers, names or keywords for genes, pathways, authors, ontology terms, etc. (e.g. eve, embryo, zen, allele)
  • Use OR to search for either of two terms (e.g. fly OR drosophila) or quotation marks to search for phrases (e.g. "dna binding").
  • Boolean search syntax is supported: e.g. dros* for partial matches or fly AND NOT embryo to exclude a term

Search results 7301 to 7400 out of 30763 for seed protein

Category restricted to ProteinDomain (x)

0.039s

Categories

Category: ProteinDomain
Type Details Score
Protein Domain
Name: Hydroxymethyl pyrimidine (HMP)/thiamine binding protein
Type: Family
Description: This group represents putative hydroxymethyl pyrimidine (HMP)/thiamine binding protein YkoF. It is part of the ABC transporter complex YkoCDEF that could transport HMP and/or thiamine. YkoF binds thiamine via its HMP moiety [ ].
Protein Domain
Name: Protein of unknown function DUF4641
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 201 and 519 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4522
Type: Family
Description: This family of mammalian proteins is functionally uncharacterised. In humans, this protein is known as C4orf36.
Protein Domain
Name: Protein of unknown function DUF4625
Type: Family
Description: Members of this family contain a likely bacterial Ig-like fold, suggesting it may be a family of lipoproteins.
Protein Domain
Name: Protein of unknown function DUF4465
Type: Family
Description: This entry represents a large family of uncharacterised proteins mostly from human gut bacteroides, but also some environmental and water bacteria (Planctomycetes), as well as metagenomic samples. Most proteins from this family are secreted or located on the outer surface, and may participate in cell-cell interactions or cell-nutrient interactions. This function is supported by a solved structure of a Bacteroides ovatus homologue, which adopts a galactose binding (jelly-roll) beta barrel structure.
Protein Domain
Name: Uncharacterised conserved protein UCP006363, ACT-type
Type: Family
Description: This group represents a predicted ligand-binding protein with an N-terminal ACT domain, AF1403 type. The ACT domain generally has a regulatory role that is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration.
Protein Domain
Name: Protein of unknown function DUF4653
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 93 and 229 amino acids in length.
Protein Domain
Name: Ribosomal protein S14, type Z
Type: Family
Description: Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [ , ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [ , ].S14 is one of the proteins from the small ribosomal subunit. In Escherichia coli, S14 is known to be required for the assembly of 30S particlesand may also be responsible for determining the conformation of 16S rRNA at the A site. It belongs to a family of ribosomal proteins [] thatinclude bacterial, algal and plant chloroplast S14, yeast mitochondrial MRP2, cyanelle S14, archaebacteria Methanococcus vannielii S14, as well as yeast mitochondrial MRP2, yeast YS29A/B, and mammalian S29. In bacteria the gene coding for ribosomal protein S14 has been subject to horizontal gene transfer, subsequent differential gene loss and insertions/deletions near the N terminus. Different groups have been described according to this [ ]. This family represents the type Z, or zinc-binding S14 proteins.
Protein Domain
Name: Protein of unknown function DUF4562
Type: Family
Description: This family of proteins is found in eukaryotes and is functionally uncharacterised. Members of the family contain a conserved HRYQNPW sequence motif. This family includes the human protein C4orf45 ( ).
Protein Domain
Name: Protein of unknown function DUF4642
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 115 and 196 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4635
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 120 and 154 amino acids in length. There are two conserved sequence motifs: LEQ and DLE.
Protein Domain
Name: Protein of unknown function DUF3945
Type: Domain
Description: This entry represents a C-terminal repeated domain found in a family of uncharacterised proteins identified by clustering human gut metagenomic sequences [ ].
Protein Domain
Name: Protein of unknown function DUF4552
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in vertebrates. Proteins in this family are typically between 425 and 649 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF5575
Type: Family
Description: This is a family of unknown function predominantly found in chordates.
Protein Domain
Name: Protein of unknown function DUF4170
Type: Family
Description: The function of these proteins is not known.
Protein Domain
Name: Protein of unknown function DUF4169
Type: Family
Description: The function of these proteins is not known.
Protein Domain
Name: Protein of unknown function DUF4501
Type: Family
Description: This family of proteins is found in eukaryotes. The exact function of the family members remains unknown, but they are thought to be a single-pass membrane proteins. This family contains many highly conserved cysteine residues.
Protein Domain
Name: Protein of unknown function DUF4464
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family have a conserved YID sequence motif.
Protein Domain
Name: Protein of unknown function DUF3956
Type: Family
Description: This is a family of uncharacterised bacterial proteins.
Protein Domain
Name: Protein of unknown function DUF4555
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in metazoa.This family includes the human protein C7orf31.
Protein Domain
Name: Non-structural protein NSP7 superfamily, coronavirus
Type: Homologous_superfamily
Description: Non-structural protein NSP7 has been implicated in viral RNA replication and is predominantly α-helical in structure. Its central core is an N-terminal helical bundle (HB), with helices HB1, HB2 and HB3, forming a triple-stranded antiparallel coiled coil with a right-handed superhelical pitch. It is part of the RNA-dependent RNA polymerase (RdRp) heterotetramer which consists of one NSP7, two NSP8 molecules and the catalytic NSP12, defined as the minimal core component for mediating coronavirus RNA synthesis [ , , , , , ]. NSP7 and NSP8 forms a complex that adopts a hollow cylinder-like structure []. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase []. NSP7 and NSP8 play a role in the stabilisation of NSP12 regions involved in RNA binding, and are essential for a highly active NSP12 polymerase complex [, , , ].
Protein Domain
Name: Armadillo-like helical domain-containing protein 3-like
Type: Family
Description: This entry includes ARMD3 and related proteins. Their functions are not known.
Protein Domain
Name: Protein of unknown function DUF4543
Type: Family
Description: This family of proteins is found in eukaryotes. The human member of this family is C17orf67.
Protein Domain
Name: Protein of unknown function DUF4603
Type: Family
Description: This protein family is found in eukaryotes and has unknown function.
Protein Domain
Name: Protein of unknown function DUF4551
Type: Family
Description: This metazoan family of proteins is functionally uncharacterised. This family includes human protein C12orf56.
Protein Domain
Name: Small integral membrane protein 15
Type: Family
Description: This family of eukaryotic proteins is functionally uncharacterised. There is a conserved LSWKL sequence motif. This family includes human protein C5orf43. Family members are annotated as small integral membrane protein 15.
Protein Domain
Name: Protein of unknown function DUF4547
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 144 and 206 amino acids in length. The human member of this family is C3orf43, also annotated as single-pass membrane and coiled-coil domain-containing protein 1.
Protein Domain
Name: Protein of unknown function DUF4649
Type: Family
Description: This family of Firmicute proteins has members that are annotated as ribose-phosphate pyrophosphokinase; however there is no evidence for this attribution. Member proteins are all shorter than 100 residues in length.
Protein Domain
Name: Intraflagellar transport protein 122/121 homolog
Type: Family
Description: Intraflagellar transport protein 122 homologue (Ift122) and Intraflagellar transport protein 121 homologue (Ift121, also known as WD repeat-containing protein 35) are a multiple WD40-repeat containing proteins, required for ciliogenesis [ , ]. They are components of the IFT complex A (IFT-A), a complex required for retrograde ciliary transport and entry into cilia of G protein-coupled receptors (GPCRs) [, , ]. Primary cilia are assembled and maintained by evolutionarily conserved intraflagellar transport (IFT) proteins that are involved in the coordinated movement of macromolecular cargo from the basal body to the cilium tip and back. Mice with mutant Ift122 lack cilia and have multiple developmental defects [] and the Sonic Hedgehog (Shh) signaling pathway is disrupted [].
Protein Domain
Name: Protein of unknown function DUF4623
Type: Family
Description: This family of proteins is found in bacteria. Proteins in this family are approximately 470 amino acids in length. There are two conserved sequence motifs: HLL and RYL.
Protein Domain
Name: Sensor protein KdpD, transmembrane domain
Type: Domain
Description: This domain is found in sensor protein KdpD. It is likely to be a transmembrane domain involved in ligand sensing.KdpD is part of the two-component regulatory system KdpD/KdpE involved in the regulation of the kdp operon, a three-component K+ transport system. KdpD may function as a membrane-associated protein kinase that phosphorylates KdpE in response to environmental signals [ ].
Protein Domain
Name: Protein of unknown function DUF4534
Type: Family
Description: This family of functionally uncharacterised proteins is found in mammals. It includes human transmembrane TMEM217 protein. Proteins in this family are typically between 170 and 190 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF5631
Type: Domain
Description: This is an alpha helical domain found at the C-terminal region of the hypothetical protein Rv3899c from Mycobacterium tuberculosis which is conserved across mycobacteria [ ].
Protein Domain
Name: Protein of unknown function DUF4597
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 63 and 76 amino acids in length. There is a conserved TPPTPT sequence motif.
Protein Domain
Name: Protein of unknown function DUF4152
Type: Domain
Description: This family of archaeal proteins is functionally uncharacterised. The structure of the family member from Pyrococcus furiosus has been solved. It shows an RNaseH like fold that conserves critical catalytic residues [ ]. This suggests that these proteins may cleave nucleic acid.
Protein Domain
Name: Protein of unknown function DUF4630
Type: Family
Description: This family of proteins is found in eukaryotes.
Protein Domain
Name: Steroid receptor-associated and regulated protein
Type: Family
Description: Steroid receptor-associated and regulated protein is also known as ER-related factor, ERRF, or C1orf64. It is thought to regulate the transcriptional function of androgen and estrogen receptors [ ]. Proteins in this family are typically between 145 and 169 amino acids in length, and have a conserved IDC sequence motif.
Protein Domain
Name: Integrin-linked protein kinase, pseudokinase domain
Type: Domain
Description: The pseudokinase domain shows similarity to protein kinases but lacks crucial residues for catalytic activity. Integrin linked kinase (ILK) contains N-terminal ankyrin repeats, a Pleckstrin Homology (PH) domain, and a C-terminal pseudokinase domain. It is a component of the IPP (ILK/PINCH/Parvin) complex that couples beta integrins to the actin cytoskeleton, and plays important roles in cell adhesion, spreading, invasion, and migration [ ]. ILK was initially thought to be an active kinase despite the lack of key conserved residues because of in vitro studies showing that it can phosphorylate certain protein substrates. However, in vivo experiments in Caenorhabditis elegans, Drosophila melanogaster, and mice (ILK-null and knock-in) proved that ILK is not an active kinase []. In addition to actin cytoskeleton regulation, ILK also influences the microtubule network and mitotic spindle orientation [, ]. The pseudokinase domain of ILK binds several adaptor proteins including the parvins and paxillin [, ].
Protein Domain
Name: Protein of unknown function DUF4628
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 152 and 673 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4573
Type: Family
Description: This entry represents a group of uncharacterised proteins, including glutamate-rich protein 5.
Protein Domain
Name: Protein of unknown function DUF4492
Type: Family
Description: This family of proteins is found in bacteria. Proteins in this family are approximately 80 amino acids in length. Their function is unknown.
Protein Domain
Name: Protein of unknown function DUF4043
Type: Family
Description: This family of proteins is functionally uncharacterised. There is a single completely conserved residue G that may be functionally important. Proteins in this entry include major structural protein ORF14 from Helicobacter pylori bacteriophage KHP30 [] and ORF017 from Pseudomonas phage KPP25 [].
Protein Domain
Name: Cag pathogenicity island protein Cag12
Type: Family
Description: This is a Proteobacterial family of Cag pathogenicity island proteins [ ].
Protein Domain
Name: 54S ribosomal protein L8, C-terminal
Type: Domain
Description: This is the C-terminal domain of mitochondrial 54S ribosomal protein L8 [ ].
Protein Domain
Name: Protein of unknown function DUF4199
Type: Family
Description: The function of these proteins is not known.
Protein Domain
Name: Protein of unknown function DUF4202
Type: Family
Description: The function of these proteins is unknown, although some are incorrectly annotated as glutamyl tRNA synthetases. There are two conserved sequence motifs: LED and KMS.
Protein Domain
Name: PilN biogenesis protein dimerization domain
Type: Domain
Description: This domain is found in PilN type IV pilus biogenesis protein present in Thermus thermophiles. PilN is an integral inner membrane protein needed for the formation of type IV pilus. This domain forms a dimer which is mediated by symmetric contacts between residues in alpha-1, beta-1, beta-3 and alpha-3 [ ].
Protein Domain
Name: Non-structural protein NSP8 superfamily, coronavirus
Type: Homologous_superfamily
Description: Viral non-structural protein NSP8 is part of the RNA-dependent RNA polymerase (RdRp) complex and forms a heterotetramer consisting of one molecule of NSP7, two copies of NSP8 and one of NSP12 [ ]. NSP8 and NSP7 adopts a hollow cylinder-like structure [, ] in which the dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RdRp [, ]. NSP7 and NSP8 are co-factors for the catalytic NSP12 that play a role in the stabilisation of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex []. It has been demonstrated that NSP8 from human coronavirus 229E acts as an oligo(U)-templated polyadenylyltransferase but also has robust (mono/oligo) adenylate transferase activities []. NSP8 has N-terminal and C-terminal D/ExD/E conserved motifs. The N-terminal motif is critical for RNA polymerase activity as these residues are part of the Mg2-binding active site []. NSP8 has a 'golf club'-like structure composed of a long α-helix N-terminal 'shaft' subdomain and an α/β C-terminal 'head' subdomain consisting of three α-helices and seven β-strands ( ). The seven β-strands form an open-barrel with two antiparallel β-sheets packed orthogonally. More than half the residues in the C-terminal domain are hydrophobic, and the whole domain forms a tight hydrophobic core [ , , ].Together with NSP9, NSP8 suppresses protein integration into the cell membrane, thus, disrupting host immune defenses [ ].The core structure of NSP8 has an α-β(2)-α-β(4)-α-β fold with bifurcated barrel-like β-sheet.
Protein Domain
Name: Protein of unknown function DUF4197
Type: Family
Description: The function of these proteins is not known.
Protein Domain
Name: Protein of unknown function DUF4193
Type: Family
Description: These functionally uncharacterised proteins contains four conserved cysteines and a conserved histidine, including a CXXXXH motif.
Protein Domain
Name: Lantibiotic immunity protein Spa1, C-terminal
Type: Domain
Description: This is the C-terminal domain found in SpaI from Bacillus subtilis. SpaI is an immunity lipoprotein that protects the Gram-positive bacteria against their own lantibiotics, in this case subtilin. SpaI together with the ABC transporter SpaFEG protects the membrane from subtilin insertion [ ].
Protein Domain
Name: Protein of unknown function DUF4645
Type: Family
Description: This family of proteins is found in eukaryotes. Proteins in this family are typically between 200 and 298 amino acids in length.
Protein Domain
Name: Protein of unknown function DUF4491
Type: Family
Description: This family of proteins is found in bacteria. There is a conserved EYY sequence motif.
Protein Domain
Name: Protein of unknown function DUF4176
Type: Family
Description: The function of these proteins is not known.
Protein Domain
Name: Protein of unknown function DUF5554
Type: Family
Description: This is a family of unknown function found in primates.
Protein Domain
Name: Protein of unknown function DUF4177
Type: Family
Description: The function of these proteins is not known.
Protein Domain
Name: Protein of unknown function DUF5553
Type: Family
Description: This is a family of unknown function found in primates.
Protein Domain
Name: Protein of unknown function DUF4620
Type: Family
Description: This entry represents a protein family of unknown function from primates.
Protein Domain
Name: Protein of unknown function DUF4184
Type: Family
Description: These proteins of unknown function contain several highly conserved histidines.
Protein Domain
Name: Protein of unknown function DUF4559
Type: Family
Description: This family of proteins is functionally uncharacterised. This family of proteins is found in eukaryotes and includes human protein CXorf38.
Protein Domain
Name: EF-hand calcium-binding domain-containing protein 10
Type: Family
Description: The function of EF-hand calcium-binding domain-containing protein 10 (EFC10) is not known.
Protein Domain
Name: DNA terminal protein Gp3 superfamily
Type: Homologous_superfamily
Description: The DNA terminal protein Gp3, found in a number of Bacillus phages, is linked to the 5' ends of both strands of the genome through a phosphodiester bond between the β-hydroxyl group of a serine residue and the 5'-phosphate of the terminal deoxyadenylate. This protein is essential for DNA replication and is involved in the priming of DNA elongation [ ].Gp3 has a multihelical structure arranged as an open array. It contains a C-terminal four-helical bundle with a left-handed superhelix.
Protein Domain
Name: Protein of unknown function DUF4153
Type: Family
Description: This family consists of putative membrane proteins.
Protein Domain
Name: Type III secretion system protein
Type: Family
Description: This is a family of bacterial putative type III secretion apparatus proteins associated with the locus of enterocyte effacement (LEE).
Protein Domain
Name: Protein of unknown function DUF4156
Type: Family
Description: The function of this family is unknown but some members are annotated as putative lipoprotein outer membrane proteins.
Protein Domain
Name: Protein of unknown function DUF4144
Type: Family
Description: The function of these proteins is not known.
Protein Domain
Name: Myelin and lymphocyte (MAL) protein
Type: Family
Description: Myelin is a product of myelinating cells: Schawnn cells in the peripheral nervous system (PNS) and oligodendrocytes in the central nervous system (CNS). The processes of these myelinating cells wrap around axon segments to form a sheath. This sheath possesses insulating properties, allowing rapid propagation of action potentials (the electrical signal propagated by nerve cells) between the myelinating cell intervening nodes. Without myelin insulation, axons would need a 10-30-fold increase in diameter to achieve comparable conduction velocities. At the onset of myelination, during development, large amounts of myelin-specific lipids and proteins are synthesised and transported to the developing myelin sheath. The major protein component of PNS and CNS myelin differs between the 2 nervous systems. However, some of the minor protein components, including the 4 transmembrane (TM)-domain-containing myelin and lymphocyte protein (MAL), are found in both systems. Outside the nervous system, MAL is also found in T-cells and some epithelial cells (e.g., kidney, stomach and thyroid) [ ]. Glycosphingolipids are enriched in both epithelial cells and myelin, and are believed to decrease the permeability of lipid membranes to small molecules and increase the ability for membrane curvature. MAL co-purifies with glycosphingolipids in detergent-insoluble domains, suggesting a possible interaction. In Madin Darby Canine Kidney (MDCK) cells, MAL is found mainly in transport vesicles, and recent studies suggest that MAL is required for efficient vesicular transport of proteins across apical cell membranes []. In myelinating cells, MAL appears to play a similar role, interacting with glycosphingolipids to decrease membrane permeability; however, here this property most likely manifests itself as an improved insulating ability of the myelin []. In lymphocytes, however, MAL appears to act as a TM-linker protein in T-cell signal transduction, linking the cell surface glycosyl phosphatidylinositol anchored protein CD59 to the intracellular tyrosine kinase Lck [].
Protein Domain
Name: Protein of unknown function DUF4044
Type: Family
Description: This family of proteins is functionally uncharacterised. There is a single completely conserved residue M that may be functionally important.
Protein Domain
Name: Protein of unknown function DUF4075
Type: Family
Description: This family of proteins is functionally uncharacterised. Some members are annotated as putative mature parasite-infected erythrocyte surface antigen protein from Bacillus spp.
Protein Domain
Name: WSSV envelope protein Vp28 superfamily
Type: Homologous_superfamily
Description: This superfamily of proteins is found in viruses. Proteins in this superfamily are approximately 210 amino acids in length. There is a conserved NNT sequence motif. These proteins are structural envelope proteins in viruses. This is the beta barrel C-terminal domain. There is a protruding N-terminal domain which completes the proteins. Three of four envelope proteins in Shrimp white spot syndrome virus (WSSV) share sequence homology with each other and are present in this superfamily - VP24, VP26 and VP28. VP19 is the other major envelope protein but shares no sequence homology with the other proteins. These proteins are essential for entry into cells of the crustacean host. Structurally, the WSSV envelope protein consists of a β-sandwich composed of nine strands in two sheets with a greek-key topology.
Protein Domain
Name: ABC transporter substrate-binding protein PnrA-like
Type: Domain
Description: Proteins containing this domain were originally annotated as basic membrane lipoproteins [ ]. However, several proteins containing this domain were later predicted as ABC transporter substrate-binding proteins, such as PnrA (also known as TmpC or TP0319) and RfuA (also known as Tpn38 or TP0298) from Treponema pallidum. PnrA transports purine nucleosides [], while RfuA transports riboflavin []. Proteins containing this domain also include Med from Bacillus subtilis. Med was annotated as a transcriptional activator protein that regulates comK []. This domain can also found at the N terminus of glutamate receptor-like proteins from Dictyostelium (slime mold) [].
Protein Domain
Name: Segregation and condensation protein A
Type: Family
Description: This family represents ScpA, which along with ScpB ( ) interacts with SMC in vivo forming a complex that is required for chromosome condensation and segregation [ , ]. The SMC-Scp complex appears to be similar to the MukB-MukE-Muk-F complex in Escherichia coli [], where MukB () is the homologue of SMC. ScpA and ScpB have little sequence similarity to MukE ( ) or MukF ( ), they are predicted to be structurally similar, being predominantly α-helical with coiled coil regions. In general scpA and scpB form an operon in most bacterial genomes. Flanking genes are highly variable suggesting that the operon has moved throughout evolution. Bacteria containing an smc gene also contain scpA or scpB but not necessarily both. An exception is found in Deinococcus radiodurans, which contains scpB but neither smc nor scpA. In the archaea the gene order SMC-ScpA is conserved in nearly all species, as is the very short distance between the two genes, indicating co-transcription of the both in different archaeal genera and arguing that interaction of the gene products is not confined to the homologues in Bacillus subtilis. It would seem probable that, in light of all the studies, SMC, ScpA and ScpB proteins or homologues act together in chromosome condensation and segregation in all prokaryotes [ ].
Protein Domain
Name: Protein of unknown function DUF166
Type: Family
Description: This family had been implicated to catalyse the synthesis of thymidine monophosphate (dTMP) from deoxyuridine monophosphate (dUMP). The physiological co-substrate has not yet been identified [ ]. Previous designation of this famliy as being thymidylate synthase from one paper, has been suggested to be erroneous []. The proteins are uncharacterised.
Protein Domain
Name: Protein of unknown function DUF169
Type: Family
Description: This entry describes proteins of unknown function.
Protein Domain
Name: Protein of unknown function UPF0178
Type: Family
Description: This entry describes proteins of unknown function.
Protein Domain
Name: Protein of unknown function DUF5583
Type: Family
Description: This is a family of unknown function found in chordata.
Protein Domain
Name: Protein of unknown function DUF4012
Type: Family
Description: This is a family of uncharacterised proteins found in archaea and bacteria.
Protein Domain
Name: Protein of unknown function DUF4026
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4025
Type: Family
Description: This family of proteins is functionally uncharacterised. There is a conserved EGT sequence motif.
Protein Domain
Name: Protein of unknown function UCP004961
Type: Family
Description: This family of proteins are related to a large superfamily of metalloenzymes [ ]. It shares protein sequence similarity with TatD, which is an DNase that is also part of the twin-arginine translocation (Tat) system that transports large folded proteins containing a characteristic twin-arginine motif in their signal peptide across membranes in Escherichia coli [, ].
Protein Domain
Name: Protein of unknown function DUF4028
Type: Family
Description: This family of proteins is functionally uncharacterised. There are two conserved sequence motifs: IVKI and YVKKWF.
Protein Domain
Name: Protein of unknown function DUF4027
Type: Family
Description: This family of proteins is functionally uncharacterised. There is a conserved CLGGF sequence motif.
Protein Domain
Name: Protein of unknown function DUF4029
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4030
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF5520
Type: Family
Description: This is a family of unknown function proteins found in mammals.
Protein Domain
Name: Cysteine-rich hydrophobic domain-containing protein 1/2
Type: Family
Description: The function of cysteine-rich hydrophobic domain-containing protein 1/2 (CHIC1/2) is not clear. CHIC1, also known as brain X-linked gene (BRX), is preferentially expressed in the brain and is a potential candidate for one of the X-linked mental retardation syndromes mapped within the pericentromeric region of the human X chromosome [ ]. A chromosomal aberration involving CHIC2 is found in a form of acute myeloid leukemia; the mutation prevents palmitoylation and the protein does not associate with the membrane [].
Protein Domain
Name: Nonsense-mediated mRNA decay protein 3
Type: Family
Description: This entry includes nonsense-mediated mRNA decay protein 3 (Upf3) and its homologues. In budding yeasts, Upf3 is involved in the nonsense-mediated mRNA decay (NMD) pathway [ ]. Human Upf3a and Upf3b are components of the exon-junction complex (EJC), which promotes nonsense-mediated mRNA decay (NMD) and regulate translation efficiency [].
Protein Domain
Name: Meiotic expression up-regulated protein 6
Type: Family
Description: The function of meiotic expression up-regulated protein 6 (Meu6) is not known. meu6 contains a PH (pleckstrin homology) domain. Homologues are known from fungi.
Protein Domain
Name: Protein of unknown function DUF5538
Type: Family
Description: This family of proteins with unknown function can be found in mammals.
Protein Domain
Name: Protein of unknown function DUF4087
Type: Family
Description: This family of proteins is functionally uncharacterised. There is a conserved RCGW sequence motif.
Protein Domain
Name: Protein of unknown function DUF4088
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4083
Type: Family
Description: This protein family is functionally uncharacterised. Some members are putatively annotated as being MutT/Nudix. However, the characteristic Nudix motif of GX(5)EX(7)REUXEE is absent.
Protein Domain
Name: Protein of unknown function DUF4085
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4090
Type: Family
Description: This family of proteins is found in cyanobacteria and is functionally uncharacterised. Proteins in this family are approximately 90 amino acids in length. A member of this family, the ssr1528 gene was found to be regulated by an sRNA NsiR4 [ ]. In addition, the expression of that gene was repressed under conditions of nitrogen depletion, suggesting it may have a role related to cellular nitrogen status [].
Protein Domain
Name: Protein of unknown function DUF5580
Type: Family
Description: This family of proteins with unknown function can be found in metazoa.
Protein Domain
Name: Intermembrane lipid transfer protein VPS13B
Type: Family
Description: Intermembrane lipid transfer protein VPS13B (also known as Vacuolar protein sorting-associated protein 13B) may be involved in protein sorting in post Golgi membrane traffic. Mutations of the VPS13B gene cause Cohen syndrome (COH1), a rare autosomal recessive disorder characterised by obesity, hypotonia, intellectual deficit, characteristic craniofacial dysmorphism and abnormalities of the hands and feet [ , ].
Protein Domain
Name: Protein of unknown function DUF4056
Type: Family
Description: This family of proteins is functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF4059
Type: Family
Description: This family of proteins is functionally uncharacterised. There is a conserved DKT sequence motif.
Protein Domain
Name: Protein of unknown function DUF4060
Type: Family
Description: This family of proteins is functionally uncharacterised. There are two conserved sequence motifs: VEVV and SYVAT.
Protein Domain
Name: Protein of unknown function DUF4058
Type: Family
Description: This family of proteins is functionally uncharacterised.
USDA
InterMine logo
The Legume Information System (LIS) is a research project of the USDA-ARS:Corn Insects and Crop Genetics Research in Ames, IA.
LegumeMine || ArachisMine | CicerMine | GlycineMine | LensMine | LupinusMine | PhaseolusMine | VignaMine | MedicagoMine
InterMine © 2002 - 2022 Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, United Kingdom