Search our database by keyword

- or -

Examples

  • Search this entire website. Enter identifiers, names or keywords for genes, pathways, authors, ontology terms, etc. (e.g. eve, embryo, zen, allele)
  • Use OR to search for either of two terms (e.g. fly OR drosophila) or quotation marks to search for phrases (e.g. "dna binding").
  • Boolean search syntax is supported: e.g. dros* for partial matches or fly AND NOT embryo to exclude a term

Search results 9101 to 9200 out of 30763 for seed protein

Category restricted to ProteinDomain (x)

0.545s

Categories

Category: ProteinDomain
Type Details Score
Protein Domain
Name: Folic acid synthesis protein FOL1
Type: Family
Description: FOL1 catalyses three sequential steps of the tetrahydrofolate biosynthetic pathway and is therefore the central enzyme of this pathway [ ].
Protein Domain
Name: Phage tail assembly chaperone protein
Type: Family
Description: This entry represents a family of general phage tail assembly chaperone proteins from double-stranded DNA viruses with no RNA stage, many of which are unclassified.
Protein Domain
Name: Systemic acquired resistance protein SAR
Type: Family
Description: Members of this family are found in Solanaceae spp. plants, a taxonomic group (family) that includes pepper and tobacco plant species. Synthesis of these proteins is induced by Tobacco mosaic virus and salicylic acid []; indeed theyare thought to be involved in the development of systemic acquired resistance (SAR) after an initial hypersensitive response to microbial infection [, ]. SAR is characterised by long-lasting resistance to infection by a wide range ofpathogens, extending to plant tissues distant from the initial infection site [ ].
Protein Domain
Name: Protein of unknown function DUF5854
Type: Family
Description: This is a family of uncharacterised proteins of unknown function predominantly found in Mimiviridae.
Protein Domain
Name: Protein of unknown function DUF5565
Type: Family
Description: This is a family of unknown function found in bacteria and eukaryotes.
Protein Domain
Name: Protein of unknown function DUF5856
Type: Family
Description: This is a family of uncharacterised proteins of unknown function predominantly found in viruses. This family was also found in Bacteria.
Protein Domain
Name: Protein of unknown function DUF5834
Type: Family
Description: This entry represents an uncharacterised protein associated with the thiocillin biosynthetic gene cluster.
Protein Domain
Name: Protein of unknown function DUF5831
Type: Family
Description: Proteins in this family are found mostly in Bamfordvirae.
Protein Domain
Name: Protein of unknown function DUF5832
Type: Family
Description: This entry contains proteins of unknown function mostly found in virus.
Protein Domain
Name: Protein of unknown function DUF5830
Type: Family
Description: This is a family of proteins predominantly found in Halobacteria. Some members includes the MarR family transcriptional regulator.
Protein Domain
Name: Protein of unknown function DUF4762
Type: Family
Description: This family of proteins is found in bacteria. Proteins in this family are approximately 70 amino acids in length. There is a conserved TTC sequence motif.
Protein Domain
Name: Protein of unknown function DUF4761
Type: Family
Description: This family of proteins is found in bacteria. Proteins in this family are approximately 110 amino acids in length.
Protein Domain
Name: Copper resistance protein ScsC, N-terminal
Type: Domain
Description: This is the N-terminal domain found in copper resistance protein ScsS present in Proteus mirabilis. ScsC is a powerful disulfide isomerase that is able to refold and reactivate the scrambled disulfide form of the model substrate RNase A. The protein has a thioredoxin 4 domain but, unlike other characterised proteins in this family, it is trimeric. The N-terminal domain is responsible for trimerization of ScsC which is needed for isomerase activity [ ].
Protein Domain
Name: Protein of unknown function DUF5828
Type: Family
Description: This is a family of proteins of unknown function predominantly found in Halobacteria.
Protein Domain
Name: Protein of unknown function DUF5829
Type: Family
Description: This is a group of uncharacterised bacterial proteins.
Protein Domain
Name: Protein of unknown function DUF5827
Type: Family
Description: This is a family of proteins of unknown function predominantly found in Halobacteria.
Protein Domain
Name: Influenza C virus M2 protein
Type: Family
Description: This family represents the matrix protein, M2, of Influenza C virus. The M1 protein is the product of a spliced mRNA (see ). Small quantities of the unspliced mRNA are found in the cell additionally encoding the M2 protein.
Protein Domain
Name: Protein of unknown function DUF5826
Type: Family
Description: Proteins in this family are mostly found in prasinoviruses and are likely to be membrane proteins with two transmembrane regions.
Protein Domain
Name: Protein of unknown function DUF5825
Type: Family
Description: This is a group of uncharacterised bacterial proteins.
Protein Domain
Name: Protein of unknown function DUF5822
Type: Family
Description: This is a family of proteins of unknown function predominantly found in Halobacteria. This family includes some members which are thought to be peptidoglycan binding proteins.
Protein Domain
Name: Protein of unknown function DUF5823
Type: Family
Description: This is a group of uncharacterised bacterial proteins.
Protein Domain
Name: Protein of unknown function DUF5068
Type: Family
Description: This family is expressed by Firmicutes. The function is not known.
Protein Domain
Name: Tyrosine protein kinase, EGF/ERB/XmrK receptor
Type: Family
Description: Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []:Serine/threonine-protein kinasesTyrosine-protein kinasesDual specificity protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins)Protein kinase function is evolutionarily conserved from Escherichia coli to human [ ]. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases [].Tyrosine-protein kinases can transfer a phosphate group from ATP to a tyrosine residue in a protein. These enzymes can be divided into two main groups [ ]:Receptor tyrosine kinases (RTK), which are transmembrane proteins involved in signal transduction; they play key roles in growth, differentiation, metabolism, adhesion, motility, death and oncogenesis [ ]. RTKs are composed of 3 domains: an extracellular domain (binds ligand), a transmembrane (TM) domain, and an intracellular catalytic domain (phosphorylates substrate). The TM domain plays an important role in the dimerisation process necessary for signal transduction []. Cytoplasmic / non-receptor tyrosine kinases, which act as regulatory proteins, playing key roles in cell differentiation, motility, proliferation, and survival. For example, the Src-family of protein-tyrosine kinases [ ].This group represents a group of known and predicted receptor-type tyrosine-protein kinases, including the EGF and ERB receptors, and the melanoma-inducing oncogene product XmrK.
Protein Domain
Name: Protein of unknown function DUF4763
Type: Family
Description: This family of proteins is functionally uncharacterised. Proteins in this family are typically between 237 and 332 amino acids in length. There are two completely conserved residues (C and R) that may be functionally important.
Protein Domain
Name: Influenza C virus M1 protein
Type: Family
Description: This family represents the matrix protein, M1, of Influenza C virus. The M1 protein is the product of a spliced mRNA. Small quantities of the unspliced mRNA are found in the cell additionally encoding the M2 protein (see ).
Protein Domain
Name: Protein of unknown function DUF5821
Type: Family
Description: Proteins in this family are predominantly found in Halobacteria.
Protein Domain
Name: Protein of unknown function DUF5819
Type: Family
Description: This is a group of uncharacterised bacterial proteins.
Protein Domain
Name: Protein of unknown function DUF5820
Type: Family
Description: Proteins in this family are predominantly found in Halobacteria.
Protein Domain
Name: Protein of unknown function DUF5818
Type: Family
Description: This is a group of uncharacterised bacterial proteins.
Protein Domain
Name: Putative RNA polymerase (calciviral)/capsid protein
Type: Family
Description: Caliciviruses are a small round-structured virus group defined by RNA-dependent RNA polymerase and capsid diversity.
Protein Domain
Name: Protein of unknown function DUF5815
Type: Family
Description: Proteins in this family are predominantly found in Halobacteria.
Protein Domain
Name: Protein of unknown function DUF5816
Type: Family
Description: This is a family of proteins predominantly found in Halobacteria. They are thought to be GNAT (Gcn5-related N-acetyltransferases) family acetyltransferases.
Protein Domain
Name: Protein of unknown function DUF5813
Type: Family
Description: Proteins in this family are predominantly found in Halobacteria.
Protein Domain
Name: Protein of unknown function DUF5814
Type: Family
Description: This is a family of proteins that are thought to have helicase activity, predominantly found in Halobacteria and Methanomicrobia.
Protein Domain
Name: Protein of unknown function DUF5812
Type: Family
Description: Proteins in this family are predominantly found in Halobacteria.
Protein Domain
Name: Protein of unknown function DUF4760
Type: Family
Description: This family of proteins is found in bacteria, archaea and viruses. Proteins in this family are typically between 147 and 190 amino acids in length. There is a single completely conserved residue R that may be functionally important.
Protein Domain
Name: TRAF3-interacting protein 1, C-terminal domain
Type: Domain
Description: TRAF3IP1, also known as MIP-T3, which interacts with both microtubules and TRAF3 (tumour necrosis factor receptor-associated factor 3), is conserved from worms to humans. The N-terminal region is the microtubule binding domain and is well-conserved; the C-terminal 100 residues, also well-conserved, constitute the coiled-coil region which binds to TRAF3. The central region of the protein is rich in lysine and glutamic acid and carries KKE motifs which may also be necessary for tubulin-binding, but this region is the least well-conserved [ ].
Protein Domain
Name: Protein of unknown function DUF4306
Type: Family
Description: This family includes the Bacillus subtilis YjdJ protein , which is functionally uncharacterised. This is not a homologue of Escherichia coli YjdJ, which belongs to . The proteins in this family are found in bacteria and they are functionally uncharacterised.
Protein Domain
Name: Protein of unknown function DUF721/UPF0232
Type: Family
Description: This family contains several proteins of unknown function. Proteins in this family include DciA from Pseudomonas aeruginosa. DciA is a replicative helicase operator essential for bacterial replication initiation [ ].
Protein Domain
Name: Protein of unknown function DUF1481
Type: Family
Description: This family consists of several hypothetical bacterial proteins of around 230 residues in length. Members of this family are often referred to as YjaH and are found in the Orders Vibrionales and Enterobacteriales. The function of this family is unknown.
Protein Domain
Name: Competence protein ComEA, helix-hairpin-helix domain
Type: Domain
Description: Competence is the ability of a cell to take up exogenous DNA from its environment, resulting in transformation. It is widespread among bacteria and is probably an important mechanism for the horizontal transfer of genes. DNA usually becomes available by the death and lysis of other cells. Competent bacteria use components of extracellular filaments called type 4 pili to create pores in their membranes and pull DNA through the pores into the cytoplasm. This process, including the development of competence and the expression of the uptake machinery, is regulated in response to cell-cell signalling and/or nutritional conditions [ ].The development of genetic competence in Bacillus subtilis is a highly regulated adaptive response to stationary-phase stress. For competence to develop, the transcriptional regulator, ComK, must be activated. ComK is required for the expression of genes encoding proteins that function in DNA uptake. In log-phase cultures, ComK is inactive in a complex with MecA and ClpC. The comS gene is induced in response to high culture cell density and nutritional stress and its product functions to release active ComK from the complex. ComK then stimulates the transcription initiation of its own gene as well as that of the late competence operons [ ].This domain is found in competence protein ComEA and closely related proteins from a number of species that exhibit competence for transformation by exongenous DNA, including Streptococcus pneumoniae, Bacillus subtilis, Neisseria meningitidis, and Haemophilus influenzae. This domain represents a region of two tandem copies of a helix-hairpin-helix domain, each about 30 residues in length. Limited sequence similarity can be found among some members of this family N-terminal to this domain.
Protein Domain
Name: Adaptive response protein AidB, N-terminal
Type: Domain
Description: This is the N-terminal domain of adaptive response protein AidB present in E. coli. AidB is upregulated in response to small doses of DNA-methylating agents initiates a response that mitigates the mutagenic and cytotoxic effects of DNA methylation. Tetramer formation is thought to be carried out by the N-terminal domain [].
Protein Domain
Name: Alkylated DNA repair protein AlkB
Type: Family
Description: AlkB proteins are dioxygenases that repair alkylation damage to DNA and RNA [ ]. The Escherichia coli alkB gene product protects against cell killing by S(N)2-alkylating agents through DNA repair by a novel direct reversal DNA repair mechanism: the oxidative demethylation of N1-methyladenine or N3-methylcytosine DNA lesions. This reaction occurs on both single- and double-stranded DNA, and requires AlkB-bound non-heme Fe(2+), O(2) and alpha-ketogluterate to oxidize the offending methyl group. This is followed by the release of succinate, CO(2) and formaldehyde, and the restoration of undamaged A or C in DNA [].Nine mammalian AlkB homologues exist (ALKBH1-8, FTO), but only a subset functions as DNA/RNA repair enzymes [ ], and only ALKBH1 is included in this entry. In humans, ALKBH1 (ABH1) is responsible for the conversion of 5-methylcytosine to 5-formylcytosine (f5C) in position 34 in mitochondrial tRNAMet [].
Protein Domain
Name: Glucose-6-phosphate dehydrogenase assembly protein OpcA
Type: Family
Description: OpcA protein may play a role in the functional assembly of glucose-6-phosphate dehydrogenase [ ]. The opcA gene is found immediately downstream of zwf, the glucose-6-phosphate dehydrogenase (G6PDH) gene, in a number of species, including Mycobacterium tuberculosis, Streptomyces coelicolor, Nostoc punctiforme (strain ATCC 29133/PCC 73102), and Synechococcus sp. (strain PCC 7942). In the latter, disruption of opcA was shown to block assembly of G6PDH into active oligomeric forms.
Protein Domain
Name: Protein of unknown function DUF1007
Type: Family
Description: This is a family of conserved bacterial proteins with unknown function.
Protein Domain
Name: CRISPR-associated protein Cas7, subtype I-B/Tneap
Type: Family
Description: The CRISPR-Cas system is a prokaryotic defense mechanism against foreign genetic elements. The key elements of this defense system are the Cas proteins and the CRISPR RNA. Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes [ ]. CRISPRs appear to provide acquired resistance against mobile genetic elements (viruses, transposable elements and conjugative plasmids). CRISPR clusters contain sequences complementary to antecedent mobile elements and target invading nucleic acids. CRISPR clusters are transcribed and processed into CRISPR RNA (crRNA).The defense reaction is divided into three stages. In the adaptation stage, the invader DNA is cleaved, and a piece of it is selected to be integrated as a new spacer into the CRISPR locus, where it is stored as an identity tag for future attacks by this invader. During the second stage (the expression stage), the CRISPR RNA (pre-crRNA) is transcribed and subsequently processed into the mature crRNAs. In the third stage (the interference stage), Cas proteins, together with crRNAs, identify and degrade the invader [ , , ].The CRISPR-Cas systems have been sorted into three major classes. In CRISPR-Cas types I and III, the mature crRNA is generally generated by a member of the Cas6 protein family. Whereas in system III the Cas6 protein acts alone, in some class I systems it is part of a complex of Cas proteins known as Cascade (CRISPR-associated complex for antiviral defense). The Cas6 protein is an endoribonuclease necessary for crRNA production whereas the additional Cas proteins that form the Cascade complex are needed for crRNA stability [ ]. This entry represents a family of DevR (Cas7)-type Cas proteins that includes DevR from Myxococcus xanthus. DevR is a key regulator of development. DevR mutants are incapable of fruiting body development [ ]. The expression of DevR appears to be regulated through a number of means, including both location and autorepression.
Protein Domain
Name: Protein of unknown function UCP012335
Type: Family
Description: There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. Members of this entry are mainly found in proteobacteria.
Protein Domain
Name: DNA repair protein Rad4, subgroup
Type: Family
Description: Mutations in the nucleotide excision repair (NER) pathway can cause the xeroderma pigmentosum skin cancer predisposition syndrome. NER lesions are limited to one DNA strand, but otherwise they are chemically and structurally diverse, being caused by a wide variety of genotoxic chemicals and ultraviolet radiation. The xeroderma pigmentosum C (XPC) protein has a central role in initiating global-genome NER by recognising the lesion and recruiting downstream factors. In NER in eukaryotes, DNA is incised on both sides of the lesion, resulting in the removal of a fragment ~25-30 nucleotides long. This is followed by repair synthesis and ligation. This reaction, in yeast, requires the damage binding factors Rad14, RPA, and the Rad4-Rad23 complex, the transcription factor TFIIH which contains the two DNA helicases Rad3 and Rad25, essential for creating a bubble structure, and the two endonucleases, the Rad1-Rad10 complex and Rad2, which incise the damaged DNA strand on the 5'- and 3'-side of the lesion, respectively [ ].The crystal structure of the yeast XPC orthologue Rad4 bound to DNA containing a cyclobutane pyrimidine dimer lesion has been determined. The structure shows that Rad4 inserts a β-hairpin through the DNA duplex, causing the two damaged base pairs to flip out of the double helix. The expelled nucleotides of the undamaged strand are recognised by Rad4, whereas the two cyclobutane pyrimidine dimer-linked nucleotides become disordered. This indicates that the lesions recognised by Rad4/XPC thermodynamically destabilise the double helix in a manner that facilitates the flipping-out of two base pairs []. Homologues of all the above mentioned yeast genes, except for RAD7, RAD16, and MMS19, have been identified in humans, and mutations in these human genes affect NER in a similar fashion as they do in yeast, with the exception of XPC, the human counterpart of yeast RAD4. Deletion of RAD4 causes the same high levelof UV sensitivity as do mutations in the other class 1 genes, and rad4 mutants are completely defective in incision. By contrast, XPC is required for the repair of nontranscribed regions of the genome but not for the repair of the transcribed DNA strand.
Protein Domain
Name: DNA repair protein Rad18, fungi
Type: Family
Description: During DNA replication, lesion bypass is an important cellular response to unrepaired damage in the genome. In the yeast Saccharomyces cerevisiae (Baker's yeast), Rad6 and Rad18 are required for both the error-free and error-prone lesion bypass mechanisms. The Rad18 gene encodes a RING-finger protein with single-stranded DNA binding activity that interacts with theubiquitin-conjugating enzyme Rad6. This entry represents the Rad8 from fungi.
Protein Domain
Name: T-cell leukemia translocation-altered gene protein
Type: Family
Description: This entry represents the T-cell leukemia translocation-altered gene protein. It may be required for cellular fusion during osteoclastogenesis [ ].
Protein Domain
Name: DNA single-strand annealing protein RecT
Type: Family
Description: All proteins in this family for which functions are known bind single-stranded DNA and are involved in the the pairing of homologous DNA. RecT from Escherichia coli is a homotetramer which binds to single-stranded DNA and promotes the renaturation of complementary single-stranded DNA, and also plays a role in recombination. It is able to promote the annealing of complementary single DNA strands and can catalyze the formation of joint molecules [].
Protein Domain
Name: Spore coat assembly protein CotJB
Type: Family
Description: This group represents a spore coat peptide assembly protein CotJB.
Protein Domain
Name: Protein of unknown function DUF5576
Type: Family
Description: This is a family of unknown function found in Hominidae.
Protein Domain
Name: Viral coat protein P2, N-terminal
Type: Domain
Description: P2 (30.2kDa) is the major outer-coat protein of the marine lipid-containing bacteriophage PM2. Each sub-unit of P2 is composed of two beta barrel jelly rolls, disposed normal to the surface of the capsid, which lend pseudo-6-fold symmetry to the molecules, facilitating their close packing within the capsid. There is a Ca2+ ion located between the two beta barrels of P2 that helps PM2 molecular organizations stabilization. This entry represents the N-terminal jelly roll domain of P2 [ ].
Protein Domain
Name: Protein of unknown function DUF5570
Type: Family
Description: This is a family of unknown function found in chordata. Family members contain a transmembrane region in the C-terminal and have been shown to be localized to the Golgi apparatus [ ].
Protein Domain
Name: Malonyl CoA-acyl carrier protein transacylase
Type: Family
Description: Malonyl CoA-acyl carrier protein transacylases transfer the malonyl moiety from coenzyme A to acyl-carrier protein. These include proteins involved in fatty acid biosynthesis (FabD), antibiotic biosynthesis (PksC) and the McdH subunit of malonate decarboxylase [ , , ].
Protein Domain
Name: Peptidase A31, hydrogenase expression/formation protein
Type: Family
Description: This family of peptidases consists of hydrogenase expression/formation proteins, such as HoxM from Alcaligenes eutrophus and HybD from E. coli. They belong to the MEROPS peptidase family A31.Nickel/iron hydrogenases are synthesized as two subunits, with the larger subunit containing the metallo centre. Once nickel has bound, the large subunit is activated by processing at a conserved site near the C terminus [ ]. The HycI endopeptidase releases a 32-residue peptide by cleavage of an Arg+Met bond []. A 15-residue peptide is released from hydrogenase 2 by cleavage of a His+Met bond by the HybD endopeptidase [ ].The tertiary structure of HybD has been solved, and shows a twisted beta sheet with three alpha helices on one side and four on the other [ ]. The molecule binds a single cadmium ion, which led to the peptidase being incorrectly identified as a metallopeptidase. It is possible that the metal binding site interacts with the nickel in the hydrogenase substrate []. The fold is similar to that of members of family A25, and both families A25 and A31 are included in the same clan, AE.
Protein Domain
Name: Sodium/glutamate symport carrier protein GltS
Type: Family
Description: This entry represents a membrane protein, GltS, which is a glutamate carrier of an Na+-dependent, binding protein-independent, and glutamate-specific transport system [ , ].
Protein Domain
Name: Cytochrome B pre-mRNA-processing protein 2
Type: Family
Description: Cytochrome B pre-mRNA-processing protein 2 (Cbp2) is required for splicing of the group I intron bI5 of the COB pre-mRNA [ , ]. It can also stimulate the splicing of the omega intron of the precursor of large ribosomal RNA [].
Protein Domain
Name: Uncharacterised conserved protein UCP004699, CBS/ParB-like
Type: Family
Description: This group represents an uncharacterised protein with CBS and ParB-like domains.
Protein Domain
Name: Uncharacterised conserved protein UCP005063, CBS-type
Type: Family
Description: This group represents an uncharacterised protein with CBS domain pair, MJ1232 type.
Protein Domain
Name: Uncharacterised conserved protein UCP004509, Rem
Type: Family
Description: This group represents a Rem protein.
Protein Domain
Name: Prepilin peptidase-dependent protein B, predicted
Type: Family
Description: This group represents a predicted prepilin peptidase-dependent protein B.
Protein Domain
Name: Lipopolysaccharide assembly protein A domain
Type: Domain
Description: This entry represents a domain found in lipopolysaccharide assembly protein A (LapA). LabA and LapB function together in the assembly of lipopolysaccharide (LPS) [ ]. This domain is also found in some uncharacterised proteins, such as Rv3760 from Mycobacterium tuberculosis.
Protein Domain
Name: Protein of unknown function DUF2589
Type: Family
Description: This family of proteins has no known function.
Protein Domain
Name: Protein of unknown function DUF5545
Type: Family
Description: This is a family of unknown function found in primates.
Protein Domain
Name: Uncharacterised protein family UPF0288, methanogenesis
Type: Family
Description: Members of this protein family are only found in archaeal methanogens. The functions of proteins in this family are unknown, but their role is likely one essential to methanogenesis.
Protein Domain
Name: Hemolysin coregulated protein (Hcp) TssD
Type: Family
Description: This entry represents a family of proteins including Hemolysin coregulated protein, Hcp (also known as TssD).Type VI secretion systems (T6SSs) are multiprotein complexes requiring numerous core proteins (Tss proteins) that function as a toxin delivery systems. The core Tss proteins include cytoplasmic, transmembrane, and outer membrane components. The toxin delivery apparatus includes a needle or tube structure which functions as a puncturing device. This is comprised of a phage-like complex, similar to the T4 contractile bacteriophage tail, which is thought to be anchored to the membrane by a trans-envelope complex. The inner tube is formed of Hcp (TssD), assembled as stacked hexamers [ ]. Hcp proteins have also been reported to function as chaperones, by binding to and stabilizing effectors [].
Protein Domain
Name: Protein of unknown function DUF5588
Type: Family
Description: This is a family of unknown function found in animals.
Protein Domain
Name: Uncharacterised conserved protein UCP005648, calcium-binding
Type: Family
Description: The structure of the Methanobacterium thermoautotrophicum protein encoded by MTH1880 demonstrates the typical alpha + beta fold found in many proteins with different functions [ ]. The molecular surface of the protein reveals a small, highly acidic pocket. MTH1880 protein contains a novel motif for calcium-specific binding, but does not show a calcium-induced conformational change typical of calcium sensor proteins. It may function as a calcium buffering protein [].
Protein Domain
Name: Protein of unknown function DUF749
Type: Family
Description: This is a family of unknown function found in archaebacterial proteins. The family has been solved via structural genomics techniques and comprises of segregated helical and anti-parallel beta sheet regions. It is a putative metal-binding protein.
Protein Domain
Name: Protein of unknown function DUF5552
Type: Family
Description: This is a family of unknown function found in primates.
Protein Domain
Name: Ribosomal protein L36e domain superfamily
Type: Homologous_superfamily
Description: Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [ , ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [ , ].A number of eukaryotic ribosomal proteins can be grouped on the basis of sequence similarities. The L36E ribosomal family consists of mammalian, Caenorhabditis elegans and Drosophila L36, Candida albicans L39, and yeast YL39 ribosomal proteins [ ].
Protein Domain
Name: Non-structural protein NSP3A domain-like superfamily
Type: Homologous_superfamily
Description: The multi-domain non-structural protein NSP3 is the largest protein encoded by the coronavirus (CoV) genome, with an average molecular mass of about 200 kD. While some of the domains differ between CoV genera, eight domains of NSP3 exist in all known CoVs: the ubiquitin-like domain 1 (Ubl1), the Glu-rich acidic domain (also called "hypervariable region"), a macrodomain (also named "X domain"), the ubiquitin-like domain 2 (Ubl2), the papain-like protease 2 (PL2pro), the NSP3 ectodomain (3Ecto, also called "zinc-finger domain"), as well as the domains Y1 and CoV-Y of unknown functions. There are also two transmembrane regions, TM1 and TM2, which exist in all CoVs [ ].The ubiquitin-like domain 1 (Ubl1) and the Glu-rich acidic region are located at the N-terminal of NSP3. These two regions together are also named "NSP3A". In addition to the four β-strands and two α-helices that are common to ubiquitin-like folds, the globular domain of NSP3A contains two short helices [ ]. NSP3A is well conserved within different SARS-CoV sequences but exhibits low sequence identity (less than 35%) to other CoV NSP3 proteins []. It is an essential component of the replication/transcription complex and is involved in multiple biological processes, such as proteolysis and RNA processing [].This entry represents NSP3A that has been shown to bind ssRNA. The overall structure of the SARS-CoV Ubl1 domain is similar to human ubiquitin (Ub) and that of each of the two ubiquitin-like domains of human or mouse interferon-stimulated gene 15 (hISG15 or mISG15). It is also similar to the Ras-interacting domain (RID) of RalGDS [ , ].
Protein Domain
Name: G protein pathway suppressor 2
Type: Family
Description: G protein pathway suppressor 2 is involved in G protein-mitogen-activated protein kinase (MAPK) signaling cascades. When overexpressed in mammalian cells, this protein could potently suppress a RAS- and MAPK-mediated signal and interfere with JNK activity, suggesting that the function of this protein may be signal repression [ ]. This protein is a component of the N-CoR-HDAC3 [] and alsothe SMRT corepressor complexes [ ].
Protein Domain
Name: Outer dense fibre protein 2-related
Type: Family
Description: This family consists of outer dense fibre protein 2 and related proteins. Outer dense fibre protein 2 (Odf2), also known as cenexin, seems to be a major component of sperm tail outer dense fibres (ODFs). ODFs are filamentous structures located on the outside of the axoneme in the midpiece and principal piece of the mammalian sperm tail and may help to maintain the passive elastic structures and elastic recoil of the sperm tail. Odf2 may have a modulating influence on sperm motility. It functions as a general scaffold protein that is specifically localized at the distal/subdistal appendages of mother centrioles [, ].
Protein Domain
Name: Kinase associated protein B superfamily
Type: Homologous_superfamily
Description: Kinase associated protein B (KapB) is one of the major histidine kinases that provide phosphate input in the phosphorelay to produce SpoOA approximately P, the key transcription factor controlling the initiation of sporulation in Bacillus subtilis [ ]. It forms an anti-parallel beta sheet with an extending alpha helical region.
Protein Domain
Name: Activating transcription factor 7-interacting protein
Type: Family
Description: Activating transcription factor 7-interacting protein (MCAF) family consists of MCAF1 and MCAF2 [ ]. MCAF1 (also known as MBD1-containing chromatin-associated factor 1) is a recruiter that couples transcriptional factors to general transcription apparatus and thereby modulates transcription regulation and chromatin formation. It can both act as an activator or a repressor depending on the context [ , ]. MCAF2 might be involved in the transcription modulation during gonad development [ ].
Protein Domain
Name: Mitochondrial ribosomal protein MRP51, fungi
Type: Family
Description: The function of mitochondrial ribosomal small-subunit protein MRP51 is not entirely clear, but deletion of the MRP51 gene completely blocks mitochondrial gene expression [ ].
Protein Domain
Name: Membrane transport protein MmpL family
Type: Family
Description: This entry represents the MmpL family of membrane transport proteins, which may be involved in lipid transport [ ]. Mycobacterium tuberculosis MMPL10 is required for the biosynthesis of polyacyltrehalose (PAT) and the transport of diacyltrehalose (DAT) and possibly PAT to the cell surface [], while MMPL4 is a part of an export system required for biosynthesis and secretion of siderophores [].
Protein Domain
Name: Cytochrome oxidase maturation protein cbb3-type
Type: Family
Description: Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation [ ].Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase.
Protein Domain
Name: Protein-export membrane protein SecD/SecF-like, archaeal-type
Type: Family
Description: Characterised members of the RND superfamily all probably catalyze substrate efflux via an H antiport mechanism. These proteins are found ubiquitously in bacteria, archaea and eukaryotes. They fall into seven phylogenetic families; one of them is the SecDF protein-secretion accessory protein (SecDF) family, which is found in Gram-negative and Gram-positive bacteria, as well as archaea [ ].This entry consists of uncharacterised putative transporters of the SecDF family, mainly from archaea.
Protein Domain
Name: Arginine exporter protein ArgO, enterobacteria
Type: Family
Description: The arginine exporter protein ArgO is involved in the export of arginine and is important to control the intracellular level of arginine and the correct balance between arginine and lysine [ ].
Protein Domain
Name: Uncharacterised conserved protein UCP019240, SpoVT/AbrB-related
Type: Family
Description: This entry represents a small family of archaeal proteins, including SpoVT-AbrB domain-containing protein from Pyrococcus horikoshii (PHS018, ), which consists of six-stranded Greek-key barrel fold and two α-helices, with a very similar overall appearance to the double-psi and swapped-hairpin β-barrel [ ]. Members of this group are predicted regulators of stationary/sporulation gene expression with an SpoVT/AbrB like domain.
Protein Domain
Name: Protein of unknown function DUF447
Type: Family
Description: This entry contains archaeal and bacterial proteins of unknown function.
Protein Domain
Name: Uncharacterised protein family UPF0285, methanogenesis
Type: Family
Description: This group represents uncharacterised conserved proteins. They share distant sequence similarity with members of . The exact function is unknown, but likely is linked to methanogenesis or a process closely connected to it.
Protein Domain
Name: Conjugative transfer relaxase protein TraI
Type: Family
Description: This entry represents TraI, which is a component of the relaxosome complex. In the process of conjugative plasmid transfer the relaxosome binds to the plasmid at the oriT (origin of transfer) site. The relaxase protein TraI mediates the single-strand nicking and ATP-dependent unwinding (relaxation, helicase activity) of the plasmid molecule. These two activities reside in separate domains of the protein [ , ]. TraI contains four domains: a trans-esterase domain that executes the nicking and covalent attachment of the T-strand to the relaxase, a vestigial helicase domain (carrying the 2B/2B-like sub-domain) that operates as an ssDNA-binding domain, an active 5' to 3' helicase domain, and a C-terminal domain that functions as a recruitment platform for relaxosome components.
Protein Domain
Name: Calcium-binding and spermatid-specific protein 1
Type: Family
Description: Calcium-binding and spermatid-specific protein, also known as casein-like phosphoprotein, is involved in spermatogenesis [ , ].
Protein Domain
Name: Protein of unknown function DUF2283
Type: Family
Description: Members of this family of hypothetical proteins have no known function.
Protein Domain
Name: Ribosomal protein L31 type A
Type: Family
Description: Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [ , ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [ , ].Ribosomal protein L31 is one of the proteins from the large ribosomal subunit. L31 is a protein of 66 to 97 amino-acid residues which has only been found so far in bacteria and in some plant and algal chloroplasts.This entry represents 50S ribosomal protein L31 from the type A subfamily.
Protein Domain
Name: Bluetongue virus nonstructural protein NS2
Type: Family
Description: This family includes NS2 proteins from other members of the Orbivirus genus. NS2 is a non-specific single-stranded RNA-binding protein that forms large homomultimers and accumulates in viral inclusion bodies of infected cells. Three RNA-binding regions have been identified in Bluetongue virus 17 ( ) at residues 2-11, 153-166 and 274-286 [ ]. NS2 multimers also possess nucleotidyl phosphatase activity []. The precise function of NS2 is not known, but it may be involved in the transport and condensation of viral mRNAs [].
Protein Domain
Name: Protein of unknown function DUF2490
Type: Family
Description: This entry represents a bacterial family of uncharacterised proteins.
Protein Domain
Name: Protein of unknown function DUF2487
Type: Family
Description: This entry represents proteins with unknown function that appears to be restricted to Bacillus sp.
Protein Domain
Name: Protein of unknown function DUF4198
Type: Family
Description: This family, which includes Haemophilus influenzae protein HI_1624, is functionally uncharacterised. Some family members were previously annotated as prokaryotic transmembrane nickel uptake transporters, NikM. This has been shown to be incorrect, as the proteins matched do not exhibit transmembrane properties, nor do they contain conserved residues associated with NikM proteins [ ].
Protein Domain
Name: Biofilm-dependent modulation protein Bdm, putative
Type: Family
Description: This entry represents a group of tightly conserved proteins from Enterobacteriaceae which are annotated as being biofilm-dependent modulation protein homologues. This entry includes Bdm, whose expression is reduced in biofilms and is repressed by a high salt concentration [ ].
Protein Domain
Name: Protein of unknown function DUF2491
Type: Family
Description: This entry represents a family of bacterial uncharacterised proteins, including YjfK ( ) from Escherichia coli.
Protein Domain
Name: Protein of unknown function DUF596
Type: Family
Description: This family contains several uncharacterised proteins.
Protein Domain
Name: Rhodanese-like domain-containing protein 9, chloroplastic
Type: Family
Description: This entry includes STR9 and related proteins from plants. The function of STR9 is not clear.
Protein Domain
Name: Protein of unknown function DUF2509
Type: Family
Description: This entry represents a conserved protein found in Proteobacteria. The function is not known but many of the members are annotated as protein YgdB.
Protein Domain
Name: Protein of unknown function DUF2513
Type: Family
Description: The function of this family is not known.
USDA
InterMine logo
The Legume Information System (LIS) is a research project of the USDA-ARS:Corn Insects and Crop Genetics Research in Ames, IA.
LegumeMine || ArachisMine | CicerMine | GlycineMine | LensMine | LupinusMine | PhaseolusMine | VignaMine | MedicagoMine
InterMine © 2002 - 2022 Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, United Kingdom