Search our database by keyword

- or -

Examples

  • Search this entire website. Enter identifiers, names or keywords for genes, pathways, authors, ontology terms, etc. (e.g. eve, embryo, zen, allele)
  • Use OR to search for either of two terms (e.g. fly OR drosophila) or quotation marks to search for phrases (e.g. "dna binding").
  • Boolean search syntax is supported: e.g. dros* for partial matches or fly AND NOT embryo to exclude a term

Search results 1501 to 1600 out of 30763 for seed protein

Category restricted to ProteinDomain (x)

0.042s

Categories

Category: ProteinDomain
Type Details Score
Protein Domain
Name: Soil-associated protein DUF3738
Type: Family
Description: Bacterial reference strains encoding members of this protein family are all isolated from soil. These include 39 members from Solibacter usitatus Ellin6076, 27 from Acidobacterium sp. MP5ACTX8 (both Acidobacteria), and four from Pedosphaera parvula Ellin514 (Verrucomicrobia). The family is well-diversified, with few pairs showing greater than 50 % pairwise identity. A few members are fused to Peptidase_M56 domains (see ), to Sigma70_r2 domains (see ), or have a duplication of this domain.
Protein Domain
Name: Hercynine metabolism protein
Type: Family
Description: Hercynine is the betaine (trimethylated amino group) form of histidine. It is the precursor of ergothioneine, a thiourea derivative containing a sulfur atom in the imidazole ring. This protein occurs in a conserved four-gene cyanobacterial cassette along with an EgtD, the methyltransferase that converts histidine to hercynine as in ergothioneine biosynthesis, an EgtB homologue that is likely to attach some thiol (e.g. gamma-glutamyl-cysteine) through its sulfur to the hercynine imidazole ring, and a small protein of unknown function ( ). Members are distantly related to phage shock protein A (PspA).
Protein Domain
Name: Oxidoreductase-SelD-related fusion protein
Type: Family
Description: Some selenium donor proteins (selenide,water dikase, product of the selD gene, ) are fusion proteins with an N-terminal extension described by . Members of this family have a C-terminal region similar to SelD, fused to an N-terminal region similar to, but outside the scope of .
Protein Domain
Name: D-alanyl carrier protein
Type: Family
Description: This entry represents D-alanyl carrier protein DltC, which is part of the operon for incorporation of D-Ala residues into lipoteichoic acids (LTAs), which requires the activity of four gene products (DltA to DltD). DltA is a cytoplasmic D-alanine-D-alanyl carrier protein ligase that catalyses the D-alanylation of the D-alanyl carrier protein DltC (or Dcp); DltB is a transmembrane protein thought to be involved in the efflux of activated D-alanine to the site of acylation; and DltD is thought to be a membrane-associated protein that may have multifunctional activities (hydrolysis of mischarged DltC, facilitation of D-alanine ligation to DltC and D-alanylation of LTAs) [ , ].These proteins are involved in the biosynthesis of D-alanyl-lipoteichoic acid in bacteria. They are major wall and membrane components of most Gram-positive bacteria [ ]. Streptococcus pneumoniae is one of the few species of low-G and C Gram-positive bacteria that do not contain d-alanine in teichoic acids []. d-alanine residues are thought to protect this human pathogen against the actions of cationic antimicrobial peptides. Mammals use antimicrobial peptides to protect themselves from invasive infectious agents such as Group A streptococcus (GAS), which causes necrotising fasciitis and toxic shock syndrome []. Teichoic acids and D-alanylation of teichoic acids are required for colony spreading in Staphylococcus aureus [].
Protein Domain
Name: Pentraxin-related protein PTX3
Type: Family
Description: Pentraxin-related protein PTX3 is a long pentraxin that provides defence against infectious agents and plays several functions in tissue repair and regulation of cancer-related inflammation [ ].Pentraxins are a family of evolutionarily conserved molecules with regulatory role in inflammation. They share the "pentraxin signature"(His-x-Cys-x-Ser/Thr-Trp-x-Ser, where x is any amino acid) and can be divided into a short and a long arm based on their structural organization. Short pentraxins, feature a peculiar quaternary structure with five or ten identical protomer subunits arranged into symmetric pentamers, such as CRP and SAP, are secreted proteins produced by hepatocytes in response to IL-6. Long pentraxins, on the other end, display an unrelated amino-terminal region coupled to a C-terminal pentraxin domain. PTX3 is the prototype of the long pentraxin subfamily produced by myeloid and stromal cells, but not by hepatocytes, in response to primary pro-inflammatory cytokines or microbial moieties [ ].
Protein Domain
Name: Uncharacterized protein KIAA1958
Type: Family
Description: The function of KIAA1958 is not clear.
Protein Domain
Name: Dyslexia-associated protein KIAA0319-like
Type: Family
Description: Several dyslexia-associated proteins have been identified: ROBO1, KIAA0319, KIAA0319L, S100B, DOCK4, FMR1, DIP2A, GTF2I, DYX1C, DCDC2, SLIT2, HMGB1 and VAPA [ , ]. This entry includes KIAA0319 and KIAA0319-like (KIAA0319L) proteins. KIAA0319 is required for neuronal migration during the formation of the cerebral neocortex []. KIAA0319L has a possible role in axon guidance through interaction with nogo receptor 1 [].
Protein Domain
Name: Mid-cell-anchored protein Z
Type: Family
Description: In Streptococcus pneumoniae, mid-cell-anchored protein Z (MapZ) forms ring structures at the cell equator and moves apart as the cell elongates, therefore behaving as a permanent beacon of division sites. During cell division, a new MapZ ring marks the future cell division site and positions tubulin-like protein FtsZ rings through direct interactions [ ]. MapZ is conserved amongst Streptococcaceae and most other Lactobacillales [].
Protein Domain
Name: DNA-binding 7kDa protein
Type: Family
Description: This family contains members of the hyperthermophilic archaebacterium 7kDa DNA-binding/endoribonuclease P2 family. There are five 7kDa DNA-binding proteins, 7a-7e, found as monomers in the cell. Protein 7e shows the tightest DNA-binding ability.
Protein Domain
Name: Proline-rich protein 3
Type: Family
Description: The function of Proline-rich protein 3 (PRR3) is not clear.
Protein Domain
Name: MAP3K7 C-terminal-like protein
Type: Family
Description: The function of MAP3K7 C-terminal-like (Map3k7cl) is not clear.
Protein Domain
Name: Antitermination protein Q
Type: Family
Description: This entry consists of antitermination proteins found in bacteriophages, such as protein Q from phage lambda, and some bacterial homologues. Protein Q positively regulates expression of the phage late gene operon by binding to the bacterial host RNA polymerase (RNAP) and modifying it. This protein binds a specific DNA Q-binding element (QBE) and interacts with RNAP. The modified RNAP can read without pausing and through transcription terminators preceding late genes [ , ]. It participates in the lysis-lysogeny decision by activating the expression of the late lytic genes []. The structure of Q from bacteriophage 21 revealed that it forms a torus, that narrows and extends the RNAP RNA-exit channel and extrudes the linked RNA, preventing the formation of pause and terminator hairpins [].
Protein Domain
Name: Integral membrane protein
Type: Family
Description: Members of this protein are short (about 85-residue), low-complexity sequences of unknown function, with a highly hydrophobic N-terminal region of about 40 amino acids followed by a charged (Asp, Glu, Lys, and Arg-rich), sometimes repetitive C-terminal region. Members occur exclusively among the Mollicutes (Mycoplasma, Mesoplasma, Acholeplasma, Spiroplasma, Entomoplasma). The gene neighbourhood of this protein is not conserved.
Protein Domain
Name: Antimicrobial protein MiAMP1
Type: Family
Description: MiAMP1 is a highly basic protein from the nut kernel of Macadamia integrifolia (Macadamia nut), which inhibits the growth of several microbial plant pathogens in vitro while having no effect on mammalian or plant cells. It consists of eight β-strands which are arranged in two Greek key motifs. These Greek key motifs then associate to form a Greek key β-barrel [ ].
Protein Domain
Name: HaoA-associated protein HaoB
Type: Family
Description: Members of this family occur as the HaoB protein, encoded next to the homotrimeric HaoA, a hydroxylamine oxidoreductase protein with eight heme groups. The haoB gene is cotranscribed with haoA, but its function is unknown [ ]. It appears all species with this enzyme are nitrifying bacteria [].
Protein Domain
Name: DnaD family protein
Type: Family
Description: This entry represents a group of Mollicutes proteins homologous to the N-terminal region of the Bacillus subtilis DnaD protein. Their function is not clear.
Protein Domain
Name: Uncharacterized protein KIAA1143-like
Type: Family
Description: This entry includes human KIAA1143 and related proteins. Their function is not clear.
Protein Domain
Name: Uncharacterized protein KIAA2026
Type: Family
Description: This entry represents a group of uncharacterised proteins from animals, including KIAA2026 from humans.
Protein Domain
Name: DNA replication/checkpoint protein
Type: Family
Description: Genome duplication is precisely regulated by cyclin-dependent kinases CDKs, which bring about the onset of S phase by activating replication origins and then prevent relicensing of origins until mitosis is completed. The optimum sequence motif for CDK phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found to have at least 11 potential phosphorylation sites. Drc1 is required for DNA synthesis and S-M replication checkpoint control. Drc1 associates with Cdc2 and is phosphorylated at the onset of S phase when Cdc2 is activated. Thus Cdc2 promotes DNA replication by phosphorylating Drc1 and regulating its association with Cut5 []. Sld2 and Sld3 represent the minimal set of S-CDK substrates required for DNA replication [].This entry also includes ATP-dependent DNA helicase Q4, which may be involved in chromosome segregation and has been associated with various diseases [ , ].
Protein Domain
Name: Aspartate-rich protein 1-like
Type: Family
Description: This entry includes aspartate-rich protein 1 (DRICH1) and uncharacterised protein C22orf42 from humans. Their function is not clear.
Protein Domain
Name: Testis-expressed protein 11
Type: Family
Description: Testis-expressed protein 11 (TEX11), also known as ZIP4H, is a regulator of crossing-over during meiosis. It is involved in initiation and/or maintenance of chromosome synapsis and formation of crossovers [ , ]. Mutations in the TEX11 gene cause Spermatogenic failure, X-linked, 2 (SPGFX2), an infertility disorder caused by spermatogenesis defects [].
Protein Domain
Name: Transmembrane protein 25
Type: Family
Description: The function of Transmembrane protein 25 (TMEM25) is not clear. It contains a C-2 type immunoglobulin domain homologous to Hemicentin (Fibulin-6, FIBL6), Titin (TTN), Sialoadhesin (SN) and Nephrin (NEPHS1) [ ].
Protein Domain
Name: Nucleolar protein 11
Type: Family
Description: NOL11 is a nucleolar protein and a component of the human ribosomal small subunit (SSU) processome. It is required for the early stages of ribosome biogenesis in humans [ ]. It interacts with the C-terminal region of the known t-UTP/UTPA subcomplex member, hUTP4/Cirhin, the protein mutated in North American Indian childhood cirrhosis (NAIC) []. In Xenopus, it is required for optimal rDNA transcription and craniofacial development [].
Protein Domain
Name: Transmembrane protein 266
Type: Family
Description: Transmembrane protein 266 (TMEM266), also known as HVRP1 (Hv1 related protein 1), is a voltage sensor regulated by extracellular Zn2 [ ]. It may have an important function in the nervous system [].
Protein Domain
Name: Ribosomal protein L15
Type: Family
Description: This entry represents the ribosomal protein family L15P. This family includes proteins L15 from bacteria, L28 and L10 from yeast, and L27A from mammals.Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [ , ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [ , ].
Protein Domain
Name: Ribosome-binding protein 1
Type: Family
Description: Ribosome-binding protein 1 (RRBP1), also called ES/130 or p180, is an endoplasmic reticulum membrane protein that is critical for ribosome binding and for the the transportation and secretion of nascent proteins []. RRBP1 plays a critical role in terminal differentiation of secretory cells and tissues []. RRBP1 is highly expressed in several cancers [, , ].
Protein Domain
Name: UPF0547 protein C16orf87-like
Type: Family
Description: This entry represents a group of uncharacterised proteins from animals, including C16orf87 from humans.
Protein Domain
Name: Protein RALF-like 27
Type: Family
Description: The plant RAPID ALKALINIZATION FACTOR (RALF) family consists of extracellular peptides that serve as extracellular signals [ ]. This entry represents RALF-like 27, whose exact function is not known.
Protein Domain
Name: Transmembrane protein 198
Type: Family
Description: TMEM198 is a membrane scaffold protein that promotes LRP6 phosphorylation and Wnt signaling activation [ ].
Protein Domain
Name: Helix-loop-helix protein TAL-like
Type: Family
Description: This entry includes highly related proteins with a Myc-type, basic helix-loop-helix (bHLH) domain, including T-cell acute lymphocytic leukemia proteins 1 and 2 (TAL1 and TAL2), helix-loop-helix proteins 1 and 2 (NHLH1/HEN1 and NHLH2/HEN2), and protein lyl-1 (LYL1). The bHLH domain is required for protein dimerization and DNA binding [ ]. TAL1 may be a positive regulator of erythroid differentiation []. The TAL1 and TAL2 genes are altered in T-cell acute lymphoblastic leukemia []. The NHLH1 and NHLH2 genes are expressed in the developing nervous system, especially cell lines derived from neuroblastoma, PNET, and small cell lung cancer [], and the proteins may have regulatory functions in the developing nervous system []. LYL1 has a role in primitive erythropoiesis []. A chromosomal aberration involving the LYL1 gene may be a cause of a form of T-cell acute lymphoblastic leukemia [].
Protein Domain
Name: RNA-binding protein KhpB
Type: Family
Description: This entry represents the RNA-binding protein KhpB, also known as RNA-binding protein Jag/EloR, a RNA chaperone that forms a complex with KhpA and binds to cellular RNA, controlling its expression. It plays a role in peptidoglycan (PG) homeostasis and cell length regulation, cell division and maintenance of cell shape [ , , ]. Bacillus subtilis KhpB/Jag is associated with SpoIIIJ, a gene necessary for the third stage of sporulation []. These proteins consist of an N-terminal Jag-domain of unknown function and two RNA-binding domains, a type II KH domain (KH-II) and R3H, at the C-terminal end.
Protein Domain
Name: Flagellar protein FlgA
Type: Family
Description: FlgA is essential for flagellar biosynthesis and motility, and plays a significant role in biofilm formation in Campylobacter [ ]. It functions as a flagellar chaperone in P-ring assembly [].
Protein Domain
Name: Spermatogenesis-defective protein 39
Type: Family
Description: Spermatogenesis-defective protein 39 (spe-39) from the nematode Caenorhabditis elegans is involved in endosomal maturation [ ] and intracellular membrane reorganization during spermatogenesis []. Also included in this entry is vacuolar protein sorting-associated protein 16B (Vps16B; also known as protein full-of-bacteria) from the fruit fly Drosophila melanogaster. Vps16B is required for phagosome maturation and the innate immune response to bacteria [].
Protein Domain
Name: Spermatogenesis-associated protein 1
Type: Family
Description: The spermatogenesis associated 1 (SPATA1) protein is thought to be involved in spermatogenesis and is a potential marker for male fertility [].
Protein Domain
Name: Transmembrane protein 201
Type: Family
Description: TMEM201, also known as SAMP1, is a RanGTP binding transmembrane protein found in the inner nuclear membrane [ ]. It is functionally associated with the LINC complex protein Sun1 and proteins of the A-type lamina network [].In Caenorhabditis elegans, it plays a role in nuclear migration [ ].
Protein Domain
Name: Sporulation-specific protein 22/ZIP4
Type: Family
Description: Spo22/Zip4 is a meiosis-specific protein that promotes chromosome synapsis and regulates crossover distribution in budding yeast [ , ]. This family includes the homologue from plants [].
Protein Domain
Name: Autophagy-related protein 11
Type: Family
Description: Autophagy-related protein 11 (ATG11; also known as cytoplasm to vacuole targeting protein 9) is required for transport of proteins from the cytosol to the vacuole [ , ], including whole mitochondria for autophagic digestion (mitophagy) in the autophagosome during starvation []. It also acts as the scaffold that recruit ATG proteins to the pre-autophagosome []. The orthologue in mammals is known as RB1-inducible coiled-coil protein 1 (RB1CC1) []; the orthologue in Schizosaccharomyces pombe is Taz1-interacting factor 1 (taf1) [].
Protein Domain
Name: Major vault protein
Type: Family
Description: The major vault protein is the major polypeptide component of a large cellular ribonuclear protein complex found in the cytoplasm of eukaryotic cells (known as vaults). Several roles for vaults have been proposed. Vault proteins have been associated with development of multi-drug resistance []. They have also being implicated in the regulation of several cellular processes including transport mechanisms, signal transmission and immune responses [, ].
Protein Domain
Name: Centromere protein H
Type: Family
Description: Chromosome segregation in eukaryotes requires the kinetochore, a multi-protein structure that assembles on centromeric DNA, and which acts to link chromosomes to spindle microtubules. Kinetochore structure and composition is highly conserved among vertebrates. The inner kinetochore is essential for kinetochore assembly, and is involved in chromosome segregation via regulation of the spindle. Inner kinetochore components include the multi-subunit CENP-H/I complex, which may function, in part, in directing centromere protein A (CENP-A) deposition to centromeres, where CENP-A is a centromere-specific histone H3 variant required for the organisation of centromeric chromatin during interphase. The CENP-H/I complex contains three functional classes of proteins [, ]: CENP-H class (includes CENP-H, -I, -K, -L)CENP-M class (includes CENP-M)CENP-O class (includes CENP-O, -P, -Q, -R, -50)CENP-H is required for the localisation of CENP-C, but not CENP-A, to the centromere. However, it may be involved in the incorporation of newly synthesised CENP-A into centromeres via its interaction with the CENP-A/CENP-HI complex. CENP-H contains a coiled-coil structure and a nuclear localisation signal. CENP-H is specifically and constitutively localised in kinetochores throughout the cell cycle, and may play a role in kinetochore organisation and function throughout the cell cycle [ ].Studies show that CENP-H may be associated with certain human cancers [ , ].
Protein Domain
Name: Transmembrane protein 180
Type: Family
Description: The function of TMEM180 is not clear.
Protein Domain
Name: Uncharacterized protein C11orf91-like
Type: Family
Description: This family of uncharacterized proteins includes human protein C11orf91.
Protein Domain
Name: Intraflagellar transport-associated protein
Type: Family
Description: IFTAP (C11ORF74) interacts with the IFT-A complex and is accumulated at the distal tip in the absence of an IFT-A subunit IFT139. In IFTAP-knockout (KO) cells, the BBSome components cannot enter cilia. However, IFTAP-KO mice demonstrated no obvious anatomical abnormalities associated with ciliary dysfunctions [ ].
Protein Domain
Name: Uncharacterized protein C14orf28-like
Type: Family
Description: C14orf28, also known as dopamine receptor-interacting protein 1 (DRIP1), has been shown to inhibit apoptosis in colorectal cancer and contribute to oncogenicity [ ]. Its function is not known.
Protein Domain
Name: Uncharacterized protein C13orf46-like
Type: Family
Description: This family of uncharacterized proteins includes human protein C13orf46.
Protein Domain
Name: Uncharacterized protein C9orf153-like
Type: Family
Description: This family of uncharacterized proteins from mammals includes human protein C9orf153.
Protein Domain
Name: Deubiquitinating protein VCPIP1
Type: Family
Description: VCPIP1, also known as VCIP135, is a deubiquitinating enzyme that functions in p97/p47-mediated Golgi reassembly [ ]. It hydrolyzes 'Lys-11'- and 'Lys-48'-linked polyubiquitin chains []. It has also been found to dictate the duration of botulinum neurotoxin type A intoxication []. VCPIP1 can also deubiquitinate SPRTN (a metalloprotease that cleaves DNA-protein crosslinks) and promotes its chromatin relocalization []. VCPIP1 is a member of peptidase family C64.
Protein Domain
Name: Exopolysaccharide synthesis protein
Type: Domain
Description: This entry describes a family of proteins whose members include tyrosine-protein kinases that are involved in extracellular polysaccharide colanic acid synthesis.
Protein Domain
Name: T6S-associated protein TagF
Type: Family
Description: This entry includes the T6S-associated protein, TagF, a post-translational repressor of the hemolysin co-regulated secretion island I (HSI-I)-encoded type VI secretion system (H1-T6SS), found in Pseudomonas aeruginosa. The type VI secretion system (T6SS) is found broadly among the Proteobacteria and is implicated in diverse processes including host-cell interactions, biofilm formation and gene regulation. Furthermore, studies indicate that the T6SS plays a critical role in interbacterial interactions. TagF has been shown to regulate the activity of the H1-T6SS in a manner that does not require phosphorylated-Fha1, PpkA, or other Tag proteins. TagF forms a homodimer with nearly identical monomers composed of alpha and beta elements assembled as a three-layer sandwich (α-β-alpha) [ ].
Protein Domain
Name: Transmembrane protein 168
Type: Family
Description: This is a family of uncharacterised transmembrane proteins.
Protein Domain
Name: Ras-related protein Rab14
Type: Family
Description: Ras-related protein Rab14 is a small GTPase that mediates endocytic recycling and functions in phagosome maturation [ , ]. Rab14 and its exchange factor FAM116 regulates the specific endocytic transport of ADAM10 and thereby N-cadherin shedding and cell motility [, ]. Rab14 regulates claudin-2 trafficking, which is required for epithelial morphogenesis []. It regulates apical targeting in polarised epithelial cells []. Rab14 also regulates the interaction of phagosomes with early endocytic compartments and the maturation of macrophage phagosomes containing the fungal pathogen Candida albicans []. Drosophila Rab14 mediates phagocytosis in the immune response to Staphylococcus aureus [].
Protein Domain
Name: Penicillin-binding protein 2
Type: Family
Description: This entry represents penicillin-binding protein 2 (PBP-2, also known as peptidoglycan D,D-transpeptidase MrdA), a protein whose gene (designated either pbpA or mrdA) is generally found next to the gene for RodA, a protein required for the rod (bacillus) shape in many bacteria [ ]. PBP-2 acts as a transpeptidase for cell elongation, hence it is involved in formation of the rod shape.
Protein Domain
Name: Centrosomal protein Spd-2/CEP192
Type: Family
Description: This entry includes Spd2 (spindle-defective protein 2) from Caenorhabditis elegans, CEP192 (centrosomal protein of 192kDa) from humans, and homologues from metazoa and fungi. Spd2 is required both for centrosome duplication and maturation [ , , ]. CEP19 is required for mitotic centrosome and spindle assembly []. They are required for pericentriolar material (PCM) recruitment [, , ].
Protein Domain
Name: Transmembrane protein 179B
Type: Family
Description: This family of transmembrane proteins is functionally uncharacterised.
Protein Domain
Name: Delta-retroviral matrix protein
Type: Domain
Description: Retroviral matrix proteins (or major core proteins) are components of envelope-associated capsids, which line the inner surface of virus envelopes and are associated with viral membranes [ ]. Matrix proteins are produced as part of Gag precursor polyproteins. During viral maturation, the Gag polyprotein is cleaved into major structural proteins by the viral protease, yielding the matrix (MA), capsid (CA), nucleocapsid (NC), and some smaller peptides. Gag-derived proteins govern the entire assembly and release of the virus particles, with matrix proteins playing key roles in Gag stability, capsid assembly, transport and budding. Although matrix proteins from different retroviruses appear to perform similar functions and can have similar structural folds which predominantly consist of four closely packed α-helices that are interconnected through loops, their primary sequences can be very different []. This entry represents matrix proteins from delta-retroviruses such as Human T-lymphotropic virus 1 and Human T-cell leukemia virus 2 (HTLV-2), both members of the human oncovirus subclass of retroviruses [ , ].
Protein Domain
Name: Storkhead-box protein 1/2
Type: Family
Description: Storkhead-box proteins are winged-helix transcription factors. STOX1 and STOX2 have been linked to pre-eclampsia [ , ]. STOX has been shown to regulate nitroso-redox balance and mitochondrial homeostasis [].
Protein Domain
Name: Uncharacterized protein C16orf45-like
Type: Family
Description: This entry includes human C16orf45 and related proteins. Their function is not clear.
Protein Domain
Name: Uncharacterized protein T25E4.2-like
Type: Family
Description: This entry represents a group of proteins from nematoda, including T25E4.2 from C. elegans.
Protein Domain
Name: Uncharacterized protein C19orf44-like
Type: Family
Description: This entry includes C19orf44 and related proteins. Their function is not clear.
Protein Domain
Name: Uncharacterized protein C10orf67-like
Type: Family
Description: This entry represents a group of animal proteins, including C10orf67 from human.
Protein Domain
Name: Recombination protein 107
Type: Family
Description: Rec107 forms part of a complex (Rec107-Mei4-Rec114) that is required for meiotic double strand DNA break formation. Rec107 increases in abundance and is phosphorylated during the prophase phase of cell division [ ]. Rec107 is not required for mitosis and mitotic DNA repair mechanisms [].
Protein Domain
Name: Large coat protein
Type: Family
Description: The virus capsid is composed 60 icosahedral units, each of which is composed of one copy of each of the two coat proteins. This family contains the large coat protein (LCP) [ ] of the comoviridae viral family.
Protein Domain
Name: Homeobox protein MNX1/Ceh-12
Type: Family
Description: This entry represents a group of homeobox proteins, including Motor neuron and pancreas homeobox protein 1 (MNX1) from mammals and Ceh-12 from C. elegans. Mutations in the MNX1 gene cause Currarino syndrome (CURRAS), a triad composed of partial sacral agenesis with intact first sacral vertebra ("sickle-shaped sacrum"), presacral mass, and anorectal malformations [ ].
Protein Domain
Name: Autophagy-related protein 29
Type: Family
Description: Autophagy-related (Atg) proteins play a role in autophagy. Atg29 forms a complex with Atg17 and Atg31. When autophagy is initiated, the Atg17-Atg31-Atg29 complex is first targeted to the phagophore assembly site (PAS) and then recruits other Atg proteins [ ]. The complex is recruited to the PAS by Atg11. Atg29 contains an N-terminal functional domain, whereas the C terminus plays a regulatory role. Phosphorylation of the C-terminal domain of Atg29 is required for its interaction with Atg11 and proper PAS localization [].
Protein Domain
Name: RPM1-interacting protein 4/NOI4
Type: Family
Description: RIN4 is an essential regulator of plant defense pathways. It functions with the plasma membrane H(+)-ATPase to regulate stomatal apertures, inhibiting the entry of bacterial pathogens during infection [ , ]. It is required for RPM1-mediated Pseudomonas syringae resistance in Arabidopsis []. Phosphorylated RIN4 activates the receptor RPM1 to mediate the plant defense []. Arabidopsis RIN4 contains two nitrate-induced (NOI) domains and is a member of the larger NOI family []. This entry also includes NOI4 (At5g55850), whose function is not known.
Protein Domain
Name: Nucleolar protein 10/Enp2
Type: Family
Description: Proteins in this entry contain WD40 repeats. Nucleolar protein 10 (NOL10) is found in the nucleolus of vertebrates and contains seven WD40 repeats but is otherwise uncharacterized. Ribosome biogenesis protein Enp2 (Enp2) from Saccharomyces cerevisiae contains five WD40 rerpeats and is a component of 90S pre-ribosomes [ ]. Homologues are also known from plants.
Protein Domain
Name: Rab-like protein 6
Type: Family
Description: Rab-like protein 6 (RABL6 or RBEL1) is one of several proteins identified as binding partners of the ARF tumour suppressor, which protects against cancer through protein-protein interactions, interacting with CDKN2A [ ]. RABL6 binds GTP and is found either in the cytoplasm or the nucleus depending in the isoform; isoform 1 is O-glycosylated and cytoplasmic [].
Protein Domain
Name: Uncharacterized protein At4g14450-like
Type: Family
Description: This family consists of uncharacterized plant proteins of unknown function.
Protein Domain
Name: Capsid protein VP4
Type: Family
Description: The virus capsid is composed of 60 icosahedral units of a combination of VP4, VP3, VP2 and VP1. Four different translation initiation sites of the Densovirus capsid protein mRNA give rise to these four viral proteins, VP1 to VP4. This family represents VP4.
Protein Domain
Name: Replication terminator protein
Type: Family
Description: The bacterial replication terminator protein (RTP) plays a role in the termination of DNA replication by impeding replication fork movement. Two RTP dimers bind to the two inverted repeat regions at the termination site.
Protein Domain
Name: Coronavirus protein 7
Type: Family
Description: This is a family of proteins from Coronavirus, which may function in the formation of membrane-bound replication complexes or in viral assembly.
Protein Domain
Name: Bacteriochlorophyll A protein
Type: Family
Description: Bacteriochlorophyll A (or FMO) protein is involved in the energy transfer system of photosynthetic bacteria, such as Green Sulphur Bacteria. Bacteriochlorophyll A acts as a light-harvesting complex that directs light energy from the chlorosomes attached to the cell membrane to the reaction centre [ ]. The protein forms a homotrimer, with each monomer unit containing seven molecules of bacteriochlorophyll A.
Protein Domain
Name: Transmembrane protein 181
Type: Family
Description: TMEM181 mutants are resistant to CDT bacterial toxins. TMEM181 is a cell-surface protein, and CDT may bind to it to enter the cell through endocytosis [ ].
Protein Domain
Name: Uncharacterized protein At1g65710-like
Type: Family
Description: At1g65710 is the representative of this family of uncharacterized plant proteins.
Protein Domain
Name: PAM2-containing protein CID1/CID2
Type: Family
Description: PAM2-containing protein CID1, also known as protein EARLY RESPONSIVE TO DEHYDRATION 15 (ERD15) or polyadenylate-binding protein-interacting protein 1, is a negative regulator of abscisic acid (ABA) responses [ ]. ABA plays an important role in plant germination and development, and accumulates in response to different stresses. ERD15 is involved in stress tolerance in plants, including resistance to drought [].This family also includes related protein CID2, whose function is not known.
Protein Domain
Name: Pre-hexon-linking protein IIIa
Type: Family
Description: The major capsid protein of the adenovirus strain is also known as a hexon. This entry represents protein IIIa, which is a hexon-associated protein that is likely to participate in vertex stabilisation and genome packaging. It stabilises vertices by tethering the penton bases to neighbouring peripentonal hexons, and lashes peripentonal hexons to the neighbouring hexons through its interaction with hexon-linking protein. During virus assembly, it seems to play a role in packaging of viral DNA via its interaction with packaging protein 3 [ , ].
Protein Domain
Name: Uncharacterized protein At5g23160-like
Type: Family
Description: This family of uncharacterized plant proteins includes Arabidopsis thaliana At5g23160.
Protein Domain
Name: Uncharacterized protein DDB_G0275255-like
Type: Family
Description: This is a family of unknown function. No member has yet been characterized.
Protein Domain
Name: Nucleocapsid N protein
Type: Family
Description: The nucleoprotein of the ssRNA negative-strand Nairovirus is an internal part of the virus particle.
Protein Domain
Name: Ubiquitin-like protein FUBI
Type: Domain
Description: Fubi is a ubiquitin-like protein encoded by the fau gene which has an N-terminal ubiquitin-like domain (also referred to as FUBI) fused to the ribosomal protein S30. Fubi is thought to be a tumour suppressor protein and the FUBI domain may act as a substitute or an inhibitor of ubiquitin or one of ubiquitin's close relatives UCRP, FAT10, and Nedd8 [ , , ].
Protein Domain
Name: Replication origin-binding protein
Type: Domain
Description: This entry represents replication origin binding protein. It functions as a docking protein to recruit essential components of the viral replication machinery to viral DNA origins. In the presence of the major DNA-binding protein, it opens dsDNA which leads to a conformational change in the origin that facilitates DNA unwinding and subsequent replication [ ].
Protein Domain
Name: Rhabdovirus non-virion protein
Type: Family
Description: Infectious hematopoietic necrosis virus (IHNV) is a member of the family Rhabdoviridae. The non-virion protein (NV) is coded for by one of the six genes of the IHNV genome [], but is absent in vesiculovirus-like rhabdovirus [].
Protein Domain
Name: Uncharacterized protein F10E9.3-like
Type: Family
Description: This small family consists of uncharacterized proteins from Caenorhabditis.
Protein Domain
Name: Uncharacterized protein C11orf97-like
Type: Family
Description: The function of this family of uncharacterized proteins is not known.
Protein Domain
Name: Uncharacterized protein C20orf141-like
Type: Family
Description: This family of uncharacterized proteins is present in mammals.
Protein Domain
Name: Uncharacterized protein C05B5.4-like
Type: Family
Description: This family of uncharacterized proteins from nematodes includes C05B5.4 and R10E12.2 from Caenorhabditis elegans.
Protein Domain
Name: Uncharacterized protein At1g76660-like
Type: Family
Description: At1g76660 is the representative of this family of uncharacterized proteins from plants.
Protein Domain
Name: Transmembrane protein 109
Type: Family
Description: TMEM109, also known as Mg23, is a endoplasmic reticulum protein that facilitates DNA damage-induced apoptosis [ ]. It plays a protective role against UVC by accumulating alphaBC in the close vicinity of the ER [].
Protein Domain
Name: Transmembrane protein 207
Type: Family
Description: The function of TMEM207 is not known. Homologues are known only from vertebrates.
Protein Domain
Name: Protein masquerade, clip-domain
Type: Domain
Description: The clip domain is a structural/regulatory unit in many arthropod serine proteases [ ]. The clip domain superfamily also includes serine protease homologs (SPHs) []. This entry describes clip domains in the SPHs (CLIP subfamily A), which belong to group-3. SPHs usually carry between 1 to 5 clip domains []. One of the most prominent family members is masquerade (mas). Deletion in drosophila models lead to defects in somatic muscle attachment and in the formation of the nervous system during embryogenesis [].
Protein Domain
Name: Uncharacterized protein CXorf65-like
Type: Family
Description: The function of this group of proteins is not known. Proteins in this entry includes CXorf65 and C22orf15 from humans.
Protein Domain
Name: Protein FAM184A/B, N-terminal
Type: Domain
Description: This entry represents the N terminus of protein FAM184A/B. The function of FAM184A/B is not known. This domain can also be found in protein tag-278 from C. elegans.
Protein Domain
Name: Nodulin-related protein 1/2
Type: Family
Description: Nodulin-related protein 1 (NRP1) may play a role in the negative-feedback regulation of the abscisic acid (ABA) synthesis pathway. Overexpression of NRP1 enhanced susceptibility to heat stress and was accompanied of decreased accumulation of ABA after heat treatment [ ]. The function of NRP2 is not clear.
Protein Domain
Name: Uncharacterized protein C2orf78-like
Type: Family
Description: This entry represents a group of vertebrate proteins, including C2orf78 from humans. Their function is not clear.
Protein Domain
Name: Transmembrane protein DDB_G0292058-like
Type: Family
Description: This entry represents a group of transmembrane proteins, including G0292058 from Dictyostelium discoideum. Their function is not known.
Protein Domain
Name: Transmembrane protein 81
Type: Family
Description: TMEM81 is a type I transmembrane protein with a large, N-terminal lumenal domain with an immunoglobulin-like fold, and a short cytoplasmic tail. The function of TMEM81 is not clear. Homologues are found only in chordates.
Protein Domain
Name: Uncharacterized protein At4g26450-like
Type: Family
Description: This family of uncharacterized proteins from plants includes At4g26450 from Arabidopsis thaliana.
Protein Domain
Name: Uncharacterized protein Os04g0629400-like
Type: Family
Description: This uncharacterized family of proteins from plants includes Os04g0629400 from Oryza sativa (rice).
Protein Domain
Name: Arabinogalactan protein 3/12/13/14/21
Type: Family
Description: Arabinogalactan proteins (AGPs) are a family of extracellular glycoproteins implicated in plant growth and development. This entry includes AGP1/3 from rice and AGP12, AGP13, AGP14 and AGP21 from Arabidopsis [ ].
Protein Domain
Name: Exosporium protein C
Type: Family
Description: Cysteine-rich protein CsxC is a component of the exosporium of Clostridium. The exosporium is the sac-like outermost layer of spores of these species, and is likely to contribute to adhesion, dissemination, and virulence [ ].
Protein Domain
Name: Uncharacterized protein T19C3.2-like
Type: Family
Description: This family of uncharacterized proteins is found in Nematoda.
USDA
InterMine logo
The Legume Information System (LIS) is a research project of the USDA-ARS:Corn Insects and Crop Genetics Research in Ames, IA.
LegumeMine || ArachisMine | CicerMine | GlycineMine | LensMine | LupinusMine | PhaseolusMine | VignaMine | MedicagoMine
InterMine © 2002 - 2022 Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, United Kingdom