EPCIP (gene)
{{Short description|Protein-coding gene in the species Homo sapiens}}
{{cs1 config|name-list-style=vanc}}
{{Infobox_gene}}
File:Predicted Structure of C21orf62 Gene.png
Exosomal polycystin-1-interacting protein is a protein that, in humans, is encoded by the EPCIP gene.{{Cite web|url=https://www.ncbi.nlm.nih.gov/gene/56245|title=EPCIP exosomal polycystin 1 interacting protein [ Homo sapiens (human) ]|website=www.ncbi.nlm.nih.gov|access-date=2024-05-15}} EPCIP is found on human chromosome 21, and it is thought to be expressed in tissues of the brain and reproductive organs.{{Cite web|url=https://www.ncbi.nlm.nih.gov/geoprofiles|title=Home - GEO Profiles - NCBI|website=www.ncbi.nlm.nih.gov|access-date=2017-05-07}} Additionally, EPCIP is highly expressed in ovarian surface epithelial cells during normal regulation, but is not expressed in cancerous ovarian surface epithelial cells.
Gene
Common aliases of EPCIP are C21orf62, C21orf120, PRED81, and B37. EPCIP is located on chromosome 21 in humans, and is specifically at the q22.11 position.{{Cite web|url=https://www.genecards.org/cgi-bin/carddisp.pl?gene=C21orf62#localization|title=C21orf62 Gene - GeneCards {{!}} CU062 Protein {{!}} CU062 Antibody|last=Database|first=GeneCards Human Gene|website=www.genecards.org|access-date=2017-05-07}} The EPCIP gene is 4132 base pairs in length and contains five exons.
mRNA
The mRNA sequence of EPCIP in humans has one known isoform. This isoform is called uncharacterized protein C21orf62 isoform X1. This isoform is 458 base pairs, or 104 amino acids, in length, and it is significantly shorter than the most observed sequence of EPCIP in humans. In addition to having an isoform, EPCIP also has splice variants. All splice variants encode the same gene, but the differences in splice variant sequences occur in the 5' untranslated region of the mRNA sequence.
Protein
= General protein characteristics =
The EPCIP protein in humans has a sequence that is 219 amino acids in length. The primary sequence of EPCIP in humans has a molecular weight of 24.9 kDa and an isoelectric point of 8.{{Cite journal|last=Kramer|first=Jack|date=1990|title=AASTATS|url=http://workbench.sdsc.edu/|journal=Biology Workbench}}{{Cite journal|last=Toldo|first=Luca|title=PI Isoelectric Point Determination Program|url=http://workbench.sdsc.edu/|journal=Biology Workbench}} When it's cleavable signal peptide, which spans amino acids 1-19, is removed, it has a molecular weight of 22.8 kDa and an isoelectric point of 7.8.{{Cite web|url=http://www.genscript.com/tools/psort|title=PSORT II server - GenScript|website=www.genscript.com|access-date=2017-05-07}}{{Cite web|url=http://terminus.unige.ch|title=TERMINUS - Welcome to terminus|last=Charpilloz|first=Jean-Luc Falcone & Christophe|website=terminus.unige.ch|language=en|access-date=2017-05-07}}
= Protein composition =
EPCIP in humans has higher cysteine and lower valine concentrations than expected compared to other human proteins. This trend, as showed in Table 1, is the same for other mammals. It does not, however, occur in taxa other than mammalia.
class="wikitable"
|+Table 1.{{Cite journal|last=Brendel|first=Volker|date=1992|title=Statistical Analysis of PS|url=http://seqtool.sdsc.edu/|journal=Biology Workbench|access-date=2017-02-06|archive-url=https://web.archive.org/web/20030811031200/http://seqtool.sdsc.edu/|archive-date=2003-08-11|url-status=dead}} Unusual amino acid concentrations of EPCIP in humans and orthologs. !Genus and Species !Common Name !Organism Clade !% Cysteine !Amino Acid Concentration of Cysteine Compared to Expected !% Valine !Amino Acid Concentration of Valine Compared to Expected !Other Amino Acids with High or Low Concentration Compared to Expected |
Homo sapiens
|Human |Mammalia |4.6% |High |3.2% |Low | - |
Mus musculus
|House Mouse |Mammalia |4.3% |High |3.5% |Low |Glutamic Acid (1.7%, low) |
Canis lupus familiaris
|Dog |Mammalia |4.1% |High |2.7% |Low |Leucine (14.2%, high) |
Physeter catodon
|Sperm Whale |Mammalia |4.6% |High |4.1% |Expected |Serine (11.9%, high) |
Gallus gallus
|Chicken |Aves |3.1% |Expected |6.7% |Expected |Alanine (2.2%, low) Glycine (3.1%, low) Proline (1.8%, low) Phenylalanine (7.1%, high) Serine (12.4%, high) Threonine (9.8%) |
Chelonia mydas
|Green Sea Turtle |3.6% |Expected |5.8% |Expected |Alanine (1.8%, low) Serine (11.2%, high) |
= Protein structure =
The protein structure of EPCIP in humans consists of a combination of alpha helices and beta sheets.{{Cite journal|last=Pearson|first=William R.|date=September 1998|title=CHOFAS Analysis|url=http://seqtool.sdsc.edu/|journal=Biology Workbench|access-date=2017-02-06|archive-url=https://web.archive.org/web/20030811031200/http://seqtool.sdsc.edu/|archive-date=2003-08-11|url-status=dead}}{{Cite journal|last=Pappas|first=Georgios J. Jr.|date=1974–1996|title=PELE: Protein Structure Prediction |url=http://seqtool.sdsc.edu/ |journal=Biology Workbench|access-date=2017-02-06|archive-url=https://web.archive.org/web/20030811031200/http://seqtool.sdsc.edu/|archive-date=2003-08-11|url-status=dead}} Figure 1 shows a predicted structure of the protein.
= Post-translational modifications =
EPCIP has a myristoylation site from amino acid 26–31.{{Cite web|url=http://myhits.isb-sib.ch/cgi-bin/motif_scan|title=Motif Scan|website=myhits.isb-sib.ch|language=en|access-date=2017-05-07}} It has a sumoylation site from amino acid 132–135.{{Cite web|url=http://sumosp.biocuckoo.org/online.php|title=GPS-SUMO 2.0 Online Service|last=The Cucko Workgroup|date=May 1, 2017|website=sumosp.biocuckoo.org/online.php|access-date=May 5, 2017|archive-date=February 17, 2019|archive-url=https://web.archive.org/web/20190217125200/http://sumosp.biocuckoo.org/online.php|url-status=dead}} Additionally, it has a nuclear export signal from amino acid 98-104.{{cite journal |vauthors = la Cour T, Kiemer L, Mølgaard A, Gupta R, Skriver K, Brunak S |title = Analysis and prediction of leucine-rich nuclear export signals |journal = Protein Eng. Des. Sel. |volume = 17 |issue = 6 |pages = 527–36 |year = 2004 |pmid = 15314210 |doi = 10.1093/protein/gzh062 |doi-access = free }}
Expression
= Tissue expression =
EPCIP is expressed in human tissues of the brain and reproductive organs.
= Expression level =
EPCIP in humans is moderately expressed in the brain, kidneys, pancreas, prostate, testes, and ovaries.{{Cite web|url=https://www.ncbi.nlm.nih.gov/unigene|title=Home - UniGene - NCBI|website=www.ncbi.nlm.nih.gov|access-date=2017-05-07}}{{Cite web|url=http://www.proteinatlas.org|title=The Human Protein Atlas|website=www.proteinatlas.org|access-date=2017-05-07}}
= Regulation of expression =
EPCIP is expressed during blastocyst, fetus, and adult states of human development. It is overexpressed during some tumor states, including pancreatic, gastrointestinal, germ cell, and glioma tumors.
Function
Interacting proteins
EPCIP is thought to potentially interact with nine other proteins.{{Cite web|url=http://string-db.org/|title=STRING: functional protein association networks|website=string-db.org|access-date=2017-05-07}} These interactions are shown in Table 2, and they were found through text mining.
class="wikitable"
|+Table 2. Proteins with Evidence of Interaction with EPCIP !Protein Full Name !Protein Name Symbol |
BCL2 Interacting Protein Like
|BNIPL |May function as a bridge molecule that promotes cell death. |
Thymosin Beta 4, X-linked Pseudogene 4
|TMSB4XP4 |Potentially influences actin polymerization. |
Synovial Sarcoma X Family Member 4
|SSX4 |May function as a repressor of transcription, and can be useful targets in cancer vaccine-based immunotherapy. |
Crystallin Beta A2
|CRYBA2 |A major protein in vertebrate eyes that maintains lens transparency and reflective index. |
Oral Cancer Overexpressed 1
|ORAOV1 |A gene that is frequently overexpressed in esophageal squamous cell cancer. |
Oligodendrocyte Transcription Factor 1
|OLIG1 |May be expressed during the time from process extension through membrane maintenance in oligodendrocytes. |
PAX3 and PAX7 Binding Protein 1
|GCFC1 (PAXBP1) |The encoded protein potentially binds to GC-rich DNA sequences. It is suggested that this gene is involved in the regulation of transcription. |
Relaxin/Insulin Like Family Peptide Receptor 1 and 2
|RXFP1 and RXFP2 |Encoded protein is a receptor for the protein hormone relaxin that influences sperm motility and pregnancy. |
Clinical significance
EPCIP over or under expression is linked to some types of cancerous cells and tumors.
Homology
= Paralogs =
There are no known paralogs of EPCIP in humans at this time.
= Orthologs =
There are currently 193 organisms that are known to be orthologs of EPCIP. The orthologs of EPCIP are deuterostome animals in the clade Chordata. Table 3 shows a range of EPCIP orthologs, their NCBI accession numbers, sequence lengths, and sequence identity to the EPCIP human protein. At this time, EPCIP is not known to have any protostome or invertebrate orthologs.
File:C21orf62 Gene Evolution in Humans.png= Evolution rate =
EPCIP has an evolution rate that is faster than cytochrome C and fibrinogen. Figure 2 shows the rate of evolution of the EPCIP gene over the past 473 million years.
External links
- {{UCSC gene info|C21orf62}}
References
{{reflist}}