C1orf21

{{Short description|Protein-coding gene in the species Homo sapiens}}

{{Infobox_gene}}

Uncharacterized protein C1orf21, also known as Proliferation-Inducing Protein 13, is a protein that in humans is encoded by the C1orf21 gene.{{cite journal | vauthors = Sood R, Bonner TI, Makalowska I, Stephan DA, Robbins CM, Connors TD, Morgenbesser SD, Su K, Faruque MU, Pinkett H, Graham C, Baxevanis AD, Klinger KW, Landes GM, Trent JM, Carpten JD | title = Cloning and characterization of 13 novel transcripts and the human RGS8 gene from the 1q25 region encompassing the hereditary prostate cancer (HPC1) locus | journal = Genomics | volume = 73 | issue = 2 | pages = 211–222 |date=Apr 2001 | pmid = 11318611 | doi = 10.1006/geno.2001.6500 | url = https://zenodo.org/record/1229806 }}{{cite web | title = Entrez Gene: C1orf21 chromosome 1 open reading frame 21| url = https://www.ncbi.nlm.nih.gov/sites/entrez?Db=gene&Cmd=ShowDetailView&TermToSearch=81563}} C1orf21 is an intracellular protein that flows between the nucleus and the cytoplasm in the cell. It has been linked with cell growth and reproduction and there has been strong links with various types of cancers.{{Cite web|url=https://www.proteinatlas.org/ENSG00000116667-C1orf21/pathology|title=Expression of C1orf21 in cancer - Summary - The Human Protein Atlas|website=www.proteinatlas.org|access-date=2019-08-08}} There are no paralogs for this gene, however, many conserved orthologs have been found in all invertebrates.{{Cite web|url=https://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastp&PAGE_TYPE=BlastSearch&LINK_LOC=blasthome|title=Protein BLAST: search protein databases using a protein query|website=blast.ncbi.nlm.nih.gov|access-date=2019-02-28}} C1orf21 has low to moderate level of expression in most tissues in humans, however, it has the most expression in the skin, lung and prostate.

Gene

= Locus =

C1orf198 is a protein-encoding gene found on the reverse strand of chromosome 1 at the locus 1q25.3.{{Cite web|url=https://www.genecards.org/cgi-bin/carddisp.pl?gene=C1orf21|website=www.genecards.org|access-date=2019-08-08|title=C1orf21 Gene - GeneCards | CA021 Protein | CA021 Antibody}}

= Gene neighborhood =

C1orf21 is located on the long arm of chromosome 1. It is found at position 5q23.1.

alt=Cytogenic band: 5q23.1

Cytogenic band: 1q25.3

= Size =

Chromosome one is one of the longest chromosomes, in which C1orf21 spans from 184,385,826 to 184,390,390 bases, resulting with mRNA transcript that is 10,278 nucleotides long with 4 exons. The protein is 121 amino acids long, containing a domain of unknown function known as DUF4612.

= Expression =

NCBI gene and RNA-Seq revealed that C1orf21 is expressed in all tissues at a low to moderate level, however, it is mostly expressed in the skin, brain and prostate.

Gene level regulation

= Promoter =

There was over 7 promoters that were predicted, but the true promoter was 1111 base pairs long known as .{{Cite web|url=https://www.genomatix.de/|title=Genomatix - NGS Data Analysis & Personalized Medicine|website=www.genomatix.de|access-date=2019-08-08|archive-date=2001-02-24|archive-url=https://web.archive.org/web/20010224072831/http://www.genomatix.de/|url-status=dead}}

= Transcription factor binding sites =

Many transcription factor (TF) binding sites have been predicted through Genomatix. Some important binding cites include MYRE, MARs, and Bright.

MYRE is a myelin regulatory factor. Myelin is produced in the central nervous system and plays a large role in axons. MARs is a special AT-rich sequence-binding protein 1, predominantly expressed in thymocytes, binds to matrix attachment regions. Bright helps with B cell regulator of IgH transcription.

Protein

= Subcellular location =

It was predicted that the location of C1orf21 is in the nucleus with 62.2% certainty. The mitochondria was predicted at 17.4%: mitochondrial, while the cytoskeleton, and vascular system at 4.3%.{{Cite web|url=https://psort.hgc.jp/form2.html|title=PSORT II Prediction|website=psort.hgc.jp|access-date=2019-08-01}}

= Structure =

C1orf21 protein is 121 amino acids long with a molecular weight of 18,7 kDa with an isoelectric point of 5.08. It is believed that the protein interacts with the nuclear membrane and contains an unknown domain known as DUF4612. For the secondary and tertiary structure it is predicted that there are many alpha helices in the structure, with the rest of the protein having a disordered structure.{{Cite web|url=http://dis.embl.de/cgiDict.py|title=DisEMBL 1.5 - Predictors of intrinsic protein disorder|website=dis.embl.de|access-date=2019-08-01}}

File:C1orf21 PHYRE.png


class="wikitable"

|File:I-TASSER C1orf21.jpg

Protein level regulation

  • O-glycosylation sites: Serine 5, Threonine 11, Serine 66, Serine 68, and Serine 69.{{Cite web|url=http://www.cbs.dtu.dk/cgi-bin/webface2.fcgi?jobid=5D473C000000137BA05A8D5A&wait=20|title=NetOGlyc 4.0 Server - prediction results|website=www.cbs.dtu.dk|access-date=2019-08-04}}
  • Palmitioyaltion site: Cysteine 3
  • Phosphorylation: Serine 34, Serine 44, Serine 66, Serine 69, Serine 75, Serine 95, Serine 115, Serine 121 {{Cite web|url=http://www.cbs.dtu.dk/cgi-bin/webface2.fcgi?jobid=5D473FB8000004DDF9E77DB6&wait=20|title=NetPhos 3.1 Server - prediction results|website=www.cbs.dtu.dk|access-date=2019-08-04}}
  • Sumoylation site: Lysine 46 and Lysine 106 {{Cite web|url=http://sumosp.biocuckoo.org/showResult.php|title=GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interaction Motifs|website=sumosp.biocuckoo.org|access-date=2019-08-04|archive-date=2018-05-06|archive-url=https://web.archive.org/web/20180506035609/http://sumosp.biocuckoo.org/showResult.php|url-status=dead}}
  • Tyrosine sulfation site: Tyrosine 113

Interacting proteins

{| class="wikitable"

|

= Protein =

|

= Function =

|-

|Calcineurin-binding protein cabin-1 (Cabin1)

|Required for replication-independent chromatin assembly

|-

|Centrosomal protein of 162 kDa (CEP162)

|Required to promote assembly of the transition zone in primary cilia.

|-

|CD97 antigen

|Receptor potentially involved in both adhesion and signaling processes early after leukocyte activation.

|-

|Chromosome 11 open reading frame 57 (C11orf57)

|Unknown

|-

|Chromosome 5 open reading frame 51 (C5orf51)

|Unknown

|-

|Homeobox protein Nkx-2.8; (NKX2-8)

|NKL subclass homeoboxes and pseudogenes

|-

|NACHT, LRR and PYD domains-containing protein 13 (NLPR13)

|Involved in inflammation

|-

|Semaphorin-3C (SEMA3C)

|Binds to plexin family members and plays an important role in the regulation of developmental processes

|-

|Zinc finger protein 19 (ZNF19)

|transcriptional regulation

|}

Homology

= Paralogs =

File:C1orf21 Tree.png

There are no isoforms or paralogs of C1orf21 that are known.

= Orthologs =

C1orf21 is found in most classes of vertebrates and some invertebrates. The most distant ortholog of C1orf21 is Acropora digitifera, which diverged an estimated 824 million years ago.{{Cite web|url=http://timetree.org/|title=TimeTree :: The Timescale of Life|website=timetree.org|access-date=2019-07-01}} There is no traces of the C1orf21 gene in organisms that are traced beyond invertebrates, such as fungi, plants, protists, or single celled organisms.{{Cite web|url=https://blast.ncbi.nlm.nih.gov/Blast.cgi|title=BLAST: Basic Local Alignment Search Tool|website=blast.ncbi.nlm.nih.gov|access-date=2019-08-01}}

= Homologous domains =

The domain of unknown function 4612 (DUF4612) was highly conserved in most orthologs.

class="wikitable"

!Species

!Common name

!Taxonomic group

!DOD

(MYA)

!Accession number

!Sequence length (aa)

!Identity

!Similarity

Homo sapiens

|Human

|Primates

|0

|NP_110433

|121

|100

|100

Pan troglodytes

|Chimpanzee

|Primates

|7

|NP_001229539

|121

|100

|100

Gorilla gorilla gorilla

|Gorilla

|Primates

|9

|XP_018883443

|121

|100

|100

Macaca mulatta

|Rhesus macaque

|Primates

|30

|NP_001247792

|121

|100

|100

Cercocebus atys

|Sooty mangabey

|Primates

|30

|XP_011903171

|121

|100

|100

Ursus maritimus

|Polar bear

|Carnivora

|96

|XP_008695366

|121

|97

|99

Pogona vitticeps

|Central bearded dragon

|Amphioxiformes

|312

|XP_020650764

|121

|94

|97

Gallus gallus

|Red junglefowl

|Galliformes

|312

|XP_422292

|121

|93

|98

Haliaeetus leucocephalus

|Bald eagle

|Accipitriformes

|312

|XP_010578992

|121

|93

|98

Fulmarus glacialis

|Northern fulmar

|Procellariiformes

|312

|KFV96345

|90

|93

|98

Ophiophagus hannah

|King cobra

|Squamata

|312

|ETE66728

|121

|91

|96

Xenopus tropicalis

|Western clawed frog

|Anura

|352

|NP_001072652

|121

|77

|85

Nothobranchius furzeri

|Turquoise killifish

|Cyprinodontiformes

|435

|XP_015827000

|116

|61

|73

Echeneis naucrates

|Live sharksucker

|Perciformes

|435

|XP_029355762

|116

|61

|73

Haplochromis burtoni

|Burton's mouthbrooder

|Cichliformes

|435

|XP_005932528

|116

|61

|73

Anabas testudineus

|Blue perch

|Anabantiformes

|435

|XP_026201702

|116

|47

|60

Callorhinchus milii

|Australian ghostshark

|Chimaeriformes

|473

|XP_007893787

|135

|69

|79

Rhincodon typus

|Whale Shark

|Orectolobiformes

|473

|XP_020373635

|91

|68

|82

Branchiostoma belcheri

|Belcher's lancelet

|Amphioxiformes

|684

|XP_019640980

|114

|33

|56

Acropora digitifera

|Stony coral pulp

|Scleractinia

|824

|XP_015747227

|140

|55

|65

Function

C1orf21 is most likely involved in the growth of cells, especially in the nucleus where replication of DNA occurs.

Clinical significance

Even though there is not a lot known about C1orf21, there have been some links with diseases. In many studies it has been found that there are links with cancer. Since C1orf21 is associated with cell proliferation, in another study by Sooda et al. there was an interest in the transcript map of the HPC1 locus, to help them identify the susceptibility genes involved in prostate cancer and jaw tumor.  It was seen that overall there are several studies where C1orf21 has been studied on role it plays in cancer for different body areas among many other genes. It was also found that there is a large correlation with affects on keratinocytes since C1orf21 plays a role in ZNF750 silencing.

References

{{reflist}}

Further reading

{{refbegin | 2}}

  • {{cite journal | vauthors=Gregory SG, Barlow KF, McLay KE |title=The DNA sequence and biological annotation of human chromosome 1. |journal=Nature |volume=441 |issue= 7091 |pages= 315–321 |year= 2006 |pmid= 16710414 |doi= 10.1038/nature04727 |display-authors=etal|bibcode=2006Natur.441..315G |doi-access=free }}
  • {{cite journal | vauthors=Oh JH, Yang JO, Hahn Y |title=Transcriptome analysis of human gastric cancer. |journal=Mamm. Genome |volume=16 |issue= 12 |pages= 942–954 |year= 2006 |pmid= 16341674 |doi= 10.1007/s00335-005-0075-2 |s2cid=69278 |display-authors=etal}}
  • {{cite journal | vauthors=Gerhard DS, Wagner L, Feingold EA |title=The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). |journal=Genome Res. |volume=14 |issue= 10B |pages= 2121–2127 |year= 2004 |pmid= 15489334 |doi= 10.1101/gr.2596504 | pmc=528928 |display-authors=etal}}
  • {{cite journal | vauthors=Ota T, Suzuki Y, Nishikawa T |title=Complete sequencing and characterization of 21,243 full-length human cDNAs. |journal=Nat. Genet. |volume=36 |issue= 1 |pages= 40–45 |year= 2004 |pmid= 14702039 |doi= 10.1038/ng1285 |display-authors=etal|doi-access=free }}
  • {{cite journal | vauthors=Strausberg RL, Feingold EA, Grouse LH |title=Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. |journal=Proc. Natl. Acad. Sci. U.S.A. |volume=99 |issue= 26 |pages= 16899–16903 |year= 2003 |pmid= 12477932 |doi= 10.1073/pnas.242603899 | pmc=139241 |display-authors=etal |bibcode=2002PNAS...9916899M|doi-access=free }}
  • {{cite journal | vauthors=Yu W, Andersson B, Worley KC |title=Large-scale concatenation cDNA sequencing. |journal=Genome Res. |volume=7 |issue= 4 |pages= 353–358 |year= 1997 |pmid= 9110174 |doi= 10.1101/gr.7.4.353| pmc=139146 |display-authors=etal}}
  • {{cite journal | vauthors=Andersson B, Wentland MA, Ricafrente JY |title=A "double adaptor" method for improved shotgun library construction. |journal=Anal. Biochem. |volume=236 |issue= 1 |pages= 107–113 |year= 1996 |pmid= 8619474 |doi= 10.1006/abio.1996.0138 |display-authors=etal}}

{{refend}}

Category:Human proteins

Category:Genes on human chromosome 5