C11ORF97

{{Short description|Protein which in humans is encoded by the C11ORF97 gene}}

{{cs1 config|name-list-style=vanc}}

{{Infobox gene}}

C11ORF97, or Chromosome 11 Open Reading Frame 97, is a protein which in humans is encoded by the C11ORF97 gene.{{Cite web |title=uncharacterized protein C11orf97 [Homo sapiens]|url=https://www.ncbi.nlm.nih.gov/protein/NP_001177391.1 | work = National Center for Bioinformatics (NCBI) }} It is hypothesized to localize to the cytoplasm, and plays a role in the ciliary basal body.{{Cite web | title = C11orf97 |url=https://www.nextprot.org/entry/NX_A0A1B0GVM6/ |access-date=2023-12-07 |work = neXtProt | publisher = SIB Swiss Institute of Bioinformatics }} Based on its protein interactions, it is thought to have a role in Lemierre's Syndrome and Hepatic Coma.{{Cite web |title=DISEASES - MORN2 |url=https://diseases.jensenlab.org/Entity?documents=10&type1=9606&id1=ENSP00000386563&type2=-26&id2=DOID:11337 |access-date=2023-12-07 |website=diseases.jensenlab.org}}

Gene

Human C11ORF97 gene is 19,663 basepairs long including all introns, spanning from position 94,512,461 to 94,532,123.{{Cite web |title=C11orf97 chromosome 11 open reading frame 97 [Homo sapiens (human)] - Gene |url=https://www.ncbi.nlm.nih.gov/gene/643037#genomic-context |access-date=2023-12-07 | work = National Center for Bioinformatics (NCBI) | publisher = U.S. National Library of Medicine }} It is found on the long arm of chromosome 11 at 11q21, with a plus strand orientation. Human C11ORF97 has only one known variant.

mRNA transcript

= Expression =

Human C11ORF97 expression is seen in many tissues, however, it is mainly seen in the lungs and the brain.{{cite journal | vauthors = Fagerberg L, Hallström BM, Oksvold P, Kampf C, Djureinovic D, Odeberg J, Habuka M, Tahmasebpoor S, Danielsson A, Edlund K, Asplund A, Sjöstedt E, Lundberg E, Szigyarto CA, Skogs M, Takanen JO, Berling H, Tegel H, Mulder J, Nilsson P, Schwenk JM, Lindskog C, Danielsson F, Mardinoglu A, Sivertsson A, von Feilitzen K, Forsberg M, Zwahlen M, Olsson I, Navani S, Huss M, Nielsen J, Ponten F, Uhlén M | display-authors = 6 | title = Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics | journal = Molecular & Cellular Proteomics | volume = 13 | issue = 2 | pages = 397–406 | date = February 2014 | pmid = 24309898 | pmc = 3916642 | doi = 10.1074/mcp.m113.035600 | doi-access = free }}{{cite journal | vauthors = Duff MO, Olson S, Wei X, Garrett SC, Osman A, Bolisetty M, Plocik A, Celniker SE, Graveley BR | display-authors = 6 | title = Genome-wide identification of zero nucleotide recursive splicing in Drosophila | journal = Nature | volume = 521 | issue = 7552 | pages = 376–379 | date = May 2015 | pmid = 25970244 | pmc = 4529404 | doi = 10.1038/nature14475 | bibcode = 2015Natur.521..376D }} According to Human Protein Atlas, the consensus dataset for RNA tissue specificity on C11ORF97 shows six different parts of the brain having some of the highest expression, stating that the enriched groups are brain, choroid plexus, fallopian tube, and lung.{{Cite web |title=Tissue expression of C11orf97 - Summary | work = The Human Protein Atlas |url=https://www.proteinatlas.org/ENSG00000257057-C11orf97/tissue#rna_expression |access-date=2023-12-07 }} According to The Human Protein Atlas, human C11ORF97 RNA tissue specificity was highest in different parts of the brain, as well as the lungs. Although it was high in testes, this is most likely not significant to this gene.{{Cite web |title=Tissue expression of C11orf97 - Summary | work = The Human Protein Atlas |url=https://www.proteinatlas.org/ENSG00000257057-C11orf97/tissue |access-date=2023-12-16 }}

Protein

= Features =

The one variant of C11ORF97 produced in humans is 126 amino acids in length, and a predicted weight of 13.9 kDa.{{Cite web |title= CK097_HUMAN | work = UniProt | id = A0A1B0GVM6 |url=https://www.uniprot.org/uniprotkb/A0A1B0GVM6/entry |access-date=2023-12-07 }} It has an isoelectric point of pH 9.87.{{Cite web |title=Protein Isoelectric Point |url=https://www.bioinformatics.org/sms2/protein_iep.html |access-date=2023-12-07 |website=www.bioinformatics.org}} It has no transmembrane regions, and no domains of unknown functions. The amino acid composition from SAPS tool shows that there is enriched G and R, and highly lessened S, T, D, and F amino acids.{{Cite web |title=SAPS < Sequence Statistics < EMBL-EBI |url=https://www.ebi.ac.uk/Tools/seqstats/saps/ |access-date=2023-12-07 |website=www.ebi.ac.uk}}File:Tertiart itasser Image.png

= Subcellular localization =

Human C11ORF97 expected localization is in the cytoplasm, with a score of 0.5188, according to DeepLoc.{{Cite web |title=DeepLoc 2.0 | publisher = DTU Health Tech - Bioinformatic Services |url=https://services.healthtech.dtu.dk/services/DeepLoc-2.0/ |access-date=2023-12-07 | language=en}} The following tools produced no results when searching for C11ORF97 localization: NetNES, SignalP, TatP, or Human Protein Atlas. A nuclear localization signal, as well as a nuclear export signal was found, suggesting that C11ORF97 most likely has a role in the nucleus, and is then exported to the cytoplasm.

File:AlphaFold figure of C11ORF97..png

= Structure =

File:Image tertiary ncbi.png

The tertiary structure was viewed through AlphaFold, i-TASSER and annotated with NCBI's iCN3D tool.{{Cite web |title=I-TASSER results |url=https://zhanggroup.org/I-TASSER/output/S757480/ |access-date=2023-12-07 |website=zhanggroup.org}}{{Cite web |title=iCn3D: Web-based 3D Structure Viewer |url=https://www.ncbi.nlm.nih.gov/Structure/icn3d/full.html |access-date=2023-12-07 | work = National Center for Bioinformatics (NCBI) | publisher = U.S. National Library of Medicine }} These results are shown in the figures to the right, all of them include similar, or near identical features—two alpha helices and no beta sheets.

The C-scores for the 5 i-TASSER models, in order, are -3.59, -4.88, -5.00, -4.47, and -5.00. Thus, the first structure in this figure has the most confidence compared to the other four predicted structures.

Function

= Protein-protein interactions =

There were only a couple protein-protein interactions found for human C11ORF97, with a medium or higher confidence threshold.{{Cite web |title=C11orf97 protein (human) | work = STRING interaction network |url=https://string-db.org/cgi/network?taskId=bkwIuHnJwRWD&sessionId=brMAXvGfmQjU |access-date=2023-12-16 }}

class="wikitable"

! Name

! Full Name

! Score

! Identification

! Description

MORN2

|MORN repeat-containing protein 2

|0.693

|Textmining

|Predicted to be involved in cell differentiation and spermatogenesis. Associated with Lemierre’s Syndrome and Hepatic Coma.

CRACR2A

|Calcium release activated channel regulator 2A

|0.583

|Textmining

|Enables GTPase activity and calcium ion binding. Involved in activation of store-operated calcium channel activity and store-operated calcium entry.

Post-translational modifications

File:Use this for schematic for c11orf97.png

File:Unrooted Phylogenetic Tree of human c11orf97.png

There are many post-translational modifications found in Human C11ORF97, many of which are conserved in orthologs. There are many phosphorylation sites, as well as a SUMOinteraction and sumoylation site, and others.{{Cite web |title=GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interacting Motifs |url=https://sumo.biocuckoo.cn/index.php |access-date=2023-12-16 |website=sumo.biocuckoo.cn}} These types post-translational modifications have various functions, and can play a role in cell growth and proliferation. A more detailed description is seen in the illustration to the right.

Homology and evolution

Human C11ORF97 protein is found in vertebrates and invertebrates. It is found in the following vertebrates: mammals, birds, reptiles, amphibians, and fish. Human C11ORF97 seems to have first appeared in invertebrates 686 million years ago. All of the comparisons are seen in Table 2 below. An unrooted phylogenetic tree is also provided, showing the predicted likelihood of how the orthologs for this gene are related.{{Cite web |title=Phylogeny fr |url=https://www.phylogeny.fr/}} Multiple sequence alignments for strict and distant orthologs are also provided as figures. The codes for the 3 letter abbreviations are the same between figures.

File:Strict orthologs msa for c11orf97.png

File:Annotated msa for c11orf97 distant.png

class="wikitable style=width 30em"

|+Table 2. Orthologs of Human C11ORF97. Compares orthologs from different groups. Sorted by date of divergence within groups, and then by sequence similarity.

|group

|genus, species

|common name

|taxonomic group

|date of divergence (MYA)

|accession number

|sequence length (aa)

|sequence identity (%)

|sequence similarity (%)

Mammals

|Homo sapiens

|human

|Primates

|0

|[https://www.ncbi.nlm.nih.gov/protein/NP_001177391.1 NP_001177391.1]

|126

|100

|100

|Mus musculus

|mouse

|Rodentia

|87

|[https://www.ncbi.nlm.nih.gov/protein/NP_083582.2 NP_083582.2]

|121

|71.4

|77

|Ailuropoda melanoleuca

|giant panda

|Carnivora

|94

|[https://www.ncbi.nlm.nih.gov/protein/XP_019648185.2 XP_019648185.2]

|127

|84.3

|88.2

|Miniopterus natalensis

|natal long-fingered bat

|Chiroptera

|94

|[https://www.ncbi.nlm.nih.gov/protein/XP_016062178.1 XP_016062178.1]

|97

|47.2

|50

Aves

|Apteryx rowi

|okarito brown kiwi

|Apterygiformes

|319

|[https://www.ncbi.nlm.nih.gov/protein/XP_025927683 XP_025927683]

|125

|42.8

|54.5

|Tinamus guttatus

|white-throated tinamou

|Tinamiformes

|319

|[https://www.ncbi.nlm.nih.gov/protein/XP_010210901.1 XP_010210901.1]

|120

|34.2

|49

|Apus apus

|common swift

|Apodiformes

|319

|[https://www.ncbi.nlm.nih.gov/protein/XP_051500140 XP_051500140]

|121

|33.8

|45.8

|Gallus gallus

|chicken

|Galliformes

|319

|[https://www.ncbi.nlm.nih.gov/protein/XP_040517768.1 XP_040517768.1]

|264

|20.1

|28.6

Reptiles

|Terrapene carolina triunguis

|three-toed box turtle

|Testudines

|319

|[https://www.ncbi.nlm.nih.gov/protein/XP_024064440.1 XP_024064440.1]

|118

|46.8

|54.7

|Podarcis raffonei

|aeolian wall lizard

|Squamata

|319

|[https://www.ncbi.nlm.nih.gov/protein/XP_053241613.1 XP_053241613.1]

|121

|42.7

|52.4

|Crotalus tigris

|tiger rattlesnake

|Squamata

|319

|[https://www.ncbi.nlm.nih.gov/protein/XP_039205026.1 XP_039205026.1]

|133

|40

|53.8

|Caretta caretta

|loggerhead-turtle

|Testudines

|319

|[https://www.ncbi.nlm.nih.gov/protein/XP_048699259.1 XP_048699259.1]

|130

|38.8

|46.2

Amphibians

|Rhinatrema bivittatum

|two-lined caecilian

|Gymnophiona

|352

|[https://www.ncbi.nlm.nih.gov/protein/XP_029458218.1 XP_029458218.1]

|149

|26.6

|36.2

|Geotrypetes seraphini

|gaboon caecilian

|Gymnophiona

|352

|[https://www.ncbi.nlm.nih.gov/protein/XP_033805672.1 XP_033805672.1]

|185

|21.8

|30.1

|Hyla sarda

|sardinian tree frog

|Anura

|352

|[https://www.ncbi.nlm.nih.gov/protein/XP_056417471.1 XP_056417471.1]

|184

|19

|27.5

|Xenopus laevis

|african clawed frog

|Anura

|352

|[https://www.ncbi.nlm.nih.gov/protein/OCT93259.1 OCT93259.1]

|167

|18

|28.6

Fish

|Leucoraja erinacea

|little skate

|Rajiformes

|462

|[https://www.ncbi.nlm.nih.gov/protein/XP_055493278.1 XP_055493278.1]

|144

|19.8

|30.5

|Pristis pectinata

|small-tooth sawfish

|Pristiformes

|462

|[https://www.ncbi.nlm.nih.gov/protein/XP_051882720.1 XP_051882720.1]

|77

|19.5

|35.9

Invertebrates

|Haliotis rufescens

|red abalone

|Vetigastropoda

|686

|[https://www.ncbi.nlm.nih.gov/protein/XP_046353937.1 XP_046353937.1]

|139

|16.7

|26.9

|Gigantopelta aegis

|deep sea snail

|Neomphalina

|686

|[https://www.ncbi.nlm.nih.gov/protein/XP_041350579.1 XP_041350579.1]

|154

|15.8

|23

= Paralogs =

There were no paralogs found for human C11ORF97 protein.

Clinical significance/pathology

Based on its protein interactions, it is predicted to have a role in Lemierre's syndrome and hepatic coma. C11ORF97 was also found to be closely related to ciliary movement, seen through multiple published papers.{{cite journal | vauthors = Stauber M, Boldt K, Wrede C, Weidemann M, Kellner M, Schuster-Gossler K, Kühnel MP, Hegermann J, Ueffing M, Gossler A | display-authors = 6 | title = 1700012B09Rik, a FOXJ1 effector gene active in ciliated tissues of the mouse but not essential for motile ciliogenesis | journal = Developmental Biology | volume = 429 | issue = 1 | pages = 186–199 | date = September 2017 | pmid = 28666954 | doi = 10.1016/j.ydbio.2017.06.027 }}{{Cite journal |title=Role of the Primary Cilium in the Crosstalk Between Obesity and Cancer |url=https://cdr.lib.unc.edu/concern/honors_theses/qb98mr445 |access-date=2023-12-16 |website=cdr.lib.unc.edu |date=2022 |doi=10.17615/gqs5-kz52 | vauthors = Merlino J }} C11ORF97 was also mentioned in a published paper that dealt with the respiratory illnesses, COVID-19.{{cite journal | vauthors = Vastrad B, Vastrad C, Tengli A | title = Bioinformatics analyses of significant genes, related pathways, and candidate diagnostic biomarkers and molecular targets in SARS-CoV-2/COVID-19 | journal = Gene Reports | volume = 21 | pages = 100956 | date = December 2020 | pmid = 33553808 | pmc = 7854084 | doi = 10.1016/j.genrep.2020.100956 }}

References

{{Reflist}}