LENG9

{{Short description|Mammalian protein found in Homo sapiens}}

Leukocyte Receptor Cluster Member 9 (LENG 9) is an uncharacterized protein encoded by the LENG9 gene. In humans, LENG9 is predicted to play a role in fertility and reproductive disorders associated with female endometrium structures.

Gene

= Location =

File:Geneneighborhood.png

LENG9 is located at 19q13.42 on chromosome 19, spanning the sense strand (-) from 54,461,796 bp to 54,463,711 bp.{{Cite web|url=https://www.ncbi.nlm.nih.gov/gene/94059|title=LENG9 leukocyte receptor cluster member 9 [Homo sapiens (human)] - Gene - NCBI|website=www.ncbi.nlm.nih.gov|access-date=2017-02-19}} The LENG9 gene is 1,930 base pairs in length and contains one exon.

= Gene Neighborhood =

Genes LENG8-AS1 and CDC42EP5 neighbor LENG9 on chromosome 19.{{Cite web|url=https://www.genecards.org/cgi-bin/carddisp.pl?gene=LENG9|title=LENG9 Gene - GeneCards {{!}} LENG9 Protein {{!}} LENG9 Antibody|last=Database|first=GeneCards Human Gene|website=www.genecards.org|access-date=2017-04-30}} CDC42EP5 extends over the same region of LENG9 while LENG8-AS1 is located to the left of both genes. TTYH1 and LENG8 are also found in the same gene neighborhood but are located on the opposite strand.

= Expression =

File:GEO Profile LENG9.png

LENG9 is highly expressed (75-100%) in skeletal muscles and part of fetal liver tissues while ubiquitous expression of LENG9 is moderate (50-75%) in all other tissues observed.{{Cite web|url=https://www.ncbi.nlm.nih.gov/UniGene/clust.cgi?UGID=2059987&TAXID=9606&SEARCH=LENG9|title=Leukocyte receptor cluster (LRC) member 9 (LENG9)|website=www.ncbi.nlm.nih.gov|access-date=2017-05-06}} Human expression of LENG9 is observed in the cervix, lung, and placenta of adults.{{Cite web|url=https://www.ncbi.nlm.nih.gov/UniGene/ESTProfileViewer.cgi?uglist=Hs.590976|title=EST Profile - Hs.590976|last=Group|first=Schuler|website=www.ncbi.nlm.nih.gov|access-date=2017-05-06}} The gene is also expressed in disease states including lung tumors and primitive neuroectodermal tumors, usually found in children or young adults. However, LENG9 is not expressed during the juvenile stage of development.

= Promoter =

The promoter region is predicted to be 1101 base pairs in length.{{Cite web|url=https://www.genomatix.de/cgi-bin/eldorado/eldorado.pl?s=18613d6fac398cb4a4c3bc3483dd9fa5;SHOW_ANNOTATION=LENG9;ELDORADO_VERSION=E32R1605|title=Genomatix: Annotation & Analysis|website=www.genomatix.de|language=en-US|access-date=2017-04-30}} The transcriptional start site found in this region is located 119 bp upstream of the start codon as well as an in-frame stop codon at 1087 bp to 1089 bp.

mRNA Transcript

= Splice Variants =

In humans, LENG9 has two mRNA unspliced transcript variants. Variant (1) is the longest and most conserved transcript of the gene and is made up of one exon that is composed of 1,919 bp.

Protein

= General Properties =

LENG9 is 501 amino acids in length, with a predicted molecular weight of 53.2 kDa.{{Cite web|url=http://seqtool.sdsc.edu/CGI/BW.cgi#!|title=SAPS|last=|first=|date=|website=Biology Workbench|access-date=May 6, 2017}}{{dead link|date=December 2017 |bot=InternetArchiveBot |fix-attempted=yes }} The isoelectric point of LENG9 protein is predicted to be 7.7.{{Cite web|url=http://seqtool.sdsc.edu/CGI/BW.cgi#!|title=PI|last=|first=|date=|website=|access-date=May 6, 2017}}{{dead link|date=December 2017 |bot=InternetArchiveBot |fix-attempted=yes }} No known transmembrane sequences were found for LENG9.{{Cite web|url=http://www.cbs.dtu.dk/services/TargetP/|title=TargetP 1.1 Server|website=www.cbs.dtu.dk|language=en|access-date=2017-04-30}}

= Composition =

File:LENG9 Tertiary Structure.pngAnalysis of the LENG9 protein was performed against the "human" database, which indicated a higher frequency of alanine and proline amino acids than of that of a normal human protein. Inversely, an abnormally lower frequency of aspartate, isoleucine, methionine, asparagine, serine, and tyrosine amino acids were detected.

= Structure =

The secondary structure of LENG9 is predicted to be composed of alpha-helices and beta-sheets throughout the sequence.{{Cite web|url=http://www.sbg.bio.ic.ac.uk/phyre2/phyre2_output/ff7c5777bdd34589/summary.html|title=Phyre 2 Results for Undefined|website=www.sbg.bio.ic.ac.uk|access-date=2017-04-30}}{{Dead link|date=February 2020 |bot=InternetArchiveBot |fix-attempted=yes }}{{Cite web|url=http://seqtool.sdsc.edu/CGI/BW.cgi#!|title=CHOFAS|last=|first=|date=|website=Biology Workbench|access-date=May 6, 2017}}{{dead link|date=December 2017 |bot=InternetArchiveBot |fix-attempted=yes }} The tertiary structure of LENG9 is displayed in the image to the right.

= Sub-cellular localization =

A strong signal peptide detected in the mitochondrion region (0.788) suggests that the LENG9 protein localizes in the mitochondrial matrix. However, further analysis among other mammal orthologs predicted sub-cellular localization in the cytoplasm and nuclear localization for Danio rerio.{{Cite web|url=https://psort.hgc.jp/form2.html|title=PSORT II Prediction|website=psort.hgc.jp|access-date=2017-05-06}}

= Post Translational Modifications =

== Phosphorylation ==

LENG9 contains numerous phosphorylation sites distributed in the protein sequence, as show in the diagram to the right. These sites include casein kinase 2 (CK2), cAMP-dependent protein kinase (PKA), protein kinase C (PKC), ataxia telangiectasia mutated kinase (ATM), cyclin-dependent kinase 5 (CDK5), and casein kinase 1 (CK1).{{Cite web|url=http://www.cbs.dtu.dk/services/NetPhos/|title=NetPhos 3.1 Server|website=www.cbs.dtu.dk|language=en|access-date=2017-05-06}}

== N-terminal Acetylation ==

There is one predicted N-terminal acetylation site found in the protein LENG9 at the serine amino acid position 3.{{Cite web|url=http://www.cbs.dtu.dk/services/NetAcet/|title=NetAcet 1.0 Server|website=www.cbs.dtu.dk|language=en|access-date=2017-05-06}}

== Sumoylation ==

There are two predicted sumoylation sites within LENG9 at position 82 and 452 on lysine residues.{{Cite web | url=http://www.abgent.com/tools/SUMOplot | title=SUMOplot™ Analysis Program | Abgent}}

== C-Terminal GPI Anchor Modification ==

A C-terminal GPI modification site was found on the glycine residue at position 486.{{Cite web|url=http://mendel.imp.ac.at/gpi/cgi-bin/gpi_pred.cgi|title=GPI Prediction Server|website=mendel.imp.ac.at|access-date=2017-05-06}}

= Domains and Motifs =

There are 3 conserved domains in LENG9. The metal-ion binding zinc finger domain ZnF_C3H1{{Cite web|url=http://smart.embl.de/smart/do_annotation.pl?DOMAIN=SM00356|title=SMART: ZnF_C3H1 domain annotation|website=smart.embl.de|language=en|access-date=2017-05-06}} is found from amino acid 46 to 61. LENG9 also has a domain of unknown function belonging to the domain family DUF504, that spans from amino acid 109 to 160. The last conserved domain spans from amino acid 320 to 500 and is known as AKAP7 2'5' RNA ligase-like domain (AKAP7_NLS).{{Cite web|url=https://www.ncbi.nlm.nih.gov/protein/NP_945339.2|title=leukocyte receptor cluster member 9 isoform 1 [Homo sapiens] - Protein - NCBI|website=www.ncbi.nlm.nih.gov|access-date=2017-05-06}}

Conserved WD40 repeats are found in LENG9, spanning from amino acid 98 to 159. This motif is characterized by beta-propeller structures in the tertiary structure.

File:LENG9 Motif-Domain.png

= Protein Interactions =

LENG9 is found to interact with the C9ORF41 protein that encodes methyltransferase, involved in converting carnosine to anserine present in skeletal muscle. Another interactor is CDC5L,{{Cite journal|last=Huttlin|first=Edward L.|last2=Ting|first2=Lily|last3=Bruckner|first3=Raphael J.|last4=Gebreab|first4=Fana|last5=Gygi|first5=Melanie P.|last6=Szpyt|first6=John|last7=Tam|first7=Stanley|last8=Zarraga|first8=Gabriela|last9=Colby|first9=Greg|date=2015-07-16|title=The BioPlex Network: A Systematic Exploration of the Human Interactome|journal=Cell|volume=162|issue=2|pages=425–440|doi=10.1016/j.cell.2015.06.043|issn=1097-4172|pmc=4617211|pmid=26186194}} which is a positive regulator for the cell cycle for phase G2 to M transition.{{Cite journal|last=Llères|first=David|last2=Denegri|first2=Marco|last3=Biggiogera|first3=Marco|last4=Ajuh|first4=Paul|last5=Lamond|first5=Angus I.|date=2010-06-01|title=Direct interaction between hnRNP-M and CDC5L/PLRG1 proteins affects alternative splice site choice|journal=EMBO Reports|volume=11|issue=6|pages=445–451|doi=10.1038/embor.2010.64|issn=1469-3178|pmc=2892320|pmid=20467437}} FOXS1 is another interacting protein that functions as a transcriptional repressor to suppress promoters such as FASLG, FOXO3, and FOXO4.{{Cite journal|last=Li|first=Xu|last2=Wang|first2=Wenqi|last3=Wang|first3=Jiadong|last4=Malovannaya|first4=Anna|last5=Xi|first5=Yuanxin|last6=Li|first6=Wei|last7=Guerra|first7=Rudy|last8=Hawke|first8=David H.|last9=Qin|first9=Jun|date=2015-01-21|title=Proteomic analyses reveal distinct chromatin-associated and soluble transcription factor complexes|journal=Molecular Systems Biology|volume=11|issue=1|pages=775|doi=10.15252/msb.20145504|issn=1744-4292|pmc=4332150|pmid=25609649}}

Clinical Significance

= Disease Association =

The LENG9 gene was found to be up-regulated and expressed in endothelial endometrium (hEECs) tissues during various stages of the menstrual/reproductive cycle when transfected with the RNA gene, miR-30d.{{Cite journal|last=Moreno-Moya|first=Juan Manuel|last2=Vilella|first2=Felipe|last3=Martínez|first3=Sebastián|last4=Pellicer|first4=Antonio|last5=Simón|first5=Carlos|date=2014-06-01|title=The transcriptomic and proteomic effects of ectopic overexpression of miR-30d in human endometrial epithelial cells|journal=MHR: Basic Science of Reproductive Medicine|language=en|volume=20|issue=6|pages=550–566|doi=10.1093/molehr/gau010|pmid=24489115|issn=1360-9947|doi-access=free}}{{Cite web|url=https://www.genecards.org/cgi-bin/carddisp.pl?gene=MIR30D|title=MIR30D Gene - GeneCards {{!}} MIR30D RNA Gene|last=Database|first=GeneCards Human Gene|website=www.genecards.org|access-date=2017-05-07}} As ectopic over-expression of miR-30d in hEECs is observed to affect cancer-associated genes, LENG9 is predicted to play a role in reproductive and endocrine system disorders.

= Fertility =

Analysis of endometrial receptivity using miRNA of receptive and prereceptive endometrium from fertile women also indicated significant up-regulation of miR-30d.{{Cite journal|last=Altmäe|first=Signe|last2=Martinez-Conejero|first2=Jose A.|last3=Esteban|first3=Francisco J.|last4=Ruiz-Alonso|first4=Maria|last5=Stavreus-Evers|first5=Anneli|last6=Horcajadas|first6=Jose A.|last7=Salumets|first7=Andres|date=2012-08-17|title=MicroRNAs miR-30b, miR-30d, and miR-494 Regulate Human Endometrial Receptivity|journal=Reproductive Sciences|language=en|volume=20|issue=3|pages=308–317|doi=10.1177/1933719112453507|pmc=4077381|pmid=22902743}} Consequently, the induced expression of LENG9 from miR-30d transfection suggests a possible relationship between the LENG9 gene and female fertility functions.

Homology

= Evolution =

Comparison of the LENG9 protein was conducted against fibrinogen and cytochrome C to observe the rate of evolutionary change. The relatively fast rate of change in LENG9 compared to that of other proteins suggests that the gene is adaptive for vital cell structures and functions.

= Paralogs =

There are no known paralogs for LENG9.

= Orthologs =

LENG9 is highly conserved in mammals and bony fish such as the zebrafish.{{Cite web|url=http://seqtool.sdsc.edu/CGI/BW.cgi#!|title=CLUSTALW|first=|date=|website=Biology Workbench|access-date=May 6, 2017}}{{dead link|date=December 2017 |bot=InternetArchiveBot |fix-attempted=yes }} It is also conserved a in few reptiles and amphibians.{{Cite web|url=https://genome.ucsc.edu/cgi-bin/hgBlat|title=Human BLAT Search|website=genome.ucsc.edu|access-date=2017-05-07}} The gene is not present in invertebrates, birds, bacteria, or fungi{{Cite web|url=https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins|title=Protein BLAST: search protein databases using a protein query|website=blast.ncbi.nlm.nih.gov|language=en|access-date=2017-05-06}}

class="wikitable"

!Genus and Species

!Common Name

!Order

!Divergence

from Human

Lineage (MYA)

!NCBI Accession

Number

!Sequence Length

(bp)

!Percent Identity

to Human

!Percent Similarity to Human

Homo sapiens

|Human

| rowspan="4" |Primates

|0

|NP_945339.2

|501

|100%

|100%

Pan troglodytes

|Chimpanzee

|6.65

|XP_003316725.1

|479

|94.4%

|94.4%

Gorilla gorilla gorilla

|Gorilla

|9.06

|XP_018870227.1

|458

|89.4%

|89.8%

Macaca mulatta

|Rhesus Macaque

|29.44

|XP_014980381.1

|476

|80.7%

|83.5%

Ictidomys tridecemlineatus

|Thirteen-Lined Squirrel

| rowspan="2" |Rodentia

|90

|XP_005341645.1

|490

|62.1%

|68.2%

Peromyscus maniculatus bairdii

|Deer Mouse

|90

|XP_006995933.1

|488

|56.6%

|63.5%

Ceratotherium simum simum

|Southern White Rhinoceros

| rowspan="2" |Perissodactyla

|96

|XP_014649760.1

|528

|58.3%

|64.1%

Equus caballus

|Horse

|96

|XP_001488865.2

|506

|58.0%

|63.5%

Odobenus rosmarus divergens

|Walrus

| rowspan="2" |Carnivora

|94

|XP_004415925.1

|527

|59.3%

|62.5%

Panthera pardus

|Leopard

|96

|XP_019292295.1

|540

|55.1%

|61.0%

Bos taurus

|Cattle

| rowspan="2" |Artiodactyla

|96

|XP_005192875.1

|535

|53.6%

|58.7%

Ovis aries

|Sheep

|96

|XP_014955819.1

|669

|41.1%

|47.1%

Alligator mississippiensis

|American Alligator

|Crocodilla

|312

|XP_014459626.1

|452

|39.0%

|46.5%

Thamnophis sirtalis

|Common Garter Snake

|Squamata

|312

|XP_013921249.1

|505

|32.6%

|45.5%

Xenopus laevis

|African Clawed Frog

| rowspan="2" |Anura

|352

|XP_018080040.1

|431

|30.9%

|40.6%

Nanorana parkeri

|High Himalaya Frog

|352

|XP_018430267.1

|401

|29.4%

|39.3%

Lates calcarifer

|Barramundi

|Perciformes

|435

|XP_018538114.1

|565

|31.3%

|38.2%

Nothobranchius furzeri

|Tortoise Killfish

| rowspan="3" |Cyprinodontiformes

|435

|XP_015805834.1

|537

|30.3%

|39.4%

Danio Rerio

|Zebrafish

|435

|XP_005157957.1

|542

|27.9%

|37.5%

Poecilia formosa

|Amazon Molly

|435

|XP_007550061.1

|569

|26.7%

|37.5%

References