MFSD6L

{{Short description|Protein-coding gene in the species Homo sapiens}}

{{#invoke:Infobox_gene|getTemplateData|QID=Q18052517}}

File:Tertiary structure of MFSD6L protein.png

Major facilitator superfamily domain containing 6 like (MFSD6L) is a protein encoded by the MFSD6L gene in humans.{{Cite web|title=MFSD6L Gene|url=https://www.genecards.org/cgi-bin/carddisp.pl?gene=MFSD6L|access-date=2021-10-01|website=www.genecards.org}} The MFSD6L protein is a transmembrane protein that is part of the major facilitator superfamily (MFS) that uses chemiosmotic gradients to facilitate the transport of small solutes across cell membranes.

Gene

File:Human chromosome 17, MFSD6L on 17p13.1 ideogram.jpg

In the human genome, the MFSD6L gene is located on chromosome 17 (17p13.1). The DNA sequence encoding the polypeptide encompasses 2,256 bases, starting from 8,797,110 bp to 8,799,365 bp.{{Cite web|title=MFSD6L major facilitator superfamily domain containing 6 like [Homo sapiens (human)] - Gene - NCBI|url=https://www.ncbi.nlm.nih.gov/gene/162387|access-date=2021-10-24|website=www.ncbi.nlm.nih.gov}} Additionally, the gene sequence resides on the minus strand.

The MFSD6L gene has one alias called FLJ35773.

The encoding DNA sequence results in only one exon in the translated mRNA sequence.

The tumor suppressor gene TP53 was also found within the gene neighborhood of MFSD6L at 17p13.1.{{Cite web|date=|title=TP53 Gene|url=https://www.genecards.org/cgi-bin/carddisp.pl?gene=TP53|access-date=2021-12-15|website=www.genecards.org}}

mRNA Transcript

The MFSD6L gene was not found to have other isoforms due to the presence of only one exon in the MFSD6L encoding sequence.

Protein

The MFSD6L protein has a precursor molecular weight of approximately 64 kDa, consisting of 586 amino acids. After post-translational modifications, such as glycosylation, the mature MFSD6L protein's molecular weight increases to 72 kDa. Of the amino acids consisting the MFSD6L protein, leucine was found to have increased levels compared to most other human proteins. This increase in leucine is also present in the MFSD6L protein of the house mouse and chimpanzee.{{Cite web|title=SAPS < Sequence Statistics < EMBL-EBI|url=https://www.ebi.ac.uk/Tools/seqstats/saps/|access-date=2021-12-14|website=www.ebi.ac.uk}} The protein also has an isoelectric point of 8.87 pI.{{Cite web|title=ExPASy - Compute pI/Mw tool|url=https://web.expasy.org/compute_pi/|access-date=2021-12-14|website=web.expasy.org}}

File:MFSD6L Predicted Tertiary structure.jpg

The peptide sequence contains 11 transmembrane regions that cross the plasma membrane. Additionally, there are also two MFS regions starting at the 28th and 368th encoding amino acids.{{Cite journal|date=2021-06-27|title=Homo sapiens major facilitator superfamily domain containing 6 like (MFSD6L), mRNA|url=http://www.ncbi.nlm.nih.gov/nuccore/NM_152599.4|language=en-US}}

For the secondary structure of the MFSD6L protein, there are 16 predicted alpha helices and 3 predicted beta sheets.{{Cite web|title=PredictProtein - Protein Sequence Analysis, Prediction of Structural and Functional Features|url=https://predictprotein.org/|access-date=2021-12-14|website=predictprotein.org}} The large amount of alpha helices within the structure of MFSD6L can be attributed to the protein being a transmembrane solute transporter since alpha helices are usually the part of the protein's structure that is positioned within the cell membrane.

Within the tertiary structure, there was a disulfide bond predicted between the two cysteines at the 29th and 311th amino acids.{{Cite web|title=DiANNA|url=http://clavius.bc.edu/~clotelab/DiANNA/|access-date=2021-12-14|website=clavius.bc.edu|archive-date=2022-07-24|archive-url=https://web.archive.org/web/20220724010112/http://clavius.bc.edu/~clotelab/DiANNA/|url-status=dead}}

Expression and regulation

= Gene level =

File:MFSD6L Tissue Array.png

There was only one promoter region, spanning 1,107 bp, found for MFSD6L using the Genomatix Gene2Promoter database.{{Cite web|title=Genomatix Software Suite|url=https://www.genomatix.de/solutions/genomatix-software-suite.html|website=Genomatix|access-date=2021-12-14|archive-date=2012-01-14|archive-url=https://web.archive.org/web/20120114124429/http://www.genomatix.de/solutions/genomatix-software-suite.html|url-status=dead}} For the part of the promoter region closest to the start of the 5' UTR of the MFSD6L gene, there were several transcription factor binding sites found. A transcription factor binding site of note was the site for the p53 tumor suppressor protein.

The MFSD6L gene was found to be highly expressed in the pancreas, salivary glands, and the thyroid.{{Cite web|title=GDS3113 / 224438|url=https://www.ncbi.nlm.nih.gov/geo/tools/profileGraph.cgi?ID=GDS3113:224438|access-date=2021-12-15|website=www.ncbi.nlm.nih.gov}}

Inspection of in-situ hybridization expression of MFSD6L gene shows that the gene is particularly expressed within glandular cells within their respective tissues.

Expression of the MFSD6L was found to be upregulated as a result of glucose starvation.{{Cite web|last=Weldai|first=Lydia|date=2018-04-16|title=Do Major Facilitator Superfamily Domain Containing Proteins Respond to Glucose Starvation?|url=https://www.diva-portal.org/smash/record.jsf?pid=diva2%3A1198183&dswid=-996|website=Digitala Vetenskapliga Arkivet}}

File:MFSD6L Expression in Colorectal Tissue.jpg

= Transcript level =

File:MFSD6L mRNA Predicted 5' UTR secondary structure.jpg

Since there is only one exon and no introns within the MFSD6L gene, There is no splicing performed on the MFSD6L mRNA. Translation of the MFSD6L protein initiates at the end of the 5' UTR, which is the first 245 nucleotides of the MFSD6L mRNA. There are conserved stem-loop regions across mammalian orthologs, which infer possible miRNA binding sites.

= Protein level =

The subcellular localization of the MFSD6L protein is predicted to be within the cell membrane via DeepLoc tool.{{Cite web|title=Services|url=https://services.healthtech.dtu.dk/|access-date=2021-12-16|website=www.healthtech.dtu.dk|language=en}} This is supported by it being a solute symporter similar to MFS proteins. The first 28 amino acids of the translated MFSD6L protein contains the signal peptide.{{Cite web|title=Protter - interactive protein feature visualization|url=https://wlab.ethz.ch/protter/start/|access-date=2021-12-16|website=wlab.ethz.ch}}

Additionally, n-glycosylation sites were predicted at the 110th, 129th, and 224th amino acids of the protein sequence. A serine phosphorylation site at the 429th amino acid was also predicted and verified by presence within other mammalian orthologs.{{Cite web|title=PhosphoSitePlus|url=https://www.phosphosite.org/homeAction.action|access-date=2021-12-16|website=www.phosphosite.org}}

Evolution

= Paralogs =

The MFSD6 protein was found to be the only paralog to the human MFSD6L protein.

= Orthologs =

Through BLAST sequence analysis, the MFSD6L protein was found to have orthologs in a many mammalian species, especially among primates, and flying foxes.{{Cite web|title=BLAST: Basic Local Alignment Search Tool|url=https://blast.ncbi.nlm.nih.gov/Blast.cgi|access-date=2021-12-17|website=blast.ncbi.nlm.nih.gov}} There were some orthologs found in the Reptilia and Amphibia classes, albeit not as great in number as in the Mammalia. Among fish, there were significantly more orthologs found amongst ray-finned fishes than cartilaginous fishes. Additionally, the jaw-less fish, the sea lamprey, was also found to be an ortholog.

There were also multiple orthologs found amongst invertebrates, such as echinoderms and mollusks.

No significant orthologs of the MFSD6L protein were found amongst insects; however there were orthologs found in the bacteria. Specifically, the Anaerolinea genus, which contains thermophilic bacteria were found to have orthologs with the human protein due to its regions of MFS being identical to MFS regions found in the human protein. The following table shows some examples of orthologs of the human MFSD6L.

class="wikitable"

! Genus and Species

! Common Name

! Taxonomic Group

! Median Time since Divergence (MYA)

! Accession Number (from NCBI)

! Sequence
Length (aa)

! Sequence
Identity (%)

! Sequence
Similarity (%)

Homo sapiens

|Human

|Primates

|0

|NP_689812.3

|586

|100%

|100%

Mus musculus

|House Mouse

|Rodentia

|89

|NP_666116.1

|586

|68%

|77%

Monodon monoceros

|Narwhal

|Artiodactyla

|94

|XP_029068193.1

|595

|73%

|81%

Molossus molossus

|Velvety Free-tailed Bat

|Chiroptera

|94

|XP_036125248.1

|600

|72%

|79%

Dermochelys coriacea

|Leatherback Sea Turtle

|Testudines

|318

|XP_038228264.1

|653

|45%

|59%

Terrapene carolina triunguis

|Three-toed Box Turtle

|Testudines

|318

|XP_024071382.1

|654

|44%

|57%

Patagioenas fasciata monilis

|Band-tailed Pigeon

|Columbiformes

|318

|OPJ90083.1

|655

|43%

|55%

Dromaius novaehollandiae

|Emu

|Casuariiformes

|318

|XP_025976398.1

|628

|42%

|55%

Microcaecilia unicolor

|Tiny Cayenne Caecilian

|Gymnophiona

|351.7

|XP_030063970.1

|652

|44%

|58%

Xenopus tropicalis

|Western-clawed Frog

|Anura

|351.7

|XP_002937042.2

|611

|42%

|56%

Bufo bufo

|Common Toad

|Anura

|351.7

|XP_040293432.1

|615

|39%

|54%

Scleropages formosus

|Asian Arowana

|Osteoglossiformes

|433

|NP_001003586.1

|585

|40%

|55%

Danio rerio

|Zebrafish

|Cypriniformes

|433

|XP_018612492.2

|542

|33%

|49%

Callorhinchus milii

|Australian Ghostfish

|Chimaeriformes

|465

|XP_042198386.1

|630

|41%

|57%

Petromyzon marinus

|Sea Lamprey

|Petromyzontiformes

|599

|XP_032823230.1

|735

|26%

|38%

Anneissia japonica

|Sea Lily

|Comatulida

|627

|XP_033125701.1

|616

|28%

|47%

Gigantopelta aegis

|Deep Sea Snail

|Neomphalida

|736

|XP_041362795.1

|637

|27%

|46%

Crassostrea gigas

|Pacific Oyster

|Ostreida

|736

|XP_011445242.2

|615

|25%

|44%

Octopus sinesis

|East Asian Common Octopus

|Octopoda

|736

|XP_029655326

|634

|24%

|43%

Tetranychus urticae

|Red Spider Mite

|Trombidiformes

|736

|XP_015795313.1

|872

|14%

|25%

Anaerolinealis

|Anaerolinealis bacterium

|Anaerolineales

|4090

|MBN1451601.1

|389

|19%

|34%

= Homologous gomains =

The main homologous domains found within the MFSD6L protein are the MFS regions. Since MFS includes a large amount of solute transporter proteins within its superfamily, there are many MFS proteins that have the same homologous MFS domains.

= Most distant homologs =

Through BLAST sequence analysis, the most distant homologs were the organisms within the Cnidaria phylum, which mainly consists of jellyfish, sea anemones, and corals. Searching with BLAST for the MFSD6L gene at an older diverging phylum, the Porifera, revealed no homologous MFSD6L protein.

= Predicted emergence date =

As a result of the MFSD6L protein's presence in Cnidaria and absence in Porifera, the estimated emergence date of the MFSD6L gene lies between 687 and 777 MYA, which are the divergence dates found from TimeTree.{{Cite web|title=TimeTree :: The Timescale of Life|url=http://www.timetree.org/|access-date=2021-12-17|website=www.timetree.org}} From the corrected % divergence chart and calculations of the corrected % divergence of the Homo sapiens MFSD6 paralog, the estimated date of emergence of the MFSD6L protein was found to be around 736 MYA.

File:MFSD6L mRNA Predicted 3' UTR secondary structure.jpg

Interacting proteins

The MFSD6L protein was not found to have any experimentally-verified protein-protein interactions.{{Cite web|title=MFSD6L protein (human) - STRING interaction network|url=https://string-db.org/network/9606.ENSP00000330051|access-date=2021-12-17|website=string-db.org}}

Function

The polypeptide sequence contains many transmembrane regions, identifying the MFSD6L protein as a transmembrane protein for transporting solutes across the plasma membrane of a cell. Tertiary structure prediction tools suggest that the structure of the MFSD6L protein is similar to 1PV6A, a β-galactosides symporter which uses proton gradients to transport solutes.{{Cite web|title=I-TASSER server for protein structure and function prediction|url=https://zhanggroup.org/I-TASSER/|access-date=2021-12-17|website=zhanggroup.org}}{{Cite web|title=lacY - Lactose permease - Escherichia coli (strain K12) - lacY gene & protein|url=https://www.uniprot.org/uniprot/P02920|access-date=2021-12-17|website=www.uniprot.org|language=en}} As a result, the function of the MFSD6L protein could possibly a sugar symporter. This is additionally supported by the fact that the expression of MFSD6L was upregulated due to glucose starvation.File:Tertiary structure of the MFSD6L Protein.png

Clinical significance

= Disease association =

A disease associated with the MFSD6L gene is the Tetralogy of Fallot, which is a series of four congenital heart defects that can cause low oxygenation of blood. This is due to a ventricular septal defect that causes the mixing of oxygenated and deoxygenated blood in the left ventricle of the heart.

The MFSD6L gene was also found to be a candidate gene taking part in the disease Pediatric Cataract.{{Cite journal|last1=Aldahmesh|first1=Mohammed A.|last2=Khan|first2=Arif O.|last3=Mohamed|first3=Jawahir Y.|last4=Hijazi|first4=Hadia|last5=Al-Owain|first5=Mohammed|last6=Alswaid|first6=Abdulrahman|last7=Alkuraya|first7=Fowzan S.|date=December 2012|title=Genomic analysis of pediatric cataract in Saudi Arabia reveals novel candidate disease genes|journal=Genetics in Medicine|language=en|volume=14|issue=12|pages=955–962|doi=10.1038/gim.2012.86|pmid=22935719|s2cid=45088616|issn=1530-0366|doi-access=free}}File:MFSD6L Divergence Chart.png

= Mutations =

Various SNP's were found within the encoding sequence of the MFSD6L protein sequence as shown below.

class="wikitable"

!Amino Acid Position

!mRNA Position

!Original Nucleotide

!SNP

!Original Amino Acid

!Variant Amino Acid

!Codon

328

|1212

|T

|T

|Ser [S]

|Leu [L]

|2

399

|1424

|G

|A

|Gly [G]

|Ser [S]

|1

406

|1446

|C

|T

|Ser [S]

|Leu [L]

|2

571

|1941

|C

|T

|Thr [T]

|Ile [I]

|2

574

|1949

|G

|A

|Asp [D]

|Asn [N]

|1

575

|1952

|T

|G

|Trp [W]

|Gly [G]

|1

{{-}}

References

{{Reflist}}