Genetic map function

In genetics, mapping functions are used to model the relationship between map distances (measured in map units or centimorgans) and recombination frequencies, particularly as these measurements relate to regions encompassed between genetic markers. One utility of this approach is that it allows one to obtain values for distances in genetic mapping units directly from recombination fractions, as map distances cannot typically be obtained from empirical experiments.{{Cite book |last1=Broman |first1=Karl W. |url=https://www.worldcat.org/title/669122118 |title=A guide to QTL mapping with R/qtl |last2=Sen |first2=Saunak |date=2009 |publisher=Springer |isbn=978-0-387-92124-2 |series=Statistics for biology and health |location=Dordrecht |pages=14 |oclc=669122118}}

The simplest mapping function is the Morgan Mapping Function, eponymously devised by Thomas Hunt Morgan. Other well-known mapping functions include the Haldane Mapping Function introduced by J. B. S. Haldane in 1919,{{Cite journal |last=Haldane |first=J.B.S. |date=1919 |title=The combination of linkage values, and the calculation of distances between the loci of linked factors |url=https://www.ias.ac.in/article/fulltext/jgen/008/04/0299-0309 |journal=Journal of Genetics |volume=8 |issue=29 |pages=299–309}} and the Kosambi Mapping Function introduced by Damodar Dharmananda Kosambi in 1944.{{Cite journal |last=Kosambi |first=D. D. |date=1943 |title=The Estimation of Map Distances from Recombination Values |url=https://onlinelibrary.wiley.com/doi/10.1111/j.1469-1809.1943.tb02321.x |journal=Annals of Eugenics |language=en |volume=12 |issue=1 |pages=172–175 |doi=10.1111/j.1469-1809.1943.tb02321.x |issn=2050-1420}}{{Cite book |last1=Wu |first1=Rongling |url=https://books.google.com/books?id=-NlGKOEQuEsC&dq=haldane%20mapping%20function&pg=PA65 |title=Statistical genetics of quantitative traits: linkage, maps, and QTL |last2=Ma |first2=Chang-Xing |last3=Casella |first3=George |date=2007 |publisher=Springer |isbn=978-0-387-20334-8 |location=New York |pages=65 |oclc=141385359}} Few mapping functions are used in practice other than Haldane and Kosambi. The main difference between them is in how crossover interference is incorporated.{{Cite journal |last1=Peñalba |first1=Joshua V. |last2=Wolf |first2=Jochen B. W. |date=2020 |title=From molecules to populations: appreciating and estimating recombination rate variation |url=https://www.nature.com/articles/s41576-020-0240-1 |journal=Nature Reviews Genetics |language=en |volume=21 |issue=8 |pages=476–492 |doi=10.1038/s41576-020-0240-1 |issn=1471-0064}}

Morgan Mapping Function

Where d is the distance in map units, the Morgan Mapping Function states that the recombination frequency r can be expressed as $\ r=d$ . This assumes that one crossover occurs, at most, in an interval between two loci, and that the probability of the occurrence of this crossover is proportional to the map length of the interval.

Where d is the distance in map units, the recombination frequency r can be expressed as:

$\ r = \frac{1}{2} [1-(1-2d)] = d$

The equation only holds when $\frac{1}{2} \geq d \geq 0$ as, otherwise, recombination frequency would exceed 50%. Therefore, the function cannot approximate recombination frequencies beyond short distances.

Haldane Mapping Function

= Overview =

Two properties of the Haldane Mapping Function is that it limits recombination frequency up to, but not beyond 50%, and that it represents a linear relationship between the frequency of recombination and map distance up to recombination frequencies of 10%.{{Cite web |title=mapping function |url=https://www.oxfordreference.com/display/10.1093/oi/authority.20110803100132641 |access-date=2024-04-29 |website=Oxford Reference |language=en }} It also assumes that crossovers occur at random positions and that they do so independent of one another. This assumption therefore also assumes no crossover interference takes place;{{Cite book |title=Mammalian genomics |date=2005 |publisher=CABI Pub |isbn=978-0-85199-910-4 |editor-last=Ruvinsky |editor-first=Anatoly |location=Wallingford, Oxfordshire, UK; Cambridge, MA, USA |pages=15 |editor-last2=Graves |editor-first2=Jennifer A. Marshall}} but using this assumption allows Haldane to model the mapping function using a Poisson distribution.

= Definitions =

r = recombination frequency
d = mean number of crossovers on a chromosomal interval
2d = mean number of crossovers for a tetrad
e^-2d = probability of no genetic exchange in a chromosomal interval

= Formula =

$\ r = \frac{1}{2} (1-e^{-2d})$

= Inverse =

$\ d = -\frac{1}{2} \ln (1-2r)$

Kosambi Mapping Function

= Overview =

The Kosambi mapping function was introduced to account for the impact played by crossover interference on recombination frequency. It introduces a parameter C, representing the coefficient of coincidence, and sets it equal to 2r. For loci which are strongly linked, interference is strong; otherwise, interference decreases towards zero. Interference declines according to the linear function i = 1 - 2r.{{Cite book |last1=Hartl |first1=Daniel L. |url=https://books.google.com/books?id=cfvILxY9tCIC&dq=haldane%20mapping%20function&pg=PA168 |title=Genetics: analysis of genes and genomes |last2=Jones |first2=Elizabeth W. |date=2005 |publisher=Jones and Bartlett |isbn=978-0-7637-1511-3 |edition=7th |location=Sudbury, Mass. |pages=168}}

= Formula =

$\ r = \frac{1}{2}\tanh(2d) = \frac{1}{2}\frac{e^{4d}-1}{e^{4d}+1}$

= Inverse =

$\ d = \frac{1}{2} \tanh^{-1} (2r) = \frac{1}{4}\ln(\frac{1+2r}{1-2r})$

Comparison and application

Below 10% recombination frequency, there is little mathematical difference between different mapping functions and the relationship between map distance and recombination frequency is linear (that is, 1 map unit = 1% recombination frequency). When genome-wide SNP sampling and mapping data is present, the difference between the functions is negligible outside of regions of high recombination, such as recombination hotspots or ends of chromosomes.

While many mapping functions now exist,{{Cite journal |last=Crow |first=J F |date=1990 |title=Mapping functions. |url=https://academic.oup.com/genetics/article/125/4/669/6000769 |journal=Genetics |language=en |volume=125 |issue=4 |pages=669–671 |doi=10.1093/genetics/125.4.669 |issn=1943-2631 |pmc=1204092 |pmid=2204577}}{{Cite journal |last=Felsenstein |first=Joseph |date=1979 |title=A Mathematically Tractable Family of Genetic Mapping Functions with Different Amounts of Interference |url=https://academic.oup.com/genetics/article/91/4/769/5993247 |journal=Genetics |language=en |volume=91 |issue=4 |pages=769–775 |doi=10.1093/genetics/91.4.769 |issn= |pmc=1216865 |pmid=17248911}}{{Cite journal |last1=Pascoe |first1=L. |last2=Morton |first2=N.E. |date=1987 |title=The use of map functions in multipoint mapping |journal=American Journal of Human Genetics |volume=40 |issue=2 |pages=174–183|pmid=3565379 |pmc=1684067 }} in practice functions other than Haldane and Kosambi are rarely used. More specifically, the Haldane function is preferred when distance between markers is relatively small, whereas the Kosambi function is preferred when distances between markers is larger and crossovers need to be accounted for.{{Cite book |url=https://books.google.com/books?id=3Ss-ws2Zm6IC&pg=SA17-PA11 |title=Handbook of computational molecular biology |date=2006 |publisher=CRC Press |isbn=978-1-58488-406-4 |editor-last=Aluru |editor-first=Srinivas |series= |location= |pages=17-10–17-11 |oclc=}}

Genetic map function

Morgan Mapping Function

Haldane Mapping Function

= Overview =

= Definitions =

= Formula =

= Inverse =

Kosambi Mapping Function

= Overview =

= Formula =

= Inverse =

Comparison and application

References

Further reading