Isochore (genetics)

In genetics, an isochore is a large region of genomic DNA (greater than 300 kilobases) with a high degree of uniformity in GC content; that is, guanine (G) and cytosine (C) bases. The distribution of bases within a genome is non-random: different regions of the genome have different amounts of G-C base pairs, such that regions can be classified and identified by the proportion of G-C base pairs they contain.

Bernardi and colleagues first noticed the compositional non-uniformity of vertebrate genomes using thermal melting and density gradient centrifugation.{{cite journal | author=Macaya, Thiery, and Bernardi | title=An approach to the organization of eukaryotic genomes at a macromolecular level | journal=Journal of Molecular Biology| volume=108 | pages=237–254 | year=1976 | pmid=826644 | doi=10.1016/S0022-2836(76)80105-2 | issue=1 }}

{{cite journal | author=Thiery, Macaya, and Bernardi | title=An analysis of eukaryotic genomes by density gradient centrifugation | journal=Journal of Molecular Biology| volume=108 | pages=219–235 | year=1976 | pmid=826643 | doi=10.1016/S0022-2836(76)80104-0 | issue=1 }}

{{cite journal | author=Bernardi | title=The mosaic genome of warm-blooded vertebrates | journal=Science| volume=228 | pages=953–958 | year=1985 | pmid=4001930| bibcode=1985Sci...228..953B | last2=Olofsson | first2=Birgitta | last3=Filipski | first3=Jan | last4=Zerial | first4=Marino | last5=Salinas | first5=Julio | last6=Cuny | first6=Gerard | last7=Meunier-Rotival | first7=Michele | last8=Rodier | first8=Francis | doi=10.1126/science.4001930 | issue=4702 |display-authors=etal}} The DNA fragments extracted by the gradient centrifugation were later termed "isochores",{{cite journal | author=Cuny | title=The major components of the mouse and human genomes: Preparation, basic properties and compositional heterogeneity | journal=European Journal of Biochemistry| volume=115 | pages=227–233 | year=1981 | pmid=7238506 | doi=10.1111/j.1432-1033.1981.tb05227.x | last2=Soriano | first2=P | last3=MacAya | first3=G | last4=Bernardi | first4=G | issue=2|display-authors=etal| doi-access=free }} which was subsequently defined as "very long (much greater than 200 KB) DNA segments" that "are fairly homogeneous in base composition and belong to a small number of major classes distinguished by differences in guanine-cytosine (GC) content". Subsequently, the isochores "grew" and were claimed to be ">300 kb in size."{{cite journal | author=Salinas | title=Nonrandom distribution of MMTV proviral sequences in the mouse genome | journal=Nucleic Acids Res| volume=15 | pages=3009–3022 | year=1987 | pmid=3031617 | doi=10.1093/nar/15.7.3009 | last2=Zerial | first2=M | last3=Filipski | first3=J | last4=Crepin | first4=M | last5=Bernardi | first5=G | issue=7 | pmc=340712|display-authors=etal}}

{{cite journal | author=Bernardi | title=The vertebrate genome: isochores and evolution | journal=Molecular Biology and Evolution| volume=10 | pages=186–204 | year=1993 | pmid=8450755 | issue=1 | doi=10.1093/oxfordjournals.molbev.a039994 | doi-access=free }} The theory proposed that the isochore composition of genomes varies markedly between "warm-blooded" (homeotherm) vertebrates and "cold-blooded" (poikilotherm) vertebrates and later became known as the isochore theory.

The thermodynamic stability hypothesis

The isochore theory purported that the genome of "warm-blooded" vertebrates (mammals and birds) are mosaics of long isochoric regions of alternating GC-poor and GC-rich composition, as opposed to the genome of "cold-blooded" vertebrates (fishes and amphibians) that were supposed to lack GC-rich isochores.{{cite journal | author=Bernardi | title=The human genome: organization and evolutionary history | journal=Annual Review of Genetics| volume=29 | pages=445–476 | year=1995 | pmid=8825483 | doi=10.1146/annurev.ge.29.120195.002305 }}

{{cite journal | author=Bernardi, Hughes, and Mouchiroud | title=The major compositional transitions in the vertebrate genome | journal=Journal of Molecular Evolution| volume=44 Suppl 1 | issue=S1 | pages=S44–51 | year=1997 | pmid=9071011 | doi=10.1007/PL00000051 | bibcode=1997JMolE..44S..44B | s2cid=25234820 }}

{{cite journal | author=Robinson, Gautier, and Mouchiroud | title=Evolution of isochores in rodents | journal=Molecular Biology and Evolution| volume=14 | pages=823–828 | year=1997 | pmid=9254920 | doi=10.1093/oxfordjournals.molbev.a025823 | issue=8 | doi-access=free }}

{{cite journal | author=Galtier and Mouchiroud | title=Isochore evolution in mammals: a human-like ancestral structure | journal=Genetics| volume=150 | pages=1577–1584 | year=1998 | pmid=9832533 | last2=Mouchiroud | first2=D | issue=4 | doi=10.1093/genetics/150.4.1577 | pmc=1460440 }}

{{cite journal | author=Oliver | title=Isochore chromosome maps of eukaryotic genomes | journal=Gene| volume=276 | pages=47–56 | year=2001 | pmid=11591471 | doi=10.1016/S0378-1119(01)00641-2 | last2=Bernaola-Galván | first2=P | last3=Carpena | first3=P | last4=Román-Roldán | first4=R | issue=1–2 |display-authors=etal| citeseerx=10.1.1.14.1712 }} These findings were explained by the thermodynamic stability hypothesis, attributing genomic structure to body temperature. GC-rich isochores were purported to be a form of adaptation to environmental pressures, as an increase in genomic GC-content could protect DNA, RNA, and proteins from degradation by heat.

Despite its attractive simplicity, the thermodynamic stability hypothesis has been repeatedly shown to be in error {{cite journal | author=Aota and Ikemura | title=Diversity in G + C content at the third position of codons in vertebrate genes and its cause | journal=Nucleic Acids Res| volume=14 | pages=6345–6355 | year=1986 | pmid=3748815 | doi=10.1093/nar/14.16.6345 | last2=Ikemura | first2=T | issue=16 | pmc=311650 }}

{{cite journal | author=Galtier and Lobry | title=Relationships Between Genomic G+C Content, RNA Secondary Structures, and Optimal Growth Temperature in Prokaryotes | journal=Journal of Molecular Evolution| volume=44 | pages=632–636 | year=1997 | pmid=9169555 | doi=10.1007/PL00006186 | last2=Lobry | first2=J.R. | issue=6| bibcode=1997JMolE..44..632G | s2cid=19054315 }}

{{cite journal | author=Hughes, Zelus, and Mouchiroud | title=Warm-blooded isochore structure in Nile crocodile and turtle | journal=Molecular Biology and Evolution| volume=16 | pages=1521–1527 | year=1999 | pmid = 10555283 | doi=10.1093/oxfordjournals.molbev.a026064 | issue=11 | doi-access=free }}

.{{cite journal | author=Eyre-Walker and Hurst | title=The evolution of isochores | journal=Nat Rev Genet| volume=2 | pages=549–555 | year=2001 | pmid = 11433361 | doi=10.1038/35080577 | last2=Hurst | first2=LD | issue=7 | s2cid=2203093 }}

{{cite journal | author=Hurst and Merchant | title=High guanine-cytosine content is not an adaptation to high temperature: a comparative analysis amongst prokaryotes | journal=Proceedings of the Royal Society B| volume=268 | pages=493–497 | year=2001 | pmid=11296861 | doi=10.1098/rspb.2000.1397 | last2=Merchant | first2=AR | issue=1466 | pmc=1088632}}

{{cite journal | author=Belle, Smith, and Eyre-Walker | title=Analysis of the phylogenetic distribution of isochores in vertebrates and a test of the thermal stability hypothesis | journal=Journal of Molecular Evolution| volume=55 | pages=356–363 | year=2002 | pmid = 12187388 | doi=10.1007/s00239-002-2333-1 | issue=3 | bibcode=2002JMolE..55..356B | s2cid=16596135 }}

{{cite journal | author=Ream, Johns, and Somero | title=Base Compositions of Genes Encoding {alpha}-Actin and Lactate Dehydrogenase-A from Differently Adapted Vertebrates Show No Temperature- Adaptive Variation in G + C Content | journal=Molecular Biology and Evolution| volume=20 | pages=105–110 | year=2003 | pmid=12519912 | url=http://mbe.oxfordjournals.org/cgi/content/abstract/20/1/105 | issue=1 | doi=10.1093/molbev/msg008| doi-access=free }}

{{cite journal | author=Belle | title=The decline of isochores in mammals: an assessment of the GC content variation along the mammalian phylogeny | journal=Journal of Molecular Evolution| volume=58 | pages=653–660 | year=2004 | pmid = 15461422 | doi=10.1007/s00239-004-2587-x | last2=Duret | first2=L | last3=Galtier | first3=N | last4=Eyre-Walker | first4=A | issue=6 |display-authors=etal| bibcode=2004JMolE..58..653B | citeseerx=10.1.1.333.2159 | s2cid=18281444 }} Many authors showed the absence of a relationship between temperature and GC-content in vertebrates, while others showed the existence of GC-rich domains in "cold-blooded" vertebrates such as crocodiles, amphibians, and fish.{{cite journal | author=Hughes, Friedman, and Murray | title=Genomewide pattern of synonymous nucleotide substitution in two complete genomes of Mycobacterium tuberculosis | journal=Emerging Infectious Diseases| volume=8 | pages=1342–1346 | year=2002 | pmid = 12453367 | issue=11 | pmc=2738538 | doi=10.3201/eid0811.020064}}

{{cite journal | author=Costantini, Auletta, and Bernardi | title=Isochore patterns and gene distributions in fish genomes | journal=Genomics| volume=90 | pages=364–371 | year=2007 | pmid = 17590311 | doi=10.1016/j.ygeno.2007.05.006 | issue=3| doi-access=free }}

{{cite journal | author=Symonová, Majtánová, Arias-Rodriguez , Mořkovský, Kořínková, Cavin, Johnson Pokorná, Doležálková, Flajšhans, Normandeau, Ráb, Meyer, and Bernatchez | title=Genome Compositional Organization in Gars Shows More Similarities to Mammals than to Other Ray-Finned Fish | journal=Journal of Experimental Zoology| volume=328 | pages=607–619 | year=2016 | issue=7 | pmid = 28035749 | doi=10.1002/jez.b.22719| url=http://nbn-resolving.de/urn:nbn:de:bsz:352-2-603ra8pvhw693 }}

Principles of the isochore theory

The isochore theory was the first to identify the nonuniformity of nucleotide composition within vertebrate genomes and predict that the genome of "warm-blooded" vertebrates such as mammals and birds are mosaic of isochores (Bernardi et al. 1985). The human genome, for example, was described as a mosaic of alternating low and high GC content isochores belonging to five compositional families, L1, L2, H1, H2, and H3, whose corresponding ranges of GC contents were said to be <38%, 38%-42%, 42%-47%, 47%-52%, and >52%, respectively.{{cite journal | author=Bernardi | title=Misunderstandings about isochores. Part 1 | journal=Gene| volume=276 | pages=3–13 | year=2001 | pmid = 11591466 | doi=10.1016/S0378-1119(01)00644-8 | issue=1–2 }}

The main predictions of the isochore theory are that:

  • GC content of the third codon position (GC3) of protein coding genes is correlated with the GC content of the isochores embedding the corresponding genes.
  • The genome organization of warm-blooded vertebrates is a mosaic of mostly GC-rich isochores.{{cite journal | author=Bernardi | title=Isochores and the evolutionary genomics of vertebrates | journal=Gene| volume=241 | pages=3–17 | year=2000 | pmid = 10607893 | doi=10.1016/S0378-1119(99)00485-0 | issue=1 }}{{cite journal | author=Bernardi | title=The compositional evolution of vertebrate genomes | journal=Gene| volume=259 | pages=31–43 | year=2000 | pmid = 11163959 | doi=10.1016/S0378-1119(00)00441-8 | issue=1–2}}
  • Genome organization of cold-blooded vertebrates is characterized by low GC content levels and lower compositional heterogeneity than warm-blooded vertebrates. Homogeneous domains do not reach the high GC levels attained by the genomes of warm-blooded vertebrates.

The neutralist-selectionist controversy

Two opposite explanations that endeavored to explain the formations of isochores were vigorously debated as part of the neutralist-selectionist controversy. The first view was that isochores reflect variable mutation processes among genomic regions consistent with the neutral model.{{cite journal | author=Wolfe, Sharp, and Li | title=Mutation rates differ among regions of the mammalian genome | journal=Nature| volume=337 | pages=283–285 | year=1989 | pmid = 2911369 | bibcode=1989Natur.337..283W | last2=Sharp | last3=Li | doi=10.1038/337283a0 | issue=6204| s2cid=4336541 }}

{{cite journal | author=Galtier | title=GC- content evolution in mammalian genomes: the biased gene conversion hypothesis | journal=Genetics| volume=159 | pages=907–911 | year=2001 | pmid = 11693127 | last2=Piganeau | first2=G | last3=Mouchiroud | first3=D | last4=Duret | first4=L | issue=2 | doi=10.1093/genetics/159.2.907 | pmc=1461818 |display-authors=etal}} Alternatively, isochores were posited as a result of natural selection for certain compositional environment required by certain genes.{{cite journal | author=Matassi, Sharp, and Gautier | title=Chromosomal location effects on gene sequence evolution in mammals | journal=Current Biology| volume=9 | pages=786–791 | year=1999 | pmid = 10469563 | doi=10.1016/S0960-9822(99)80361-3 | issue=15 | doi-access=free | bibcode=1999CBio....9..786M }} Several hypotheses derive from the selectionist view, such as the thermodynamic stability hypothesis {{cite journal | author=Bernardi and Bernardi | title=Compositional constraints and genome evolution | journal=Journal of Molecular Evolution| volume=24 | pages=1–11 | year=1986 | pmid = 3104608 | doi=10.1007/BF02099946 | last2=Bernardi | first2=G | issue=1–2 | bibcode=1986JMolE..24....1B | s2cid=26783774 }} and the biased gene conversion hypothesis. Thus far, none of the theories provides a comprehensive explanation to the genome structure, and the topic is still under debate.

The rise and fall of the isochore theory

The isochore theory became one of the most useful theories in molecular evolution for many years. It was the first and most comprehensive attempt to explain the long-range compositional heterogeneity of vertebrate genomes within an evolutionary framework. Despite the interest in the early years in the isochore model, in recent years, the theory’s methodology, terminology, and predictions have been challenged.

Because this theory was proposed in the 20th century before complete genomes were sequenced, it could not be fully tested for nearly 30 years. In the beginning of the 21st century, when the first genomes were made available it was clear that isochores do not exist in the human genome{{cite journal | title=Initial sequencing and analysis of the human genome | journal=Nature| volume=409 | pages=860–921 | year=2001 | pmid = 11237011 | doi=10.1038/35057062 | last2=Linton | first2=LM | last3=Birren | first3=B | last4=Nusbaum | first4=C | last5=Zody | first5=MC | last6=Baldwin | first6=J | last7=Devon | first7=K | last8=Dewar | first8=K | last9=Doyle | first9=M | issue=6822 |last10 = Fitzhugh| first10=W.| last11=Funke| first11=R.| last12=Gage| first12=D.| last13=Harris| first13=K.| last14=Heaford| first14=A.| last15=Howland| first15=J.| last16=Kann| first16=L.| last17=Lehoczky| first17=J.| last18=Levine| first18=R.| last19=McEwan| first19=P.| last20=McKernan| first20=K.| last21=Meldrim| first21=J.| last22=Mesirov| first22=J.P.| last23=Miranda| first23=C.| last24=Morris| first24=W.| last25=Naylor| first25=J.| last26=Raymond| first26=C.| last27=Rosetti| first27=M.| last28=Santos| first28=R.| last29=Sheridan| first29=A.| last30=Sougnez| first30=C.| display-authors=8 |author1 = Jean|bibcode = 2001Natur.409..860L | url=https://deepblue.lib.umich.edu/bitstream/2027.42/62798/1/409860a0.pdf| doi-access=free}}

nor in other mammalian genomes.{{cite journal | author=Elsik | title=The genome sequence of taurine cattle: a window to ruminant biology and evolution | journal=Science| volume=324 | pages=522–528 | year=2009 | pmid = 19390049 | bibcode=2009Sci...324..522A | last2=Elsik | first2=Christine G. | last3=Tellam | first3=Ross L. | last4=Worley | first4=Kim C. | last5=Gibbs | first5=Richard A. | last6=Elsik | first6=Christine G. | last7=Tellam | first7=Ross L. | last8=Gibbs | first8=Richard A. | last9=Muzny | first9=Donna M. | doi=10.1126/science.1169588 | issue=5926 | pmc=2943200 |display-authors=etal}} When failed to find isochores, many attacked the very existence of isochores.{{cite journal | author=Nekrutenko and Li | title=Assessment of compositional heterogeneity within and between eukaryotic genomes | journal=Genome Research| volume=10 | pages=1986–1995 | year=2000 | pmid = 11116093 | doi=10.1101/gr.10.12.1986 | last2=Li | first2=WH | issue=12 | pmc=313050 }}

{{cite journal | author=Häring and Kypr | title=No Isochores in the Human Chromosomes 21 and 22? | journal=Biochemical and Biophysical Research Communications| volume=280 | pages=567–573 | year=2001 | pmid= 11162557| doi=10.1006/bbrc.2000.4162 | issue=2 | last2=Kypr | first2=J }}

{{cite journal | author=Cohen | title=GC composition of the human genome: in search of isochores | journal=Molecular Biology and Evolution| volume=22 | pages=1260–1272 | year=2005 | pmid = 15728737 | doi=10.1093/molbev/msi115 | last2=Dagan | first2=T | last3=Stone | first3=L | last4=Graur | first4=D | issue=5 |display-authors=etal| doi-access=free }}

{{cite journal | author=Elhaik | title=Identifying compositionally homogeneous and nonhomogeneous domains within the human genome using a novel segmentation algorithm | journal=Nucleic Acids Res| volume=38 | pages=e158 | year=2010 | pmid = 20571085 | doi=10.1093/nar/gkq532 | last2=Graur | first2=D | last3=Josić | first3=K | last4=Landan | first4=G | issue=15 | pmc=2926622|display-authors=etal}} The most important predictor of isochores, GC3 was shown to have no predictable power {{cite journal | author=Elhaik, Landan, and Graur | title=Can GC Content at Third- Codon Positions Be Used as a Proxy for Isochore Composition? | journal=Molecular Biology and Evolution| volume=26 | pages=1829–1833 | year=2009 | pmid=19443854 | doi=10.1093/molbev/msp100 | issue=8 | doi-access=free }}

{{cite journal | author=Tatarinova | title=GC3 biology in corn, rice, sorghum and other grasses | journal=BMC Genomics| volume=11 | pages=308 | year=2010 | pmid = 20470436 | doi=10.1186/1471-2164-11-308 | last2=Alexandrov | first2=NN | last3=Bouck | first3=JB | last4=Feldmann | first4=KA | pmc=2895627|display-authors=etal | doi-access=free }} to the GC content of nearby genomic regions, refuting findings from over 30 years of research, which were the basis for many isochore studies. Isochore-originators replied that the term was misinterpreted {{cite journal | author=Li | title=Isochores merit the prefix 'iso' | journal=Computational Biology and Chemistry| volume=27 | pages=5–10 | year=2003 | pmid = 12798034 | bibcode=2002physics...9080L | last2=Bernaola-Galvan | first2=Pedro | last3=Carpena | first3=Pedro | last4=Oliver | first4=Jose L | arxiv=physics/0209080 | doi=10.1016/S1476-9271(02)00090-7 | issue=1 | s2cid=53305489 |display-authors=etal}}

{{cite journal | author=Clay and Bernardi | title=How Not to Search for Isochores: A Reply to Cohen et al | journal=Molecular Biology and Evolution| volume=22 | pages=2315–2317 | year=2005 |pmid= 16093569| doi=10.1093/molbev/msi231 | issue=12 | doi-access=free }} as isochores are not "homogeneous" but rather fairly homogeneous regions with a heterogeneous nature (especially) of GC-rich regions at the 5 kb scale,{{cite journal | author=Romiguier | title=Contrasting GC-content dynamics across 33 mammalian genomes: Relationship with life-history traits and chromosome sizes| journal=Genome Research| volume=20 | pages=1001–1009 | year=2010 | pmid = 20530252 | doi=10.1101/gr.104372.109 | last2=Ranwez | first2=V | last3=Douzery | first3=EJ | last4=Galtier | first4=N | issue=8 | pmc=2909565|display-authors=etal}} which only added to the already growing confusion. The reason for this ongoing frustration was the ambiguous definition of isochores as long and homogeneous, allowed some researchers to discover "isochores" and others to dismiss them, although both camps used the same data.

The unfortunate side effect of this controversy was an "arms race" in which isochores are frequently redefined and relabeled following conflicting findings that failed to reveal "mosaic of isochores." The unfortunate outcomes of this controversy and the following terminological-methodological mud were the loss of interest in isochores by the scientific community. When the most important core-concept in isochoric literature, the thermodynamic stability hypothesis, was rejected, the theory lost its appeal. Even today, there is no clear definition to isochores nor is there an algorithm that detects isochores.{{cite journal | author=Elhaik, Graur, and Josic | title=Comparative testing of DNA segmentation algorithms using benchmark simulations | journal=Molecular Biology and Evolution| volume=27 | issue=5 | pages=1015–1024 | year=2010 | pmid= 20018981| doi=10.1093/molbev/msp307| doi-access=free }} Isochores are detected manually by visual inspection of GC content curves ,{{cite journal | author=Costantini | title=An isochore map of human chromosomes | journal=Genome Research| volume=16 | pages=536–541 | year=2006 | pmid= 16597586| doi=10.1101/gr.4910606 | issue=4 | last2=Clay | first2=O | last3=Auletta | first3=F | last4=Bernardi | first4=G | pmc=1457033 |display-authors=etal}} however because this approach lacks scientific merit and is difficult to replicate by independent groups, the findings remain disputed.

The compositional domain model

{{main|Compositional domain model}}

As the study of isochores was de facto abandoned by most scientists, an alternative theory was proposed to describe the compositional organization of genomes in accordance with the most recent genomic studies. The Compositional Domain Model depicts genomes as a medley of short and long homogeneous and nonhomogeneous domains. The theory defines "compositional domains" as genomic regions with distinct GC-contents as determined by a computational segmentation algorithm. The homogeneity of compositional domains is compared to that of the chromosome on which they reside using the F-test, which separated them into compositionally homogeneous domains and compositionally nonhomogeneous domains based on the outcome of test. Compositionally homogeneous domains that are sufficiently long (≥ 300 kb) are termed isochores or isochoric domains. These terms are in accordance with the literature as they provide clear distinction between isochoric- and nonisochoric-domains.

A comprehensive study of the human genome unraveled a genomic organization where two-thirds of the genome is a mixture of many short compositionally homogeneous domains and relatively few long ones. The remaining portion of the genome is composed of nonhomogeneous domains. In terms of coverage, only 1% of the total number of compositionally homogeneous domains could be considered "isochores" which covered less than 20% of the genome.

Since its inception the theory received wide attention and was extensively used to explain findings emerging from over dozen new genome sequencing studies.{{cite journal | title=Insights into social insects from the genome of the honeybee Apis mellifera | journal=Nature| volume=443 | pages=931–949 | year=2006 | pmid = 17073008 | doi=10.1038/nature05260 | issue=7114 | pmc=2048586 | last1=Robinson | first1=Gene E. | last2=Gibbs | first2=Richard A. | last3=Weinstock | first3=George M. | last4=Worley | first4=Kim C. | last5=Evans | first5=Jay D. | last6=Maleszka | first6=Ryszard | bibcode = 2006Natur.443..931T }}

{{cite journal | author=Sodergren | title=The genome of the sea urchin Strongylocentrotus purpuratus | journal=Science| volume=314 | pages=941–952 | year=2006 | pmid = 17095691 | bibcode=2006Sci...314..941S | last2=Weinstock | first2=George M. | last3=Davidson | first3=Eric H. | last4=Cameron | first4=R. Andrew | last5=Gibbs | first5=Richard A. | last6=Weinstock | first6=George M. | last7=Angerer | first7=Robert C. | last8=Angerer | first8=Lynne M. | last9=Arnone | first9=Maria Ina | doi=10.1126/science.1133609 | issue=5801 | pmc=3159423 |display-authors=etal}}

{{cite journal | title=The genome of the model beetle and pest Tribolium castaneum | journal=Nature| volume=452 | pages=949–955 | year=2008 | pmid = 18362917 | bibcode=2008Natur.452..949R | last2=Gibbs | first2=Richard A. | last3=Weinstock | first3=George M. | last4=Brown | first4=Susan J. | last5=Denell | first5=Robin | last6=Beeman | first6=Richard W. | last7=Gibbs | first7=Richard | last8=Beeman | first8=Richard W. | last9=Brown | first9=Susan J. | last10=Bucher| first10=G.| last11=Friedrich| first11=M.| last12=Grimmelikhuijzen| first12=C.J.P.| last13=Klingler| first13=M.| last14=Lorenzen| first14=M.| last15=Richards| first15=S.| last16=Roth| first16=S.| last17=Schroder| first17=R.| last18=Tautz| first18=D.| last19=Zdobnov| first19=E.M.| last20=Muzny| first20=D.| last21=Gibbs| first21=R.A.| last22=Weinstock| first22=G.M.| last23=Attaway| first23=T.| last24=Bell| first24=S.| last25=Buhay| first25=C.J.| last26=Chandrabose| first26=M.N.| last27=Chavez| first27=D.| last28=Clerk-Blankenburg| first28=K.P.| last29=Cree| first29=A.| last30=Dao| first30=M.| doi=10.1038/nature06784 | issue=7190| display-authors=8 |last1 = Richards|first1 = S.| doi-access=free| hdl=11858/00-001M-0000-000F-D6B7-5| hdl-access=free}}

{{cite journal | author=Kirkness | title=Genome sequences of the human body louse and its primary endosymbiont provide insights into the permanent parasitic lifestyle | journal=Proceedings of the National Academy of Sciences of the United States of America| volume=107 | pages=12168–12173 | year=2010 | pmid = 20566863 | doi=10.1073/pnas.1003379107 | last2=Haas | first2=BJ | last3=Sun | first3=W | last4=Braig | first4=HR | last5=Perotti | first5=MA | last6=Clark | first6=JM | last7=Lee | first7=SH | last8=Robertson | first8=HM | last9=Kennedy | first9=RC | issue=27 | pmc=2901460|bibcode = 2010PNAS..10712168K |display-authors=etal| doi-access=free }}

{{cite journal | author=Werren | title=Functional and evolutionary insights from the genomes of three parasitoid Nasonia species | journal=Science| volume=327 | pages=343–348 | year=2010 | pmid = 20075255 | bibcode= 2010Sci...327..343.| doi=10.1126/science.1178028 | last2=Richards | first2=S | last3=Desjardins | first3=CA | last4=Niehuis | first4=O | last5=Gadau | first5=J | last6=Colbourne | first6=JK | last7=Nasonia Genome Working | first7=Group | last8=Werren | first8=JH | last9=Richards | first9=S | issue=5963 | pmc=2849982 |display-authors=etal}}

{{cite journal | author=Smith | title=Draft genome of the globally widespread and invasive Argentine ant (Linepithema humile) | journal=Proceedings of the National Academy of Sciences of the United States of America| volume=108 | pages=5673–5678 | year=2011 | pmid = 21282631 | bibcode=2011PNAS..108.5673S | last2=Zimin | first2=A. | last3=Holt | first3=C. | last4=Abouheif | first4=E. | last5=Benton | first5=R. | last6=Cash | first6=E. | last7=Croset | first7=V. | last8=Currie | first8=C. R. | last9=Elhaik | first9=E. | doi=10.1073/pnas.1008617108 | issue=14 | pmc=3078359 |display-authors=etal| doi-access=free }}

{{cite journal | author=Smith | title=Draft genome of the red harvester ant Pogonomyrmex barbatus | journal=Proceedings of the National Academy of Sciences of the United States of America| volume=108 | pages=5667–5672 | year=2011 | pmid = 21282651 | bibcode=2011PNAS..108.5667S | last2=Smith | first2=C. D. | last3=Robertson | first3=H. M. | last4=Helmkampf | first4=M. | last5=Zimin | first5=A. | last6=Yandell | first6=M. | last7=Holt | first7=C. | last8=Hu | first8=H. | last9=Abouheif | first9=E. | doi=10.1073/pnas.1007901108 | issue=14 | pmc=3078412 |display-authors=etal| doi-access=free }}

{{cite journal | author=Suen | title=The genome sequence of the leaf-cutter ant Atta cephalotes reveals insights into its obligate symbiotic lifestyle | journal=PLOS Genetics| volume=7 | pages=e1002007 | year=2011 | pmid = 21347285 | doi=10.1371/journal.pgen.1002007 | last2=Teiling | first2=C | last3=Li | first3=L | last4=Holt | first4=C | last5=Abouheif | first5=E | last6=Bornberg-Bauer | first6=E | last7=Bouffard | first7=P | last8=Caldera | first8=EJ | last9=Cash | first9=E | issue=2 | pmc=3037820 | editor1-last=Copenhaver | editor1-first=Gregory |display-authors=etal | doi-access=free }} However, many important questions remain open, such as which evolutionary forces shaped the structure of compositional domains and the ways they differ between different species.

References