Mitochondrial DNA SNPs associated with Schizophrenia exhibit Highly Variable Inter-allelic Haplogroup Affiliation and Nuclear Genogeographic Affinity: Bi-Genomic Linkage Disequilibrium raises Major Concerns for Link to Disease ================================================================================================================================================================================================================================== * Christian M Hagen * Vanessa F Gonçalves * Paula L Hedley * Jonas Bybjerg-Grauholm * Marie Bækvad-Hansen * Christine S Hansen * Jørgen K Kanters * Jimmi Nielsen * Ole Mors * Alfonso B Demur * Thomas D Als * Merete Nordentoft * Anders Børglum * Preben Bo Mortensen * James Kennedy * Thomas M Werge * David M Hougaard * Michael Christiansen ## Abstract Mitochondria play a significant role in human diseases. However, disease associations with mitochondrial DNA (mtDNA) SNPs have proven difficult to replicate. A reanalysis of eight schizophrenia-associated mtDNA SNPs, in 23,743 normal Danes and 2,538 schizophrenia patients, revealed marked inter-allelic differences in haplogroup affiliation and nuclear ancestry, genogeophraphic affinity (GGA). This bi-genomic linkage disequilibrium (2GLD) could entail population stratification. Only two mitochondrial SNPs, m. 15043A and m. 15218G, were significantly associated with schizophrenia. However, these associations disappeared when corrected for haplogroup affiliation. The extensive 2GLD documented is a major concern when interpreting historic as well as designing future mtDNA association studies. Keywords * bi-genomic linkage disequilibrium * single nucleotide polymorphism * psychiatric disease Genetic variants in mitochondrial DNA (mtDNA) – and in nuclear genes coding for mitochondrial function - have been associated with disease 1-3. More than 300 variants4,5 in mtDNA and genes involved in mitochondrial function6 have been reported to cause mitochondrial disease which is clinically characterised by complex metabolic, neurological, muscular and psychiatric symptoms7,8. SNPs in mtDNA and mitochondrial haplogroups (mtDNA hgs), which are evolutionarily fixed SNP sets with a characteristic geographical distribution, have been proposed as potential disease modifiers8. This has been reported in neurological degenerative diseases such as Alzheimer’s disease9-12 and Parkinson’s disease12-14, metabolic diseases and cancers15, as well as psychiatric diseases, notably schizophrenia (SZ) and bipolar disease16-18. Association studies of mtDNA variants and disease have been difficult to replicate8. However, the definition of a methodological paradigm for association studies with mtDNA variants19 implicitly assumes that mtDNA variants are independent of the nuclear genome. In a recent Danish study on mtDNA haplogroups and their nuclear ancestry or nuclear genogeographical affinity (GGA), we demonstrated a marked difference in nuclear ancestry between individual haplogroups20. This means that mtDNA hgs entail population stratification also at the level of gDNA. The effect of such a stratification on disease association, will depend on the admixture structure of the particular population, the population history, epidemiology and genetic epidemiology of the disease, as well as the number of persons included in the study. The extensive fine-scale heterogeneity of gDNA and significant admixture documented in the UK21 and Europe22 further increase the risk of spurious false positive associations, if the mtDNA/gDNA interaction is not corrected phenomenon for in association studies. Using DNA-array data from the Danish iPSYCH study on 2,538 schizophrenia patients and 23,743 population controls, we show that eight mtDNA SNPs, previously associated with SZ16-18,23, exhibit considerable inter-allelic differences both with respect to mtDNA hg affiliation and nuclear GGA. This phenomenon, that we name bi-genomic linkage disequilibrium (2GLD), affecting the association between an mtDNA SNP and mtDNA, as well as gDNA, can lead to both false negative and positive associations with disease. We demonstrate that for only two of the original eight SNPs is it possible to replicate the association with SZ in this cohort, when correcting for population stratification. Both mtDNA haplogroup affiliation and GGA affects the strength of association. Finally, we show that none of the SNPs are associated with SZ when examined on a particular the mtDNA hg background, with correction for 2GLD. As none of previous studies of mtDNA SNPs have been performed with correction for population stratification, let alone 2GDL, our results indicate that all such published associations should be considered preliminary. In principle, this conclusion should not be limited to associations with SZ. ## Results From a literature search of mtDNA SNPs previously associated with SZ, we identified eight that were also typed by the PsychChip, Table 1. PsychChip data from 23,743 normal Danes and 2,538 SZ patients (Detailed in Suppl Table 1) showed that the SNPs were present in the population with frequencies varying from 0.2 % – 20.6 %, Table 1. There was no appreciable difference in mtDNA haplogroup or GGA distribution between controls and SZ patients, Suppl table 1. View this table: [Table 1.](http://biorxiv.org/content/early/2017/06/12/149070/T1) Table 1. Schizophrenia associated mtDNA SNPs. ### Haplogroup distribution of mtDNA SNPs The potential affiliation, based on PhyloTree, of SNPs to different mtDNA haplogroups is shown in Table 1, and the actual mtDNA haplogroup distribution in the controls (not different from that of the SZ patients, Suppl Table 2) is shown in Table 2. For all SNPs there is a marked difference in the actual mtDNA haplogroup distribution between the two alleles at the same position. Thus, when comparing persons with either of two alleles at the same mtDNA position, the comparison is between groups with widely differing mtDNA distributions. A PCA analysis of the mtDNA sequences in persons with either the A or G allele at position 15,043 is shown in figure 1A. This analysis shows that the difference in mtDNA sequence is very extensive. View this table: [Table 2.](http://biorxiv.org/content/early/2017/06/12/149070/T2) Table 2. MtDNA haplogroup distribution for each SNP allele. ### Genogeographic affinity (ancestry) of mtDNA SNPs The average GGA (ancestry) of the nuclear genome in control persons as a function of each mtDNA allele revealed major differences, both between different positions (inter-SNP) in the mtDNA molecule and between different alleles (inter-allelic) at the same position, Table 3. Apart from subtle differences in the distributions of Greenlandic and Asian GGA affiliations, the distribution is similar in SZ patients, Suppl table 3. Most marked is the difference in Danish GGA for m.15043 A/G, with an inter-allelic difference ~ 20 percent points in Danish ancestry, and the A-allele has a prominent Central South Asian ancestry not seen in the G-allele, Table 3 and Suppl Table 3. PCA analyses of the nuclear genome, Figure 1 B, of the two alleles, reveal a striking difference in the ancestry of the nuclear genome between the two alleles. Thus, when comparing persons with either of two alleles at the same mtDNA position, there is a substantial risk for comparison between groups with widely different ancestries (GGA), thus increased risk of confounding by population structure. ![](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2017/06/12/149070/F1/graphic-3.medium.gif) [](http://biorxiv.org/content/early/2017/06/12/149070/F1/graphic-3) ![](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2017/06/12/149070/F1/graphic-4.medium.gif) [](http://biorxiv.org/content/early/2017/06/12/149070/F1/graphic-4) Figure 1. PCA of **A**. mtDNA (PC1 versus PC3) and **B**. nuclear genome (PC1 versus PC2), with the m.15043A (red) and m.15043G (blue) alleles. Only persons with “European” mtDNA haplogroup (H,HV,V,U,K,J & T) were included. The PCA was performed using PCs defined by control samples. View this table: [Table 3.](http://biorxiv.org/content/early/2017/06/12/149070/T3) Table 3. Distribution of genogeographic affinities of controls with specific SNPs. ### Association between schizophrenia and mtDNA SNPs The association of each mtDNA SNP with SZ was assessed (Table 4). In consequence of the inter-allelic differences in mtDNA hg affiliations and GGA demonstrated above, several association analyses were performed. The first (“ALL\_hgs”) comprised all persons irrespective of either mtDNA hg or GGA, the second (“EU_hgs”) only comprised persons with a “classical European” mtDNA hg, i.e. H, V, J, K, U, or T. Thirdly, we examined associations in persons with a Danish GGA (All-hgs_DANES), and finally, in persons with both a “European” mtDNA haplogroup and a Danish GGA (EU-hgs_DANES). Five SNPs, m.1438A, m.3197C, m.3666A, m.4769A, and m.9388G showed no association with SZ, both when all persons were included and where selection was made to reduce effects of varying mtDNA and GGA affiliations. The m.10398G SNP was marginally significantly, while m.15043A was significantly associated with a reduced risk for SZ irrespective of the grouping. The high level of diversity in the PCA analysis of m.15043A (and the remaining SNPs – data not shown), figure 1A, prompted us to examine whether a fixation of the mtDNA hg, i.e. limiting the analysis to persons with a specific hg, in cases where it was reasonable frequent, would result in significant associations. The result, table 5, was the contrary – on a fixed mtDNA haplogroup background, none of the SNPs exhibited a significant association with SZ. View this table: [Table 4.](http://biorxiv.org/content/early/2017/06/12/149070/T4) Table 4. Association of SNPs with SZ for different selections of cohort. View this table: [Table 5.](http://biorxiv.org/content/early/2017/06/12/149070/T5) Table 5. Association of SNPs with SZ in Danish GGA and a specific mtDNA haplogroup. ## Discussion Here we show that mtDNA SNPs are in bi-genomic linkage disequilibrium (2GLD) with mtDNA hgs and gDNA clusters due to population structure and shared demographic history of mtDNA and gDNA. This means that each mtDNA allele is associated with a unique distribution of mtDNA hgs and other associated mtDNA variants, and at the same time associated with a unique distribution of gDNA clusters. This has the consequence that an association between a particular allele in a specific SNP and disease is not exclusively a result of the presence of the particular allele of the SNP; rather it is the result of a combination of differences in mtDNA and gDNA sequences – a result of population stratification and admixtureIt is thus not – as is frequently done, and suggested as a paradigm for mtDNA association – sufficient to consider an association as evidence for a specific effect of an allele on the function of a protein or RNA coded for by the mtDNA and, consequently, as a cause of pathophysiological changes. The linkage disequilibrium between different alleles on a SNP and mtDNA hgs and sub-hgs is not surprising as hgs are defined by series of evolutionarily conserved SNPs. The particular distribution of subsets of mtDNA hgs, sub-hgs and individual SNPs, that are associated with a particular allele at a specific SNP will depend on the population history and the extent and source of admixture. In most countries, and in particular in Europe, such history is very complicated and incompletely clarified. m.10398G was found associated with SZ in Han Chinese, however, when the cohort was broken down with respect to haplogroups, the association disappeared24. This illustrates that a specific allele’s mtDNA haplogroup distribution may induce spurious association with SZ. Spurious associations between mtDNA SNPs and a particular phenotype, when restricted to persons of a specific mtDNA hg25, may however be due to population stratification at the sub-hg level. As mtDNA replicates without recombination, independently of the cell cycle, mtDNA hgs and SNPs should be independent of the nuclear genome, but only if the population were infinitely large and in the absence of population substructure. This is often not the case, due to geographical population substructure, recent admixture, socially and culturally defined restrictions in choice of spouse. A recent study showed that most Danish grandparents to present-day high school students chose spouses within a short distance of their birthplace26. This practice will with time lead to regionalisation, and a southwestern to northeastern gradient was found26. Furthermore, immigrants may seek a partner from within their ethnic community. Such effects have been sought eliminated in some studies by restricting the participants in mtDNA association studies to persons with a three-generation presence in the population. However, it has not been documented that this is sufficient to obviate association or linkage disequilibrium between mtDNA SNPs and specific gDNA clusters. Extensive gDNA micro-scale heterogeneity has been documented in the UK21 and Western France27 and admixture has been an important factor in the accretion of the present-day genomic variation of Europe22,28. The UK study21 showed that this is not just a result of recent demic changes; however, recent migrations may lead to widespread 2GLD. Schizophrenia is a complex syndromic disease29 with geographically varying prevalence30 and characterized by a markedly elevated prevalence among first and second generation immigrants31,32, particularly among persons with dark skin moving to Nordic lattitudes33. These epidemiological characteristics of SZ obviously increase the risk of spurious associations caused by subtle admixture and 2GLD. However, it does not *per se* refute the mitochondrial pathogenic paradigm34 where variation in mitochondrial function, believed to interfere with ATP production 35,36, inflammation and signaling37,38 as well as Ca2+-homeostasis39, and apoptosis38, is considered to be of paramount importance for development of disease. Several neuroanatomical post-mortem findings in SZ brains indicate perturbed mitochondrial function40, but such findings are difficult to distinguish from changes caused by drug treatment. The iPSYCH data are prospective and signs of immigration are apparent20, but they also showed that the variation in ancestry differed greatly between mtDNA hgs – even within traditional European hgs, i.e. mtDNA hg U, where ancient European sub-hgs occurred together with U-sub-hgs of recent Near Eastern and Central Asian origin20. Thus, 2GLD is likely to be a confounder and may lead both to false positive as well as false negative associations with disease. The method of correction for 2GLD in association studies will depend on the specific mtDNA SNP examined, the population structure and history, as well as the size of the study population. If population stratification involving gDNA is inherent when performing association studies with mtDNA SNPs, it should be expected that diseases with geographically varying prevalence would be likely to find associated with specific mtDNA SNPs. The largest mtDNA association study to data18 found mtDNA SNPs associated with ulcerative colitis, exhibiting a European North-South and East-West gradient41, and with multiple sclerosis exhibiting a longitudinal prevalence gradient42 and effect of immigration43. The same study found that the prevalence of mtDNA SNPs associated with Parkinson’s disease was lower in African and Asian people44. Furthermore, the incidence of primary biliary cirrhosis is very high in North East England, 50 % lower in the rest of England and Scandinavia, and 90 % lower in the Middle East and Asia45. A major problem with the interpretation of mtDNA SNP variants is the difficulties associated with performing a meaningful and reproducible assessment of mitochondrial function. In vitro studies of mitochondrial function, e.g. enzymatic activity measurements of OXPHOS components in cells, tissues46 or cybrids47 as well as allotopic expression15, are difficult to interpret as they also interfere with the inherent cellular control of mitochondrial function38. Furthermore, it should be noted, that mtDNA hgs and sub-hgs are cladistics groups and not functional units. Thus, in the Danish population, the U-hg is composed of a range of sub-hgs, e.g. U5a, U5b, U6, U7, and U8, with widely differing GGAs, reflecting migrations rather than selection20. It is thus meaningless to ascribe a specific functional effect to a particular mtDNA hg – without having carefully examined both mtDNA and nuclear genetic variation and corrected for stratifications in both. Previous conflicting studies of disease associations with mtDNA have been suggested to be the result of insufficient power48, insufficient stratification respect to sex, age, geographical background49 or population admixture50, or the use of small areas of recruitment risking “occult” founder effects51. The fact that careful control, as here, of these factors and the 2GLD, results in none of eight previously SZ associated mtDNA SNPs being associated with SZ in the very large Danish iPSYCH cohort, suggests that previously reported associations could indeed be spurious findings due to cryptic population stratification. Meta-analyses pooling studies from different populations15 does not necessarily solve this problem – it may aggravate it by introducing further sub-stratification of the total population analysed. The extensive 2GLD demonstrated in the Danish population makes this phenomenon the most parsimonious explanation of non-replicable associations with mtDNA variants, not only for associations with SZ, but obviously 2GLD can interfere with associations between mtDNA and all types of diseases and traits. ## Online Methods ### Ethics statement The iPSYCH cohort study ([www.ipsych.au.dk](http://www.ipsych.au.dk)) is register-based using data from Danish national health registries. The study was approved by the Scientific Ethics Committees of the Central Denmark Region ([www.komite.rm.dk](http://www.komite.rm.dk)) (J.nr.: 1-10-72-287-12) and executed according to guidelines from the Danish Data Protection Agency ([www.datatilsynet.dk](http://www.datatilsynet.dk)) (J.nr.: 2012-41-0110). Passive, but not informed, consent was obtained, in accordance with Danish Law nr. 593 of June 14, 2011, para 10, on the scientific ethics administration of projects within health research. ### SZ patients and controls As part of the iPSYCH recruitment protocol, 23,743 controls, born between May 1st 1981 and Dec 31st 2005 were selected at random from the Danish Central Person Registry. Among persons born within the same time span 2,538 persons assigned an ICD-10 F20 were identified in the Danish National Patient Registry. All were singletons, were alive one year after their birth, and had a mother registered in the Danish Central Person Registry. DNA should be available from DBS cards obtained from the Danish Neonatal Screening Biobank at Statens Serum Institut 52 Demographics of patients and controls are given in Suppl Table 1. ### Genetic analysis and mtDNA SNPing From each DBS card two 3.2-mm disks were excised from which DNA extracted using Extract-N-Amp Blood PCR Kit (Sigma-Aldrich, St Louis, MO, USA)(extraction volume: 200 μL). The extracted DNA samples were whole genome amplified (WGA) in triplicate using the REPLIg kit (Qiagen, Hilden, Germany), then pooled into a single aliquot. Finally, WGA DNA concentrations were estimated using the Quant-IT Picogreen dsDNA kit (Invitrogen, Carlsbad, CA, USA). The amplified samples were genotyped at the Broad Institute (MA, USA) using the PsychChip (Illumina, CA, USA) typing 588,454 variants, developed by the Psychiatric Genetic Consortia. We then isolated the 418 mitochondrial loci and reviewed the genotype calls, before exporting into the PED/MAP format using GenomeStudio (Illumina, CA, USA). Haplo-grouping of mtDNA was performed using the defining SNPs reported in [www.phylotree.org](http://www.phylotree.org) 53. ### Geno-geographic affinity (autosomal ancestry) Geno-geographic affinity (GGA) estimation was done using ADMIXTURE 1.3.050 in the supervised approach. Briefly, reference populations consisting of Human Genome Diversity Project (HGDP) ([http://www.hagsc.org/hgdp/](http://www.hagsc.org/hgdp/)), a Danish (716 individuals) and a Greenlandian (592 individuals) genotyping SNP data set were used. The final reference data set consisted of 103,268 autosomal SNPs and 2,248 individuals assigned to one of nine population groups: Africa, America, Central South Asia, Denmark, East Asia, non-Danish Europe, Greenland, Middle East and Oceania. The number of clusters, K was set to eight, based on principal component analysis clustering (data not shown). The subpopulations were merged with the reference population data set and analysed using ADMIXTURE. For prediction of the ancestry of individuals within the mtDNA haplogroups we created a random forest model54 based on the reference data set, with the clusters Q1-8 as predictors and population groups as outcome. Thus the ancestry analysis of the individual person was the result of a supervised prediction. Prediction was done using R3 version 3.2.2, using the Caret package. ### Statistics The statistical significance of differences in mtDNA SNP proportions between controls and SZ patients was assessed using a permutation version of Fisher’s exact test. Samples with missing sequence data were excluded. Calculations were performed using R. Principal component analysis (PCA), was prepared using PLINK(v.1.90b3.31). For the PCA the reference population variants were extracted from the iPSYCH control sample, LD pruned (indep-pairwise 50 5 0.5) and allowing only SNPs with 99% genotyping rate. Prior to PCA of mtDNA data, samples were loaded into GenomeStudio (version 2011.a), a custom cluster was created using Gentrain (version 2), following automatic clustering, all positions with heterozygotes were manually curated. The data was exported relative to the forward strand using PLINK Input Report Plug-in (version 2.1.3). Eigenvectors were calculated using PLINK (v1.90b3.31). PCA plots were created using the package ggplot2 (version 1.0.1) in R (version 3.1.3). ## Legends to tables and figures **Table 1.** The mtDNA alleles studied, their frequency, mtDNA haplogroup association, and link with disease. **Table 2.** Distribution of mtDNA haplogroups of controls with specific mtDNA SNP. **Table 3.** Autosomal genomic genogeographic affinity of studied mtDNA SNPs controls. **Table 4.** Association between individual mtDNA SNPs and schizophrenia as a function of the selection of the involved persons. All hgs: All persons, EU hgs: persons with a European haplogroup; Danish GGA: persons with a Danish genogeographic affinity and Danish GGA and Eu hgs: persons both with a European hg and a Danish GGA. **Table 5.** Association between mtDNA SNP and SZ in persons with Danish GGA and a defined mtDNA haplogroup. ## Acknowledgements The iPSYCH study was funded by The Lundbeck Foundation Initiative for Integrative Psychiatric Research. We further gratefully acknowledge the financial support of The Jascha Foundation, The Strategic Research Council (“Heart Safe”), The Augustinus Foundation, The Lundbeck Foundation (Grant no. R67-A6552), and Familien Hede Nielsens Fond. This research has been conducted using the Danish National Biobank resource, supported by the Novo Nordisk Foundation. * Received June 12, 2017. * Revision received June 12, 2017. * Accepted June 12, 2017. * © 2017, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.Wallace, D.C. et al. Mitochondrial DNA mutation associated with Leber’s hereditary optic neuropathy. Science 242, 1427-30 (1988). [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEzOiIyNDIvNDg4NC8xNDI3IjtzOjQ6ImF0b20iO3M6Mzc6Ii9iaW9yeGl2L2Vhcmx5LzIwMTcvMDYvMTIvMTQ5MDcwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 2. 2.Holt, I.J., Harding, A.E. & Morgan-Hughes, J.A. Deletions of muscle mitochondrial DNA in patients with mitochondrial myopathies. Nature 331, 717-9 (1988). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/331717a0&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=2830540&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=A1988M216000065&link_type=ISI) 3. 3.Bourgeron, T. et al. Mutation of a nuclear succinate dehydrogenase gene results in mitochondrial respiratory chain deficiency. Nat Genet 11, 144-9 (1995). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng1095-144&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=7550341&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=A1995RX80600015&link_type=ISI) 4. 4.Koopman, W.J., Willems, P.H. & Smeitink, J.A. Monogenic mitochondrial disorders. N Engl J Med 366, 1132-41 (2012). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1056/NEJMra1012478&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22435372&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000301769700012&link_type=ISI) 5. 5.Calvo, S.E. & Mootha, V.K. The mitochondrial proteome and human disease. Annu Rev Genomics Hum Genet 11, 25-44 (2010). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1146/annurev-genom-082509-141720&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20690818&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000282662200002&link_type=ISI) 6. 6.Pagliarini, D.J. et al. A mitochondrial protein compendium elucidates complex I disease biology. Cell 134, 112-23 (2008). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2008.06.016&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=18614015&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000257513100018&link_type=ISI) 7. 7.Vafai, S.B. & Mootha, V.K. Mitochondrial disorders as windows into an ancient organelle. Nature 491, 374-83 (2012). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature11707&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23151580&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000311031600034&link_type=ISI) 8. 8.Chinnery, P.F. & Hudson, G. Mitochondrial genetics. Br Med Bull 106, 135-59 (2013). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/bmb/ldt017&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23704099&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000320114900008&link_type=ISI) 9. 9.Chinnery, P.F. et al. Mitochondrial DNA haplogroups and susceptibility to AD and dementia with Lewy bodies. Neurology 55, 302-4 (2000). 10. 10.Carrieri, G. et al. Mitochondrial DNA haplogroups and APOE4 allele are non-independent variables in sporadic Alzheimer’s disease. Hum Genet 108, 194-8 (2001). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1007/s004390100463&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=11354629&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000168063800004&link_type=ISI) 11. 11.van der Walt, J.M. et al. Analysis of European mitochondrial haplogroups with Alzheimer disease risk. Neurosci Lett 365, 28-32 (2004). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.neulet.2004.04.051&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=15234467&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000222785700007&link_type=ISI) 12. 12.Coskun, P. et al. A mitochondrial etiology of Alzheimer and Parkinson disease. Biochim Biophys Acta 1820, 553-64 (2012). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.bbagen.2011.08.008&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21871538&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 13. 13.Hudson, G. et al. Two-stage association study and meta-analysis of mitochondrial DNA variants in Parkinson disease. Neurology 80, 2042-8 (2013). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1212/WNL.0b013e318294b434&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23645593&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 14. 14.Pyle, A. et al. Mitochondrial DNA haplogroup cluster UKJT reduces the risk of PD. Ann Neurol 57, 564-7 (2005). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1002/ana.20417&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=15786469&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000228053500016&link_type=ISI) 15. 15.Marom, S., Friger, M. & Mishmar, D. MtDNA meta-analysis reveals both phenotype specificity and allele heterogeneity: a model for differential association. Sci Rep 7, 43449 (2017). 16. 16.Sequeira, A. et al. Mitochondrial mutations and polymorphisms in psychiatric disorders. Front Genet 3, 103 (2012). [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22723804&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 17. 17.Rollins, B. et al. Mitochondrial variants in schizophrenia, bipolar disorder, and major depressive disorder. PLoS One 4, e4913 (2009). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0004913&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=19290059&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 18. 18.Hudson, G., Gomez-Duran, A., Wilson, I.J. & Chinnery, P.F. Recent mitochondrial DNA mutations increase the risk of developing common late-onset human diseases. PLoS Genet 10, e1004369 (2014). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1004369&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24852434&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 19. 19.Saxena, R. et al. Comprehensive association testing of common mitochondrial DNA variation in metabolic disease. Am J Hum Genet 79, 54-61 (2006). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1086/504926&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=16773565&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000238341200006&link_type=ISI) 20. 20.Bybjerg-Grauholm, J. et al. Complex spatio-temporal distribution and genogeographical affinity of mitochondrial DNA haplogroups in 24,216 danes. biorxiv/2017/148494 (2017). 21. 21.Leslie, S. et al. The fine-scale genetic structure of the British population. Nature 519, 309-14 (2015). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature14230&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25788095&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 22. 22.Busby, G.B. et al. The Role of Recent Admixture in Forming the Contemporary West Eurasian Genomic Landscape. Curr Biol 25, 2518-26 (2015). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.cub.2015.08.007&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=26387712&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 23. 23.Zhang, W. et al. A matrilineal genetic legacy from the last glacial maximum confers susceptibility to schizophrenia in Han Chinese. J Genet Genomics 41, 397-407 (2014). 24. 24.Du, C., El Harchi, A., Zhang, H. & Hancox, J.C. Modification by KCNE1 variants of the hERG potassium channel response to premature stimulation and to pharmacological inhibition. Physiol Rep 1, e00175 (2013). [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTE6InBoeXNyZXBvcnRzIjtzOjU6InJlc2lkIjtzOjEwOiIxLzYvZTAwMTc1IjtzOjQ6ImF0b20iO3M6Mzc6Ii9iaW9yeGl2L2Vhcmx5LzIwMTcvMDYvMTIvMTQ5MDcwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 25. 25.Ji, Y. et al. Mitochondrial DNA haplogroups M7b1′2 and M8a affect clinical expression of leber hereditary optic neuropathy in Chinese families with the m.11778G-->a mutation. Am J Hum Genet 83, 760-8 (2008). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2008.11.002&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=19026397&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000261822100011&link_type=ISI) 26. 26.Athanasiadis, G. et al. Nationwide Genomic Study in Denmark Reveals Remarkable Population Homogeneity. Genetics 204, 711-722 (2016). [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiZ2VuZXRpY3MiO3M6NToicmVzaWQiO3M6OToiMjA0LzIvNzExIjtzOjQ6ImF0b20iO3M6Mzc6Ii9iaW9yeGl2L2Vhcmx5LzIwMTcvMDYvMTIvMTQ5MDcwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 27. 27.Karakachoff, M. et al. Fine-scale human genetic structure in Western France. Eur J Hum Genet 23, 831-6 (2015). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ejhg.2014.175&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25182131&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 28. 28.Novembre, J. et al. Genes mirror geography within Europe. Nature 456, 98-101 (2008). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature07331&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=18758442&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000260674000048&link_type=ISI) 29. 29.Owen, M.J., Sawa, A. & Mortensen, P.B. Schizophrenia. Lancet 388, 86-97 (2016). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(15)01121-6&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=26777917&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 30. 30.McGrath, J., Saha, S., Chant, D. & Welham, J. Schizophrenia: a concise overview of incidence, prevalence, and mortality. Epidemiol Rev 30, 67-76 (2008). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/epirev/mxn001&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=18480098&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000260555900004&link_type=ISI) 31. 31.Cantor-Graae, E. & Pedersen, C.B. Full spectrum of psychiatric disorders related to foreign migration: a Danish population-based cohort study. JAMA Psychiatry 70, 427-35 (2013). 32. 32.Cantor-Graae, E. & Selten, J.P. Schizophrenia and migration: a meta-analysis and review. Am J Psychiatry 162, 12-24 (2005). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1176/appi.ajp.162.1.12&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=15625195&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000226262000003&link_type=ISI) 33. 33.Dealberto, M.J. Ethnic origin and increased risk for schizophrenia in immigrants to countries of recent and longstanding immigration. Acta Psychiatr Scand 121, 325-39 (2010). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1111/j.1600-0447.2009.01535.x&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20105146&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000276650800002&link_type=ISI) 34. 34.Wallace, D.C. A mitochondrial paradigm of metabolic and degenerative diseases, aging, and cancer: a dawn for evolutionary medicine. Annu Rev Genet 39, 359-407 (2005). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1146/annurev.genet.39.110304.095751&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=16285865&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000234685200017&link_type=ISI) 35. 35.Kenney, M.C. et al. Molecular and bioenergetic differences between cells with African versus European inherited mitochondrial DNA haplogroups: implications for population susceptibility to diseases. Biochim Biophys Acta 1842, 208-19 (2014). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.bbadis.2013.10.016&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24200652&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000331414900010&link_type=ISI) 36. 36.Larsen, S. et al. Increased intrinsic mitochondrial function in humans with mitochondrial haplogroup H. Biochim Biophys Acta 1837, 226-31 (2014). 37. 37.Atilano, S.R. et al. Mitochondrial DNA variants can mediate methylation status of inflammation, angiogenesis and signaling genes. Hum Mol Genet 24, 4491-503 (2015). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddv173&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25964427&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 38. 38.Kenney, M.C. et al. Inherited mitochondrial DNA variants can affect complement, inflammation and apoptosis pathways: insights into mitochondrial-nuclear interactions. Hum Mol Genet 23, 3537-51 (2014). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddu065&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24584571&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000338126300016&link_type=ISI) 39. 39.Raffaello, A., Mammucari, C., Gherardi, G. & Rizzuto, R. Calcium at the Center of Cell Signaling: Interplay between Endoplasmic Reticulum, Mitochondria, and Lysosomes. Trends Biochem Sci 41, 1035-1049 (2016). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.tibs.2016.09.001&link_type=DOI) 40. 40.Verge, B. et al. Mitochondrial DNA (mtDNA) and schizophrenia. Eur Psychiatry 26, 45-56 (2011). [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20980130&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 41. 41.Burisch, J. Crohn’s disease and ulcerative colitis. Occurrence, course and prognosis during the first year of disease in a European population-based inception cohort. Dan Med J 61, B4778 (2014). [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24393595&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 42. 42.Kurtzke, J.F. Geographic distribution of multiple sclerosis: An update with special reference to Europe and the Mediterranean region. Acta Neurol Scand 62, 65-80 (1980). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1111/j.1600-0404.1980.tb03006.x&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=6971038&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=A1980KK06900001&link_type=ISI) 43. 43.McLeod, J.G., Hammond, S.R. & Kurtzke, J.F. Migration and multiple sclerosis in immigrants to Australia from United Kingdom and Ireland: a reassessment. I. Risk of MS by age at immigration. J Neurol 258, 1140-9 (2011). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1007/s00415-010-5898-4&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21264474&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000290985300026&link_type=ISI) 44. 44.de Lau, L.M. & Breteler, M.M. Epidemiology of Parkinson’s disease. Lancet Neurol 5, 525-35 (2006). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/S1474-4422(06)70471-9&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=16713924&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000237906000021&link_type=ISI) 45. 45.Boonstra, K., Beuers, U. & Ponsioen, C.Y. Epidemiology of primary sclerosing cholangitis and primary biliary cirrhosis: a systematic review. J Hepatol 56, 1181-8 (2012). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.jhep.2011.10.025&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22245904&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000302980800025&link_type=ISI) 46. 46.Aidt, F.H. et al. Dysfunctional mitochondrial respiration in the striatum of the Huntington’s disease transgenic R6/2 mouse model. PLoS Curr 5(2013). 47. 47.Arduino, D.M., Esteves, A.R., Swerdlow, R.H. & Cardoso, S.M. A cybrid cell model for the assessment of the link between mitochondrial deficits and sporadic Parkinson’s disease. Methods Mol Biol 1265, 415-24 (2015). 48. 48.Samuels, D.C., Carothers, A.D., Horton, R. & Chinnery, P.F. The power to detect disease associations with mitochondrial DNA haplogroups. Am J Hum Genet 78, 713-20 (2006). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1086/502682&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=16532401&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000236755900017&link_type=ISI) 49. 49.Raule, N., Sevini, F., Santoro, A., Altilia, S. & Franceschi, C. Association studies on human mitochondrial DNA: methodological aspects and results in the most common age-related diseases. Mitochondrion 7, 29-38 (2007). 50. 50.Biffi, A. et al. Principal-component analysis for assessment of population stratification in mitochondrial medical genetics. Am J Hum Genet 86, 904-17 (2010). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2010.05.005&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20537299&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 51. 51.Achilli, A. et al. The molecular dissection of mtDNA haplogroup H confirms that the Franco-Cantabrian glacial refuge was a major source for the European gene pool. Am J Hum Genet 75, 910-8 (2004). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1086/425590&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=15382008&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000224303500018&link_type=ISI) 52. 52.Agerbo, E. et al. Modelling the contribution of family history and variation in single nucleotide polymorphisms to risk of schizophrenia: a Danish national birth cohort-based study. Schizophr Res 134, 246-52 (2012). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.schres.2011.10.025&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22108675&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 53. 53.van Oven, M. & Kayser, M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum Mutat 30, E386-94 (2009). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1002/humu.20921&link_type=DOI) [PubMed](http://biorxiv.org/lookup/external-ref?access_num=18853457&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F12%2F149070.atom) 54. 54.Breiman, L. Random forests. Mach Learn 45, 5-32 (2001). [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1023/A:1010933404324&link_type=DOI) [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000170489900001&link_type=ISI)