Estimating phenotypic polygenicity and causal effect size variance from GWAS summary statistics while accounting for inflation due to cryptic relatedness
=========================================================================================================================================================

* Dominic Holland
* Chun-Chieh Fan
* Oleksandr Frei
* Alexey A. Shadrin
* Olav B. Smeland
* V. S. Sundar
* Ole A. Andreassen
* Anders M. Dale

## Abstract

Of signal interest in the genetics of traits are estimating the proportion, *π*1, of causally associated single nucleotide polymorphisms (SNPs), and their effect size variance, ![Graphic][1]</img>, which are components of the mean heritabilities captured by the causal SNP. Here we present the first model, using detailed linkage disequilibrium structure, to estimate these quantities from genome-wide association studies (GWAS) summary statistics, assuming a Gaussian distribution of SNP effect sizes, *β*. We apply the model to three diverse phenotypes – schizophrenia, putamen volume, and educational attainment – and validate it with extensive simulations. We find that schizophrenia is highly polygenic, with ≃ 5 × 104 causal SNPs distributed with small effect size variance, ![Graphic][2]</img>(in units where the phenotype variance is normalized to 1), requiring a GWAS study with more than 1/2-million samples in each arm for full discovery. In contrast, putamen volume involves only ≃ 3 × 102 causal SNPs, but with ![Graphic][3]</img>, indicating a much larger proportion of the causal SNPs that are strongly associated. Educational attainment has similar polygenicity to schizophrenia, but with effects that are substantially weaker, ![Graphic][4]</img>, leading to much lower heritability. Thus the model is able to describe the broad genetic architecture of phenotypes where both polygenicity and effect size variance range over several orders of magnitude, shows why only small proportions of heritability have been explained for discovered SNPs, and provides a roadmap for future GWAS discoveries.

Keywords
*   GWAS
*   Polygenicity
*   Causal SNPs
*   Effect size
*   Linkage Disequlilbrium

## INTRODUCTION

The genetic components of complex traits or diseases arise from hundreds to likely many thousands of single nucleotide polymorphisms (SNPs) (Visscher et al., 2012), most of which have weak effects. As sample sizes increase, more of the associated SNPs are identifiable (they reach genomewide significance), though power for discovery varies widely across phenotypes. Of particular interest are estimating the proportion of SNPs (polygenicity) involved in any particular phenotype; their effective strength of association (discoverability); the proportion of variation in susceptibility, or phenotypic variation, captured additively by all common causal SNPs (approximately, the narrow sense heritability), and the fraction of that captured by genomewide significant SNPs – all of which are active areas of research (Stahl et al., 2012; Yang et al., 2015; So et al., 2011; Speed et al., 2012; Lee et al., 2011; Yang et al., 2011a; Kumar et al., 2016; Palla and Dudbridge, 2015). However, the effects of population structure (Price et al., 2010), combined with high polygenicity and linkage disequilibrium (LD), leading to spurious degrees of SNP association, or inflation, considerably complicate matters, and are also areas of much focus (Yang et al., 2011c; Bulik-Sullivan et al., 2015; Kang et al., 2010). Yet, despite recent significant advances, it has been difficult to develop a mathematical model of polygenic architecture based on GWAS that can be used for power estimated across human phenotypes.

Here, in a unified approach explicitly taking into account LD, we present a model relying on genome-wide association studies (GWAS) summary statistics (z-scores for SNP associations with a phenotype (Pasaniuc and Price, 2016)) to estimate polygenicity (*π*1) and discoverability (![Graphic][5]</img>), as well as any residual inflation of the z-scores arising from variance distortion induced by cryptic relatedness (![Graphic][6]</img>), which remains a concern in large-scale studies (Price et al., 2010). We estimate *π*1, ![Graphic][7]</img> and ![Graphic][8]</img>, by postulating a z-score probability distribution function (pdf) that explicitly depends on them, and fitting it to the actual distribution of GWAS z-scores.

Estimates of polygenicity and discoverability allow one to estimate compound quantities, like narrow-sense heritability captured by the SNPs (Witte et al., 2014); to predict the power of larger-scale GWAS to discover genomewide significant loci; and to understand why some phenotypes have higher power for SNP discovery and proportion of heritability explained than other phenotypes.

In previous work (Holland et al., 2016) we presented a related model that treated the overall effects of LD on z-scores in an approximate way. Here we take the details of LD explicitly into consideration, resulting in a conceptually more basic model to predict the distribution of z-scores. We apply the model to multiple phenotypes, in each case estimating the three model parameters and auxiliary quantities, including the overall inflation factor *λ*, (traditionally referred to as genomic control (Devlin and Roeder, 1999)) for pruned SNP sets, and narrow sense heritability, *h*2. We also perform extensive simulations on genotypes with realistic LD structure in order to validate the interpretation of the model parameters.

## METHODS

### The Model: Probability Distribution for z-scores

Consistent with the work of others (Yang et al., 2011c), we assume the causal SNPs are distributed randomly throughout the genome (an assumption that can be relaxed when explicitly considering different SNP categories, but that in the main is consistent with the additive variation explained by a given part of the genome being proportional to the length of DNA (Yang et al., 2011b)), and that their *β* coefficients in the GWAS framework are distributed normally with variance ![Graphic][9]</img>: ![Formula][10]</img>

(We use the symbol *β* to refer to a scalar or vector, with context indicating which.) Taking into account all SNPs (the remaining ones are all null by definition), this is equivalent to the two-component Gaussian mixture model ![Formula][11]</img> where ![Graphic][12]</img> is the Dirac delta function, so that considering all SNPs, the net variance is var ![Graphic][13]</img>. Ignoring LD, the association z-scores for causal SNPs can be decomposed into an effect *δ* and a residual term ![Graphic][14]</img>, assumed to be independent (Holland et al., 2016): ![Formula][15]</img> with ![Formula][16]</img> where *N* is the sample size and *H* is the SNP’s heterozygosity (frequency of the heterozygous genotype, *H* = 2*p*(1−*p*) where *p* is the frequency of either of the SNP’s alleles), so that ![Formula][17]</img> where ![Formula][18]</img>

Now consider the effects of LD on z-scores. Let *βeff* be the true effective *β*-coefficient for a tag SNP arising due to LD with neighboring causal SNPs. It is given by the sum of neighboring causal SNP *β*-coefficients, each weighted by its correlation with the tag SNP: ![Formula][19]</img>

Then, from Eq. 3, the z-score for the tag SNP’s association with the phenotype is given by: ![Formula][20]</img>

Thus, for example, if the SNP itself were not causal but were in LD with *k* known causal SNPs, where its LD with each of these was the same, given by some value *r*2 (0 < *r*2 ≤ 1), then *σ*2 will be given by ![Formula][21]</img>

For this idealized case, the marginal distribution, or pdf, of z-scores for a set of such associated SNPs is ![Formula][22]</img> where *ϕ*(·*, μ, σ*2) is the normal distribution with mean *μ* and variance *σ*2, and *L* is shorthand for the LD structure of such SNPs – in this case, denoting LD given by *r*2 with exactly *k* causals. If a proportion *a* of all tag SNPs are similarly associated with the phenotype while the remaining proportion are all null (not causal and not in LD with causal SNPs), then the marginal distribution for all SNP z-scores is the gaussian mixture ![Formula][23]</img> dropping the parameters for convenience.

For real genotypes, however, the LD structure is far more complicated, and of course the causal SNPs are generally numerous and unknown. As in our previous work, we incorporate the model parameter *π*1 for the fraction of all SNPs that are causal (Holland et al., 2016). Additionally, we calculate the actual LD structure for each SNP. That is, for each SNP we build a histogram of the numbers of other SNPs in LD with it for *w* equally-spaced *r*2-windows between 0.05 and 1; we use *L* again as a short-hand to represent all this. The value ![Graphic][24]</img> was chosen as a lower-bound for LD above the noise threshold; we find that *w* ≃ 10 is sufficient for converged results. For any given SNP, the set of SNPs thus determined to be in LD with it constitute its LD block, with their number given by *n* (LD with self is always 1, so *n* is at least 1). The pdf for z-scores, given *N, H, L* and the three model parameters *π*1*, σβ, σ*, will then be given by the sum of gaussians that are generalizations of Eq. 10 for different combinations of numbers of causals among the *w* LD windows, each gaussian scaled by the probability of the corresponding combination of causals among the LD windows, i.e., by the appropriate multinomial distribution term.

For *w r*2-windows, we must consider the possibilities where the tag SNP is in LD with all possible numbers of causal SNPs in each of these windows, or any combination thereof. There are thus *w* + 1 categories of SNPs: null SNPs (which *r*2-windows they are in is irrelevant), and causal SNPs, where it does matter which *r*2-windows they reside in. If window *i* has *ni* SNPs (![Graphic][25]</img>), and the overall fraction of SNPs that are causal is *π*1, then the probability of having simultaneously *k* null SNPs, *k*1 causal SNPs in window 1, and so on through *kw* causal SNPs in window *w*, for a nominal total of *K* causals (![Graphic][26]</img> and *k* = *n* − *K*), is given by the multinomial distribution, which we denote *M* (*k*, …, *kw*; *n*, …, *nw*; *π*1). For an LD block of *n* SNPs, the prior probability, *pi*, for a SNP to be causal and in window *i* is the product of the independent prior probabilities of a SNP being causal and being in window *i*: *pi* = *π*1*ni/n*. The prior probability of being null (regardless of *r*2-window) is simply *p* = (1 − *π*1). The probability of a given breakdown *k*, …, *kw* of the neighboring SNPs into the *w*+1 categories is then given by ![Formula][27]</img> and the corresponding gaussian is ![Formula][28]</img>

For a SNP with heterozygosity *H* and LD structure *L*, the pdf for its z-score, given *N* and the model parameters, is then given by summing over all possible numbers of total causals in LD with the SNP, and all possible distributions of those causals among the *w r*2-windows: ![Formula][29]</img> where *Kmax* in bounded above by *n*. Note again that *L* is shorthand for the linkage-disequilibrium structure of the SNP, giving the set {*ni*}, and hence, for a given *π*1, *pi*. Also there is the constraint ![Graphic][30]</img> on the second summation, and, for all *i*, max(*ki*) = max(*K, ni*), though generally – see below – *Kmax* ≪ *ni*. The number of ways of dividing *K* causal SNPs amongst *w* LD windows is given by the binomial coefficient ![Graphic][31]</img>, where *m* = *K* + *w* – 1 and *a* = *w* − 1, so the number of terms in the second summation grows rapidly with *K* and *w*. However, because *π*1 is small (often ≤10−3), we find that the upper bound on the first summation over total number of potential causals *K* in the LD block for the SNP can be limited to *Kmax* < min(10*, n*), even for large blocks with *n* ≃ 103. That is, ![Formula][32]</img>

Still, the number of terms is large; e.g., for *K* = 8 and *w* = 5 there are 495 terms. We approximate the sums in Eq. 14 with the simpler expression involving only sums over terms where the causal SNPs all reside in the same *r*2-window, plus a null term. The probability that any *K* of the *n* SNPs in the block are causal while the remainder *n* − *K* are null is given by the binomial distribution, *B*(*K, n*; *π*1): ![Formula][33]</img>

Multiplying this by *ni/n* approximates the probability of their being in the *i*-th *r*2-window. Multiplying these into the gaussian corresponding to *K* causals in window *i*, summing over both indices, and incorporating the null term, leads to the following approximation that is in good numerically agreement with Eq. 14: ![Formula][34]</img>

### Data Preparation

For real phenotypes, we calculated SNP minor allele frequency (MAF) and LD between SNPs using the 1000 Genomes phase 3 data set for 503 subjects/samples of European ancestry (Consortium et al., 2015, 2012; Sveinbjornsson et al., 2016). For simulations, we used HapGen2 (Li and Stephens, 2003; Spencer et al., 2009; Su et al., 2011) to generate genotypes; we calculated SNP MAF and LD structure from 1000 simulated samples. We elected to use the same intersecting set of SNPs for real data and simulation. For HapGen2, we eliminated SNPs for which more than 99% of genotypes were identical; for 1000 Genomes, we eliminated SNPs for which the call rate (percentage of samples with useful data) was less than 90%. This left *nsnp*=11,015,833 SNPs.

Sequentially moving through each chromosome in contiguous blocks of 5,000 SNPs, for each SNP in the block we calculated its Pearson *r*2 correlation coefficients with all SNPs in the central bock itself and with all SNPs in the pair of flanking blocks of size up to 50,000 each. For each SNP we calculated its total LD (TLD), given by the sum of LD *r*2’s thresholded such that if *r*2 < 0.05 we set that *r*2 to zero (zeroing out the noise). For each SNP we also built a histogram giving the numbers of SNPs in *w* = 8 equally-spaced *r*2-windows covering the range 0.05 ≤ *r*2 ≤ 1. These steps were carried out independently for both 1000 Genomes phase 3 and for HapGen2.

Employing a similar procedure, we also built binary (logical) LD matrices identifying all pairs of SNPs for which LD *r*2 > 0.8, a liberal threshold for SNPs being “synonymous”.

In applying the model to summary statistics, we restricted to SNPs for which TLD ≤ 600, MAF ≥ 0.005, and LD block size (defined by *r*2 ≥ 0.05) ≤ 2000.

We analyzed summary statistics for participants with European ancestry for: (1) schizophrenia from the Psychiatric Genomics Consortium (PGC) (Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014), with 35,476 cases and 46,839 controls (*Neff* ≡ 4/(1/*Ncases* +1/*Ncontrols*) = 76, 326) across 52 separate substudies, with imputation of SNPs using the 1000 Genomes Project reference panel (1000 Genomes Project Consortium, 2010) for a total of approximately 5,369,285 genotyped and imputed SNPs passing the above restrictions (Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014); (2) putamen volume, normalized by intracranial volume, using data from the Enhancing Neuro Imaging Genetics through Meta-Analysis (ENIGMA) consortium (Hibar et al., 2015), with 12,596 samples and a total of 4,196,831 SNPs; and (3) educational attainment, measured as the number of years of schooling completed, with 328,917 samples and a total of 5,361,110 SNPs, available at [https://www.thessgac.org](http://https://www.thessgac.org) (Okbay et al., 2016). Examples of SNP histograms for schizophrenia are in Supporting Material Fig. 5.

### Simulations

We generated genotypes for 105 unrelated simulated samples using HapGen2 (Su et al., 2011). For narrow-sense heritability *h*2 equal to 0.1, 0.4, and 0.7, we considered polygenicity *π*1 equal to 10−5, 10−4, 10−3, and 10−2. For each of these 12 combinations, we randomly selected *ncausal* = *π*1×*nsnp* “causal” SNPs and assigned them *β*-values drawn from the standard normal distribution (i.e., independent of *H*), with all other SNPs having *β* = 0. We repeated this ten times, giving ten independent instantiations of random vectors of *β*’s. Defining *Yg* = *Gβ*, where *G* is the genotype matrix and *β* here is the vector of coefficients over all SNPs, the total phenotype vector is constructed as *Y* =*Yg* +*ϵ*, where the residual random vector *ϵ* for each instantiation is drawn from a normal distribution such that *h*2 = var(*Yg*) /var(*Y*). For each of the instantiations this implicitly defines the “true” value ![Graphic][35]</img>.

The regression slope, *β*, and the Pearson correlation coefficient, *r*, are assumed to be t-distributed. These quantities have the same t-value: ![Graphic][36]</img>, with corresponding p-value from Student’s *t* cumulative distribution function (cdf) with *N* – 2 degrees of freedom: *p* = 2×tcdf(−|*t*|*, N* − 2). Since we are not here dealing with covariates, we calculated *p* from correlation, which is slightly faster than from estimating the regression coefficient. The t-value can be transformed to a z-value, giving the z-score for this *p*: *z* = −Φ−1(*p*/2) × sign(*r*), where Φ is the normal cdf (*z* and *t* have same p-value).

### Parameter Estimation

We randomly pruned SNPs using the threshold *r*2 > 0.8 to identify “synonymous” SNPs, performing ten such iterations. That is, for each of ten iterations, we randomly selected a SNP (not necessarily the one with largest z-score) to represent each subset of synonymous SNPs. For schizophrenia, for example, pruning resulted in approximately 1.3 million SNPs in each iteration.

The postulated pdf for a SNP’s z-score depends on the SNP’s heterozygosity, H, and detailed LD structure, i.e., its LD histogram, L. Given the data – the set of z-scores for all SNPs, as well as their heterozygosities and LD-structures – and the H- and L-dependent pdf for z-scores, the objective is to find the model parameters that best predict the distribution of all z-scores. H ranges between 0.05 and 0.5, and the amplitudes of L will vary over a wide range. A useful one-dimensional proxy for L is TLD, which ranges from 1 to 600. Since the model pdf explicitly predicts z-score distributions for particular values of H and L, instead of taking all the SNPs at once, we bin the SNPs with respect to a grid of these quantities; for any given (H,TLD) bin there will be a range of z-scores whose distribution the model it intended to predict. We find that a 5×5-grid of equally spaced bins is adequate for converged results. In lieu of or in addition to TLD binning, one can bin SNPs with respect to their total LD block size (total number of SNPs in LD, ranging from 1 to 2,000).

To find the model parameters that best fit the data, for a given (H,TLD) bin we binned the selected SNPs z-scores into equally-spaced bins of width *dz*=0.4 (between *zmin*=−38 and *zmax*=38, allowing for p-values near the numerical limit of 10−315), and from Eq. 17 calculated the probability for z-scores to be in each of those z-score bins (the prior probability for “success” in each z-score bin). Then, knowing the actual numbers of z-scores (numbers of “successes”) in each z-score bin, we calculated the multinomial probability, *pm*, for this outcome. The optimal model prameter values will be those that maximize the accrual of this probability over all (H,TLD) bins. We constructed a cost function by calculating, for a givem (H,TLD) bin, −ln(*pm*) and averaging over prunings, and then accumulating this over all (H,TLD) bins. Model parameters minimizing the cost were obtained from Nelder-Mead multi-dimensional unconstrained nonlinear minimization of the cost function, using the Matlab function fminsearch().

### Posterior Effect Sizes

Model posterior effect sizes were calculated using numerical integration over the random variable *δ*: ![Formula][37]</img>

Here, since ![Graphic][38]</img>, the posterior probability of *z* given *δ* is simply ![Formula][39]</img>

*P* (*z*) is shorthand for pdf(*z*|*N, H, L*; *π*1*, σβ, σ*), given by Eq. 17, and, also from Eq. 17, *P* (*δ*) is ![Formula][40]</img>

Similarly, ![Formula][41]</img> which is used in power calculations.

### GWAS Power

It is of interest to estimate the proportion of additive phenotypic variance arising from the *nsnp* SNPs under study (the chip heritability (Witte et al., 2014)) that can be explained by SNPs that reach genome-wide significance, *p*≤5×10−8 (i.e., for which |*z*|>*zt*=5.33) at a given sample size (Pe’er et al., 2008; McCarthy et al., 2008). For a SNP with genotype vector *g* (over *N* samples) and heterozygosity *H*, one has var(*Y* |*g*)=var(*βg*)=2*β*2*H* and ![Graphic][42]</img>. Using Eq. 21, let *C*≡*E*(*δ*2|*z, N*) *P* (*z, N*), emphasizing dependence on sample size, *N*. Then the proportion of chip heritability captured additively by genome-wide significant SNPs is ![Formula][43]</img>

The ratio in Eq. 22 should be accurate if the average effects of LD in the numerator and denominator cancel – which will always be true as the ratio approaches 1 for large *N*. Plotting *S*(*N*; *zt*) gives an indication of the power of future GWAS to capture chip heritability.

### Quantile-Quantile Plots and Genomic Control

One of the advantages of quantile-quantile (QQ) plots (also known as PP plots) is that on a logarithmic scale they emphasize behavior in the tails of a distribution, and provide a valuable visual aid in assessing the independent effects of polygenicity, strength of association, and cryptic relatedness – the roles played by the three model parameters – as well as showing how well a model fits data. QQ plots for the model were constructed using Eq. 17, replacing the normal pdf with the normal cdf, and replacing *z* with an equally-spaced vector ![Graphic][44]</img> of length 10,000 covering a wide range of nominal |*z*| values (0 through 38). SNPs were divided into a 5×5 grid of H×TLD bins, and the cdf vector (with elements corresponding to the z-values in ![Graphic][45]</img>) accumulated for each such bin (using mean values of H and TLD for SNPs in a given bin).

For a given set of samples and SNPs, the genomic control factor, *λ*, for the z-scores is defined as the median *z*2 divided by the median for the null distribution, 0.455 (Devlin and Roeder, 1999). This can also be calculated from the QQ plot. In the plots we present here, the abscissa gives the -log10 of the proportion, *q*, of SNPs whose z-scores exceed the two-tailed significance threshold *p*, transformed in the ordinate as -log10(*p*). The median is at *qmed* = 0.5, or −log10(*qmed*) ≃ 0.3; the corresponding empirical and model p-value thresholds (*pmed*) for the z-scores – and equivalently for the z-scores-squared – can be read off from the plots. The genomic inflation factor is then given by *λ* = [Φ−1(*pmed*/2)]2/0.455. Note that the values of *λ* reported here are for pruned SNP sets; these values will be lower than for the total GWAS SNP sets.

Knowing the total number, *ntot*, of p-values involved in a QQ plot (number of GWAS z-scores from pruned SNPs), any point (*q, p*) (log-transformed) on the plot gives the number, *np* = *qntot*, of p-values that are as extreme as or more extreme than the chosen p-value. This can be thought of as *np* “successes” out of *ntot* independent trials (thus ignoring LD) from a binomial distribution with prior probability *q*. To approximate the effects of LD, we estimate the number of independent SNPs as *ntot/f* where *f* ≃ 10. The 95% binomial confidence interval for *q* is calculated as the exact Clopper-Pearson 95% interval (Clopper and Pearson, 1934), which is similar to the normal approximation interval, ![Graphic][46]</img>.

### Narrow-sense Chip Heritability

Since we are treating the *β* coefficients as fixed effects in the simple linear regression GWAS formalism, with the phenotype vector standardized with mean zero and unit variance, the proportion of phenotypic variance explained by a particular causal SNP, *q*2=var(*y*|*g*), is given by *q*2 = *β*2*H*. The proportion of phenotypic variance explained additively by all causal SNPs is, by definition, the narrow sense chip heritability, *h*2. Since ![Graphic][47]</img> and *ncausal* = *π*1*nsnp*, and taking the mean heterozygosity over causal SNPs to be approximately equal to the mean over all SNPs, ![Graphic][48]</img>, the chip heritability can be estimated as ![Formula][49]</img>

For all-or-none traits like disease status, the estimated *h*2 from Eq. 23 for an ascertained case-control study is on the observed scale and is a function of the prevalence in the adult population, *K*, and the proportion of cases in the study, *P*. The heritability on the underlying continuous liability scale (Falconer, 1965), ![Graphic][50]</img>, is obtained by adjusting for ascertainment (multiplying by *K*(1 − *K*)/(*P* (1 − *P*)), the ratio of phenotypic variances in the population and in the study) and rescaling based on prevalence (Dempster and Lerner, 1950; Lee et al., 2011): ![Formula][51]</img> where *a* is the height of the standard normal pdf at the truncation point *zK* defined such that the area under the curve in the region to the right of *zK* is *K*.

## RESULTS

### Phenotypes

Figure 1 shows QQ plots for the z-scores for schizophrenia, educational attainment, and putamen volume, along with model estimates. In all cases, the model fit (yellow) closely tracks the data (dark blue). Figure 6 in Supporting Material shows QQ subplots for a 5 × 5 grid of *H* × *T LD* ranges for schizophrenia. 

![Figure 1:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2017/06/23/133132/F1.medium.gif)

[Figure 1:](http://biorxiv.org/content/early/2017/06/23/133132/F1)

Figure 1: 
QQ plots of z-scores for (A) schizophrenia, (B) educational attainment, and (C) putamen volume, (dark blue, 95% confidence interval in light blue) with model prediction (yellow). The dashed line is the expected QQ plot under null (no SNPs associated with the phenotype). *λ* and ![Graphic][52]</img> are the overall nominal genomic control factors calculated from the data and model plots, respectively. The three estimated model parameters are: polygenicity, ![Graphic][53]</img>; discoverability, ![Graphic][54]</img>; and SNP association *χ*2-statistic inflation factor, ![Graphic][55]</img>. ![Graphic][56]</img> is the estimated narrow-sense chip heritability (reexpressed as ![Graphic][57]</img> on the liability scale for schizophrenia assuming a prevalence of 1% in adult populations), and ![Graphic][58]</img> is the estimated number of causal SNPs. *nsnp* = 11,015,833 is the total number of SNPs, whose LD and MAF underlie the model; the GWAS z-scores are for subsets of these SNPs. Though the phenotypes are diverse (examples of a categorical mental disorder, a behavioural phenotype, and a cerebral subregional tissue volume), the model nevertheless provides good fits, even though estimated polygenicities differ by two orders of magnitude and discoverabilities differ by almost three orders of magnitude. *Nsamp* is the sample size, expressed as *Neff* for schizophrenia – see text. Reading the plots: on the vertical axis, choose a p-value threshold (more extreme values are further from the origin), then the horizontal axis gives the proportion of SNPs exceeding that threshold (higher proportions are closer to the origin).

The estimated number of causal SNPs is given by the polygenicity, *π*1, times the total number of SNPs, *nsnp*; the latter is given by the total number of SNPs that went into building the LD structure, *L* in Eq. 17, i.e., the approximately 11 million SNPs selected from the 1000 Genomes Phase 3 reference panel, not the number of SNPs in the particular GWAS. Thus, for schizophrenia, *π*1 = 5.0 × 10−3, so that ![Graphic][59]</img>, not all of which are in linkage equilibrium. Educational attainment has slightly greater polygenicity than schizophrenia, *π*1 = 7.7 × 10−3. In contrast, for putamen volume *π*1 = 2.6 × 10−5, so that ![Graphic][60]</img>.

The effective strength of SNP association with the phenotype (mean *β*2 for causals, the effective SNP “discoverability”) is ![Graphic][61]</img> for schizophrenia (in units where the variance of the phenotype is normalized to 1). It is an order of magnitude smaller for educational attainment, ![Graphic][62]</img>, but two orders of magnitude bigger for putamen volume: ![Graphic][63]</img>.

Note that for logistic linear regression coefficient *β*, the odds ratio for disease is OR = *eβ*; for a rare disease, this is approximately equal to the genotypic relative risk: GRR ≃ OR. Since ![Graphic][64]</img>, the mean relative risk ![Graphic][65]</img>. Thus, for schizophrenia, the mean relative risk is ≃ 1.0000175.

The narrow sense heritability from the ascertained case-control schizophrenia GWAS is estimated as *h*2=0.41 (with mean heterozygosity from the ~11 million SNPs, ![Graphic][66]</img>). Taking adult population prevalence of schizophrenia to be *K*=0.01 (Purcell et al., 2009; Whiteford et al., 2013), and given that there are 35,476 cases and 46,839 controls in the study, so that *P* =0.43, the heritability on the liability scale for schizophrenia from Eq. 24 is ![Graphic][67]</img>; for *K*=0.005 (Kinney et al., 2009), ![Graphic][68]</img>. For the quantitative endophenotype putamen volume, the heritability is estimated to be 7%, while for educational attainment the heritability is estimated to be 10%.

Figure 2 shows the sample size required to reach a given proportion of chip heritability for the phenotypes (assuming equal numbers of cases and controls for schizophrenia: *Neff* = 4/(1*/Ncases* + 1*/Ncontrols*), so that when *Ncases* = *Ncontrols*, *Neff* = *Ncases* +*Ncontrols* = *N*, the total sample size). At current sample sizes, only 10%, 10%, and 7% of narrow-sense chip heritability is captured for schizophrenia, educational attainment, and putamen volume, respectively. And to capture the preponderance of chip heritability for schizophrenia, for example, a sample with approximately half a million each of cases and controls would be needed. 

![Figure 2:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2017/06/23/133132/F2.medium.gif)

[Figure 2:](http://biorxiv.org/content/early/2017/06/23/133132/F2)

Figure 2: 
Proportion of narrow-sense chip heritability captured by genome-wide significant SNPs as a function of sample size, *N*. Left-to-right plot order is determined by decreasing ![Graphic][69]</img>. For current sample sizes, the proportions are: putamen volume, 0.064; schizophrenia, 0.096; educational attainment, 0.109.

The estimated total inflation factor for the pruned data, *λ*, is almost exactly predicted by the model. E.g., for schizophrenia, ![Graphic][70]</img>, whereas for educational attainment the values are *λ* = 1.16 and ![Graphic][71]</img>. Higher polygenicity, *π*1, mean strength of association, ![Graphic][72]</img>, and sample size, *N*, will all contribute to higher *λ*. Residual population structure in the form of cryptic relatedness will also contribute to genomic inflation. For schizophrenia, inflation from population structure is estimated to be ![Graphic][73]</img>. In contrast, for educational attainment ![Graphic][74]</img>, indicating essentially no residual inflation due to population structure.

### Simulations

Table 1 shows the simulation results, comparing true and estimated values for the model parameters, heritability, and the number of causal SNPs. In supporting material, Figure 3 shows QQ plots for a randomly chosen *β*-vector and phenotype instantiation for each of the twelve (*π*1, *h*2) scenarios. Most of the ![Graphic][75]</img> estimated are in reasonable agreement with the true values, though for *π*1 = 10−5 they are larger by about a factor of two for *h*2 equal to 0.4 and 0.7. The number of estimated causals are in correspondingly good agreement with the true values, ranging in increasing powers of 10 from 110 through 110,158. While the estimated polygenicities tend to be slight overestimates, the estimated discoverabilities, ![Graphic][76]</img>, tend to be under-estimates. From Supporting Material Figure 3, the tails of the QQ plots for the true parameters (dashed dark blue curves), particularly for the larger *π*1’s, deviate from the simulated data plots (solid dark blue curves), consistently over-estimating the proportion of SNPs with more extreme z-scores. The model fit, however, bends these curves down toward the data curves. Note that steeper tails have larger ![Graphic][77]</img>’s, and larger *π*1’s lead to earlier departure from the null line. In all cases, ![Graphic][78]</img> is close to 1, indicating no cryptic relatedness. Estimates of heritability, ![Graphic][79]</img>, show a tendency to decrease with increasing *π*1. In all cases, however, the value for genomic control, *λ*, estimated from the model is in very good agreement with the value estimated from the simulated data; these values increase both as *π*1 and ![Graphic][80]</img> (or *h*2, for fixed *π*1) increase. E.g., for *π*1 = 10−5 and *h*2 = 0.1, *λ* = 1.01 and ![Graphic][81]</img>, while for *π*1 = 10−2 and *h*2 = 0.7, *λ* = 1.26 and ![Graphic][82]</img>. 

![Figure 3:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2017/06/23/133132/F3.medium.gif)

[Figure 3:](http://biorxiv.org/content/early/2017/06/23/133132/F3)

Figure 3: 
Quantile-quantile plots for simulations. True polygenicity is specified for each row, and true heritability is specified for each column. QQ-plots for simulated data in dark blue, with 95% confidence interval in light blue; model prediction in yellow. The dashed blue curve is the QQ plot corresponding to the true parameters. ![Graphic][83]</img> and ![Graphic][84]</img> are the overall nominal genomic control factors calculated from the plots. The three estimated model parameters are: polygenicity, ![Graphic][85]</img>; discoverability, ![Graphic][86]</img> and SNP association *χ*2-statistic inflation factor, ![Graphic][87]</img>. ![Graphic][88]</img> is the estimated narrow-sense chip heritability, and ![Graphic][89]</img> is the estimated number of causal SNPs. The dotted black line is the expected plot under null. ![Graphic][90]</img> is the same as ![Graphic][91]</img> but with ![Graphic][92]</img> calculated from the known causal SNPs (instead of from all SNPs). Reading the plots: on the vertical axis, choose a p-value threshold (more extreme values are further from the origin), then the horizontal axis gives the proportion of SNPs exceeding that threshold (higher proportions are closer to the origin).

![Figure 4:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2017/06/23/133132/F4.medium.gif)

[Figure 4:](http://biorxiv.org/content/early/2017/06/23/133132/F4)

Figure 4: 
(A) Mean value of heterozygosity for given total LD (SNPs were binned based on TLD and the mean TLD for each bin plotted on the x-axis; the corresponding mean heterozygosity for SNPs in each bin was then plotted on the y-axis). (B) Mean value of total LD for given heterozygosity. Plots made for SNPs in the PGC2 schizophrenia GWAS; TLD and H calculated from 1000 Genomes phase 3 reference panel.

![Figure 5:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2017/06/23/133132/F5.medium.gif)

[Figure 5:](http://biorxiv.org/content/early/2017/06/23/133132/F5)

Figure 5: 
Histograms of SNPs in schizophrenia GWAS, by (A) total LD, and (B) heterozygosity.

![Figure 6:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2017/06/23/133132/F6.medium.gif)

[Figure 6:](http://biorxiv.org/content/early/2017/06/23/133132/F6)

Figure 6: 
QQ plots for schizophrenia, for a 5X5 grid of total LD X heterozygosity. *n* is the number of SNPs in each plot. *H* and *TLD* are the mean values in each plot. ![Graphic][93]</img> and ![Graphic][94]</img> are the genomic control values calculated from the QQ plots for the data and the model, respectively.

View this table:
[Table 1:](http://biorxiv.org/content/early/2017/06/23/133132/T1)

Table 1: 
Simulation results: comparison of mean (std) true and estimated (ˆ) model parameters and derived quantities. Results for each line, for specified heritability *h*2 and fraction *π*1 of causal SNPs, are from 10 independent instantiations with random selection of the *ncausal* causal SNPs that are assigned a *β*-value from the standard normal distribution. Defining *Yg* = *Gβ*, where *G* is the genotype matrix, the total phenotype vector is constructed as *Y* =*Yg* +*ϵ*, where the residual random vector *ϵ* for each instantiation is drawn from a normal distribution such that var(*Y*) = var(*Yg*) */h*2 for predefined *h*2. For each of the instantiations, *i*, this implicitly defines the true value ![Graphic][95]</img>, and ![Graphic][96]</img> is their mean.

## DISCUSSION

Building on our previous work and the work of others, here we present the first unified method based on GWAS summary statistics, incorporating detailed LD structure from a reference panel, for directly estimating phenotypic polygenicity, *π*1, “SNP discoverability” or strength of association (specifically, the variance of the underlying causal effects), ![Graphic][97]</img>, and residual inflation of the association statistics due to variance distortion induced by cryptic relatedness, ![Graphic][98]</img>.

We apply the model to three diverse phenotypes, one qualitative and two quantitative: schizophrenia, educational attainment, putamen volume. In each case, we estimate the polygenicity, discoverability, and residual inflation due to variance distortion; we also estimate the number of causal SNPs, *ncausal*, and the SNP heritability, *h*2 (for schizophrenia, we reexpress this as the proportion of population variance in disease liability, ![Graphic][99]</img>, under a liability threshold model, adjusted for ascertainment). In addition, we estimate the proportion of SNP heritability captured by genome-wide significant SNPs at current sample sizes, and predict future sample sizes needed to explain the preponderance of SNP heritability.

We find that schizophrenia is highly polygenic, with *π*1 = 5 × 10−3. This leads to an estimate of *ncausal* ≃ 55,000, which is in scale-agreement with a recent estimate that the number of causals is >20,000 (Loh et al., 2015). The SNP associations, however, are characterized by a narrow distribution, ![Graphic][100]</img>, indicating that most associations are of week effect, i.e., have low discoverability.

For educational attainment (Rietveld et al., 2013; Okbay et al., 2016; Cesarini and Visscher, 2017), the polygenicity is somewhat greater, *π*1 = 7.7 × 10−3, leading to an estimate of *ncausal* ≃ 85,000, which also is in scale-agreement with a recent estimate of the number of loci contributing to heritability of ≃ 70,000 (Rietveld et al., 2013). The variance of the distribution for causal effect sizes is an order of magnitude smaller than for schizophrenia, ![Graphic][101]</img>, indicating lower discoverability.

In marked contrast is putamen volume, which has very low polygenicity: *π*1 = 2.6 × 10−5, so that only 285 SNPs (out of ~11 million) are estimated to be causal. However, these SNPs are characterized by high discoverability, two-orders of magnitude larger than for schizophrenia: ![Graphic][102]</img>

The QQ plots (which are sample size dependent) reflect these differences in genetic architecture. For example, the early departure of the schizophrenia QQ plot from the null line indicates its high polygenicity, while the steep rise for putamen volume after its departure corresponds to its high SNP discoverability.

Despite the much stronger effects in putamen volume, the very high polygenicity for schizophrenia leads to its being more than three times as heritable. Our point estimate for liability-scale heritability of schizophrenia is ![Graphic][103]</img> (assuming a population risk of 0.01), and that 10% of this (i.e., 2.3% of overall disease liability) is explainable based on common SNPs reaching genome-wide significance at the current sample size. This ![Graphic][104]</img> estimate is in good agreement with a recent result, ![Graphic][105]</img> (Loh et al., 2015; Golan et al., 2014), also calculated from the PGC2 data set but using raw genotype data for 472,178 markers for a subset of 22,177 schizophrenia cases and 27,629 controls of European ancestry; and with an earlier result of ![Graphic][106]</img> from PGC1 raw genotype data for 915,354 markers for 9,087 schizophrenia cases and 12,171 controls (Lee et al., 2012; Yang et al., 2011a). Our estimate of 2.3% of overall variation on the liability scale for schizophrenia explainable by genome-wide significant loci is a little lower than the corresponding estimate of 3.4% based on risk profile scores (RPS) (Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014). Nevertheless, these results show that current sample sizes need to increase substantially in order for RPSs to have predictive utility, as the vast majority of associated SNPs remain undiscovered. Our power estimates indicate that ~500,000 cases and an equal number of controls would be needed to identify these SNPs (note thst there is a total of approximately 3 million cases in the US alone). The identified SNPs then need to be mapped to genes and their modality (e.g., regulatory or functional effects) determined, so that targeted therapeutics can be developed (Schubert et al., 2015). Greater power for discovery is achievable by using prior information involving SNP functional categories (Schork et al., 2013; Andreassen et al., 2013; Sveinbjornsson et al., 2016). However, it is not yet clear how significant a role genomics can play in psychiatric precision medicine (Breen et al., 2016). Noteworthy in this respect is that estimates of broad-sense heritability of schizophrenia from twin and family studies are in the range 0.6-0.8 (Sullivan et al., 2003; Lichtenstein et al., 2009), considerably higher than the narrow-sense chip heritability estimates from GWAS. Additionally, schizophrenia is considered a spectrum disorder with multiple phenotypic dimensions and diverse clinical presentation (MacDonald and Schulz, 2009; Peralta and Cuesta, 2001); GWAS might therefore benefit from considering continuous phenotypes rather than dichotomous variables in such situations (Edwards et al., 2016). More specifically in the context of the present model, if a nominally categorical phenotype can be decomposed into more than one subcategory, there is potential for enhanced power for discovery. The heritability estimated in a binary case-control design would be an average over heritabilities for the case subcategories. If those heritabilities are similar, then, since the union of the subcategory polygenicities gives the total polygenicity over all cases, the ![Graphic][107]</img> for any subcategory will be larger by a factor equal to the ratio of overall polygenicity to the subcategory polygenicity, and the corresponding power curve (as in Figure 2) will shift to the left.

For educational attainment, we estimate SNP heritability *h*2 = 0.10, in good agreement with the estimate of 11.5% given in (Okbay et al., 2016). As with schziophrenia, this is substantially less than the estimate of heritability from twin and family studies of ≃ 40% of the variance in educational attainment explained by genetic factors (Branigan et al., 2013; Rietveld et al., 2013).

For putamen volume, we estimate the SNP heritability *h*2 = 0.07, in reasonable agreement with an earlier estimate of 0.1 for the same overall data set (Hibar et al., 2015; So et al., 2011).

To assess the validity of the model, we conduct extensive simulations over a wide range of polygenicities and heritabilities for simulated quantitative traits, using the full set of SNPs used in the phenotype analyses with realistic LD structure. The simulations in general validate the model: with the true number of causals ranging over three orders of magnitude, 102-105 (while heritabilities range from 0.1 to 0.7), the estimated number of causals in each case is in reasonable agreement with the corresponding true value. Similarly, the true ![Graphic][108]</img> range over four orders of magnitued, and the estimated values are generally well within a factor of two of the corresponding true value. It should be noted that for all simulations, ![Graphic][109]</img> is close to 1.0 (indicating no variance distortion, and hence no inflation due to cryptic relatedness, as expected in HapGen), though there is a trend toward larger values for higer heritability and polygenicity. Thus, the higher inflation found for schizophrenia is unlikely to be an artifact of the model. The simulation QQ model plots in general agree with the simulation QQ data plots, though there is an overestimation of the proportion of more extreme z-scores, particularly at very high polygenicities. This might be an artifact of using the computationally simpler but less accurate Eq. 17 instead of Eq. 14, which is currently a limitation in the implementation of the model. A Monte Carlo approach to calculating the pdf in Eq. 14 might lead to more accurate QQ model plots.

## CONCLUSION

The SNP-level causal effects model we have presented is based on GWAS summary statistics and detailed LD structure, and assumes a Gaussian distribution of effect sizes at a fraction of SNPs randomly distributed across the autosomal genome. We have shown that it captures the broad genetic architecture of diverse complex traits, where polygenicities and the variance of the effect sizes range over orders of magnitude. In addition, the model provides a roadmap for discovery in future GWAS. The model was not designed to handle situations where the reversal of short sections of DNA underlies SNP association, as appears to be the case for some phenotypes, e.g, in chromosome 8p for neuroticism (Lo et al., 2016). Future extensions and refinements include modeling specific polygenicities and effect size variances for different SNP functional annotation categories (Schork et al., 2013; Andreassen et al., 2013; Sveinbjornsson et al., 2016), possible modified pdf for non-Gauassian distribution of effects at the tails of the z-score distributions, examining individual chromosomes and possible allele frequency dependencies in different phenotypes, and extension to pleiotropic analyses. Higher accuracy in characterizing causal alleles in turn will enable greater power for SNP discovery.

## Funding

Research Council of Norway (262656, 248984, 248778, 223273) and KG Jebsen Stiftelsen; ABCD-USA Consortium (5U24DA041123).

## Acknowledgments

We thank the Schizophrenia Working Group of the Psychiatric Genomics Consortium (PGC) for making available their GWAS summary statistics for schizophrenia; the Enhancing Neuro Imaging Genetics through Meta Analysis Consortium (ENIGMA) for making available their GWAS summary statistics for putamen volume; and the Social Science Genetic Association Consortium (SSGAC) for GWAS summary statistics on educational attainment.

*   Received May 23, 2017.
*   Revision received June 23, 2017.
*   Accepted June 23, 2017.


*   © 2017, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/)

## References

1.  1000 Genomes Project Consortium, 2010. A map of human genome variation from population-scale sequencing. Nature 467 (7319), 1061-1073.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature09534&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20981092&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000283548600039&link_type=ISI) 

2.  Andreassen, O. A., Djurovic, S., Thompson, W. K., Schork, A. J., Kendler, K. S., O’Donovan, M. C., Rujescu, D., Werge, T., van de Bunt, M., Morris, A. P., et al., 2013. Improved detection of common variants associated with schizophrenia by leveraging pleiotropy with cardiovascular-disease risk factors. The American Journal of Human Genetics 92 (2), 197-209.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2013.01.001&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23375658&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

3.  Branigan, A. R., McCallum, K. J., Freese, J., 2013. Variation in the heritability of educational attainment: An international meta-analysis. Social Forces, 109-140.
    
    
4.  Breen, G., Li, Q., Roth, B. L., O’Donnell, P., Didriksen, M., Dolmetsch, R., O’Reilly, P. F., Gaspar, H. A., Manji, H., Huebel, C., et al., 2016. Translating genome-wide association findings into new therapeutics for psychiatry. Nature Neuroscience 19 (11), 1392-1396.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nn.4411&link_type=DOI) 

5.  Bulik-Sullivan, B. K., Loh, P.-R., Finucane, H. K., Ripke, S., Yang, J., Patterson, N., Daly, M. J., Price, A. L., Neale, B. M., of the Psychiatric Genomics Consortium, S. W. G., et al., 2015. Ld score regression distinguishes confounding from polygenicity in genomewide association studies. Nature genetics 47 (3), 291-295.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.3211&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25642630&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

6.  Cesarini, D., Visscher, P. M., 2017. Genetics and educational attainment. npj Science of Learning 2 (1), 4.
    
    
7.  Clopper, C. J., Pearson, E. S., 1934. The use of confidence or fiducial limits illustrated in the case of the binomial. Biometrika 26 (4), 404-413.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/biomet/26.4.404&link_type=DOI) 

8.  Consortium, G. P., et al., 2012. An integrated map of genetic variation from 1,092 human genomes. Nature 491 (7422), 56-65.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature11632&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23128226&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000310434500030&link_type=ISI) 

9.  Consortium, G. P., et al., 2015. A global reference for human genetic variation. Nature 526 (7571), 68-74.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature15393&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=26432245&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

10. Dempster, E. R., Lerner, I. M., 1950. Heritability of threshold characters. Genetics 35 (2), 212.
    
    [FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo4OiJnZW5ldGljcyI7czo1OiJyZXNpZCI7czo4OiIzNS8yLzIxMiI7czo0OiJhdG9tIjtzOjM3OiIvYmlvcnhpdi9lYXJseS8yMDE3LzA2LzIzLzEzMzEzMi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

11. Devlin, B., Roeder, K., Dec 1999. Genomic control for association studies. Biometrics 55 (4), 997-1004.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1111/j.0006-341X.1999.00997.x&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=11315092&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000084218000001&link_type=ISI) 

12. Edwards, A. C., Bigdeli, T. B., Docherty, A. R., Bacanu, S., Lee, D., De Candia, T. R., Moscati, A., Thiselton, D. L., Maher, B. S., Wormley, B. K., et al., 2016. Meta-analysis of positive and negative symptoms reveals schizophrenia modifier genes. Schizophrenia bulletin 42 (2), 279-287.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/schbul/sbv119&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=26316594&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

13. Falconer, D. S., 1965. The inheritance of liability to certain diseases, estimated from the incidence among relatives. Annals of human genetics 29 (1), 51-76.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1111/j.1469-1809.1965.tb00500.x&link_type=DOI) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=A19656735800005&link_type=ISI) 

14. Golan, D., Lander, E. S., Rosset, S., 2014. Measuring missing heritability: inferring the contribution of common variants. Proceedings of the National Academy of Sciences 111 (49), E5272–E5281.
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMjoiMTExLzQ5L0U1MjcyIjtzOjQ6ImF0b20iO3M6Mzc6Ii9iaW9yeGl2L2Vhcmx5LzIwMTcvMDYvMjMvMTMzMTMyLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

15. Hibar, D. P., Stein, J. L., Renteria, M. E., Arias-Vasquez, A., Desrivières, S., Jahanshad, N., Toro, R., Wittfeld, K., Abramovic, L., Andersson, M., et al., 2015. Common genetic variants influence human subcortical brain structures. Nature.
    
    
16. Holland, D., Wang, Y., Thompson, W. K., Schork, A., Chen, C. H., Lo, M. T., Witoelar, A., Werge, T., O’Donovan, M., Andreassen, O. A., Dale, A. M., 2016. Estimating Effect Sizes and Expected Replication Probabilities from GWAS Summary Statistics. Front Genet 7, 15.
    
    
17. Kang, H. M., Sul, J. H., Service, S. K., Zaitlen, N. A., Kong, S.-y., Freimer, N. B., Sabatti, C., Eskin, E., et al., 2010. Variance component model to account for sample structure in genome-wide association studies. Nature genetics 42 (4), 348-354.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.548&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20208533&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000276150500016&link_type=ISI) 

18. Kinney, D. K., Teixeira, P., Hsu, D., Napoleon, S. C., Crowley, D. J., Miller, A., Hyman, W., Huang, E., 2009. Relation of schizophrenia prevalence to latitude, climate, fish consumption, infant mortality, and skin color: a role for prenatal vitamin d deficiency and infections? Schizophrenia bulletin, sbp023.
    
    
19. Kumar, S. K., Feldman, M. W., Rehkopf, D. H., Tuljapurkar, S., 2016. Limitations of gcta as a solution to the missing heritability problem. Proceedings of the National Academy of Sciences 113 (1), E61–E70.
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czo5OiIxMTMvMS9FNjEiO3M6NDoiYXRvbSI7czozNzoiL2Jpb3J4aXYvZWFybHkvMjAxNy8wNi8yMy8xMzMxMzIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

20. Lee, S. H., DeCandia, T. R., Ripke, S., Yang, J., Sullivan, P. F., Goddard, M. E., Keller, M. C., Visscher, P. M., Wray, N. R., Consortium, S. P. G.-W. A. S., et al., 2012. Estimating the proportion of variation in susceptibility to schizophrenia captured by common snps. Nature genetics 44 (3), 247-250.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.1108&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22344220&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

21. Lee, S. H., Wray, N. R., Goddard, M. E., Visscher, P. M., 2011. Estimating missing heritability for disease from genome-wide association studies. The American Journal of Human Genetics 88 (3), 294-305.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2011.02.002&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21376301&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000288589000007&link_type=ISI) 

22. Li, N., Stephens, M., 2003. Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data. Genetics 165 (4), 2213-2233.
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiZ2VuZXRpY3MiO3M6NToicmVzaWQiO3M6MTA6IjE2NS80LzIyMTMiO3M6NDoiYXRvbSI7czozNzoiL2Jpb3J4aXYvZWFybHkvMjAxNy8wNi8yMy8xMzMxMzIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

23. Lichtenstein, P., Yip, B. H., Björk, C., Pawitan, Y., Cannon, T. D., Sullivan, P. F., Hultman, C. M., 2009. Common genetic determinants of schizophrenia and bipolar disorder in swedish families: a population-based study. The Lancet 373 (9659), 234-239.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/s0140-6736(09)60072-6&link_type=DOI) 

24. Lo, M.-T., Hinds, D. A., Tung, J. Y., Franz, C., Fan, C.-C., Wang, Y., Smeland, O. B., Schork, A., Holland, D., Kauppi, K., et al., 2016. Genome-wide analyses for personality traits identify six genomic loci and show correlations with psychiatric disorders. Nature genetics.
    
    
25. Loh, P.-R., Bhatia, G., Gusev, A., Finucane, H. K., Bulik-Sullivan, B. K., Pollack, S. J., de Candia, T. R., Lee, S. H., Wray, N. R., Kendler, K. S., et al., 2015. Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis. Nature genetics.
    
    
26. MacDonald, A. W., Schulz, S. C., 2009. What we know: findings that every theory of schizophrenia should explain. Schizophrenia Bulletin 35 (3), 493-508.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/schbul/sbp017&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=19329559&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000265277800006&link_type=ISI) 

27. McCarthy, M. I., Abecasis, G. R., Cardon, L. R., Goldstein, D. B., Little, J., Ioannidis, J. P., Hirschhorn, J. N., 2008. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nature reviews genetics 9 (5), 356-369.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nrg2344&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=18398418&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000255057300012&link_type=ISI) 

28. Okbay, A., Beauchamp, J. P., Fontana, M. A., Lee, J. J., Pers, T. H., Rietveld, C. A., Turley, P., Chen, G.-B., Emilsson, V., Meddens, S. F. W., et al., 2016. Genome-wide association study identifies 74 loci associated with educational attainment. Nature 533 (7604), 539-542.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature17671&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=27225129&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

29. Palla, L., Dudbridge, F., 2015. A fast method that uses polygenic scores to estimate the variance explained by genome-wide marker panels and the proportion of variants affecting a trait. The American Journal of Human Genetics 97 (2), 250-259.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2015.06.005&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=26189816&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

30. Pasaniuc, B., Price, A. L., 2016. Dissecting the genetics of complex traits using summary association statistics. Nature Reviews Genetics.
    
    
31. Pe’er, I., Yelensky, R., Altshuler, D., Daly, M. J., 2008. Estimation of the multiple testing burden for genomewide association studies of nearly all common variants. Genetic epidemiology 32 (4), 381-385.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1002/gepi.20303&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=18348202&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000255471100009&link_type=ISI) 

32. Peralta, V., Cuesta, M. J., 2001. How many and which are the psychopathological dimensions in schizophrenia? issues influencing their ascertainment. Schizophrenia research 49 (3), 269-285.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/S0920-9964(00)00071-2&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=11356588&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

33. Price, A. L., Zaitlen, N. A., Reich, D., Patterson, N., 2010. New approaches to population stratification in genome-wide association studies. Nature Reviews Genetics 11 (7), 459-463.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nrg2813&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20548291&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000278998500008&link_type=ISI) 

34. Purcell, S. M., Wray, N. R., Stone, J. L., Visscher, P. M., O’Donovan, M. C., Sullivan, P. F., Sklar, P., Purcell, S. M., Stone, J. L., Sullivan, P. F., et al., 2009. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460 (7256), 748-752.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature08185&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=19571811&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000268670300041&link_type=ISI) 

35. Rietveld, C. A., Medland, S. E., Derringer, J., Yang, J., Esko, T., Martin, N. W., Westra, H.-J., Shakhbazov, K., Abdellaoui, A., Agrawal, A., et al., 2013. Gwas of 126,559 individuals identifies genetic variants associated with educational attainment. science 340 (6139), 1467-1471.
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEzOiIzNDAvNjEzOS8xNDY3IjtzOjQ6ImF0b20iO3M6Mzc6Ii9iaW9yeGl2L2Vhcmx5LzIwMTcvMDYvMjMvMTMzMTMyLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

36. Schizophrenia Working Group of the Psychiatric Genomics Consortium, Jul 2014. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511 (7510), 421-427.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature13595&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25056061&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000339335700037&link_type=ISI) 

37. Schork, A. J., Thompson, W. K., Pham, P., Torkamani, A., Roddey, J. C., Sullivan, P. F., Kelsoe, J. R., O’Donovan, M. C., Furberg, H., Schork, N. J., et al., 2013. All snps are not created equal: genome-wide association studies reveal a consistent pattern of enrichment among functionally annotated snps. PLoS genetics 9 (4), e1003449.
    
    
38. Schubert, C. R., O’Donnell, P., Quan, J., Wendland, J. R., Xi, H. S., Winslow, A. R., Domenici, E., Essioux, L., Kam-Thong, T., Airey, D. C., et al., 2015. Brainseq: neurogenomics to drive novel target discovery for neuropsychiatric disorders. Neuron 88 (6), 1078.
    
    
39. So, H.-C., Li, M., Sham, P. C., 2011. Uncovering the total heritability explained by all true susceptibility variants in a genome-wide association study. Genetic epidemiology 35 (6), 447-456.
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21618601&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

40. Speed, D., Hemani, G., Johnson, M. R., Balding, D. J., 2012. Improved heritability estimation from genome-wide snps. The American Journal of Human Genetics 91 (6), 1011-1021.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2012.10.010&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23217325&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

41. Spencer, C. C., Su, Z., Donnelly, P., Marchini, J., 2009. Designing genome-wide association studies: sample size, power, imputation, and the choice of genotyping chip. PLoS Genet 5 (5), e1000477.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1000477&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=19492015&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

42. Stahl, E. A., Wegmann, D., Trynka, G., Gutierrez-Achury, J., Do, R., Voight, B. F., Kraft, P., Chen, R., Kallberg, H. J., Kurreeman, F. A., et al., 2012. Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis. Nature genetics 44 (5), 483-489.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.2232&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22446960&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

43. Su, Z., Marchini, J., Donnelly, P., 2011. Hapgen2: simulation of multiple disease snps. Bioinformatics 27 (16), 2304-2305.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btr341&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21653516&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000293620800021&link_type=ISI) 

44. Sullivan, P. F., Kendler, K. S., Neale, M. C., 2003. Schizophrenia as a complex trait: evidence from a meta-analysis of twin studies. Archives of general psychiatry 60 (12), 1187-1192.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1001/archpsyc.60.12.1187&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=14662550&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000187022200002&link_type=ISI) 

45. Sveinbjornsson, G., Albrechtsen, A., Zink, F., Gudjonsson, S. A., Oddson, A., Másson, G., Holm, H., Kong, A., Thorsteinsdottir, U., Sulem, P., et al., 2016. Weighting sequence variants based on their annotation increases power of whole-genome association studies. Nature genetics.
    
    
46. Visscher, P. M., Brown, M. A., McCarthy, M. I., Yang, J., 2012. Five years of gwas discovery. The American Journal of Human Genetics 90 (1), 7-24.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2011.11.029&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22243964&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

47. Whiteford, H. A., Degenhardt, L., Rehm, J., Baxter, A. J., Ferrari, A. J., Erskine, H. E., Charlson, F. J., Norman, R. E., Flaxman, A. D., Johns, N., et al., 2013. Global burden of disease attributable to mental and substance use disorders: findings from the global burden of disease study 2010. The Lancet 382 (9904), 1575–1586.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(13)61611-6&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23993280&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

48. Witte, J. S., Visscher, P. M., Wray, N. R., 2014. The contribution of genetic variants to disease depends on the ruler. Nature Reviews Genetics 15 (11), 765–776.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nrg3786&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25223781&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

49. Yang, J., Bakshi, A., Zhu, Z., Hemani, G., Vinkhuyzen, A. A., Lee, S. H., Robinson, M. R., Perry, J. R., Nolte, I. M., van Vliet-Ostaptchouk, J. V., et al., 2015. Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nature genetics.
    
    
50. Yang, J., Lee, S. H., Goddard, M. E., Visscher, P. M., 2011a. Gcta: a tool for genome-wide complex trait analysis. The American Journal of Human Genetics 88 (1), 76–82.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2010.11.011&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21167468&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

51. Yang, J., Manolio, T. A., Pasquale, L. R., Boerwinkle, E., Caporaso, N., Cunningham, J. M., De Andrade, M., Feenstra, B., Feingold, E., Hayes, M. G., et al., 2011b. Genome partitioning of genetic variation for complex traits using common snps. Nature genetics 43 (6), 519-525.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.823&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21552263&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom) 

52. Yang, J., Weedon, M. N., Purcell, S., Lettre, G., Estrada, K., Willer, C. J., Smith, A. V., Ingelsson, E., O’connell, J. R., Mangino, M., et al., 2011c. Genomic inflation factors under polygenic inheritance. European Journal of Human Genetics 19 (7), 807-812.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ejhg.2011.39&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21407268&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F06%2F23%2F133132.atom)

 [1]: /embed/inline-graphic-1.gif
 [2]: /embed/inline-graphic-2.gif
 [3]: /embed/inline-graphic-3.gif
 [4]: /embed/inline-graphic-4.gif
 [5]: /embed/inline-graphic-5.gif
 [6]: /embed/inline-graphic-6.gif
 [7]: /embed/inline-graphic-7.gif
 [8]: /embed/inline-graphic-8.gif
 [9]: /embed/inline-graphic-9.gif
 [10]: /embed/graphic-1.gif
 [11]: /embed/graphic-2.gif
 [12]: /embed/inline-graphic-10.gif
 [13]: /embed/inline-graphic-11.gif
 [14]: /embed/inline-graphic-12.gif
 [15]: /embed/graphic-3.gif
 [16]: /embed/graphic-4.gif
 [17]: /embed/graphic-5.gif
 [18]: /embed/graphic-6.gif
 [19]: /embed/graphic-7.gif
 [20]: /embed/graphic-8.gif
 [21]: /embed/graphic-9.gif
 [22]: /embed/graphic-10.gif
 [23]: /embed/graphic-11.gif
 [24]: /embed/inline-graphic-13.gif
 [25]: /embed/inline-graphic-14.gif
 [26]: /embed/inline-graphic-15.gif
 [27]: /embed/graphic-12.gif
 [28]: /embed/graphic-13.gif
 [29]: /embed/graphic-14.gif
 [30]: /embed/inline-graphic-16.gif
 [31]: /embed/inline-graphic-17.gif
 [32]: /embed/graphic-15.gif
 [33]: /embed/graphic-16.gif
 [34]: /embed/graphic-17.gif
 [35]: /embed/inline-graphic-18.gif
 [36]: /embed/inline-graphic-19.gif
 [37]: /embed/graphic-18.gif
 [38]: /embed/inline-graphic-20.gif
 [39]: /embed/graphic-19.gif
 [40]: /embed/graphic-20.gif
 [41]: /embed/graphic-21.gif
 [42]: /embed/inline-graphic-21.gif
 [43]: /embed/graphic-22.gif
 [44]: /embed/inline-graphic-22.gif
 [45]: /embed/inline-graphic-23.gif
 [46]: /embed/inline-graphic-24.gif
 [47]: /embed/inline-graphic-25.gif
 [48]: /embed/inline-graphic-26.gif
 [49]: /embed/graphic-23.gif
 [50]: /embed/inline-graphic-27.gif
 [51]: /embed/graphic-24.gif
 [52]: F1/embed/inline-graphic-28.gif
 [53]: F1/embed/inline-graphic-29.gif
 [54]: F1/embed/inline-graphic-30.gif
 [55]: F1/embed/inline-graphic-31.gif
 [56]: F1/embed/inline-graphic-32.gif
 [57]: F1/embed/inline-graphic-33.gif
 [58]: F1/embed/inline-graphic-34.gif
 [59]: /embed/inline-graphic-35.gif
 [60]: /embed/inline-graphic-36.gif
 [61]: /embed/inline-graphic-37.gif
 [62]: /embed/inline-graphic-38.gif
 [63]: /embed/inline-graphic-39.gif
 [64]: /embed/inline-graphic-40.gif
 [65]: /embed/inline-graphic-41.gif
 [66]: /embed/inline-graphic-42.gif
 [67]: /embed/inline-graphic-43.gif
 [68]: /embed/inline-graphic-44.gif
 [69]: F2/embed/inline-graphic-45.gif
 [70]: /embed/inline-graphic-46.gif
 [71]: /embed/inline-graphic-47.gif
 [72]: /embed/inline-graphic-48.gif
 [73]: /embed/inline-graphic-49.gif
 [74]: /embed/inline-graphic-50.gif
 [75]: /embed/inline-graphic-51.gif
 [76]: /embed/inline-graphic-52.gif
 [77]: /embed/inline-graphic-53.gif
 [78]: /embed/inline-graphic-54.gif
 [79]: /embed/inline-graphic-55.gif
 [80]: /embed/inline-graphic-56.gif
 [81]: /embed/inline-graphic-57.gif
 [82]: /embed/inline-graphic-58.gif
 [83]: F3/embed/inline-graphic-59.gif
 [84]: F3/embed/inline-graphic-60.gif
 [85]: F3/embed/inline-graphic-61.gif
 [86]: F3/embed/inline-graphic-62.gif
 [87]: F3/embed/inline-graphic-63.gif
 [88]: F3/embed/inline-graphic-64.gif
 [89]: F3/embed/inline-graphic-65.gif
 [90]: F3/embed/inline-graphic-66.gif
 [91]: F3/embed/inline-graphic-67.gif
 [92]: F3/embed/inline-graphic-68.gif
 [93]: F6/embed/inline-graphic-69.gif
 [94]: F6/embed/inline-graphic-70.gif
 [95]: T1/embed/inline-graphic-71.gif
 [96]: T1/embed/inline-graphic-72.gif
 [97]: /embed/inline-graphic-73.gif
 [98]: /embed/inline-graphic-74.gif
 [99]: /embed/inline-graphic-75.gif
 [100]: /embed/inline-graphic-76.gif
 [101]: /embed/inline-graphic-77.gif
 [102]: /embed/inline-graphic-78.gif
 [103]: /embed/inline-graphic-79.gif
 [104]: /embed/inline-graphic-80.gif
 [105]: /embed/inline-graphic-81.gif
 [106]: /embed/inline-graphic-82.gif
 [107]: /embed/inline-graphic-83.gif
 [108]: /embed/inline-graphic-84.gif
 [109]: /embed/inline-graphic-85.gif