Charith Bhagya Karunarathna edited section_Methods_subsection_Data_simulation__.tex  over 7 years ago

Commit id: ebf6d206bc07b3e137126a475b6d1d8c03032488

deletions | additions      

       

$$  \item $D$ is disease status ($D = 1$, case; $D=0$, control).  \item $G=(G_1, G_2, \ldots , G_{m})$ is an individual's multi-locus genotype at $m$ risk SNVs, with $G_j$ being the number of copies of the derived allele at the $j^{th}$ risk SNV.  \item We select the intercept term to ensure that the probability of sporadic disease (i.e. $P(D=1|G=\underline{\tilde{0}})$) $P(D=1|G=\undertilde{0})$)  is approximately $20\%$. \end{itemize}  \item We randomly sampled SNVs from the middle region one at a time, until the disease prevalence was between $9.5−10.5\%$ in the $1500$ individuals.  \item After assigning disease status to the 1500 individuals, we sampled 50 case and 50 control individuals from all affected and unaffected individuals.