Charith Bhagya Karunarathna edited untitled.tex  almost 8 years ago

Commit id: c3c02aa5b81ab4e0c98309a1f4c7507f57d336e3

deletions | additions      

       

\end{itemize}  \item Logistic regression model of disease status.  \begin{itemize}  \item Assign disease status to the 1500 individuals based on randomly sampled disease causing risk  SNVs from the mid region (950kbp - 1050kbp) and a diploid model of disease risk. \item For disease causing risk  SNVs, the number of copies of the derived allele increases disease risk according to a logistic regression model, $$  {logit}\{P(D=1|G)\} = {logit}(0.2)+ \sum_{j=1}^{m} 2 \times G_j,\;\;\mbox{where,}  $$  \item $D$ is disease status ($D = 1$, case; $D=0$, control).  \item $G=(G_1, G_2, \ldots , G_{m})$ is an individual's multi-locus genotype at $m (=13)$ disease causing $m$ risk  SNVs, with $G_j$ being the number of copies of the derived allele at the $j^{th}$ disease causing risk  SNV. \item We obtain $13$ disease causing risk  SNVs by adding randomly sampled SNVs from the mid-region one-at-a-time, until the prevalence is between $9.5-10.5\%$ in the $1500$ individuals. \end{itemize}  \end{itemize}