Authorea

Awaiting Activation edited untitled.tex about 8 years ago

Commit id: 2c11f24eb1e1aaa65de3fa49e4888eebca05fba1

deletions | additions

\section{Overview} Beta regression is used when the output $y$ is between 0 and 1, that is $y\in[0,1]$. Common examples include the fraction of employees participating in a company 401k plan or any response that is a percentage. Linear regression has often been applied due to its simplicity but such data violates key assumptions. First, responses bounded by 0 and 1 are not normally distributed. Second, such data is usually heteroscedastic . For example, the variance shrinks as the mean approaches the boundary point 0, 1 \cite{Lockhart_2014}. 1. Instead, beta regression assumes the response to follow follows a beta distribution . distribution. The beta distribution is usually given by \begin{equation} {f(y;p,q)} = {\frac{\Gamma(p+q)}{\Gamma(p)\Gamma(q)}}y^{p-1}(1-y)^{q-1}, \quad 0 \end{equation}