chengds added The_technique_we_used_to__.tex  over 8 years ago

Commit id: 7fd939b55b3df0b3023f1d62353791a44ce64983

deletions | additions      

         

The technique we used to turn a regression problem (of the type: estimate the correct level of a trait) into a classification problem is to separate the distribution into three parts: low values, middle values, high values. However, instead of having three classes, we focused on recognizing the low/high values and ignoring the middle ones. To separate the values we use the first and the third quartiles of each distribution. The following tables show the values of these quartiles and the number of users selected for the two resulting classes.  \begin{tabular}{c|cc|}  \hline  & \multicolumn{2}{|c|}{Ope}  \hline  Self & 2.0 & 4.0 \\  \hline  Attr & -0.2 & 0.85 \\  \end{tabular}