Joel Bangalan edited Methodology_Information_and_Resources_Previous__.md  about 8 years ago

Commit id: 2045959adc0160a33db2eaafa1615d0de4fc4c08

deletions | additions      

       

Previous researches on gene expression and cancer classification as applied to the colon cancer and the leukemia data sets will serve as the foundational pieces in this research. A similar set of data involving lung tissues will be used, and classification algorithms developed based on these. Methods on feature selection will be considered, with focus on how R Programming and existing packages can be utilized in a high dimensional setting.   ## Data Collection   The colon and leukemia data sets are available as described and used  in the previous researches (see listed in  the preliminary bibliography list). reference section.  The lung cancer data set is proprietary and will be provided by a Cancer Research Organization.  ## Overview of the analytical approach