This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
assignments:assignment5 [2015/10/31 08:30] asa |
assignments:assignment5 [2015/10/31 09:03] asa |
||
---|---|---|---|
Line 4: | Line 4: | ||
In this assignment you will compare several feature selection methods on several datasets. | In this assignment you will compare several feature selection methods on several datasets. | ||
- | The first dataset is the [[https://archive.ics.uci.edu/ml/datasets/Arcene| Arcene]] dataset which was used in a feature selection competition | + | The first dataset is the [[https://archive.ics.uci.edu/ml/datasets/Arcene| Arcene]] dataset which was used in the 2003 NIPS feature selection competition. The dataset is produced by mass spectrometry of biological samples that comes from different types of cancer. |
- | The datasets we will use are the yeast gene expression dataset | + | |
+ | The second dataset describes the expression of human genes in two types of leukemia The original publication that describes the data: | ||
+ | |||
+ | T. R. Golub, D. K. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek, J. P. Mesirov, H. Coller, M. L. Loh, J. R. Downing, M. A. Caligiuri, C. D. Bloomfield, and E. S. Lander. | ||
+ | [[https://www.broadinstitute.org/mpr/publications/projects/Leukemia/Golub_et_al_1999.pdf | Molecular classification of cancer: class discovery and class prediction by gene expression monitoring]]. | ||
+ | Science, 286(5439):531, 1999. | ||
===== Part 1: Filter methods ===== | ===== Part 1: Filter methods ===== |