Feature Construction and Selection using Genetic Programming and a Genetic Algorithm

  author =       "Matthew G. Smith and Larry Bull",
  title =        "Feature Construction and Selection using Genetic
                 Programming and a Genetic Algorithm",
  booktitle =    "Genetic Programming, Proceedings of EuroGP'2003",
  year =         "2003",
  editor =       "Conor Ryan and Terence Soule and Maarten Keijzer and 
                 Edward Tsang and Riccardo Poli and Ernesto Costa",
  volume =       "2610",
  series =       "LNCS",
  pages =        "229--237",
  address =      "Essex",
  publisher_address = "Berlin",
  month =        "14-16 " # apr,
  organisation = "EvoNet",
  publisher =    "Springer-Verlag",
  keywords =     "genetic algorithms, genetic programming",
  ISBN =         "3-540-00971-X",
  URL =          "http://www.springerlink.com/openurl.asp?genre=article&issn=0302-9743&volume=2610&spage=229",
  DOI =          "doi:10.1007/3-540-36599-0_21",
  abstract =     "The use of machine learning techniques to
                 automatically analyse data for information is becoming
                 increasingly widespread. In this paper we examine the
                 use of Genetic Programming and a Genetic Algorithm to
                 pre-process data before it is classified using the C4.5
                 decision tree learning algorithm. The Genetic
                 Programming is used to construct new features from
                 those available in the data, a potentially significant
                 process for data mining since it gives consideration to
                 hidden relationships between features. The Genetic
                 Algorithm is used to determine which such features are
                 the most predictive. Using ten well-known datasets we
                 show that our approach, in comparison to C4.5 alone,
                 provides marked improvement in a number of cases.",
