Intelligent Fusion of Structural and Citation-Based Evidence for Text Classification

Created by W.Langdon from gp-bibliography.bib Revision:1.4221

  author =       "Baoping Zhang and Marcos Andre Goncalves and 
                 Weiguo Fan and Yuxin Chen and Edward A. Fox and 
                 Pavel Calado and Marco Cristo",
  title =        "Intelligent Fusion of Structural and Citation-Based
                 Evidence for Text Classification",
  institution =  "Computer Science, Virginia Tech",
  year =         "2004",
  number =       "TR-04-16",
  keywords =     "genetic algorithms, genetic programming,
                 Classification, document similarity, citation analysis,
                 Computer Science, Information Retrieval, Digital
  URL =          "",
  size =         "9 pages",
  abstract =     "This paper investigates how citation-based information
                 and structural content (e.g., title, abstract) can be
                 combined to improve classification of text documents
                 into predefined categories. We evaluate different
                 measures of similarity, five derived from the citation
                 structure of the collection, and three measures derived
                 from the structural content, and determine how they can
                 be fused to improve classification effectiveness. To
                 discover the best fusion framework, we apply Genetic
                 Programming (GP) techniques. Our empirical experiments
                 using documents from the ACM digital library and the
                 ACM classification scheme show that we can discover
                 similarity functions that work better than any evidence
                 in isolation and whose combined performance through a
                 simple majority voting is comparable to that of Support
                 Vector Machine classifiers.",
  notes =        "See also \cite{Zhang05c}

                 ID Code: 693 Deposited By: Administrator, Eprints
                 Deposited On: 09 September 2005

                 Site Administrator:",

Genetic Programming entries for Baoping Zhang Marcos Andre Goncalves Weiguo Fan Yuxin (Jerry) Chen Edward A Fox Pavel Pereira Calado Marco Cristo