Automatic Threshold Selection using PSO for GA based Duplicate Record Detection

  author =       "K. Deepa and R. Rangarajan and M. Senthamil Selvi",
  title =        "Automatic Threshold Selection using PSO for GA based
                 Duplicate Record Detection",
  journal =      "International Journal of Computer Applications",
  year =         "2013",
  volume =       "62",
  number =       "4",
  month =        jan,
  keywords =     "genetic algorithms, genetic programming, GA, PSO,
                 similarity metrics, threshold",
  annote =       "The Pennsylvania State University CiteSeerX Archives",
  bibsource =    "OAI-PMH server at",
  language =     "en",
  oai =          "oai:CiteSeerX.psu:",
  rights =       "Metadata may be used without restrictions as long as
                 the oai identifier remains attached to it.",
  URL =          "",
  URL =          "",
  size =         "6 pages",
  abstract =     "Normally setting the threshold is an important issue
                 in applications where the similarity functions are used
                 and it relies more on human intervention. The proposed
                 work addressed two issues: first to find the optimal
                 equation using Genetic Algorithm (GA) and next it
                 adopts an intelligence algorithm, Particle Swarm
                 Optimisation (PSO) to get the optimal threshold to
                 detect the duplicate records more accurately and also
                 it reduces human intervention. Restaurant and CORA data
                 repository are used to analyse the proposed algorithm
                 and the performance of the proposed algorithm is
                 compared against marlin method and the genetic
                 programming with the help of evaluation metrics.",
  notes =        "Sri Ramakrishna Engg College, Coimbatore",

