An Evolutionary-Based Method for Reconstructing Conversation Threads in Email Corpora

  author =       "Mostafa Dehghani and Masoud Asadpour and 
                 Azadeh Shakery",
  booktitle =    "Advances in Social Networks Analysis and Mining
                 (ASONAM), 2012 IEEE/ACM International Conference on",
  title =        "An Evolutionary-Based Method for Reconstructing
                 Conversation Threads in Email Corpora",
  year =         "2012",
  pages =        "1132--1137",
  address =      "Istanbul",
  month =        "26-29 " # aug,
  isbn13 =       "978-1-4673-2497-7",
  DOI =          "doi:10.1109/ASONAM.2012.195",
  size =         "6 pages",
  abstract =     "Email is a type of Web data which is produced in
                 enormous quantities. It is beneficial to detect
                 conversation threads contained in the email corpora for
                 various applications, including discussion search,
                 expert finding and even email clustering and
                 classification. Conversation thread in email corpora
                 can be defined as a cluster of exchanged emails among
                 the same group of people by reply or forwarding on the
                 same topic. According to this definition, we can define
                 parent-child relation between emails, so email
                 conversation threads seem to demonstrate tree
                 structure. This paper presents a new approach based on
                 genetic programming for reconstruction of conversation
                 threads in emails data. This approach considers finding
                 email conversation threads as an optimisation problem,
                 and exploits genetic programming to search
                 intelligently in the space of possible solutions.
                 Rather than several studies that have been conducted on
                 this problem, this work concentrates on detecting
                 accurate structure of conversation threads in high
                 recall. This paper provides a comprehensive evaluation
                 on the BC3 data set. Preliminary results suggest that
                 our method provides acceptable precision and higher
                 recall than existing methods.",
  keywords =     "genetic algorithms, genetic programming, Internet,
                 electronic mail, pattern classification, pattern
                 clustering, BC3 data set, Web data, conversation thread
                 reconstruction, discussion search, email
                 classification, email clustering, email corpora,
                 evolutionary-based method, expert finding, optimisation
                 problem, parent-child relation, Biological cells,
                 Educational institutions, Electronic mail, Social
                 network services, Sociology, Statistics, conversation,
                 email, emails thread",
  notes =        "Also known as \cite{6425605}",

