School of Computer Science

Module 06-02495 (2016)

Natural Language Processing 1

Level 2/I

Mark Lee Semester 1 10 credits
Co-ordinator: Mark Lee
Reviewer: Peter Hancox

The Module Description is a strict subset of this Syllabus Page.

Outline

The module presents an overview of Natural Language Processing and its applications, followed by introductions to morphology, syntax and semantics. These topics are used to introduce some linguistic theory and appropriate algorithms for their computational implementation. Examples are mostly given using Prolog.


Aims

The aims of this module are to:

  • introduce Natural Language Processing as one of the components of Artificial Intelligence, both from engineering and cognitive viewpoints
  • provide foundations for the programming of Natural Language Processing techniques.

Learning Outcomes

On successful completion of this module, the student should be able to:

  1. Describe major concepts, trends, approaches/systems, and difficulties in Natural Language Processing and the study of language generally.
  2. Discuss and illustrate the potential distinctions between morphology, syntax, semantics and pragmatics
  3. Describe several standard methods of applying morphological and syntactic knowledge in Natural Language Processing systems, for instance: finite-state methods; probabilistic methods; context-free grammars and parsers, including the Active Chart Parsers; unification grammars and parsing; dependency parsing.
  4. Describe the basics of symbolic and statistical semantic algorithms.
  5. Demonstrate knowledge of at least one method for a task such as pronoun reference resolution, coreference resolution, or named-entity recognition as an example of a specific, core task in interpretation.
  6. Describe an application of natural language processing (for instance machine translation or document summarization) and show the place of syntactic, semantic and pragmatic processing.

Restrictions

None


Teaching methods

2 hrs/week lectures and exercise classes.

Contact Hours: 23


Assessment

Sessional: 1.5 hr examination (80%), continuous assessment (20%).

Supplementary (where allowed): By examination only.

The nature and timing of the continuous assessment will be specified on the module web page -- see under "Relevant Links".


Detailed Syllabus

  1. Introduction and Overview
  2. Nature of language, words, word classes, lexical ambiguity, morphology, etc.
  3. Part-of-speech tagging.
  4. Grammars and parsing.
  5. Semantics: symbolic and statistical approaches.
  6. Pragmatics, cohesion, discourse structure.
  7. Areas of special difficulty such as idioms, figurative language, speech acts, textese.
  8. Applications (may be interspersed through above topics).

Programmes containing this module