FirstPreviousIndexNextLast

Project Proposal: OCR for Scientific Documents

Supervisor: Alan Sexton

Keywords: Document Image Analysis

Brief Description:

Scientific documents are particularly difficult to recognise with OCR software. They contain diagrams, graphs, mathematical formulae and complex tables. Because of the difficulty of analysing them, students with visual impairments are much more severely disadvantaged in studying the sciences than in studying many other subjects. This set of projects (there are many different ones in this area) is to use a particular scanned book (http://www.cs.bham.ac.uk/~aps/research/projects/as/) in developing tools, algorithms and solutions to assist in this research.

Possible projects include: developing a character recogniser for the individual characters in the image. Analysing the layout of the page, to identify blocks of the pages as containing columns, headings, diagrams, formulae etc. Recognising tables or formulae. Turning bitmap diagrams into vector diagrams etc.

Special Equipment: No special equipment requirements

Special Software: No special software requirements



Maintained by A.P.Sexton@cs.bham.ac.uk

Home Page: http//www.cs.bham.ac.uk/~aps

School of Computer Science Home Page