Supervisor: Alan Sexton
Keywords: Document Image Analysis
Brief Description:
Scientific documents are particularly difficult to recognise with OCR
software. They contain diagrams, graphs, mathematical formulae and
complex tables. Because of the difficulty of analysing them, students
with visual impairments are much more severely disadvantaged in
studying the sciences than in studying many other subjects. This set
of projects (there are many different ones in this area) is to use a
particular scanned book (http://www.cs.bham.ac.uk/~aps/research/projects/as/)
in developing tools, algorithms and solutions to assist in this research.
Possible projects include: developing a character recogniser for the
individual characters in the image. Analysing the layout of the page,
to identify blocks of the pages as containing columns, headings,
diagrams, formulae etc. Recognising tables or formulae. Turning bitmap
diagrams into vector diagrams etc.
Special Equipment: No special equipment requirements
Special Software: No special software requirements
Maintained by A.P.Sexton@cs.bham.ac.uk
Home Page: http//www.cs.bham.ac.uk/~aps
School of Computer Science Home Page