23 Jan 2018 11:00am - 12:30pm S3, Alison Richard Bulding, Sidgwick Site

Description

Optical character recognition (OCR) is a term used to describe techniques for converting images containing printed or handwritten text into a format that can be searched and analysed computationally. Despite recent advances in OCR technology, OCR tools available to researchers are not always as accurate as one might hope, and are unable to work with handwritten text without significant time investment and significant amounts of source material written in the same hand. Nevertheless, there are several computational tools that can be applied to images and PDFs to enable text mining and to make scanned documents more searchable. This workshop will introduce several such tools along with some practical techniques for using them, and will also highlight OCR and related services offered by the Digital Content Unit at the Cambridge University Library.

For more information and to register for a place please click here or use the online registration link on this page.
 

Upcoming Events

Walking with Constable: the Cambridge Edition
Cambridge Festival, Walk
A group of young people look towards a laptop as if they are researching something for a group project.
Cambridge Festival, Workshop
Sculpture made from twisting layers of metal sheeting.
Closed Event, Data School
AI-generated tiles of satellite imagery
Data School

9 Sep 2024 - 13 Sep 2024

CDH | Social Data School: September 2024

CENTRE FOR RESEARCH IN THE ARTS, SOCIAL SCIENCES AND HUMANITIES

Tel: +44 1223 766886
Email enquiries@crassh.cam.ac.uk