OCR Report

Charlton, Ash

OCR Report

Files

OCR Report Final.pdf (839.63 KB)

Date

2023-07

Authors

Charlton, Ash

Full item page

Abstract

Optical Character Recognition (OCR) is the most commonly known method of text extraction from digitised documents used in the cultural heritage sector. It is a process that transforms images of text into a machine-readable format. Traditionally, OCR uses technology to digitally scan text and identify letters individually, therefore recognising one character at a time. Advancements have been made over time that introduce aspects of machine learning into OCR which change this dynamic slightly, which will be explored in more detail later in this report. This report explores OCR software options broadly, in addition to past, current and future proposed OCR processes and workflows that the University of Edinburgh library may introduce.

URI

https://hdl.handle.net/1842/40899
http://dx.doi.org/10.7488/era/3652

This item appears in the following Collection(s)

Information Services