Guests: Bruce Robertson & Federico Boschetti

Bruce Robertson and Federico Boschetti from Mount Allison University and CNRS Pisa respectively walked us through their work on OCR of Ancient Greek text.
Bruce described some of the issues OCR engines face when scanning polytonic Greek (e.g. line segmentation), image requirements for optimal OCR scanning, spellchecking as well as his joint effort with Federico to combine results from different OCR engines in order to obtain the best possible output. Federico elaborated on the different correction and editing methodologies of OCR output, including crowd sourcing and data entry contracts. His introduction served to contextualise the OCR proof-reading tool (a web application) he has developed to help flag-up errors and suggest corrections.

Photo of Federico Boschetti

Federico Boschetti

Photo of Bruce Robertson

Bruce Robertson

Share postShare on FacebookShare on LinkedInTweet about this on TwitterEmail this to someone

Trackbacks/Pingbacks

  1. Bruce Robertson and Federico Boschetti at Digital Humanities Leipzig » Dynamic Variorum Editions - […] Read the Digital Humanities Leipzig blog entry. […]

Submit a Comment

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>