Guests: Francesco Mambrini & Giuseppe Celano

Summary authored and posted by Greta Franzini On 19 August our two Italian guests, Francesco and Giuseppe, talked to us about Greek and Latin treebanking. Francesco kicked off the session with an introduction to treebanking, where he described the differences between scholarly and standard annotation, constituency and dependency syntax, and walked us through his annotation work on Sophocle’s Ajax.     Giuseppe elaborated on Francesco’s introduction with a deeper discussion about treebanking, providing illustrative examples of Greek sentences available in the Alpheios tool.      ...

Guest: Nick Kallen

Summary authored and posted by Greta Franzini On 15th August Nick Kallen, a former Twitter engineer, stopped by to talk to us about the reading environment he has created to help teach himself Ancient Greek. The tool, available at www.iliados.com, allows users to perform grammatical and syntactical searches on the Perseus Greek Treebank data using the CSS3 query language. Nick will help Perseus rebuild its architecture so as to increase speed, improve overall performance and thus optimise derivative tools such as Iliados or Alpheios....

Guest: Hugh Cayless

Summary authored and posted by Greta Franzini On 9th August Hugh Cayless from Duke Collaboratory for Classics Computing joined us in Leipzig to talk about his Linked Ancient World Data project, Papyri.info. Papyri.info harvests papyrological resources and allows users to contribute and edit content. For those of you who don’t know, Hugh is one of the founders of EpiDoc, a customisation of the TEI Guidelines focussing on the encoding of ancient documents and...

Guests: Bridget Almas & Marie-Claire Beaulieu

Summary authored and posted by Greta Franzini Bridget Almas and Marie-Claire Beaulieu presented their work on Perseids, a project inspired by the Homer Multitext Project and Papyri.info project. Bridget and Marie-Claire were awarded a Mellon Foundation grant to create an open-source tool which would facilitate collaboration on and the publication of born-digital editions of Greek and Latin texts, as well as support and enable undergraduate research.  ...

Guests: Bruce Robertson & Federico Boschetti

Summary authored and posted by Greta Franzini Bruce Robertson and Federico Boschetti from Mount Allison University and CNRS Pisa respectively walked us through their work on OCR of Ancient Greek text. Bruce described some of the issues OCR engines face when scanning polytonic Greek (e.g. line segmentation), image requirements for optimal OCR scanning, spellchecking as well as his joint effort with Federico to combine results from different OCR engines in order to obtain the best possible output. Federico elaborated on the different correction and editing methodologies of OCR output, including crowd sourcing and data entry contracts. His introduction served to contextualise the OCR proof-reading tool (a web application) he has developed to help flag-up errors and suggest corrections. Federico Boschetti Bruce...