New Archive of 632 Greek Texts with OCR - Printable Version +- RomanArmyTalk (https://www.romanarmytalk.com/rat) +-- Forum: Recreational Arena (https://www.romanarmytalk.com/rat/forumdisplay.php?fid=6) +--- Forum: Off-Topic (https://www.romanarmytalk.com/rat/forumdisplay.php?fid=18) +--- Thread: New Archive of 632 Greek Texts with OCR (/showthread.php?tid=23364) |
New Archive of 632 Greek Texts with OCR - Sean Manning - 12-16-2013 Quote:Bruce Robertson at Mount Allison University has performed high-quality optical character recognition on over 600 volumes of ancient Greek in collaboration with Federico Boschetti of the CNR, Pisa. Page images with corresponding OCR output and freely downloadable archives of all stages of processing are available at the project website: http://heml.mta.ca/lace The quality of the OCR is varied, but they have photos as well as the scanned text, and where else can you read about the storied EQTA eNI hEBAS, or Dio's gripping account of the Bellum Piςaticum? For our martial members, they have a Greek Polyaenus, the Poeti Lyrici Graeci for Tyrtaeus and Alcaeus, and many historians. New Archive of 632 Greek Texts with OCR - Lyceum - 12-16-2013 Alcman* Yeah good catch this, I saw it advertised the other day and just had a play around with it. I don't know, its bloody impressive as a feat of technology and, as they say, the rights management is much freer than TLG and Perseus but...these texts, so many are out of date to be near unusable. I really couldn't recommend anything theyve got in lyric or drama since both editing and papyrus finds have drastically changed the game. I love some of the ancillary books they've put up though like Evangelinus Sophocles lexion...It might be out dated here and there but its well organised and a saviour for students and omg they've got a fully OCR Zosimus on there. This tool has some great potential. Thanks Sean. New Archive of 632 Greek Texts with OCR - D B Campbell - 12-16-2013 Hmmm ... had a look at a Loeb Arrian and it was just gibberish. :dizzy: New Archive of 632 Greek Texts with OCR - Sean Manning - 12-17-2013 Quote:Alcman*It is worse than that, since I was thinking of Alcaeus not Alcman! I wonder if they chose some books as test material for their OCR software. Yet even a bad edition is better than nothing, and not everyone has convenient access to a university library or the TLG. Their editions include the apparatus criticus, and that is valuable. Quote:Hmmm ... had a look at a Loeb Arrian and it was just gibberish. :dizzy:They seem to do better on the Teubner font with its four-stroke kappas and lunate sigmas. The errors on those texts seem to occur at a rate of one every few lines, which is rare enough that one can correct by hand. The FAQ says that they will do more proofreading once their access to a data centre expires. |