YDC2-1: Layout analysis research

Funded by: Yale UniversityReference: Yale/CRS4 Agreement 20140325
Start: 2014-03-01Duration: 12 months
Contractor: CRS4Italy


Yale University, through the Yale Digital Collections Center (YDC2), and together with its international partners, is working on the development of a world-wide community of interoperable repositories, which provide consistent access to digital representations of the manuscripts in a way that lets scholars easily and consistently view and compare manuscripts from any participating institution. The manuscript images are openly on the web and available in its Content Delivery Service through an IIIF compliant scalable image server. In this scenario, CRS4 will perform research and development of computing tools and techniques to analyze and index such kind of cultural heritage databases. The main goal will be the investigation of methods to perform document layout analysis in the case of a huge heterogeneous corpus of illuminated medieval manuscripts, with different writing styles, languages, and with various problematic attributes, such as holes, spots, ink bleed-through, ornamentation, background noise, and overlapping text lines. Particularly, the aim will be to devise a robust per-book text-line segmentation framework, a technique to order pages within a book on a text density basis, and an interactive framework to search words across a single manuscript.


[1] Holly Rushmeier, Ruggero Pintus, Ying Yang, Christiana Wong, and David Li. Examples of challenges and opportunities in visual analysis in the digital humanities. In Human Vision and Electronic Imaging XX, 2015. SPIE. 
[2] Ruggero Pintus, Ying Yang, and Holly Rushmeier. ATHENA: Automatic Text Height ExtractioN for the Analysis of text lines in old handwritten manuscripts. ACM Journal on Computing and Cultural Heritage (JOCCH), 8(1): 1:1-1:25, 2015. DOI: 10.1145/2659020
[3] Ruggero Pintus, Ying Yang, Enrico Gobbetti, and Holly Rushmeier. A TaLISMAN: Automatic Text and LIne Segmentation of historical MANuscripts. In The 12th Eurographics Workshop on Graphics and Cultural Heritage. Pages 35-44, October 2014. DOI: 10.2312/gch.20141302
[4] Ruggero Pintus, Kazim Pal, Ying Yang, Tim Weyrich, Enrico Gobbetti, and Holly Rushmeier. Geometric Analysis in Cultural Heritage. In The 12th Eurographics Workshop on Graphics and Cultural Heritage - STARS Proceedings. Pages 117-133, October 2014. DOI: 10.2312/gch.20141310
[5] Ying Yang, Ruggero Pintus, Holly Rushmeier, and Ioannis Ivrissimtzis. A Steganalytic Algorithm for 3D Polygonal Meshes. In 20th IEEE International Conference on Image Processing (ICIP). Pages 4782-4786, 2014. IEEE. DOI: 10.1109/ICIP.2014.7025969