This guide takes users through the practicalities of working with digitised archive newspapers: how are newspapers made text-searchable via OCR? What is METS/ALTO? A short checklist of questions helps users understand how collections are processed and what interactions are possible with different interfaces. Finally, the guide provides a few pointers on how natural language processing tools can be used for historical research with digitised newspapers.
PARTHENOS is a cluster project that brings together knowledge and resources from projects on research infrastructures and humanities. All the resources are freely available here.