Natural Language Processing for Cultural Heritage Domains
Article first published online: 6 SEP 2010
DOI: 10.1111/j.1749-818X.2010.00230.x
© 2010 The Author. Language and Linguistics Compass © 2010 Blackwell Publishing Ltd
Additional Information
How to Cite
Sporleder, C. (2010), Natural Language Processing for Cultural Heritage Domains. Language and Linguistics Compass, 4: 750–768. doi: 10.1111/j.1749-818X.2010.00230.x
Publication History
- Issue published online: 6 SEP 2010
- Article first published online: 6 SEP 2010
- Abstract
- Article
- References
- Cited By
Abstract
Museums, archives, libraries and other cultural heritage institutes maintain large collections of artefacts, which are valuable knowledge sources for both experts and interested lay persons. Recently, more and more cultural heritage institutes have started to digitise their collections, for instance to make them accessible via web portals. However, while digitisation is a necessary first step towards improved information access, to fully unlock the knowledge contained in these collections, users have to be able to easily browse, search and query these collections. This requires cleaning, linking and enriching the data, a process that is often too time-consuming to be performed manually. Information technology can help with (partially) automating this task. Because data processing and enrichment typically involve the textual metadata level, natural language processing has a key role to play in this endeavour. At the same time, cultural heritage domains pose significant challenges for language technology and call for the development of very robust and flexible solutions. Consequently, cultural heritage data can also serve as a good test-bed for the development of robust natural language processing tools.

1749-818X/asset/olbannerleft.jpg?v=1&s=0edfbd2e17ed9e42724131a7af63c93c12a64411)
1749-818X/asset/olbannerright.jpg?v=1&s=0e9c148378d2cf06eb08813be816b8e26bc8cdf8)
