The special theme of HELDIG Summit 2019 is From Text to Knowledge. Most content used in DH research and applications is available originally in more of less unstructured textual form, e.g., as books, articles, newspapers, legislation and legal documents, web pages, social media discussions, parliamentary materials, letters, and folklore. Even in databases, such as collections in museums, archives, and libraries, much of the content may be in unstructured textual form. A key task in using texts computationally is to extract – in one way or another – structure and meaning from strings using methods such as linguistic analysis and natural language understanding, named entity recognition and linking, relation and event extraction, data mining, machine learning, topic modelling, and reference and network analysis.
To set context, the day starts with the keynote "Square pegs and round holes: addressing the mismatch between humanities questions and the state-of-the-art in language technology" by Dr. Marieke van Erp from KNAW, Amsterdam. After this follows presentations regarding tools and infrastructures for digitizing, processing, and analyzing large text data, and services for publishing textual datasets.
In the afternoon, we have the keynote "Legislative data portals and linked data quality" by prof. Jose Emilio Labra Gayo from University of Oviedo. The day continues with presentations of research projects and applications using digital humanities approaches for analyzing texts. To end the day, bubbles and nibbles are served in a social networking event.
The event will be streamed at http://video.helsinki.fi/unitube/live-stream.html?room=l21
More information and registration: http://heldig.fi/summit-2019