EventKG+TL: Creating Cross-Lingual Timelines from an Event-Centric Knowledge Graph

05/03/2018 ∙ by Simon Gottschalk, et al. ∙ L3S Research Center 0

The provision of multilingual event-centric temporal knowledge graphs such as EventKG enables structured access to representations of a large number of historical and contemporary events in a variety of language contexts. Timelines provide an intuitive way to facilitate an overview of events related to a query entity - i.e. an entity or an event of user interest - over a certain period of time. In this paper, we present - a novel system that generates cross-lingual event timelines using EventKG and facilitates an overview of the language-specific event relevance and popularity along with the cross-lingual differences.

READ FULL TEXT VIEW PDF
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.

1 Introduction

The amount of event-centric information regarding contemporary and historical events of global importance, such as Brexit and the migration crisis in Europe, constantly grows on the Web, in Web archives, in the news as well as within emerging event-centric collections [2] and knowledge graphs generated from these sources (e.g. [4], [7]). An important research area in this context is cross-cultural and cross-lingual event analytics (e.g. see [6], [5] for case studies, and [3] for a cross-lingual user interface). These studies aim to analyze language-specific and community-specific representations and perceptions of historical and contemporary events including their popularity and relations in a language context as well as to better understand the cross-lingual differences.

EventKG [4] - a recently proposed multilingual event-centric temporal knowledge graph incorporating over thousand events in five languages - is an important knowledge source that can facilitate a variety of studies and applications related to cross-cultural and cross-lingual event analytics. However, given a query entity, i.e. an entity or an event of user interest, EventKG can contain hundreds of related events along with their descriptions in several language contexts, which makes the provision of a comprehensive cross-lingual overview and a selection of relevant events for further detailed analysis challenging.

Timelines are an intuitive way to provide an overview of events related to a query entity over a certain period of time. Timeline generation is an active research area [1], where the focus is to generate a timeline (i.e. a chronologically ordered selection) of events related to the query entity from a knowledge graph. However, existing timelines do not explicitly support a cross-lingual comparison of language-specific event representations, including their popularity and relation to the query entity in different language contexts.

EventKG+TL presented in this paper is a timeline generator that creates cross-lingual timelines for a query entity, while relying on EventKG to provide language-specific information with respect to the event popularity and the relation strength between the events and the query entity. To this extent, EventKG+TL conducts a language-specific event ranking and complements this ranking with a cross-lingual visual representation. The timelines generated by EventKG+TL facilitate efficient identification of relevant events based on their language-specific popularity, relation strength and the cross-lingual differences.

2 Scenarios & Timelines

A multilingual event-centric temporal knowledge graph is a labeled directed multigraph, where is a set of language contexts, is a set of nodes (i.e. events or entities), and is a multiset of directed edges (i.e. relations).

Given a query entity , the timelines generated by EventKG+TL can assist users in answering questions such as:

: What are the most popular events related to ?

: Which events are the most closely related to ?

: Which of the most popular events are the most closely related to ?

: How does the popularity of the identified events and the strength of their relations to the query entity differ across the language contexts?

The provision of EventKG+TL facilitates users to answer these questions with respect to a particular language context and enables a visual cross-lingual comparison. To answer these questions, the user of EventKG+TL can issue a timeline query that includes the following parameters:

  • a query entity ;

  • a set of the language contexts of user interest ;

  • the maximum number of the events to be selected per language context;

  • the ranking criterion to identify the top- most relevant events among all events related to in according to the questions .

The ranking criteria include:

  • is the popularity of an event in ;

  • is the relation strength between the query entity and an event in a language context ; and

  • is a combination of the event popularity of and the relation strength between and the query entity in .

The timelines generated by EventKG+TL complement the language-specific event ranking with a cross-lingual visual representation to address the question . To this extent, EventKG+TL utilizes labeled pie charts located on a timeline, where each pie chart represents an individual event. The size of the pie chart corresponds to an overall (i.e. language independent) relevance of the event according to the ranking criterion . Each slice of the pie chart represents a language context. The area of each slice is proportional to the contribution of the corresponding language context to the ranking criterion .

Fig. 1 exemplifies a Brexit timeline. We can observe that the most important event according to is the ”United Kingdom European Union membership referendum, 2016” that is nearly equally important in all considered language contexts. Some of the events are more important in the specific language contexts, e.g. ”European Migrant Crisis” in the German and ”Dutch Ukraine-European Union Association Agreement referendum 2016” in the Russian context.

Figure 1: An excerpt of an EventKG+TL timeline representing events related to the query entity ”Brexit” in the time interval 01/2015-01/2018, overall including the top-8 events from each of the five language contexts in EventKG ranked according to - i.e. a combination of the popularity and the relation strength of the events towards Brexit. Each event is represented by a labeled pie chart. The size of the pie chart corresponds to the language independent event relevance according to . The colored slices determine the ratio of the relevance in a language context (see the legend for the color encoding). The duration of events that lasted for more than a day is marked by a yellow interval. Upon click on a timeline entry, detailed information, including scores and link counts, is shown.

3 Timeline Generation

3.0.1 The Knowledge Graph

To answer a timeline query, EventKG+TL utilizes EventKG [4]. EventKG is a multilingual RDF knowledge graph incorporating over thousand events and over million temporal relations in V1.1 extracted from several large-scale entity-centric knowledge graphs (i.e. Wikidata, DBpedia in five language editions and YAGO), Wikipedia Current Event Portal (WCEP) and Wikipedia event lists. One of the key features of EventKG is the provision of event-centric information for historical and contemporary events, including their interlinking in the language-specific contexts to facilitate an assessment of relation strength and event popularity. The information on language-specific interlinking provided by EventKG is based on the corresponding Wikipedia language editions.

3.0.2 Event and Relation Retrieval

To retrieve relevant information from EventKG, EventKG+TL adopts SPARQL queries. First, EventKG+TL retrieves the query entity , including its existence time, if available. Second, EventKG+TL retrieves a set of events that are connected to via an EventKG relation as the subject or the object, along with the time information associated with these events. Third, the interlinking information related to the events in is retrieved from EventKG’s link relations and their eventKG-s:links and eventKG-s:mentions property values.

3.0.3 Event Ranking and Timeline Creation

The top- events related to are selected according to the ranking criterion. For each event and language , the language-specific relevance score is computed using the interlinking information provided by EventKG. The following link counts are used:

  • : Event link count, i.e. the number of links pointing to the event in a language context (via eventKG-s:links).

  • : Pair count, i.e. the number of links from to plus the number of links from to in , denoted by eventKG-s:links values.

  • : Mention count, i.e. the number of sentences in a language context that jointly link to and , denoted by eventKG-s:mentions.

Each count is normalized to by dividing its value by the highest value of this count related to the events in in the respective language. That way, the bias resulting from the differences in the language-specific coverage is reduced. To avoid the domination of the disproportionately often linked events (e.g. the World War II), a smoothing parameter , experimentally set to , is adopted. The scores are computed as follows:

(1)
(2)

The score () is computed as a linear combination of the two ranking criteria. We experimentally set its weight to .

(3)

The resulting timeline consists of a chronologically ordered list of the top- highest ranked events per language with respect to the ranking criterion.

3.0.4 System Implementation

The EventKG+TL system is accessible as an HTML5 website. It is implemented using the Java Spark web framework111http://sparkjava.com/. The timeline is visualized through the browser-based Javascript library vis.js222http://visjs.org/timeline_examples.html, the pie charts are created using the Google Charts Javascript library333https://developers.google.com/chart/interactive/docs/gallery/piechart and pop-ups showing detailed event information are based on Twitter Bootstrap444https://getbootstrap.com/.

4 Demonstration

In our demonstration we will primarily show how EventKG+TL works and how users can use it to create cross-lingual timelines. To highlight the advantages of our approach, we will ask our audience to create timelines for the entities and events of their choice using EventKG+TL based on the language-specific information contained in EventKG. Through the visual cross-lingual comparison provided by EventKG+TL, the audience can get an impression of the language-specific event representations, as well as their relation to the query entity and popularity in different language contexts.

Acknowledgements

This work was partially funded by the ERC (”ALEXANDRIA”, 339233) and BMBF (”Data4UrbanMobility”, 02K15A040).

References

  • [1] Althoff, T., Dong, X.L., Murphy, K., Alai, S., Dang, V., Zhang, W.: TimeMachine: Timeline Generation for Knowledge-base Entities. In: Proc. of SIGKDD’15 (2015)
  • [2] Gossen, G., Demidova, E., Risse, T.: iCrawl: Improving the Freshness of Web Collections by Integrating Social Web and Focused Web Crawling. In: JCDL’15 (2015)
  • [3] Gottschalk, S., Demidova, E.: MultiWiki: Interlingual Text Passage Alignment in Wikipedia. TWEB 11(1), 6:1–6:30 (2017)
  • [4] Gottschalk, S., Demidova, E.: EventKG: A Multilingual Event-Centric Temporal Knowledge Graph. In: Proc. of the ESWC’18 (2018)
  • [5] Gottschalk, S., Demidova, E., Bernacchi, V., Rogers, R.: Ongoing Events in Wikipedia: A Cross-lingual Case Study. In: Proc. of WebSci’17. pp. 387–388 (2017)
  • [6] Rogers, R.: Digital Methods. MIT Press (2013)
  • [7] Rospocher, M., et al.: Building Event-centric Knowledge Graphs from News. Web Semantics 37, 132–151 (2016)