Lest We Forget: A Dataset of Coronavirus-Related News Headlines in Swiss Media

06/25/2020
by   Alireza Ghasemi, et al.
0

We release our COVID-19 news dataset, containing more than 10,000 links to news articles related to the Coronavirus pandemic published in the Swiss media since early January 2020. This collection can prove beneficial in mining and analysis of the reaction of the Swiss media and the COVID-19 pandemic and extracting insightful information for further research. We hope this dataset helps researchers and the public deliver results that will help analyse the pandemic and potentially lead to a better understanding of the events.

READ FULL TEXT VIEW PDF

Authors

page 1

page 2

02/12/2021

When no news is bad news – Detection of negative events from news media content

During the first wave of Covid-19 information decoupling could be observ...
07/28/2020

A System for Worldwide COVID-19 Information Aggregation

The global pandemic of COVID-19 has made the public pay close attention ...
03/19/2022

Understanding COVID-19 News Coverage using Medical NLP

Being a global pandemic, the COVID-19 outbreak received global media att...
05/05/2021

ExcavatorCovid: Extracting Events and Relations from Text Corpora for Temporal and Causal Analysis for COVID-19

Timely responses from policy makers to mitigate the impact of the COVID-...
11/24/2021

Detecting Frames in News Headlines and Lead Images in U.S. Gun Violence Coverage

News media structure their reporting of events or issues using certain p...
06/10/2020

Pandemic Pulse: Unraveling and Modeling Social Signals during the COVID-19 Pandemic

We present and begin to explore a collection of social data that represe...
11/15/2021

Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection

A drastic rise in potentially life-threatening misinformation has been a...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.

I Introduction

The COVID-19 pandemic started in Switzerland on February 25th 2020, when the first infection was officially reported in the Italian-speaking canton of Ticino [7, 3]. Soon the pandemic spread around the country on all cantons, and Switzerland became one of the most infected countries on a per-capita basis [2, 8].

The Swiss government started putting in place various measures to control and suppress the pandemic. Gatherings were limited and later totally banned, following by closure of all except essential business, and finally closing land borders with neighbouring countries [6].

These measures helped control the spread of the virus and significantly decreased the number of active and daily new cases in Switzerland. With the success confirmed, government started gradually lifting the established restrictions from late April [4]. Finally, the June 15th re-opening land borders with the neighbouring countries marked the ”end” of the pandemic in Switzerland, at least for the first wave [5].

Since the first recorded case of the COVID-19 virus in Switzerland and far before as it was gaining attention around the world, The Swiss media started covering the topic from various aspects, including the everyday news about the state of the country, the immediate effects, and longer-term consequences of the pandemic. Given the multi-lingual and multi-cultural nature of Switzerland, interesting analyses can be accomplished to see how the media coverage of the pandemic has been managed and what topics in respect to the pandemic have been important to the Swiss media, and hopefully, by proxy to the Swiss public.

In order to help the research community and the public be able to analyse and seek answers to the above questions, we at ELCA decided to release our COVID-19 news dataset, containing more than 10,000 links to news articles related to the Coronavirus pandemic published in the Swiss media since early January 2020.

We hope this dataset helps researchers make insightful analyses on the reaction of the Swiss public to the pandemic and deliver results that help shape a better response in the prospective future cases.

Ii The Data

We tried to cover the most popular Swiss newspapers and news websites. Therefore, we chose a total of 10 news sources in German, five in French, three in Italian, and also two English-speaking Swiss news websites in order to make the data more accessible to to researchers outside Switzerland. Table I depicts the list of venues and some information about them.

News Source Language Number of Articles
Blick German 2080
Neue Zürcher Zeitung German 1961
Tages Anzeiger German 1109
Aargauer Zeitung German 1099
Basler Zeitung German 859
SRF German 718
20 Minuten German 553
Berner Zeitung German 281
20 Minutes French 225
Tribune de Genève French 223
Corriere del Ticino Italian 199
24 heures French 189
World Radio Switzerland English 176
Il portale del Ticino Italian 156
RTS French 188
SWI swissinfo.ch English 112
Le Temps French 89
SWI swissinfo.ch French 71
SWI swissinfo.ch German 55
SWI swissinfo.ch Italian 52
TABLE I: Selected News Sources

Ii-a Selection of Relevant Articles

We backdated our data collection to late 2019, and started scanning front pages of the selected news sources in consecutive days, extracting headlines of the articles. Initially, articles with any of the following keywords in the title were deemed ”Coronavirus-related”:

  • Coronavirus,

  • Covid,

  • Lockdown,

  • Pandem* (To account for different spellings of the concept in different languages).

This inevitably leads to false negatives. In order to reduce such false negatives, we read at a later stage the synopsis of the article and searched for the keywords also in the body, yielding more positive results. The distribution of the languages in the dataset is depicted in Figure 1.

German

French

Italian

English

Fig. 1: Language Distribution of Articles in the Dataset

The first article we could find in the Swiss media has been published on January 8th in the French-speaking news portal 20 Minutes, titled ”A new Coronavirus appears in China” [1]. We have made a web application to simplify exploring and browsing the data, and reading the collected news articles. The web application is available at https://covidnewsdataset.herokuapp.com/.

Summary

We explained in this article our Swiss COVID-19 dataset and how it has been collected. We publish the dataset hereby for public use, along with an online visualisation application to help explore and look at the news articles of Swiss media during the pandemic in Switzerland. We hope this dataset proves useful in analysis of the pandemic era and the public response to it in Switzerland.

=0mu plus 1mu

References

  • [1] 20 Minutes (January 8, 2020 (accessed June 21, 2020)) Un nouveau coronavirus apparaît en Chine. External Links: Link Cited by: §II-A.
  • [2] 24 Heures (March 20, 2020 (accessed June 21, 2020)) Vaud enregistre 7 décès, 32 personnes aux soins intensifs. External Links: Link Cited by: §I.
  • [3] Blick (February 25, 2020 (accessed June 21, 2020)) Erster bestätigter fall in der schweiz. External Links: Link Cited by: §I.
  • [4] RTE (April 16, 2020 (accessed June 21, 2020)) Switzerland announces gradual easing of COVID-19 restrictions. External Links: Link Cited by: §I.
  • [5] SRF (June 14, 2020 (accessed June 21, 2020)) Wiedereröffnung der Grenzen: Das müssen Reisende aus der Schweiz jetzt wissen. External Links: Link Cited by: §I.
  • [6] Swiss Confederation (March 14, 2020 (accessed June 21, 2020)) Bundesrat verschärft massnahmen gegen das coronavirus zum schutz der gesundheit und unterstützt betroffene branchen. External Links: Link Cited by: §I.
  • [7] The Local (February 25, 2020 (accessed June 21, 2020)) BREAKING: switzerland confirms first case of coronavirus. External Links: Link Cited by: §I.
  • [8] The Local (March 7, 2020 (accessed June 21, 2020)) Coronavirus in Switzerland: Number of cases rises above 260. External Links: Link Cited by: §I.