Automatically Selecting Striking Images for Social Cards

03/08/2021
by   Shawn M. Jones, et al.
0

To allow previewing a web page, social media platforms have developed social cards: visualizations consisting of vital information about the underlying resource. At a minimum, social cards often include features such as the web resource's title, text summary, striking image, and domain name. News and scholarly articles on the web are frequently subject to social card creation when being shared on social media. However, we noticed that not all web resources offer sufficient metadata elements to enable appealing social cards. For example, the COVID-19 emergency has made it clear that scholarly articles, in particular, are at an aesthetic disadvantage in social media platforms when compared to their often more flashy disinformation rivals. Also, social cards are often not generated correctly for archived web resources, including pages that lack or predate standards for specifying striking images. With these observations, we are motivated to quantify the levels of inclusion of required metadata in web resources, its evolution over time for archived resources, and create and evaluate an algorithm to automatically select a striking image for social cards. We find that more than 40 the NEWSROOM dataset and 22 Central dataset fail to supply striking images. We demonstrate that we can automatically predict the striking image with a Precision@1 of 0.83 for news articles from NEWSROOM and 0.78 for scholarly articles from the open access journal PLOS ONE.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2021

It's All About The Cards: Sharing on Social Media Probably Encouraged HTML Metadata Growth

In a perfect world, all articles consistently contain sufficient metadat...
research
10/03/2012

Logical segmentation for article extraction in digitized old newspapers

Newspapers are documents made of news item and informative articles. The...
research
05/29/2019

Using Micro-collections in Social Media to Generate Seeds for Web Archive Collections

In a Web plagued by disappearing resources, Web archive collections prov...
research
08/01/2020

MementoEmbed and Raintale for Web Archive Storytelling

For traditional library collections, archivists can select a representat...
research
07/01/2021

When Curation Becomes Creation: Algorithms, Microcontent, and the Vanishing Distinction between Platforms and Creators

Ever since social activity on the Internet began migrating from the wild...
research
08/21/2020

DApp for Rating

Lots of existing web applications include a component for rating interne...
research
10/09/2019

Disciplinary Variations in Altmetric Coverage of Scholarly Articles

The popular social media platforms are now making it possible for schola...

Please sign up or login with your details

Forgot password? Click here to reset