Limiting Tags Fosters Efficiency

by   Tiago Santos, et al.

Tagging facilitates information retrieval in social media and other online communities by allowing users to organize and describe online content. Researchers found that the efficiency of tagging systems steadily decreases over time, because tags become less precise in identifying specific documents, i.e., they lose their descriptiveness. However, previous works did not answer how or even whether community managers can improve the efficiency of tags. In this work, we use information-theoretic measures to track the descriptive and retrieval efficiency of tags on Stack Overflow, a question-answering system that strictly limits the number of tags users can specify per question. We observe that tagging efficiency stabilizes over time, while tag content and descriptiveness both increase. To explain this observation, we hypothesize that limiting the number of tags fosters novelty and diversity in tag usage, two properties which are both beneficial for tagging efficiency. To provide qualitative evidence supporting our hypothesis, we present a statistical model of tagging that demonstrates how novelty and diversity lead to greater tag efficiency in the long run. Our work offers insights into policies to improve information organization and retrieval in online communities.



There are no comments yet.


page 12


Modeling and Analysis of Tagging Networks in Stack Exchange Communities

Large Question-and-Answer (Q&A) platforms support diverse knowledge cura...

Method for Customizable Automated Tagging: Addressing the Problem of Over-tagging and Under-tagging Text Documents

Using author provided tags to predict tags for a new document often resu...

A Data-Driven Approach for Tag Refinement and Localization in Web Videos

Tagging of visual content is becoming more and more widespread as web-ba...

Aspect-Based Tagging for Collaborative Media Organization

Organizing multimedia data is very challenging. One of the most importan...

Patch Transformer for Multi-tagging Whole Slide Histopathology Images

Automated whole slide image (WSI) tagging has become a growing demand du...

From Lost to Found: Discover Missing UI Design Semantics through Recovering Missing Tags

Design sharing sites provide UI designers with a platform to share their...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.