Deep multi-modal networks for book genre classification based on its cover

11/15/2020
by   Chandra Kundu, et al.
0

Book covers are usually the very first impression to its readers and they often convey important information about the content of the book. Book genre classification based on its cover would be utterly beneficial to many modern retrieval systems, considering that the complete digitization of books is an extremely expensive task. At the same time, it is also an extremely challenging task due to the following reasons: First, there exists a wide variety of book genres, many of which are not concretely defined. Second, book covers, as graphic designs, vary in many different ways such as colors, styles, textual information, etc, even for books of the same genre. Third, book cover designs may vary due to many external factors such as country, culture, target reader populations, etc. With the growing competitiveness in the book industry, the book cover designers and typographers push the cover designs to its limit in the hope of attracting sales. The cover-based book classification systems become a particularly exciting research topic in recent years. In this paper, we propose a multi-modal deep learning framework to solve this problem. The contribution of this paper is four-fold. First, our method adds an extra modality by extracting texts automatically from the book covers. Second, image-based and text-based, state-of-the-art models are evaluated thoroughly for the task of book cover classification. Third, we develop an efficient and salable multi-modal framework based on the images and texts shown on the covers only. Fourth, a thorough analysis of the experimental results is given and future works to improve the performance is suggested. The results show that the multi-modal framework significantly outperforms the current state-of-the-art image-based models. However, more efforts and resources are needed for this classification task in order to reach a satisfactory level.

READ FULL TEXT

page 3

page 16

page 17

research
11/21/2020

Deep learning for video game genre classification

Video game genre classification based on its cover and textual descripti...
research
06/24/2019

Serif or Sans: Visual Font Analytics on Book Covers and Online Advertisements

In this paper, we conduct a large-scale study of font statistics in book...
research
08/25/2018

How do Convolutional Neural Networks Learn Design?

In this paper, we aim to understand the design principles in book cover ...
research
05/19/2021

Font Style that Fits an Image – Font Generation Based on Image Context

When fonts are used on documents, they are intentionally selected by des...
research
01/15/2020

Evaluating image matching methods for book cover identification

Humans are capable of identifying a book only by looking at its cover, b...
research
05/24/2023

Decomposing Complex Queries for Tip-of-the-tongue Retrieval

When re-finding items, users who forget or are uncertain about identifyi...
research
08/03/2023

Interleaving GANs with knowledge graphs to support design creativity for book covers

An attractive book cover is important for the success of a book. In this...

Please sign up or login with your details

Forgot password? Click here to reset