PAM: Understanding Product Images in Cross Product Category Attribute Extraction

06/08/2021
by   Rongmei Lin, et al.
0

Understanding product attributes plays an important role in improving online shopping experience for customers and serves as an integral part for constructing a product knowledge graph. Most existing methods focus on attribute extraction from text description or utilize visual information from product images such as shape and color. Compared to the inputs considered in prior works, a product image in fact contains more information, represented by a rich mixture of words and visual clues with a layout carefully designed to impress customers. This work proposes a more inclusive framework that fully utilizes these different modalities for attribute extraction. Inspired by recent works in visual question answering, we use a transformer based sequence to sequence model to fuse representations of product text, Optical Character Recognition (OCR) tokens and visual objects detected in the product image. The framework is further extended with the capability to extract attribute value across multiple product categories with a single model, by training the decoder to predict both product category and attribute value and conditioning its output on product category. The model provides a unified attribute extraction solution desirable at an e-commerce platform that offers numerous product categories with a diverse body of product attributes. We evaluated the model on two product attributes, one with many possible values and one with a small set of possible values, over 14 product categories and found the model could achieve 15 existing methods using text-only features.

READ FULL TEXT

page 1

page 4

research
09/15/2020

Multimodal Joint Attribute Prediction and Value Extraction for E-commerce Product

Product attribute values are essential in many e-commerce scenarios, suc...
research
06/01/2023

Large Scale Generative Multimodal Attribute Extraction for E-commerce Attributes

E-commerce websites (e.g. Amazon) have a plethora of structured and unst...
research
08/15/2022

Exploring Generative Models for Joint Attribute Value Extraction from Product Titles

Attribute values of the products are an essential component in any e-com...
research
06/04/2021

AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding

Automatic extraction of product attribute values is an important enablin...
research
04/15/2020

TXtract: Taxonomy-Aware Knowledge Extraction for Thousands of Product Categories

Extracting structured knowledge from product profiles is crucial for var...
research
03/07/2022

Multi-Modal Attribute Extraction for E-Commerce

To improve users' experience as they navigate the myriad of options offe...
research
06/28/2022

Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction

A key challenge in attribute value extraction (AVE) from e-commerce site...

Please sign up or login with your details

Forgot password? Click here to reset