Creative Procedural-Knowledge Extraction From Web Design Tutorials

04/18/2019
by   Longqi Yang, et al.
0

Complex design tasks often require performing diverse actions in a specific order. To (semi-)autonomously accomplish these tasks, applications need to understand and learn a wide range of design procedures, i.e., Creative Procedural-Knowledge (CPK). Prior knowledge base construction and mining have not typically addressed the creative fields, such as design and arts. In this paper, we formalize an ontology of CPK using five components: goal, workflow, action, command and usage; and extract components' values from online design tutorials. We scraped 19.6K tutorial-related webpages and built a web application for professional designers to identify and summarize CPK components. The annotated dataset consists of 819 unique commands, 47,491 actions, and 2,022 workflows and goals. Based on this dataset, we propose a general CPK extraction pipeline and demonstrate that existing text classification and sequence-to-sequence models are limited in identifying, predicting and summarizing complex operations described in heterogeneous styles. Through quantitative and qualitative error analysis, we discuss CPK extraction challenges that need to be addressed by future research.

READ FULL TEXT

page 5

page 6

page 7

research
10/18/2019

Using Local Knowledge Graph Construction to Scale Seq2Seq Models to Multi-Document Inputs

Query-based open-domain NLP tasks require information synthesis from lon...
research
02/06/2018

Investigations on Knowledge Base Embedding for Relation Prediction and Extraction

We report an evaluation of the effectiveness of the existing knowledge b...
research
05/02/2020

A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos

Procedural knowledge, which we define as concrete information about the ...
research
06/12/2018

Sequence-to-Sequence Learning for Task-oriented Dialogue with Dialogue State Representation

Classic pipeline models for task-oriented dialogue system require explic...
research
05/17/2020

Semi-Automating Knowledge Base Construction for Cancer Genetics

In this work, we consider the exponentially growing subarea of genetics ...
research
07/21/2021

COfEE: A Comprehensive Ontology for Event Extraction from text, with an online annotation tool

Data is published on the web over time in great volumes, but majority of...
research
01/28/2019

OpenHowNet: An Open Sememe-based Lexical Knowledge Base

In this paper, we present an open sememe-based lexical knowledge base Op...

Please sign up or login with your details

Forgot password? Click here to reset