Hierarchical Neural Network for Extracting Knowledgeable Snippets and Documents

08/22/2018
by   Ganbin Zhou, et al.
0

In this study, we focus on extracting knowledgeable snippets and annotating knowledgeable documents from Web corpus, consisting of the documents from social media and We-media. Informally, knowledgeable snippets refer to the text describing concepts, properties of entities, or relations among entities, while knowledgeable documents are the ones with enough knowledgeable snippets. These knowledgeable snippets and documents could be helpful in multiple applications, such as knowledge base construction and knowledge-oriented service. Previous studies extracted the knowledgeable snippets using the pattern-based method. Here, we propose the semantic-based method for this task. Specifically, a CNN based model is developed to extract knowledgeable snippets and annotate knowledgeable documents simultaneously. Additionally, a "low-level sharing, high-level splitting" structure of CNN is designed to handle the documents from different content domains. Compared with building multiple domain-specific CNNs, this joint model not only critically saves the training time, but also improves the prediction accuracy visibly. The superiority of the proposed method is demonstrated in a real dataset from Wechat public platform.

READ FULL TEXT
research
12/11/2020

KOSMOS: Knowledge-graph Oriented Social media and Mainstream media Overview System

We introduce KOSMOS, a knowledge retrieval system based on the construct...
research
09/19/2020

Extracting Summary Knowledge Graphs from Long Documents

Knowledge graphs capture entities and relations from long documents and ...
research
02/23/2021

Page Layout Analysis System for Unconstrained Historic Documents

Extraction of text regions and individual text lines from historic docum...
research
09/16/2021

SenTag: a Web-based Tool for Semantic Annotation of Textual Documents

In this work, we present SenTag, a lightweight web-based tool focused on...
research
08/20/2020

Constructing a Knowledge Graph from Unstructured Documents without External Alignment

Knowledge graphs (KGs) are relevant to many NLP tasks, but building a re...
research
10/23/2018

Towards a Ranking Model for Semantic Layers over Digital Archives

Archived collections of documents (like newspaper archives) serve as imp...
research
10/15/2019

On Constructing a Knowledge Base of Chinese Criminal Cases

We are developing a knowledge base over Chinese judicial decision docume...

Please sign up or login with your details

Forgot password? Click here to reset