An Efficient Approach to Learning Chinese Judgment Document Similarity Based on Knowledge Summarization

08/06/2018
by   Yinglong Ma, et al.
0

A previous similar case in common law systems can be used as a reference with respect to the current case such that identical situations can be treated similarly in every case. However, current approaches for judgment document similarity computation failed to capture the core semantics of judgment documents and therefore suffer from lower accuracy and higher computation complexity. In this paper, a knowledge block summarization based machine learning approach is proposed to compute the semantic similarity of Chinese judgment documents. By utilizing domain ontologies for judgment documents, the core semantics of Chinese judgment documents is summarized based on knowledge blocks. Then the WMD algorithm is used to calculate the similarity between knowledge blocks. At last, the related experiments were made to illustrate that our approach is very effective and efficient in achieving higher accuracy and faster computation speed in comparison with the traditional approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2012

Content-based Text Categorization using Wikitology

A major computational burden, while performing document clustering, is t...
research
12/29/2011

Document Clustering based on Topic Maps

Importance of document clustering is now widely acknowledged by research...
research
04/03/2023

A Comparison of Document Similarity Algorithms

Document similarity is an important part of Natural Language Processing ...
research
08/06/2015

Privacy-Preserving Multi-Document Summarization

State-of-the-art extractive multi-document summarization systems are usu...
research
02/01/2019

Dating Documents using Graph Convolution Networks

Document date is essential for many important tasks, such as document re...
research
02/25/2023

HADES: Homologous Automated Document Exploration and Summarization

This paper introduces HADES, a novel tool for automatic comparative docu...
research
03/15/2022

FastKASSIM: A Fast Tree Kernel-Based Syntactic Similarity Metric

Syntax is a fundamental component of language, yet few metrics have been...

Please sign up or login with your details

Forgot password? Click here to reset