Improving Vietnamese Legal Question–Answering System based on Automatic Data Enrichment

06/08/2023
by   Thi-Hai-Yen Vuong, et al.
0

Question answering (QA) in law is a challenging problem because legal documents are much more complicated than normal texts in terms of terminology, structure, and temporal and logical relationships. It is even more difficult to perform legal QA for low-resource languages like Vietnamese where labeled data are rare and pre-trained language models are still limited. In this paper, we try to overcome these limitations by implementing a Vietnamese article-level retrieval-based legal QA system and introduce a novel method to improve the performance of language models by improving data quality through weak labeling. Our hypothesis is that in contexts where labeled data are limited, efficient data enrichment can help increase overall performance. Our experiments are designed to test multiple aspects, which demonstrate the effectiveness of the proposed technique.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/04/2022

Miko Team: Deep Learning Approach for Legal Question Answering in ALQAC 2022

We introduce efficient deep learning-based methods for legal document pr...
research
09/09/2020

Aspect Classification for Legal Depositions

Attorneys and others have a strong interest in having a digital library ...
research
09/16/2023

NOWJ1@ALQAC 2023: Enhancing Legal Task Performance with Classic Statistical Models and Pre-trained Language Models

This paper describes the NOWJ1 Team's approach for the Automated Legal Q...
research
11/27/2022

Improving Low-Resource Question Answering using Active Learning in Multiple Stages

Neural approaches have become very popular in the domain of Question Ans...
research
01/19/2022

Expert Finding in Legal Community Question Answering

Expert finding has been well-studied in community question answering (QA...
research
04/19/2022

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies

Prior studies in privacy policies frame the question answering (QA) task...
research
12/15/2021

Lex Rosetta: Transfer of Predictive Models Across Languages, Jurisdictions, and Legal Domains

In this paper, we examine the use of multi-lingual sentence embeddings t...

Please sign up or login with your details

Forgot password? Click here to reset