Patent Sentiment Analysis to Highlight Patent Paragraphs

11/06/2021
by   Renukswamy Chikkamath, et al.
0

Given a patent document, identifying distinct semantic annotations is an interesting research aspect. Text annotation helps the patent practitioners such as examiners and patent attorneys to quickly identify the key arguments of any invention, successively providing a timely marking of a patent text. In the process of manual patent analysis, to attain better readability, recognising the semantic information by marking paragraphs is in practice. This semantic annotation process is laborious and time-consuming. To alleviate such a problem, we proposed a novel dataset to train Machine Learning algorithms to automate the highlighting process. The contributions of this work are: i) we developed a multi-class, novel dataset of size 150k samples by traversing USPTO patents over a decade, ii) articulated statistics and distributions of data using imperative exploratory data analysis, iii) baseline Machine Learning models are developed to utilize the dataset to address patent paragraph highlighting task, iv) dataset and codes relating to this task are open-sourced through a dedicated GIT web page: https://github.com/Renuk9390/Patent_Sentiment_Analysis and v) future path to extend this work using Deep Learning and domain specific pre-trained language models to develop a tool to highlight is provided. This work assist patent practitioners in highlighting semantic information automatically and aid to create a sustainable and efficient patent analysis using the aptitude of Machine Learning.

READ FULL TEXT
research
06/24/2023

L3Cube-MahaSent-MD: A Multi-domain Marathi Sentiment Analysis Dataset and Transformer Models

The exploration of sentiment analysis in low-resource languages, such as...
research
06/17/2021

IFCNet: A Benchmark Dataset for IFC Entity Classification

Enhancing interoperability and information exchange between domain-speci...
research
02/07/2023

Applying BERT and ChatGPT for Sentiment Analysis of Lyme Disease in Scientific Literature

This chapter presents a practical guide for conducting Sentiment Analysi...
research
06/06/2023

How Good is the Model in Model-in-the-loop Event Coreference Resolution Annotation?

Annotating cross-document event coreference links is a time-consuming an...
research
05/11/2021

Addressing "Documentation Debt" in Machine Learning Research: A Retrospective Datasheet for BookCorpus

Recent literature has underscored the importance of dataset documentatio...
research
11/16/2020

WikiAsp: A Dataset for Multi-domain Aspect-based Summarization

Aspect-based summarization is the task of generating focused summaries b...
research
08/29/2019

HARE: a Flexible Highlighting Annotator for Ranking and Exploration

Exploration and analysis of potential data sources is a significant chal...

Please sign up or login with your details

Forgot password? Click here to reset