A Cognitive Study on Semantic Similarity Analysis of Large Corpora: A Transformer-based Approach

07/24/2022
by   Praneeth Nemani, et al.
0

Semantic similarity analysis and modeling is a fundamentally acclaimed task in many pioneering applications of natural language processing today. Owing to the sensation of sequential pattern recognition, many neural networks like RNNs and LSTMs have achieved satisfactory results in semantic similarity modeling. However, these solutions are considered inefficient due to their inability to process information in a non-sequential manner, thus leading to the improper extraction of context. Transformers function as the state-of-the-art architecture due to their advantages like non-sequential data processing and self-attention. In this paper, we perform semantic similarity analysis and modeling on the U.S Patent Phrase to Phrase Matching Dataset using both traditional and transformer-based techniques. We experiment upon four different variants of the Decoding Enhanced BERT - DeBERTa and enhance its performance by performing K-Fold Cross-Validation. The experimental results demonstrate our methodology's enhanced performance compared to traditional techniques, with an average Pearson correlation score of 0.79.

READ FULL TEXT
research
04/20/2019

Language Models with Transformers

The Transformer architecture is superior to RNN-based models in computat...
research
04/04/2021

TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling

In this paper, we describe the use of recurrent neural networks to captu...
research
10/13/2019

T-GSA: Transformer with Gaussian-weighted self-attention for speech enhancement

Transformer neural networks (TNN) demonstrated state-of-art performance ...
research
04/19/2019

ERNIE: Enhanced Representation through Knowledge Integration

We present a novel language representation model enhanced by knowledge c...
research
06/09/2021

Phraseformer: Multimodal Key-phrase Extraction using Transformer and Graph Embedding

Background: Keyword extraction is a popular research topic in the field ...
research
06/01/2023

Boosting the Performance of Transformer Architectures for Semantic Textual Similarity

Semantic textual similarity is the task of estimating the similarity bet...
research
05/28/2022

A New High-Performance Approach to Approximate Pattern-Matching for Plagiarism Detection in Blockchain-Based Non-Fungible Tokens (NFTs)

We are presenting a fast and innovative approach to performing approxima...

Please sign up or login with your details

Forgot password? Click here to reset