Reduction of Parameter Redundancy in Biaffine Classifiers with Symmetric and Circulant Weight Matrices

10/18/2018
by   Tomoki Matsuno, et al.
0

Currently, the biaffine classifier has been attracting attention as a method to introduce an attention mechanism into the modeling of binary relations. For instance, in the field of dependency parsing, the Deep Biaffine Parser by Dozat and Manning has achieved state-of-the-art performance as a graph-based dependency parser on the English Penn Treebank and CoNLL 2017 shared task. On the other hand, it is reported that parameter redundancy in the weight matrix in biaffine classifiers, which has O(n^2) parameters, results in overfitting (n is the number of dimensions). In this paper, we attempted to reduce the parameter redundancy by assuming either symmetry or circularity of weight matrices. In our experiments on the CoNLL 2017 shared task dataset, our model achieved better or comparable accuracy on most of the treebanks with more than 16

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2016

Deep Biaffine Attention for Neural Dependency Parsing

This paper builds off recent work from Kiperwasser & Goldberg (2016) usi...
research
07/18/2016

Dependency Language Models for Transition-based Dependency Parsing

In this paper, we present an approach to improve the accuracy of a stron...
research
02/22/2017

Improving a Strong Neural Parser with Conjunction-Specific Features

While dependency parsers reach very high overall accuracy, some dependen...
research
07/09/2021

Levi Graph AMR Parser using Heterogeneous Attention

Coupled with biaffine decoders, transformers have been effectively adapt...
research
11/10/2019

Rethinking Self-Attention: An Interpretable Self-Attentive Encoder-Decoder Parser

Attention mechanisms have improved the performance of NLP tasks while pr...
research
04/17/2021

Question Decomposition with Dependency Graphs

QDMR is a meaning representation for complex questions, which decomposes...
research
05/23/2022

Efficient Update of Redundancy Matrices for Truss and Frame Structures

Redundancy matrices provide insights into the load carrying behavior of ...

Please sign up or login with your details

Forgot password? Click here to reset