API Entity and Relation Joint Extraction from Text via Dynamic Prompt-tuned Language Model

01/10/2023
by   Qing Huang, et al.
0

Extraction of Application Programming Interfaces (APIs) and their semantic relations from unstructured text (e.g., Stack Overflow) is a fundamental work for software engineering tasks (e.g., API recommendation). However, existing approaches are rule-based and sequence-labeling based. They must manually enumerate the rules or label data for a wide range of sentence patterns, which involves a significant amount of labor overhead and is exacerbated by morphological and common-word ambiguity. In contrast to matching or labeling API entities and relations, this paper formulates heterogeneous API extraction and API relation extraction task as a sequence-to-sequence generation task, and proposes AERJE, an API entity-relation joint extraction model based on the large pre-trained language model. After training on a small number of ambiguous but correctly labeled data, AERJE builds a multi-task architecture that extracts API entities and relations from unstructured text using dynamic prompts. We systematically evaluate AERJE on a set of long and ambiguous sentences from Stack Overflow. The experimental results show that AERJE achieves high accuracy and discrimination ability in API entity-relation joint extraction, even with zero or few-shot fine-tuning.

READ FULL TEXT
research
08/21/2019

Fine-tuning BERT for Joint Entity and Relation Extraction in Chinese Medical Text

Entity and relation extraction is the necessary step in structuring medi...
research
04/07/2020

Efficient long-distance relation extraction with DG-SpanBERT

In natural language processing, relation extraction seeks to rationally ...
research
08/08/2023

Ahead of the Text: Leveraging Entity Preposition for Financial Relation Extraction

In the context of the ACM KDF-SIGIR 2023 competition, we undertook an en...
research
12/21/2022

Zero-shot Triplet Extraction by Template Infilling

Triplet extraction aims to extract entities and their corresponding rela...
research
04/25/2017

DeepAM: Migrate APIs with Multi-modal Sequence to Sequence Learning

Computer programs written in one language are often required to be porte...
research
05/08/2023

Revisiting Relation Extraction in the era of Large Language Models

Relation extraction (RE) is the core NLP task of inferring semantic rela...
research
03/15/2023

GCRE-GPT: A Generative Model for Comparative Relation Extraction

Given comparative text, comparative relation extraction aims to extract ...

Please sign up or login with your details

Forgot password? Click here to reset