A Robust Semantic Frame Parsing Pipeline on a New Complex Twitter Dataset

12/18/2022
by   Yu Wang, et al.
0

Most recent semantic frame parsing systems for spoken language understanding (SLU) are designed based on recurrent neural networks. These systems display decent performance on benchmark SLU datasets such as ATIS or SNIPS, which contain short utterances with relatively simple patterns. However, the current semantic frame parsing models lack a mechanism to handle out-of-distribution (OOD) patterns and out-of-vocabulary (OOV) tokens. In this paper, we introduce a robust semantic frame parsing pipeline that can handle both OOD patterns and OOV tokens in conjunction with a new complex Twitter dataset that contains long tweets with more OOD patterns and OOV tokens. The new pipeline demonstrates much better results in comparison to state-of-the-art baseline SLU models on both the SNIPS dataset and the new Twitter dataset (Our new Twitter dataset can be downloaded from https://1drv.ms/u/s!AroHb-W6_OAlavK4begsDsMALfE?e=c8f2XX ). Finally, we also build an E2E application to demo the feasibility of our algorithm and show why it is useful in real application.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset