User Information Augmented Semantic Frame Parsing using Coarse-to-Fine Neural Networks
Semantic frame parsing is a crucial component in spoken language understanding (SLU) to build spoken dialog systems. It has two main tasks: intent detection and slot filling. Although state-of-the-art approaches showed good results, they require large annotated training data and long training time. In this paper, we aim to alleviate these drawbacks for semantic frame parsing by utilizing the ubiquitous user information. We design a novel coarse-to-fine deep neural network model to incorporate prior knowledge of user information intermediately to better and quickly train a semantic frame parser. Due to the lack of benchmark dataset with real user information, we synthesize the simplest type of user information (location and time) on ATIS benchmark data. The results show that our approach leverages such simple user information to outperform state-of-the-art approaches by 0.25 0.31 training data, the performance improvement on intent detection and slot filling reaches up to 1.35 achieve similar performance as state-of-the-art approaches by using less than 80 performance is also reduced by over 60
READ FULL TEXT