SDST: Successive Decoding for Speech-to-text Translation

09/21/2020 ∙ by Qianqian Dong, et al. ∙ 0

End-to-end speech-to-text translation (ST), which directly translates the source language speech to the target language text, has attracted intensive attention recently. However, the combination of speech recognition and machine translation in a single model poses a heavy burden on the direct cross-modal cross-lingual mapping. To reduce the learning difficulty, we propose SDST, an integral framework with Successive Decoding for end-to-end Speech-to-text Translation task. This method is verified in two mainstream datasets. Experiments show that our proposed improves the previous state-of-the-art methods by big margins.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 3

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.