Finnish Paraphrase Corpus

03/24/2021
by   Jenna Kanerva, et al.
0

In this paper, we introduce the first fully manually annotated paraphrase corpus for Finnish containing 53,572 paraphrase pairs harvested from alternative subtitles and news headings. Out of all paraphrase pairs in our corpus 98 context, if not in all contexts. Additionally, we establish a manual candidate selection method and demonstrate its feasibility in high quality paraphrase selection in terms of both cost and quality.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset