Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop

02/14/2018
by   Odette Scharenborg, et al.
0

We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding the discovery of linguistic units (subwords and words) in a language without orthography. We study the replacement of orthographic transcriptions by images and/or translated text in a well-resourced language to help unsupervised discovery from raw speech.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2017

Proceedings of the 2017 AdKDD & TargetAd Workshop

Proceedings of the 2017 AdKDD and TargetAd Workshop held in conjunction ...
research
04/27/2020

A Summary of the First Workshop on Language Technology for Language Documentation and Revitalization

Despite recent advances in natural language processing and other languag...
research
10/12/2020

Towards Induction of Structured Phoneme Inventories

This extended abstract surveying the work on phonological typology was p...
research
07/29/2020

Exploiting Cross-Lingual Knowledge in Unsupervised Acoustic Modeling for Low-Resource Languages

(Short version of Abstract) This thesis describes an investigation on un...
research
06/02/2022

Proceedings of the 2022 Workshop on Resource AWareness of Systems and Society (RAW)

Proceedings of the 2022 Workshop on Resource AWareness of Systems and So...
research
07/01/2016

Throwing fuel on the embers: Probability or Dichotomy, Cognitive or Linguistic?

Prof. Robert Berwick's abstract for his forthcoming invited talk at the ...

Please sign up or login with your details

Forgot password? Click here to reset