DeepAI AI Chat
Log In Sign Up

Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop

by   Odette Scharenborg, et al.
Radboud Universiteit

We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding the discovery of linguistic units (subwords and words) in a language without orthography. We study the replacement of orthographic transcriptions by images and/or translated text in a well-resourced language to help unsupervised discovery from raw speech.


page 1

page 2

page 3

page 4


Proceedings of the 2017 AdKDD & TargetAd Workshop

Proceedings of the 2017 AdKDD and TargetAd Workshop held in conjunction ...

A Summary of the First Workshop on Language Technology for Language Documentation and Revitalization

Despite recent advances in natural language processing and other languag...

Towards Induction of Structured Phoneme Inventories

This extended abstract surveying the work on phonological typology was p...

Exploiting Cross-Lingual Knowledge in Unsupervised Acoustic Modeling for Low-Resource Languages

(Short version of Abstract) This thesis describes an investigation on un...

Throwing fuel on the embers: Probability or Dichotomy, Cognitive or Linguistic?

Prof. Robert Berwick's abstract for his forthcoming invited talk at the ...

A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery

In this work, we propose a hierarchical subspace model for acoustic unit...

Unsupervised Multimodal Word Discovery based on Double Articulation Analysis with Co-occurrence cues

Human infants acquire their verbal lexicon from minimal prior knowledge ...