Incremental Learning for Fully Unsupervised Word Segmentation Using Penalized Likelihood and Model Selection

07/20/2016
by   Ruey-Cheng Chen, et al.
0

We present a novel incremental learning approach for unsupervised word segmentation that combines features from probabilistic modeling and model selection. This includes super-additive penalties for addressing the cognitive burden imposed by long word formation, and new model selection criteria based on higher-order generative assumptions. Our approach is fully unsupervised; it relies on a small number of parameters that permits flexible modeling and a mechanism that automatically learns parameters from the data. Through experimentation, we show that this intricate design has led to top-tier performance in both phonemic and orthographic word segmentation.

READ FULL TEXT

page 8

page 10

research
06/11/2015

Generalized Additive Model Selection

We introduce GAMSEL (Generalized Additive Model Selection), a penalized ...
research
09/01/2023

Subjectivity in Unsupervised Machine Learning Model Selection

Model selection is a necessary step in unsupervised machine learning. De...
research
07/24/2023

Consistent model selection in the spiked Wigner model via AIC-type criteria

Consider the spiked Wigner model X = ∑_i = 1^k λ_i u_i u_i^⊤ + ...
research
06/04/2020

Model selection criteria for regression models with splines and the automatic localization of knots

In this paper we propose a model selection approach to fit a regression ...
research
04/18/2021

Non-asymptotic model selection in block-diagonal mixture of polynomial experts models

Model selection, via penalized likelihood type criteria, is a standard t...
research
12/13/2011

Large Scale Correlation Clustering Optimization

Clustering is a fundamental task in unsupervised learning. The focus of ...
research
03/03/2023

Online simulator-based experimental design for cognitive model selection

The problem of model selection with a limited number of experimental tri...

Please sign up or login with your details

Forgot password? Click here to reset