Optimal Weighting of Multi-View Data with Low Dimensional Hidden States

09/25/2012
by   Yichao Lu, et al.
0

In Natural Language Processing (NLP) tasks, data often has the following two properties: First, data can be chopped into multi-views which has been successfully used for dimension reduction purposes. For example, in topic classification, every paper can be chopped into the title, the main text and the references. However, it is common that some of the views are less noisier than other views for supervised learning problems. Second, unlabeled data are easy to obtain while labeled data are relatively rare. For example, articles occurred on New York Times in recent 10 years are easy to grab but having them classified as 'Politics', 'Finance' or 'Sports' need human labor. Hence less noisy features are preferred before running supervised learning methods. In this paper we propose an unsupervised algorithm which optimally weights features from different views when these views are generated from a low dimensional hidden state, which occurs in widely used models like Mixture Gaussian Model, Hidden Markov Model (HMM) and Latent Dirichlet Allocation (LDA).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/11/2018

Semi-supervised Deep Representation Learning for Multi-View Problems

While neural networks for learning representation of multi-view data hav...
research
11/15/2019

The Similarity-Consensus Regularized Multi-view Learning for Dimension Reduction

During the last decades, learning a low-dimensional space with discrimin...
research
01/05/2019

Auto-weighted Mutli-view Sparse Reconstructive Embedding

With the development of multimedia era, multi-view data is generated in ...
research
05/20/2019

Multi-view Locality Low-rank Embedding for Dimension Reduction

During the last decades, we have witnessed a surge of interests of learn...
research
06/28/2016

Multi-View Kernel Consensus For Data Analysis and Signal Processing

The input data features set for many data driven tasks is high-dimension...
research
02/04/2012

A Reconstruction Error Formulation for Semi-Supervised Multi-task and Multi-view Learning

A significant challenge to make learning techniques more suitable for ge...
research
02/22/2015

Using NLP to measure democracy

This paper uses natural language processing to create the first machine-...

Please sign up or login with your details

Forgot password? Click here to reset