Determining the Unithood of Word Sequences using Mutual Information and Independence Measure

10/01/2008
by   Wilson Wong, et al.
0

Most works related to unithood were conducted as part of a larger effort for the determination of termhood. Consequently, the number of independent research that study the notion of unithood and produce dedicated techniques for measuring unithood is extremely small. We propose a new approach, independent of any influences of termhood, that provides dedicated measures to gather linguistic evidence from parsed text and statistical evidence from Google search engine for the measurement of unithood. Our evaluations revealed a precision and recall of 98.68 95.42

READ FULL TEXT
research
10/01/2008

Determining the Unithood of Word Sequences using a Probabilistic Approach

Most research related to unithood were conducted as part of a larger eff...
research
03/15/2022

On Suspicious Coincidences and Pointwise Mutual Information

Barlow (1985) hypothesized that the co-occurrence of two events A and B ...
research
11/10/2018

Formal Limitations on the Measurement of Mutual Information

Motivate by applications to unsupervised learning, we consider the probl...
research
08/04/2018

Implementation and Analysis of Stable PUFs Using Gate Oxide Breakdown

We implement and analyze highly stable PUFs using two random gate oxide ...
research
03/03/2021

Optimizing Multi-task Peer Prediction

In the setting where we ask participants multiple similar possibly subje...
research
09/04/2018

Pointwise HSIC: A Linear-Time Kernelized Co-occurrence Norm for Sparse Linguistic Expressions

In this paper, we propose a new kernel-based co-occurrence measure that ...
research
03/20/2015

On measuring linguistic intelligence

This work addresses the problem of measuring how many languages a person...

Please sign up or login with your details

Forgot password? Click here to reset