Sampling Strategies for Mining in Data-Scarce Domains

04/22/2002
by   Naren Ramakrishnan, et al.
0

Data mining has traditionally focused on the task of drawing inferences from large datasets. However, many scientific and engineering domains, such as fluid dynamics and aircraft design, are characterized by scarce data, due to the expense and complexity of associated experiments and simulations. In such data-scarce domains, it is advantageous to focus the data collection effort on only those regions deemed most important to support a particular data mining objective. This paper describes a mechanism that interleaves bottom-up data mining, to uncover multi-level structures in spatial data, with top-down sampling, to clarify difficult decisions in the mining process. The mechanism exploits relevant physical properties, such as continuity, correspondence, and locality, in a unified framework. This leads to effective mining and sampling decisions that are explainable in terms of domain knowledge and data characteristics. This approach is demonstrated in two diverse applications -- mining pockets in spatial data, and qualitative determination of Jordan forms of matrices.

READ FULL TEXT
research
09/17/2020

Multi-source Data Mining for e-Learning

Data mining is the task of discovering interesting, unexpected or valuab...
research
04/26/2002

Qualitative Analysis of Correspondence for Experimental Algorithmics

Correspondence identifies relationships among objects via similarities a...
research
04/10/2023

DASS Good: Explainable Data Mining of Spatial Cohort Data

Developing applicable clinical machine learning models is a difficult ta...
research
10/27/2020

Towards Active Simulation Data Mining

Simulations have recently been considered as data generators for machine...
research
10/21/2020

Effective Data Scraping Strategies and Resources for Digital Marketers

Data scraping is not a new practice. It pre-dates the internet and exist...
research
05/18/2008

Symmetry in Data Mining and Analysis: A Unifying View based on Hierarchy

Data analysis and data mining are concerned with unsupervised pattern fi...

Please sign up or login with your details

Forgot password? Click here to reset