
Tell Me Something I Don't Know: Randomization Strategies for Iterative Data Mining
There is a wide variety of data mining methods available, and it is gene...
read it

A Gap Analysis of LowCost Outdoor Air Quality Sensor InField Calibration
In recent years, interest in monitoring air quality has been growing. Tr...
read it

Estimating regression errors without ground truth values
Regression analysis is a standard supervised machine learning method use...
read it

Guided Visual Exploration of Relations in Data Sets
Efficient explorative data analysis systems must take into account both ...
read it

Randomisation Algorithms for Large Sparse Matrices
In many domains it is necessary to generate surrogate networks, e.g., fo...
read it

Humanguided data exploration using randomisation
An explorative data analysis system should be aware of what the user alr...
read it

HumanGuided Data Exploration
The outcome of the explorative data analysis (EDA) phase is vital for su...
read it

Interactive Visual Data Exploration with Subjective Feedback: An InformationTheoretic Approach
Visual exploration of highdimensional realvalued datasets is a fundame...
read it

Subjectively Interesting Subgroup Discovery on Realvalued Targets
Deriving insights from highdimensional data is one of the core problems...
read it

Interpreting Classifiers through Attribute Interactions in Datasets
In this work we present the novel ASTRID method for investigating which ...
read it

Multivariate Confidence Intervals
Confidence intervals are a popular way to visualize and analyze data dis...
read it

Clustering with Confidence: Finding Clusters with Statistical Guarantees
Clustering is a widely used unsupervised learning method for finding str...
read it

Finding Statistically Significant Attribute Interactions
In many data exploration tasks it is meaningful to identify groups of at...
read it

TwoWay Latent Grouping Model for User Preference Prediction
We introduce a novel latent grouping model for predicting the relevance ...
read it

Inference with Discriminative Posterior
We study Bayesian discriminative inference given a model family p(c,, θ)...
read it

An Approximation Ratio for Biclustering
The problem of biclustering consists of the simultaneous clustering of r...
read it
Kai Puolamäki
is this you? claim profile