
Tell Me Something I Don't Know: Randomization Strategies for Iterative Data Mining
There is a wide variety of data mining methods available, and it is gene...
A Gap Analysis of LowCost Outdoor Air Quality Sensor InField Calibration
In recent years, interest in monitoring air quality has been growing. Tr...
Estimating regression errors without ground truth values
Regression analysis is a standard supervised machine learning method use...
Guided Visual Exploration of Relations in Data Sets
Efficient explorative data analysis systems must take into account both ...
Randomisation Algorithms for Large Sparse Matrices
In many domains it is necessary to generate surrogate networks, e.g., fo...
Humanguided data exploration using randomisation
An explorative data analysis system should be aware of what the user alr...
HumanGuided Data Exploration
The outcome of the explorative data analysis (EDA) phase is vital for su...
Interactive Visual Data Exploration with Subjective Feedback: An InformationTheoretic Approach
Visual exploration of highdimensional realvalued datasets is a fundame...
Subjectively Interesting Subgroup Discovery on Realvalued Targets
Deriving insights from highdimensional data is one of the core problems...
Interpreting Classifiers through Attribute Interactions in Datasets
In this work we present the novel ASTRID method for investigating which ...
Multivariate Confidence Intervals
Confidence intervals are a popular way to visualize and analyze data dis...
Clustering with Confidence: Finding Clusters with Statistical Guarantees
Clustering is a widely used unsupervised learning method for finding str...
Finding Statistically Significant Attribute Interactions
In many data exploration tasks it is meaningful to identify groups of at...
TwoWay Latent Grouping Model for User Preference Prediction
We introduce a novel latent grouping model for predicting the relevance ...
Inference with Discriminative Posterior
We study Bayesian discriminative inference given a model family p(c,, θ)...
An Approximation Ratio for Biclustering
The problem of biclustering consists of the simultaneous clustering of r...
Kai Puolamäki
