Juliana Freire

research

∙ 08/11/2023

Simple Analysis of Priority Sampling

We prove a tight upper bound on the variance of the priority sampling me...

0 Majid Daliri, et al. ∙

research

∙ 04/17/2023

eTOP: Early Termination of Pipelines for Faster Training of AutoML Systems

Recent advancements in software and hardware technologies have enabled t...

0 Haoxiang Zhang, et al. ∙

research

∙ 01/14/2023

Weighted Minwise Hashing Beats Linear Sketching for Inner Product Estimation

We present a new approach for computing compact sketches that can be use...

0 Aline Bessa, et al. ∙

research

∙ 03/27/2022

GPU-Powered Spatial Database Engine for Commodity Hardware: Extended Version

Given the massive growth in the volume of spatial data, there is a great...

0 Harish Doraiswamy, et al. ∙

research

∙ 11/03/2021

AlphaD3M: Machine Learning Pipeline Synthesis

We introduce AlphaD3M, an automatic machine learning (AutoML) system bas...

18 Iddo Drori, et al. ∙

research

∙ 05/13/2021

DataExposer: Exposing Disconnect between Data and Systems

As data is a central component of many modern systems, the cause of a sy...

0 Sainyam Galhotra, et al. ∙

research

∙ 04/07/2021

Correlation Sketches for Approximate Join-Correlation Queries

The increasing availability of structured datasets, from Web tables and ...

0 Aécio Santos, et al. ∙

research

∙ 02/10/2021

Auctus: A Dataset Search Engine for Data Augmentation

Machine Learning models are increasingly being adopted in many applicati...

0 Fernando Chirigati, et al. ∙

research

∙ 09/01/2020

Towards Evaluating Exploratory Model Building Process with AutoML Systems

The use of Automated Machine Learning (AutoML) systems are highly open-e...

0 Sungsoo Ray Hong, et al. ∙

research

∙ 05/01/2020

PipelineProfiler: A Visual Analytics Tool for the Exploration of AutoML Pipelines

In recent years, a wide variety of automated machine learning (AutoML) m...

0 Jorge Piazentin Ono, et al. ∙

research

∙ 04/12/2020

BugDoc: Algorithms to Debug Computational Processes

Data analysis for scientific experiments and enterprises, large-scale si...

0 Raoni Lourenco, et al. ∙

research

∙ 04/07/2020

A GPU-friendly Geometric Data Model and Algebra for Spatial Queries: Extended Version

The availability of low cost sensors has led to an unprecedented growth ...

0 Harish Doraiswamy, et al. ∙

research

∙ 02/11/2020

Debugging Machine Learning Pipelines

Machine learning tasks entail the use of complex computational pipelines...

73 Raoni Lourenco, et al. ∙

research

∙ 10/19/2019

Efficient Discovery of Meaningful Outlier Relationships

We propose PODS (Predictable Outliers in Data-trendS), a method that, gi...

0 Aline Bessa, et al. ∙

research

∙ 10/08/2019

AutoML using Metadata Language Embeddings

As a human choosing a supervised learning algorithm, it is natural to be...

36 Iddo Drori, et al. ∙

research

∙ 07/05/2019

Visus: An Interactive System for Automatic Machine Learning Model Building and Curation

While the demand for machine learning (ML) applications is booming, ther...

0 Aécio Santos, et al. ∙

research

∙ 05/24/2019

Automatic Machine Learning by Pipeline Synthesis using Model-Based Reinforcement Learning and a Grammar

Automatic machine learning is an important problem in the forefront of m...

25 Iddo Drori, et al. ∙

research

∙ 05/02/2019

A Topic-Agnostic Approach for Identifying Fake News Pages

Fake news and misinformation have been increasingly used to manipulate p...

0 Sonia Castelo, et al. ∙

research

∙ 02/25/2019

Bootstrapping Domain-Specific Content Discovery on the Web

The ability to continuously discover domain-specific content from the We...

0 Kien Pham, et al. ∙

research

∙ 08/04/2018

ReproServer: Making Reproducibility Easier and Less Intensive

Reproducibility in the computational sciences has been stymied because o...

0 Remi Rampin, et al. ∙

research

∙ 08/09/2017

A Collaborative Approach to Computational Reproducibility

Although a standard in natural science, reproducibility has been only ep...

0 Fernando Chirigati, et al. ∙

Juliana Freire

Featured Co-authors

Sign in with Google

Consider DeepAI Pro