Automatic Machine Learning Derived from Scholarly Big Data

03/06/2020
by   Asnat Greenstein-Messica, et al.
0

One of the challenging aspects of applying machine learning is the need to identify the algorithms that will perform best for a given dataset. This process can be difficult, time consuming and often requires a great deal of domain knowledge. We present Sommelier, an expert system for recommending the machine learning algorithms that should be applied on a previously unseen dataset. Sommelier is based on word embedding representations of the domain knowledge extracted from a large corpus of academic publications. When presented with a new dataset and its problem description, Sommelier leverages a recommendation model trained on the word embedding representation to provide a ranked list of the most relevant algorithms to be used on the dataset. We demonstrate Sommelier's effectiveness by conducting an extensive evaluation on 121 publicly available datasets and 53 classification algorithms. The top algorithms recommended for each dataset by Sommelier were able to achieve on average 97.7

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/05/2018

A Review of Different Word Embeddings for Sentiment Classification using Deep Learning

The web is loaded with textual content, and Natural Language Processing ...
research
08/23/2020

Augmenting Semantic Representation of Depressive Language: from Forums to Microblogs

We discuss and analyze the process of creating word embedding feature re...
research
03/29/2022

An Evaluation Dataset for Legal Word Embedding: A Case Study On Chinese Codex

Word embedding is a modern distributed word representations approach wid...
research
12/05/2017

EmTaggeR: A Word Embedding Based Novel Method for Hashtag Recommendation on Twitter

The hashtag recommendation problem addresses recommending (suggesting) o...
research
10/24/2018

Clinical Concept Extraction with Contextual Word Embedding

Automatic extraction of clinical concepts is an essential step for turni...
research
10/22/2018

LAMVI-2: A Visual Tool for Comparing and Tuning Word Embedding Models

Tuning machine learning models, particularly deep learning architectures...
research
05/01/2023

Logion: Machine Learning for Greek Philology

This paper presents machine-learning methods to address various problems...

Please sign up or login with your details

Forgot password? Click here to reset