PyKale: Knowledge-Aware Machine Learning from Multiple Sources in Python

06/17/2021
by   Haiping Lu, et al.
0

Machine learning is a general-purpose technology holding promises for many interdisciplinary research problems. However, significant barriers exist in crossing disciplinary boundaries when most machine learning tools are developed in different areas separately. We present Pykale - a Python library for knowledge-aware machine learning on graphs, images, texts, and videos to enable and accelerate interdisciplinary research. We formulate new green machine learning guidelines based on standard software engineering practices and propose a novel pipeline-based application programming interface (API). PyKale focuses on leveraging knowledge from multiple sources for accurate and interpretable prediction, thus supporting multimodal learning and transfer learning (particularly domain adaptation) with latest deep learning and dimensionality reduction models. We build PyKale on PyTorch and leverage the rich PyTorch ecosystem. Our pipeline-based API design enforces standardization and minimalism, embracing green machine learning concepts via reducing repetitions and redundancy, reusing existing resources, and recycling learning models across areas. We demonstrate its interdisciplinary nature via examples in bioinformatics, knowledge graph, image/video recognition, and medical imaging.

READ FULL TEXT
research
05/07/2023

Egg-smol Python: A Pythonic Library for E-graphs

E-graphs have emerged as a versatile data structure with applications in...
research
02/13/2020

PHOTON – A Python API for Rapid Machine Learning Model Development

This article describes the implementation and use of PHOTON, a high-leve...
research
08/11/2017

Augmentor: An Image Augmentation Library for Machine Learning

The generation of artificial data based on existing observations, known ...
research
02/10/2020

RDFFrames: Knowledge Graph Access for Machine Learning Tools

Knowledge graphs represented as RDF datasets are becoming increasingly p...
research
10/20/2021

A Data-Centric Optimization Framework for Machine Learning

Rapid progress in deep learning is leading to a diverse set of quickly c...
research
07/23/2019

On Using Machine Learning to Identify Knowledge in API Reference Documentation

Using API reference documentation like JavaDoc is an integral part of so...
research
07/21/2021

HistoCartography: A Toolkit for Graph Analytics in Digital Pathology

Advances in entity-graph based analysis of histopathology images have br...

Please sign up or login with your details

Forgot password? Click here to reset