EfficientBioAI: Making Bioimaging AI Models Efficient in Energy, Latency and Representation

by   Yu Zhou, et al.

Artificial intelligence (AI) has been widely used in bioimage image analysis nowadays, but the efficiency of AI models, like the energy consumption and latency is not ignorable due to the growing model size and complexity, as well as the fast-growing analysis needs in modern biomedical studies. Like we can compress large images for efficient storage and sharing, we can also compress the AI models for efficient applications and deployment. In this work, we present EfficientBioAI, a plug-and-play toolbox that can compress given bioimaging AI models for them to run with significantly reduced energy cost and inference time on both CPU and GPU, without compromise on accuracy. In some cases, the prediction accuracy could even increase after compression, since the compression procedure could remove redundant information in the model representation and therefore reduce over-fitting. From four different bioimage analysis applications, we observed around 2-5 times speed-up during inference and 30-80% saving in energy. Cutting the runtime of large scale bioimage analysis from days to hours or getting a two-minutes bioimaging AI model inference done in near real-time will open new doors for method development and biomedical discoveries. We hope our toolbox will facilitate resource-constrained bioimaging AI and accelerate large-scale AI-based quantitative biological studies in an eco-friendly way, as well as stimulate further research on the efficiency of bioimaging AI.


page 3

page 14

page 15

page 16

page 17


Eco2AI: carbon emissions tracking of machine learning models as the first step towards sustainable AI

The size and complexity of deep neural networks continue to grow exponen...

Precise Energy Consumption Measurements of Heterogeneous Artificial Intelligence Workloads

With the rise of AI in recent years and the increase in complexity of th...

EPAM: A Predictive Energy Model for Mobile AI

Artificial intelligence (AI) has enabled a new paradigm of smart applica...

AI-based Predictive Analytic Approaches for safeguarding the Future of Electric/Hybrid Vehicles

In response to the global need for sustainable energy, green technology ...

DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales

ChatGPT-like models have revolutionized various applications in artifici...

Deep Learning-based Frozen Section to FFPE Translation

Frozen sectioning (FS) is the preparation method of choice for microscop...

Enabling Serverless Deployment of Large-Scale AI Workloads

We propose a set of optimization techniques for transforming a generic A...

Please sign up or login with your details

Forgot password? Click here to reset