Exploiting the relationship between visual and textual features in social networks for image classification with zero-shot deep learning

07/08/2021
by   Luis Lucas, et al.
0

One of the main issues related to unsupervised machine learning is the cost of processing and extracting useful information from large datasets. In this work, we propose a classifier ensemble based on the transferable learning capabilities of the CLIP neural network architecture in multimodal environments (image and text) from social media. For this purpose, we used the InstaNY100K dataset and proposed a validation approach based on sampling techniques. Our experiments, based on image classification tasks according to the labels of the Places dataset, are performed by first considering only the visual part, and then adding the associated texts as support. The results obtained demonstrated that trained neural networks such as CLIP can be successfully applied to image classification with little fine-tuning, and considering the associated texts to the images can help to improve the accuracy depending on the goal. The results demonstrated what seems to be a promising research direction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/22/2022

iCAR: Bridging Image Classification and Image-text Alignment for Visual Recognition

Image classification, which classifies images by pre-defined categories,...
research
11/14/2022

ContextCLIP: Contextual Alignment of Image-Text pairs on CLIP visual representations

State-of-the-art empirical work has shown that visual representations le...
research
03/22/2017

Joint Intermodal and Intramodal Label Transfers for Extremely Rare or Unseen Classes

In this paper, we present a label transfer model from texts to images fo...
research
02/06/2018

Deep Inference of Personality Traits by Integrating Image and Word Use in Social Networks

Social media, as a major platform for communication and information exch...
research
09/21/2023

Exploiting CLIP-based Multi-modal Approach for Artwork Classification and Retrieval

Given the recent advances in multimodal image pretraining where visual m...
research
04/23/2019

DenseNet Models for Tiny ImageNet Classification

In this paper, we present two image classification models on the Tiny Im...
research
01/10/2023

Deep Learning based Multi-Label Image Classification of Protest Activities

With the rise of internet technology amidst increasing rates of urbaniza...

Please sign up or login with your details

Forgot password? Click here to reset