b'Rishabh Jain'

research

∙ 09/19/2023

MAGIC-TBR: Multiview Attention Fusion for Transformer-based Bodily Behavior Recognition in Group Settings

Bodily behavioral language is an important social cue, and its automated...

0 Surbhi Madan, et al. ∙

research

∙ 09/07/2023

Automatic Concept Embedding Model (ACEM): No train-time concepts, No issue!

Interpretability and explainability of neural networks is continuously i...

0 Rishabh Jain, et al. ∙

research

∙ 07/24/2023

Adaptation of Whisper models to child speech recognition

Automatic Speech Recognition (ASR) systems often struggle with transcrib...

0 Rishabh Jain, et al. ∙

research

∙ 07/18/2023

Neural Priority Queues for Graph Neural Networks

Graph Neural Networks (GNNs) have shown considerable success in neural a...

0 Rishabh Jain, et al. ∙

research

∙ 07/10/2023

FODVid: Flow-guided Object Discovery in Videos

Segmentation of objects in a video is challenging due to the nuances suc...

0 Silky Singh, et al. ∙

research

∙ 03/27/2023

Parameter Efficient Local Implicit Image Function Network for Face Segmentation

Face parsing is defined as the per-pixel labeling of images containing h...

0 Mausoom Sarkar, et al. ∙

research

∙ 11/17/2022

UMFuse: Unified Multi View Fusion for Human Editing applications

The vision community has explored numerous pose guided human editing met...

0 Rishabh Jain, et al. ∙

research

∙ 11/13/2022

VGFlow: Visibility guided Flow Network for Human Reposing

The task of human reposing involves generating a realistic image of a pe...

0 Rishabh Jain, et al. ∙

research

∙ 08/30/2022

Analysis of Distributed Deep Learning in the Cloud

We aim to resolve this problem by introducing a comprehensive distribute...

0 Aakash Sharma, et al. ∙

research

∙ 04/06/2022

Can Self-Supervised Learning solve the problem of child speech recognition?

Despite recent advancements in deep learning technologies, Child Speech ...

0 Rishabh Jain, et al. ∙

research

∙ 03/22/2022

A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis

Speech synthesis has come a long way as current text-to-speech (TTS) mod...

0 Rishabh Jain, et al. ∙

research

∙ 09/14/2021

ZFlow: Gated Appearance Flow-based Virtual Try-on with 3D Priors

Image-based virtual try-on involves synthesizing perceptually convincing...

0 Ayush Chopra, et al. ∙

research

∙ 07/24/2020

Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data

Can we develop visually grounded dialog agents that can efficiently adap...

6 Michael Cogswell, et al. ∙

research

∙ 09/23/2019

On Model Stability as a Function of Random Seed

In this paper, we focus on quantifying model stability as a function of ...

0 Pranava Madhyastha, et al. ∙

research

∙ 06/18/2019

Model Explanations under Calibration

Explaining and interpreting the decisions of recommender systems are bec...

0 Rishabh Jain, et al. ∙

research

∙ 02/10/2019

EvalAI: Towards Better Evaluation Systems for AI Agents

We introduce EvalAI, an open source platform for evaluating and comparin...

24 Deshraj Yadav, et al. ∙

research

∙ 12/20/2018

nocaps: novel object captioning at scale

Image captioning models have achieved impressive results on datasets con...

46 Harsh Agrawal, et al. ∙

Rishabh Jain

Featured Co-authors

Sign in with Google

Consider DeepAI Pro