Srijan Das

research

∙ 08/25/2023

Attending Generalizability in Course of Deep Fake Detection by Exploring Multi-task Learning

This work explores various ways of exploring multi-task learning (MTL) t...

0 Pranav Balaji, et al. ∙

research

∙ 06/15/2023

Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers

Human perception of surroundings is often guided by the various poses pr...

0 Dominick Reilly, et al. ∙

research

∙ 07/01/2022

Video + CLIP Baseline for Ego4D Long-term Action Anticipation

In this report, we introduce our adaptation of image-text models for lon...

0 Srijan Das, et al. ∙

research

∙ 06/23/2022

Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space

Humans are remarkably flexible in understanding viewpoint changes due to...

0 Jinghuan Shang, et al. ∙

research

∙ 12/07/2021

STC-mix: Space, Time, Channel mixing for Self-supervised Video Representation

Contrastive representation learning of videos highly relies on the avail...

0 Srijan Das, et al. ∙

research

∙ 12/07/2021

ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints

Learning self-supervised video representation predominantly focuses on d...

0 Srijan Das, et al. ∙

research

∙ 12/07/2021

MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection

Action detection is an essential and challenging task, especially for de...

5 Rui Dai, et al. ∙

research

∙ 10/26/2021

CTRN: Class-Temporal Relational Network for Action Detection

Action detection is an essential and challenging task, especially for de...

0 Rui Dai, et al. ∙

research

∙ 08/20/2021

Weakly-supervised Joint Anomaly Detection and Classification

Anomaly activities such as robbery, explosion, accidents, etc. need imme...

0 Snehashis Majhi, et al. ∙

research

∙ 08/08/2021

Learning an Augmented RGB Representation with Cross-Modal Knowledge Distillation for Action Detection

In video understanding, most cross-modal knowledge distillation (KD) met...

0 Rui Dai, et al. ∙

research

∙ 05/17/2021

VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living

Many attempts have been made towards combining RGB and 3D poses for the ...

0 Srijan Das, et al. ∙

research

∙ 10/28/2020

Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity Detection

This work aims at building a large scale dataset with daily-living activ...

14 Rui Dai, et al. ∙

research

∙ 07/06/2020

VPN: Learning Video-Pose Embedding for Activities of Daily Living

In this paper, we focus on the spatio-temporal aspect of recognizing Act...

10 Srijan Das, et al. ∙

research

∙ 02/01/2018

A Fusion of Appearance based CNNs and Temporal evolution of Skeleton with LSTM for Daily Living Action Recognition

In this paper, we propose efficient method which combines skeleton infor...

0 Srijan Das, et al. ∙

Srijan Das

Featured Co-authors

Sign in with Google

Consider DeepAI Pro