Yoichi Sato

research

∙ 05/11/2023

Surgical tool classification and localization: results and methods from the MICCAI 2022 SurgToolLoc challenge

The ability to automatically detect and track surgical instruments in en...

17 Aneeq Zia, et al. ∙

research

∙ 03/10/2023

Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction

The Multiplane Image (MPI), containing a set of fronto-parallel RGBA lay...

0 Mingfang Zhang, et al. ∙

research

∙ 02/07/2023

Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos

Object affordance is an important concept in hand-object interaction, pr...

0 Zecheng Yu, et al. ∙

research

∙ 08/04/2022

Surgical Skill Assessment via Video Semantic Aggregation

Automated video-based assessment of surgical skills is a promising task ...

8 Zhenqiang Li, et al. ∙

research

∙ 07/23/2022

CompNVS: Novel View Synthesis with Scene Completion

We introduce a scalable framework for novel view synthesis from RGB-D im...

0 Zuoyue Li, et al. ∙

research

∙ 07/12/2022

Compound Prototype Matching for Few-shot Action Recognition

Few-shot action recognition aims to recognize novel action classes using...

0 Yifei Huang, et al. ∙

research

∙ 06/11/2022

Precise Affordance Annotation for Egocentric Action Video Datasets

Object affordance is an important concept in human-object interaction, p...

0 Zecheng Yu, et al. ∙

research

∙ 06/10/2022

Object Instance Identification in Dynamic Environments

We study the problem of identifying object instances in a dynamic enviro...

0 Takuma Yagi, et al. ∙

research

∙ 06/05/2022

Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey

In this survey, we present comprehensive analysis of 3D hand pose estima...

0 Takehiko Ohkawa, et al. ∙

research

∙ 03/16/2022

Domain Adaptive Hand Keypoint and Pixel Localization in the Wild

We aim to improve the performance of regressing hand keypoints and segme...

0 Takehiko Ohkawa, et al. ∙

research

∙ 02/28/2022

Background Mixup Data Augmentation for Hand and Object-in-Contact Detection

Detecting the positions of human hands and objects-in-contact (hand-obje...

0 Koya Tango, et al. ∙

research

∙ 12/02/2021

Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips

First-person action recognition is a challenging task in video understan...

0 Lijin Yang, et al. ∙

research

∙ 12/02/2021

Leveraging Human Selective Attention for Medical Image Analysis with Limited Training Data

The human gaze is a cost-efficient physiological data that reveals human...

15 Yifei Huang, et al. ∙

research

∙ 10/19/2021

Hand-Object Contact Prediction via Motion-Based Pseudo-Labeling and Guided Progressive Label Correction

Every hand-object interaction begins with contact. Despite predicting th...

7 Takuma Yagi, et al. ∙

research

∙ 09/01/2021

Spatio-Temporal Perturbations for Video Attribution

The attribution method provides a direction for interpreting opaque neur...

6 Zhenqiang Li, et al. ∙

research

∙ 07/06/2021

Foreground-Aware Stylization and Consensus Pseudo-Labeling for Domain Adaptation of First-Person Hand Segmentation

Hand segmentation is a crucial task in first-person vision. Since first-...

8 Takehiko Ohkawa, et al. ∙

research

∙ 06/18/2021

EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021: Team M3EM Technical Report

In this report, we describe the technical details of our submission to t...

0 Lijin Yang, et al. ∙

research

∙ 01/18/2021

GO-Finder: A Registration-Free Wearable System for Assisting Users in Finding Lost Objects via Hand-Held Object Discovery

People spend an enormous amount of time and effort looking for lost obje...

18 Takuma Yagi, et al. ∙

research

∙ 05/01/2020

A Comprehensive Study on Visual Explanations for Spatio-temporal Networks

Identifying and visualizing regions that are significant for a given dee...

9 Zhenqiang Li, et al. ∙

research

∙ 01/09/2019

Manipulation-skill Assessment from Videos with Spatial Attention Network

Recent advances in computer vision have made it possible to automaticall...

6 Zhenqiang Li, et al. ∙

research

∙ 01/07/2019

Mutual Context Network for Jointly Estimating Egocentric Gaze and Actions

In this work, we address two coupled tasks of gaze prediction and action...

8 Yifei Huang, et al. ∙

research

∙ 07/22/2018

Understanding hand-object manipulation by modeling the contextual relationship between actions, grasp types and object attributes

This paper proposes a novel method for understanding daily hand-object m...

0 Minjie Cai, et al. ∙

research

∙ 03/24/2018

Predicting Gaze in Egocentric Video by Learning Task-dependent Attention Transition

We present a new computational model for gaze prediction in egocentric v...

0 Yifei Huang, et al. ∙

research

∙ 11/30/2017

Future Person Localization in First-Person Videos

We present a new task that predicts future locations of people observed ...

0 Takuma Yagi, et al. ∙

research

∙ 07/05/2017

Fast Multi-frame Stereo Scene Flow with Motion Segmentation

We propose a new multi-frame method for efficiently computing scene flow...

0 Tatsunori Taniai, et al. ∙

research

∙ 06/14/2017

Hierarchical Gaussian Descriptors with Application to Person Re-Identification

Describing the color and textural information of a person image is one o...

0 Tetsu Matsukawa, et al. ∙

research

∙ 04/07/2017

Privacy-Preserving Visual Learning Using Doubly Permuted Homomorphic Encryption

We propose a privacy-preserving framework for learning visual classifier...

0 Ryo Yonetani, et al. ∙

research

∙ 06/15/2016

Ego-Surfing: Person Localization in First-Person Videos Using Ego-Motion Signatures

We envision a future time when wearable cameras are worn by the masses a...

0 Ryo Yonetani, et al. ∙

research

∙ 03/28/2016

Continuous 3D Label Stereo Matching using Local Expansion Moves

We present an accurate stereo matching method using local expansion move...

0 Tatsunori Taniai, et al. ∙

Yoichi Sato

Featured Co-authors

Sign in with Google

Consider DeepAI Pro