
-
"Train one, Classify one, Teach one" – Cross-surgery transfer learning for surgical step recognition
Prior work demonstrated the ability of machine learning to automatically...
read it
-
Orientation Matters: 6-DoF Autonomous Camera Movement for Minimally Invasive Surgery
We propose a new method for six-degree-of-freedom (6-DoF) autonomous cam...
read it
-
SAFCAR: Structured Attention Fusion for Compositional Action Recognition
We present a general framework for compositional action recognition – i....
read it
-
Fine-grained activity recognition for assembly videos
In this paper we address the task of recognizing assembly actions as a s...
read it
-
Nothing But Geometric Constraints: A Model-Free Method for Articulated Object Pose Estimation
We propose an unsupervised vision-based system to estimate the joint con...
read it
-
Autonomously Navigating a Surgical Tool Inside the Eye by Learning from Demonstration
A fundamental challenge in retinal surgery is safely navigating a surgic...
read it
-
Deep Hiearchical Multi-Label Classification Applied to Chest X-Ray Abnormality Taxonomies
CXRs are a crucial and extraordinarily common diagnostic tool, leading t...
read it
-
Learning Representations of Endoscopic Videos to Detect Tool Presence Without Supervision
In this work, we explore whether it is possible to learn representations...
read it
-
Opportunities and Challenges for Next Generation Computing
Computing has dramatically changed nearly every aspect of our lives, fro...
read it
-
Anatomy-Aware Siamese Network: Exploiting Semantic Asymmetry for Accurate Pelvic Fracture Detection in X-ray Images
Visual cues of enforcing bilaterally symmetric anatomies as normal findi...
read it
-
Learning Geocentric Object Pose in Oblique Monocular Images
An object's geocentric pose, defined as the height above ground and orie...
read it
-
Artificial Intelligence-based Clinical Decision Support for COVID-19 – Where Art Thou?
The COVID-19 crisis has brought about new clinical questions, new workfl...
read it
-
Semantic Image Manipulation Using Scene Graphs
Image manipulation can be considered a special case of image generation ...
read it
-
Extremely Dense Point Correspondences using a Learned Feature Descriptor
High-quality 3D reconstructions from endoscopy video play an important r...
read it
-
Car Pose in Context: Accurate Pose Estimation with Ground Plane Constraints
Scene context is a powerful constraint on the geometry of objects within...
read it
-
Zero-shot Recognition of Complex Action Sequences
Zero-shot video classification for fine-grained activity recognition has...
read it
-
RSA: Randomized Simulation as Augmentation for Robust Human Action Recognition
Despite the rapid growth in datasets for video activity, stable robust a...
read it
-
Action Recognition Using Volumetric Motion Representations
Traditional action recognition models are constructed around the paradig...
read it
-
"Good Robot!": Efficient Reinforcement Learning for Multi-Step Visual Tasks via Reward Shaping
In order to learn effectively, robots must be able to extract the intang...
read it
-
Self-supervised Dense 3D Reconstruction from Monocular Endoscopic Video
We present a self-supervised learning-based pipeline for dense 3D recons...
read it
-
Automated Surgical Activity Recognition with One Labeled Sequence
Prior work has demonstrated the feasibility of automated activity recogn...
read it
-
sharpDARTS: Faster and More Accurate Differentiable Architecture Search
Neural Architecture Search (NAS) has been a source of dramatic improveme...
read it
-
Artificial Intelligence for Social Good
The Computing Community Consortium (CCC), along with the White House Off...
read it
-
Semantic Stereo for Incidental Satellite Images
The increasingly common use of incidental satellite images for stereo re...
read it
-
Evaluating Methods for End-User Creation of Robot Task Plans
How can we enable users to create effective, perception-driven task plan...
read it
-
Training Frankenstein's Creature to Stack: HyperTree Architecture Search
We propose HyperTrees for the low cost automatic design of multiple-inpu...
read it
-
Towards automatic initialization of registration algorithms using simulated endoscopy images
Registering images from different modalities is an active area of resear...
read it
-
Unsupervised Learning for Surgical Motion by Learning to Predict the Future
We show that it is possible to learn meaningful representations of surgi...
read it
-
Surgical Data Science: A Consensus Perspective
Surgical data science is a scientific discipline with the objective of i...
read it
-
Endoscopic navigation in the absence of CT imaging
Clinical examinations that involve endoscopic exploration of the nasal c...
read it
-
Visual Robot Task Planning
Prospection, the act of predicting the consequences of many possible fut...
read it
-
Guide Me: Interacting with Deep Networks
Interaction and collaboration between humans and intelligent machines ha...
read it
-
A Unified Framework for Multi-View Multi-Class Object Pose Estimation
One core challenge in object pose estimation is to ensure accurate and r...
read it
-
Occupancy Map Prediction Using Generative and Fully Convolutional Networks for Vehicle Navigation
Fast, collision-free motion through unknown environments remains a chall...
read it
-
Deep Supervision with Intermediate Concepts
Recent data-driven approaches to scene interpretation predominantly pose...
read it
-
Learning to Imagine Manipulation Goals for Robot Task Planning
Prospection is an important part of how humans come up with new task pla...
read it
-
Adversarial Deep Structured Nets for Mass Segmentation from Mammograms
Mass segmentation provides effective morphological features which are im...
read it
-
Temporal and Physical Reasoning for Perception-Based Robotic Manipulation
Accurate knowledge of object poses is crucial to successful robotic mani...
read it
-
Advances in Artificial Intelligence Require Progress Across all of Computer Science
Advances in Artificial Intelligence require progress across all of compu...
read it
-
Analyzing and Exploiting NARX Recurrent Neural Networks for Long-Term Dependencies
Recurrent neural networks (RNNs) have achieved state-of-the-art performa...
read it
-
Regularizing Face Verification Nets For Pain Intensity Regression
Limited labeled data are available for the research of estimating facial...
read it
-
Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing
Monocular 3D object parsing is highly desirable in various scenarios inc...
read it
-
Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses
Many prediction tasks contain uncertainty. In some cases, uncertainty is...
read it
-
Temporal Convolutional Networks for Action Segmentation and Detection
The ability to identify and temporally segment fine-grained human action...
read it
-
Anatomically Constrained Video-CT Registration via the V-IMLOP Algorithm
Functional endoscopic sinus surgery (FESS) is a surgical procedure used ...
read it
-
Temporal Convolutional Networks: A Unified Approach to Action Segmentation
The dominant paradigm for video-based action segmentation is composed of...
read it
-
SANTIAGO: Spine Association for Neuron Topology Improvement and Graph Optimization
Developing automated and semi-automated solutions for reconstructing wir...
read it
-
Recognizing Surgical Activities with Recurrent Neural Networks
We apply recurrent neural networks to the task of recognizing surgical a...
read it
-
Segmental Spatiotemporal CNNs for Fine-grained Action Segmentation
Joint segmentation and classification of fine-grained actions is importa...
read it
-
Automated Objective Surgical Skill Assessment in the Operating Room Using Unstructured Tool Motion
Previous work on surgical skill assessment using intraoperative tool mot...
read it