Real-Time Cattle Interaction Recognition via Triple-stream Network

09/06/2022
by   Yang Yang, et al.
0

In stockbreeding of beef cattle, computer vision-based approaches have been widely employed to monitor cattle conditions (e.g. the physical, physiology, and health). To this end, the accurate and effective recognition of cattle action is a prerequisite. Generally, most existing models are confined to individual behavior that uses video-based methods to extract spatial-temporal features for recognizing the individual actions of each cattle. However, there is sociality among cattle and their interaction usually reflects important conditions, e.g. estrus, and also video-based method neglects the real-time capability of the model. Based on this, we tackle the challenging task of real-time recognizing interactions between cattle in a single frame in this paper. The pipeline of our method includes two main modules: Cattle Localization Network and Interaction Recognition Network. At every moment, cattle localization network outputs high-quality interaction proposals from every detected cattle and feeds them into the interaction recognition network with a triple-stream architecture. Such a triple-stream network allows us to fuse different features relevant to recognizing interactions. Specifically, the three kinds of features are a visual feature that extracts the appearance representation of interaction proposals, a geometric feature that reflects the spatial relationship between cattle, and a semantic feature that captures our prior knowledge of the relationship between the individual action and interaction of cattle. In addition, to solve the problem of insufficient quantity of labeled data, we pre-train the model based on self-supervised learning. Qualitative and quantitative evaluation evidences the performance of our framework as an effective method to recognize cattle interaction in real time.

READ FULL TEXT

page 1

page 6

page 7

research
05/22/2018

Pose-Based Two-Stream Relational Networks for Action Recognition in Videos

Recently, pose-based action recognition has gained more and more attenti...
research
01/04/2019

Intelligent Intersection: Two-Stream Convolutional Networks for Real-time Near Accident Detection in Traffic Video

In Intelligent Transportation System, real-time systems that monitor and...
research
07/22/2023

Two-stream Multi-level Dynamic Point Transformer for Two-person Interaction Recognition

As a fundamental aspect of human life, two-person interactions contain m...
research
02/04/2018

Human Action Adverb Recognition: ADHA Dataset and A Three-Stream Hybrid Model

We introduce the first benchmark for a new problem --- recognizing human...
research
06/24/2021

Exploring Stronger Feature for Temporal Action Localization

Temporal action localization aims to localize starting and ending time w...
research
03/07/2023

SKGHOI: Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection

Detecting human-object interactions (HOIs) is a challenging problem in c...
research
07/02/2023

Human-to-Human Interaction Detection

A comprehensive understanding of interested human-to-human interactions ...

Please sign up or login with your details

Forgot password? Click here to reset