Where is this? Video geolocation based on neural network features

10/22/2018
by   Salvador Medina, et al.
0

In this work we propose a method that geolocates videos within a delimited widespread area based solely on the frames visual content. Our proposed method tackles video-geolocation through traditional image retrieval techniques considering Google Street View as the reference point. To achieve this goal we use the deep learning features obtained from NetVLAD to represent images, since through this feature vectors the similarity is their L2 norm. In this paper, we propose a family of voting-based methods to aggregate frame-wise geolocation results which boost the video geolocation result. The best aggregation found through our experiments considers both NetVLAD and SIFT similarity, as well as the geolocation density of the most similar results. To test our proposed method, we gathered a new video dataset from Pittsburgh Downtown area to benefit and stimulate more work in this area. Our system achieved a precision of 90 from the original position.

READ FULL TEXT
research
08/20/2019

ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning

In this paper we introduce ViSiL, a Video Similarity Learning architectu...
research
08/05/2019

A Fast Content-Based Image Retrieval Method Using Deep Visual Features

Fast and scalable Content-Based Image Retrieval using visual features is...
research
04/16/2021

Self-supervised Video Retrieval Transformer Network

Content-based video retrieval aims to find videos from a large video dat...
research
06/21/2023

Key Frame Extraction with Attention Based Deep Neural Networks

Automatic keyframe detection from videos is an exercise in selecting sce...
research
09/22/2016

Pose-Selective Max Pooling for Measuring Similarity

In this paper, we deal with two challenges for measuring the similarity ...
research
03/13/2023

Unsupervised HDR Image and Video Tone Mapping via Contrastive Learning

Capturing high dynamic range (HDR) images (videos) is attractive because...
research
02/10/2018

2-gram-based Phonetic Feature Generation for Convolutional Neural Network in Assessment of Trademark Similarity

A trademark is a mark used to identify various commodities. If same or s...

Please sign up or login with your details

Forgot password? Click here to reset