Simple feature pyramid network for weakly supervised object localization using multi-scale information

A Two-Branch Network for Weakly Supervised Object Localization

Electronics ◽

10.3390/electronics9060955 ◽

2020 ◽

Vol 9 (6) ◽

pp. 955

Author(s):

Chang Sun ◽

Yibo Ai ◽

Sheng Wang ◽

Weidong Zhang

Keyword(s):

Ground Truth ◽

Object Localization ◽

Feature Extracting ◽

Multi Scale ◽

Main Challenge ◽

Branch Model ◽

Localization Precision ◽

Object Parts ◽

Weakly Supervised ◽

Scale Detection

Weakly supervised object localization (WSOL) has attracted intense interest in computer vision for instance level annotations. As a hot research topic, a number of existing works concentrated on utilizing convolutional neural network (CNN)-based methods, which are powerful in extracting and representing features. The main challenge in CNN-based WSOL methods is to obtain features covering the entire target objects, not only the most discriminative object parts. To overcome this challenge and to improve the detection performance of feature extracting related WSOL methods, a CNN-based two-branch model was presented in this paper to locate objects using supervised learning. Our method contained two branches, including a detection branch and a self-attention branch. During the training process, the two branches interacted with each other by regarding the segmentation mask from the other branch as the pseudo ground truth labels of itself. Our model was able to focus on capturing the information of all the object parts due to the self-attention mechanism. Additionally, we embedded multi-scale detection into our two-branch method to output two-scale features. We evaluated our two-branch network on the CUB-200-2011 and VOC2007 datasets. The pointing localization, intersection over union (IoU) localization, and correct localization precision (CorLoc) results demonstrated competitive performance with other state-of-the-art methods in WSOL.

Download Full-text

Multi-Scale Low-Discriminative Feature Reactivation for Weakly Supervised Object Localization

IEEE Transactions on Image Processing ◽

10.1109/tip.2021.3091833 ◽

2021 ◽

pp. 1-1

Author(s):

Bo Wang ◽

Chunfeng Yuan ◽

Bing Li ◽

Xinmiao Ding ◽

Zeya Lia ◽

...

Keyword(s):

Object Localization ◽

Multi Scale ◽

Discriminative Feature ◽

Weakly Supervised

Download Full-text

Contrastive consistent feature learning for weakly supervised object localization semantic segmentation

Neurocomputing ◽

10.1016/j.neucom.2021.03.023 ◽

2021 ◽

Author(s):

Minsong Ki ◽

Youngjung Uh ◽

Wonyoung Lee ◽

Hyeran Byun

Keyword(s):

Feature Learning ◽

Semantic Segmentation ◽

Object Localization ◽

Consistent Feature ◽

Weakly Supervised

Download Full-text

Feature Pyramid Hierarchies for Multi-scale Temporal Action Detection

2020 25th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr48806.2021.9411986 ◽

2021 ◽

Author(s):

Jiayu He ◽

Guohui Li ◽

Jun Lei

Keyword(s):

Action Detection ◽

Multi Scale ◽

Feature Pyramid ◽

Temporal Action

Download Full-text

Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet

Sensors ◽

10.3390/s21051820 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1820

Author(s):

Xiaotao Shao ◽

Qing Wang ◽

Wei Yang ◽

Yun Chen ◽

Yi Xie ◽

...

Keyword(s):

Semantic Information ◽

Detection System ◽

Pedestrian Detection ◽

Detection Accuracy ◽

The Public ◽

Scale Feature ◽

Detection Algorithms ◽

Multi Scale ◽

Art Works ◽

Feature Pyramid

The existing pedestrian detection algorithms cannot effectively extract features of heavily occluded targets which results in lower detection accuracy. To solve the heavy occlusion in crowds, we propose a multi-scale feature pyramid network based on ResNet (MFPN) to enhance the features of occluded targets and improve the detection accuracy. MFPN includes two modules, namely double feature pyramid network (FPN) integrated with ResNet (DFR) and repulsion loss of minimum (RLM). We propose the double FPN which improves the architecture to further enhance the semantic information and contours of occluded pedestrians, and provide a new way for feature extraction of occluded targets. The features extracted by our network can be more separated and clearer, especially those heavily occluded pedestrians. Repulsion loss is introduced to improve the loss function which can keep predicted boxes away from the ground truths of the unrelated targets. Experiments carried out on the public CrowdHuman dataset, we obtain 90.96% AP which yields the best performance, 5.16% AP gains compared to the FPN-ResNet50 baseline. Compared with the state-of-the-art works, the performance of the pedestrian detection system has been boosted with our method.

Download Full-text

Weakly Supervised Object Localization on grocery shelves using simple FCN and Synthetic Dataset

Proceedings of the 11th Indian Conference on Computer Vision, Graphics and Image Processing ◽

10.1145/3293353.3293367 ◽

2018 ◽

Cited By ~ 1

Author(s):

Srikrishna Varadarajan ◽

Muktabh Mayank Srivastava

Keyword(s):

Synthetic Dataset ◽

Object Localization ◽

Weakly Supervised

Download Full-text

Where to Look?: Mining Complementary Image Regions for Weakly Supervised Object Localization

2021 IEEE Winter Conference on Applications of Computer Vision (WACV) ◽

10.1109/wacv48630.2021.00105 ◽

2021 ◽

Author(s):

Sadbhavana Babar ◽

Sukhendu Das

Keyword(s):

Object Localization ◽

Weakly Supervised ◽

Complementary Image

Download Full-text

MM-FPN: Multi-path and Multi-scale Feature Pyramid Network for Object Detection

10.1109/isceic53685.2021.00072 ◽

2021 ◽

Author(s):

Sheng Dong ◽

Jiaxin Zhang ◽

Zehui Qu

Keyword(s):

Object Detection ◽

Scale Feature ◽

Multi Scale ◽

Feature Pyramid

Download Full-text

Rethinking Class Activation Mapping for Weakly Supervised Object Localization

Computer Vision – ECCV 2020 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-58555-6_37 ◽

2020 ◽

pp. 618-634

Author(s):

Wonho Bae ◽

Junhyug Noh ◽

Gunhee Kim

Keyword(s):

Object Localization ◽

Weakly Supervised ◽

Activation Mapping

Download Full-text

Extended Feature Pyramid Network with Adaptive Scale Training Strategy and Anchors for Object Detection in Aerial Images

Remote Sensing ◽

10.3390/rs12050784 ◽

2020 ◽

Vol 12 (5) ◽

pp. 784 ◽

Cited By ~ 1

Author(s):

Wei Guo ◽

Weihong Li ◽

Weiguo Gong ◽

Jinkai Cui

Keyword(s):

Neural Network ◽

Object Detection ◽

Semantic Information ◽

Aerial Images ◽

Training Strategy ◽

The Public ◽

Multi Scale ◽

Shallow Layer ◽

The Neural Network ◽

Feature Pyramid

Multi-scale object detection is a basic challenge in computer vision. Although many advanced methods based on convolutional neural networks have succeeded in natural images, the progress in aerial images has been relatively slow mainly due to the considerably huge scale variations of objects and many densely distributed small objects. In this paper, considering that the semantic information of the small objects may be weakened or even disappear in the deeper layers of neural network, we propose a new detection framework called Extended Feature Pyramid Network (EFPN) for strengthening the information extraction ability of the neural network. In the EFPN, we first design the multi-branched dilated bottleneck (MBDB) module in the lateral connections to capture much more semantic information. Then, we further devise an attention pathway for better locating the objects. Finally, an augmented bottom-up pathway is conducted for making shallow layer information easier to spread and further improving performance. Moreover, we present an adaptive scale training strategy to enable the network to better recognize multi-scale objects. Meanwhile, we present a novel clustering method to achieve adaptive anchors and make the neural network better learn data features. Experiments on the public aerial datasets indicate that the presented method obtain state-of-the-art performance.

Download Full-text