Simultaneous Image Reconstruction and Feature Learning with 3D-CNNs for Image Set–Based Classification

Xinyu Zhang; Xiaocui Li; Xiao-Yuan Jing; Li Cheng

doi:10.1145/3420037

Simultaneous Image Reconstruction and Feature Learning with 3D-CNNs for Image Set–Based Classification

ACM/IMS Transactions on Data Science ◽

10.1145/3420037 ◽

2021 ◽

Vol 2 (2) ◽

pp. 1-13

Author(s):

Xinyu Zhang ◽

Xiaocui Li ◽

Xiao-Yuan Jing ◽

Li Cheng

Keyword(s):

Image Reconstruction ◽

State Of The Art ◽

Feature Learning ◽

Temporal Information ◽

Classification Methods ◽

Learning Network ◽

Image Set Classification ◽

Image Set ◽

Deep Image ◽

3D Cnn

Image set–based classification has attracted substantial research interest because of its broad applications. Recently, lots of methods based on feature learning or dictionary learning have been developed to solve this problem, and some of them have made gratifying achievements. However, most of them transform the image set into a 2D matrix or use 2D convolutional neural networks (CNNs) for feature learning, so the spatial and temporal information is missing. At the same time, these methods extract features from original images in which there may exist huge intra-class diversity. To explore a possible solution to these issues, we propose a simultaneous image reconstruction with deep learning and feature learning with 3D-CNNs (SIRFL) for image set classification. The proposed SIRFL approach consists of a deep image reconstruction network and a 3D-CNN-based feature learning network. The deep image reconstruction network is used to reduce the diversity of images from the same set, and the feature learning network can effectively retain spatial and temporal information by using 3D-CNNs. Extensive experimental results on five widely used datasets show that our SIRFL approach is a strong competitor for the state-of-the-art image set classification methods.

Download Full-text

Disentangled Feature Learning Network for Vehicle Re-Identification

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/66 ◽

2020 ◽

Author(s):

Yan Bai ◽

Yihang Lou ◽

Yongxing Dai ◽

Jun Liu ◽

Ziqian Chen ◽

...

Keyword(s):

State Of The Art ◽

Feature Learning ◽

Feature Representation ◽

Public Security ◽

The Public ◽

Common Features ◽

Learning Network ◽

Single Feature ◽

Art Performance

Vehicle Re-Identification (ReID) has attracted lots of research efforts due to its great significance to the public security. In vehicle ReID, we aim to learn features that are powerful in discriminating subtle differences between vehicles which are visually similar, and also robust against different orientations of the same vehicle. However, these two characteristics are hard to be encapsulated into a single feature representation simultaneously with unified supervision. Here we propose a Disentangled Feature Learning Network (DFLNet) to learn orientation specific and common features concurrently, which are discriminative at details and invariant to orientations, respectively. Moreover, to effectively use these two types of features for ReID, we further design a feature metric alignment scheme to ensure the consistency of the metric scales. The experiments show the effectiveness of our method that achieves state-of-the-art performance on three challenging datasets.

Download Full-text

Efficient Image Set Classification Using Linear Regression Based Image Reconstruction

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) ◽

10.1109/cvprw.2017.88 ◽

2017 ◽

Cited By ~ 11

Author(s):

Syed A. A. Shah ◽

Uzair Nadeem ◽

Mohammed Bennamoun ◽

Ferdous Sohel ◽

Roberto Togneri

Keyword(s):

Image Reconstruction ◽

Linear Regression ◽

Image Set Classification ◽

Image Set

Download Full-text

Prototype based feature learning for face image set classification

2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG) ◽

10.1109/fg.2013.6553702 ◽

2013 ◽

Cited By ~ 1

Author(s):

Mingbo Ma ◽

Ming Shao ◽

Xu Zhao ◽

Yun Fu

Keyword(s):

Feature Learning ◽

Face Image ◽

Image Set Classification ◽

Image Set

Download Full-text

LNTP-MDBN: Big Data Integrated Learning Framework for Heterogeneous Image Set Classification

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405613666170721103949 ◽

2019 ◽

Vol 15 (2) ◽

pp. 227-236

Author(s):

D. Franklin Vinod ◽

V. Vasudevan

Keyword(s):

Big Data ◽

Medical Image ◽

Research Study ◽

Big Data Analytics ◽

Learning Framework ◽

Image Set Classification ◽

Filtering Technique ◽

Image Set ◽

Hidden Patterns ◽

Global Data

Background: With the explosive growth of global data, the term Big Data describes the enormous size of dataset through the detailed analysis. The big data analytics revealed the hidden patterns and secret correlations among the values. The major challenges in Big data analysis are due to increase of volume, variety, and velocity. The capturing of images with multi-directional views initiates the image set classification which is an attractive research study in the volumetricbased medical image processing. Methods: This paper proposes the Local N-ary Ternary Patterns (LNTP) and Modified Deep Belief Network (MDBN) to alleviate the dimensionality and robustness issues. Initially, the proposed LNTP-MDBN utilizes the filtering technique to identify and remove the dependent and independent noise from the images. Then, the application of smoothening and the normalization techniques on the filtered image improves the intensity of the images. Results: The LNTP-based feature extraction categorizes the heterogeneous images into different categories and extracts the features from each category. Based on the extracted features, the modified DBN classifies the normal and abnormal categories in the image set finally. Conclusion: The comparative analysis of proposed LNTP-MDBN with the existing pattern extraction and DBN learning models regarding classification accuracy and runtime confirms the effectiveness in mining applications.

Download Full-text

Automatic Detection of Discrimination Actions from Social Images

Electronics ◽

10.3390/electronics10030325 ◽

2021 ◽

Vol 10 (3) ◽

pp. 325

Author(s):

Zhihao Wu ◽

Baopeng Zhang ◽

Tianchen Zhou ◽

Yan Li ◽

Jianping Fan

Keyword(s):

Action Recognition ◽

State Of The Art ◽

Automatic Detection ◽

Experimental Results ◽

Practical Approach ◽

Detection And Identification ◽

Art Methods ◽

Image Set ◽

Social Images ◽

Relationship Identification

In this paper, we developed a practical approach for automatic detection of discrimination actions from social images. Firstly, an image set is established, in which various discrimination actions and relations are manually labeled. To the best of our knowledge, this is the first work to create a dataset for discrimination action recognition and relationship identification. Secondly, a practical approach is developed to achieve automatic detection and identification of discrimination actions and relationships from social images. Thirdly, the task of relationship identification is seamlessly integrated with the task of discrimination action recognition into one single network called the Co-operative Visual Translation Embedding++ network (CVTransE++). We also compared our proposed method with numerous state-of-the-art methods, and our experimental results demonstrated that our proposed methods can significantly outperform state-of-the-art approaches.

Download Full-text

Using spatial-temporal ensembles of convolutional neural networks for lumen segmentation in ureteroscopy

International Journal of Computer Assisted Radiology and Surgery ◽

10.1007/s11548-021-02376-3 ◽

2021 ◽

Author(s):

Jorge F. Lazo ◽

Aldo Marzullo ◽

Sara Moccia ◽

Michele Catellani ◽

Benoit Rosa ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

State Of The Art ◽

Automatic Segmentation ◽

Temporal Information ◽

Invasive Technique ◽

Dice Similarity Coefficient ◽

Specular Reflections ◽

Lumen Segmentation ◽

Previous State

Abstract Purpose Ureteroscopy is an efficient endoscopic minimally invasive technique for the diagnosis and treatment of upper tract urothelial carcinoma. During ureteroscopy, the automatic segmentation of the hollow lumen is of primary importance, since it indicates the path that the endoscope should follow. In order to obtain an accurate segmentation of the hollow lumen, this paper presents an automatic method based on convolutional neural networks (CNNs). Methods The proposed method is based on an ensemble of 4 parallel CNNs to simultaneously process single and multi-frame information. Of these, two architectures are taken as core-models, namely U-Net based in residual blocks ($$m_1$$ m 1 ) and Mask-RCNN ($$m_2$$ m 2 ), which are fed with single still-frames I(t). The other two models ($$M_1$$ M 1 , $$M_2$$ M 2 ) are modifications of the former ones consisting on the addition of a stage which makes use of 3D convolutions to process temporal information. $$M_1$$ M 1 , $$M_2$$ M 2 are fed with triplets of frames ($$I(t-1)$$ I ( t - 1 ) , I(t), $$I(t+1)$$ I ( t + 1 ) ) to produce the segmentation for I(t). Results The proposed method was evaluated using a custom dataset of 11 videos (2673 frames) which were collected and manually annotated from 6 patients. We obtain a Dice similarity coefficient of 0.80, outperforming previous state-of-the-art methods. Conclusion The obtained results show that spatial-temporal information can be effectively exploited by the ensemble model to improve hollow lumen segmentation in ureteroscopic images. The method is effective also in the presence of poor visibility, occasional bleeding, or specular reflections.

Download Full-text

SymNet: A Simple Symmetric Positive Definite Manifold Deep Learning Method for Image Set Classification

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2020.3044176 ◽

2021 ◽

pp. 1-15

Author(s):

Rui Wang ◽

Xiao-Jun Wu ◽

Josef Kittler

Keyword(s):

Deep Learning ◽

Positive Definite ◽

Learning Method ◽

Symmetric Positive Definite ◽

Image Set Classification ◽

Image Set

Download Full-text

New Multi-View Classification Method with Uncertain Data

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3458282 ◽

2021 ◽

Vol 16 (1) ◽

pp. 1-23

Author(s):

Bo Liu ◽

Haowen Zhong ◽

Yanshan Xiao

Keyword(s):

Learning Strategy ◽

State Of The Art ◽

Uncertain Data ◽

Real Life ◽

Support Vector ◽

Classification Methods ◽

Complementary Information ◽

Novel Approach ◽

Svm Model ◽

Iterative Framework

Multi-view classification aims at designing a multi-view learning strategy to train a classifier from multi-view data, which are easily collected in practice. Most of the existing works focus on multi-view classification by assuming the multi-view data are collected with precise information. However, we always collect the uncertain multi-view data due to the collection process is corrupted with noise in real-life application. In this case, this article proposes a novel approach, called uncertain multi-view learning with support vector machine (UMV-SVM) to cope with the problem of multi-view learning with uncertain data. The method first enforces the agreement among all the views to seek complementary information of multi-view data and takes the uncertainty of the multi-view data into consideration by modeling reachability area of the noise. Then it proposes an iterative framework to solve the proposed UMV-SVM model such that we can obtain the multi-view classifier for prediction. Extensive experiments on real-life datasets have shown that the proposed UMV-SVM can achieve a better performance for uncertain multi-view classification in comparison to the state-of-the-art multi-view classification methods.

Download Full-text

Multiview Deep Feature Learning Network for SAR Automatic Target Recognition

Remote Sensing ◽

10.3390/rs13081455 ◽

2021 ◽

Vol 13 (8) ◽

pp. 1455

Author(s):

Jifang Pei ◽

Weibo Huo ◽

Chenwei Wang ◽

Yulin Huang ◽

Yin Zhang ◽

...

Keyword(s):

Target Recognition ◽

Feature Learning ◽

Automatic Target Recognition ◽

Operating Conditions ◽

Sar Images ◽

Single View ◽

Learning Network ◽

Deep Feature ◽

Deep Feature Learning ◽

Classification Information

Multiview synthetic aperture radar (SAR) images contain much richer information for automatic target recognition (ATR) than a single-view one. It is desirable to establish a reasonable multiview ATR scheme and design effective ATR algorithm to thoroughly learn and extract that classification information, so that superior SAR ATR performance can be achieved. Hence, a general processing framework applicable for a multiview SAR ATR pattern is first given in this paper, which can provide an effective approach to ATR system design. Then, a new ATR method using a multiview deep feature learning network is designed based on the proposed multiview ATR framework. The proposed neural network is with a multiple input parallel topology and some distinct deep feature learning modules, with which significant classification features, the intra-view and inter-view features existing in the input multiview SAR images, will be learned simultaneously and thoroughly. Therefore, the proposed multiview deep feature learning network can achieve an excellent SAR ATR performance. Experimental results have shown the superiorities of the proposed multiview SAR ATR method under various operating conditions.

Download Full-text

AFLN-DGCL: Adaptive Feature Learning Network With Difficulty-Guided Curriculum Learning for skin lesion segmentation

Applied Soft Computing ◽

10.1016/j.asoc.2021.107656 ◽

2021 ◽

pp. 107656

Author(s):

Peng Tang ◽

Xintong Yan ◽

Qiaokang Liang ◽

Dan Zhang

Keyword(s):

Skin Lesion ◽

Feature Learning ◽

Lesion Segmentation ◽

Learning Network

Download Full-text