scholarly journals IOU-Guided Siamese Tracking

2021 ◽  
Vol 2021 ◽  
pp. 1-10
Author(s):  
Jianjun Bao ◽  
Haibo Wang ◽  
Chen Lv ◽  
Ke Luo ◽  
Xiaolin Shen

Target tracking is currently a hot research topic in machine vision. The traditional target tracking algorithm based on the generative model selects target features manually, which has a simple structure and fast running speed, but it cannot meet the requirements of algorithm accuracy in complex scenes. Compared with traditional algorithms, due to the good performance, the tracking method based on full convolutional network has become one of the important methods of target tracking. However, the RPN-based Siamese network lacks positional reliability when predicting the target area. Aiming at the low tracking accuracy of the RPN-based Siamese network, this paper proposes an improved framework model named IoU-guided SiamRPN (IG-SiamRPN). In the proposed IG-SiamRPN, the IoU-guided branch is first constructed and sample pairs are generated through data augmentation. Then, the Jittered RoI is constructed to train the network to realize the direct prediction of the localization confidence of the candidate area. Subsequently, a target selection method based on predicted IoU scores is proposed, which uses predicted IoU scores instead of classification scores to optimize the target decision strategy of the Siamese network. Finally, an optimization-based fine-tuning method for the Siamese network frame is proposed, which solves the problem of location degradation and improves the performance of the algorithm. Compared with other state-of-the-art target tracking algorithms, experimental results on popular databases demonstrate that the proposed IG-SiamRPN can achieve better performance in both tracking accuracy and robustness.

Author(s):  
Ding Ma ◽  
Yong Zhou ◽  
Rui Yao ◽  
Jiaqi Zhao ◽  
Bing Liu ◽  
...  

This paper combines the boundary box regression with the training data occlusion processing method, the occlusion problem is more accurate and the tracking accuracy is improved. The occlusion problem is now the major challenge in target tracking. This paper puts forward a weakly monitoring framework to address this problem. The main idea is to randomly hide the most discriminating patches in the input images, forcing the network to focus on other relevant parts. Our method only needs to modify the inputs, no need to hide any patches during the test.


2020 ◽  
Vol 6 (1) ◽  
Author(s):  
Svenja Ipsen ◽  
Sven Böttger ◽  
Holger Schwegmann ◽  
Floris Ernst

AbstractUltrasound (US) imaging, in contrast to other image guidance techniques, offers the distinct advantage of providing volumetric image data in real-time (4D) without using ionizing radiation. The goal of this study was to perform the first quantitative comparison of three different 4D US systems with fast matrix array probes and real-time data streaming regarding their target tracking accuracy and system latency. Sinusoidal motion of varying amplitudes and frequencies was used to simulate breathing motion with a robotic arm and a static US phantom. US volumes and robot positions were acquired online and stored for retrospective analysis. A template matching approach was used for target localization in the US data. Target motion measured in US was compared to the reference trajectory performed by the robot to determine localization accuracy and system latency. Using the robotic setup, all investigated 4D US systems could detect a moving target with sub-millimeter accuracy. However, especially high system latency increased tracking errors substantially and should be compensated with prediction algorithms for respiratory motion compensation.


Diagnostics ◽  
2021 ◽  
Vol 11 (6) ◽  
pp. 1052
Author(s):  
Leang Sim Nguon ◽  
Kangwon Seo ◽  
Jung-Hyun Lim ◽  
Tae-Jun Song ◽  
Sung-Hyun Cho ◽  
...  

Mucinous cystic neoplasms (MCN) and serous cystic neoplasms (SCN) account for a large portion of solitary pancreatic cystic neoplasms (PCN). In this study we implemented a convolutional neural network (CNN) model using ResNet50 to differentiate between MCN and SCN. The training data were collected retrospectively from 59 MCN and 49 SCN patients from two different hospitals. Data augmentation was used to enhance the size and quality of training datasets. Fine-tuning training approaches were utilized by adopting the pre-trained model from transfer learning while training selected layers. Testing of the network was conducted by varying the endoscopic ultrasonography (EUS) image sizes and positions to evaluate the network performance for differentiation. The proposed network model achieved up to 82.75% accuracy and a 0.88 (95% CI: 0.817–0.930) area under curve (AUC) score. The performance of the implemented deep learning networks in decision-making using only EUS images is comparable to that of traditional manual decision-making using EUS images along with supporting clinical information. Gradient-weighted class activation mapping (Grad-CAM) confirmed that the network model learned the features from the cyst region accurately. This study proves the feasibility of diagnosing MCN and SCN using a deep learning network model. Further improvement using more datasets is needed.


2014 ◽  
Vol 904 ◽  
pp. 325-329
Author(s):  
Hong Wei Quan ◽  
Lin Chen ◽  
Dong Liang Peng

This paper addresses the problem of the joint target tracking and classification based on data fusion. In traditional methods, a separate suite of sensors and system models are used, target tracking and target classification are usually treated as separate problems. In our JTC framework, the link between target state and class is considered and the feasibility of JTC techniques is discussed. The tracking accuracy and classification probability are improved to some extent with the more accurate classification results from classifier based on data fusion feedback to state filter.


2020 ◽  
Author(s):  
Luis H. S. Vogado ◽  
Rodrigo M. S. Veras ◽  
Kelson R. T. Aires

Leukemia is a disorder that affects the bone marrow, causing uncontrolled production of leukocytes, impairing the transport of oxygen and causing blood coagulation problems. In this article, we propose a new computational tool, named LeukNet, a Convolutional Neural Network (CNN) architecture based on the VGG-16 convolutional blocks, to facilitate the leukemia diagnosis from blood smear images. We evaluated different architectures and fine-tuning methods using 18 datasets containing 3536 images with distinct characteristics of color, texture, contrast, and resolution. Additionally, data augmentation operations were applied to increase the training set by up to 20 times. The k-fold cross-validation (k = 5) results achieved 98.28% of accuracy. A cross-dataset validation technique, named LeaveOne-Dataset-Out Cross-Validation (LODOCV), is also proposed to evaluate the developed model’s generalization capability. The accuracy of using LODOCV on the ALL-IDB 1, ALL-IDB 2, and UFG datasets was 97.04%, 82.46%, and 70.24%, respectively, overcoming the current state-of-the-art results and offering new guidelines for image-based computer-aided diagnosis (CAD) systems in this area.


2020 ◽  
Author(s):  
Tuan Pham

Chest X-rays have been found to be very promising for assessing COVID-19 patients, especially for resolving emergency-department and urgent-care-center overcapacity. Deep-learning (DL) methods in artificial intelligence (AI) play a dominant role as high-performance classifiers in the detection of the disease using chest X-rays. While many new DL models have been being developed for this purpose, this study aimed to investigate the fine tuning of pretrained convolutional neural networks (CNNs) for the classification of COVID-19 using chest X-rays. Three pretrained CNNs, which are AlexNet, GoogleNet, and SqueezeNet, were selected and fine-tuned without data augmentation to carry out 2-class and 3-class classification tasks using 3 public chest X-ray databases. In comparison with other recently developed DL models, the 3 pretrained CNNs achieved very high classification results in terms of accuracy, sensitivity, specificity, precision, F1 score, and area under the receiver-operating-characteristic curve. AlexNet, GoogleNet, and SqueezeNet require the least training time among pretrained DL models, but with suitable selection of training parameters, excellent classification results can be achieved without data augmentation by these networks. The findings contribute to the urgent need for harnessing the pandemic by facilitating the deployment of AI tools that are fully automated and readily available in the public domain for rapid implementation.


2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Haibo Pang ◽  
Qi Xuan ◽  
Meiqin Xie ◽  
Chengming Liu ◽  
Zhanbo Li

Target tracking is a significant topic in the field of computer vision. In this paper, the target tracking algorithm based on deep Siamese network is studied. Aiming at the situation that the tracking process is not robust, such as drift or miss the target, the tracking accuracy and robustness of the algorithm are improved by improving the feature extraction part and online update part. This paper adds SE-block and temporal attention mechanism (TAM) to the framework of Siamese neural network. SE-block can refine and extract features; different channels are given different weights according to their importance which can improve the discrimination of the network and the recognition ability of the tracker. Temporal attention mechanism can update the target state by adjusting the weights of samples at current frame and historical frame to solve the model drift caused by the existence of similar background. We use cross-entropy loss to distinguish the targets in different sequences so that their distance in the feature domains is longer and the features are easier to identify. We train and test the network on three benchmarks and compare with several state-of-the-art tracking methods. The experimental results demonstrate that the algorithm proposed is superior to other methods in tracking effect diagram and evaluation criteria. The proposed algorithm can solve the occlusion problem effectively while ensuring the real-time performance in the process of tracking.


Counterfeit note has a disastrous impact on a country’s economy. The circulation of such fake notes not only diminishes the value of genuine note but also results in inflation. The feasible solution to this burning issue is to create awareness about the counterfeit notes among public and to equip them with a technology to detect fake notes on their own. Though there exist numerous research articles on detection of fake notes, they are not handy. The reason for this could be the unavailability or unaffordability in acquiring the equipment for the same. This paper proposes an approach whose implementation can easily be deployed on a smart phone and hence anyone with access to them can use the application to detect the fake notes. The proposed approach consists of the processing phases including image procurement, pre-processing, data augmentation, feature extraction and classification. ₹500 notes are considered for experimentation analysis. Out of 17 distinctive features, 3 such from the obverse side are considered to evaluate the genuineness of the note. Siamese neural network is employed to build a model for effective classification of the notes. The performance of the proposed approach is evaluated at 85% with respect to accuracy.


2021 ◽  
Vol 7 ◽  
pp. e371
Author(s):  
Elia Cano ◽  
José Mendoza-Avilés ◽  
Mariana Areiza ◽  
Noemi Guerra ◽  
José Longino Mendoza-Valdés ◽  
...  

Skin lesions are one of the typical symptoms of many diseases in humans and indicative of many types of cancer worldwide. Increased risks caused by the effects of climate change and a high cost of treatment, highlight the importance of skin cancer prevention efforts like this. The methods used to detect these diseases vary from a visual inspection performed by dermatologists to computational methods, and the latter has widely used automatic image classification applying Convolutional Neural Networks (CNNs) in medical image analysis in the last few years. This article presents an approach that uses CNNs with a NASNet architecture to recognize in a more accurate way, without segmentation, eight skin diseases. The model was trained end-to-end on Keras with augmented skin diseases images from the International Skin Imaging Collaboration (ISIC). The CNN architectures were initialized with weight from ImageNet, fine-tuned in order to discriminate well among the different types of skin lesions, and then 10-fold cross-validation was applied. Finally, some evaluation metrics are calculated as accuracy, sensitivity, and specificity and compare with other CNN trained architectures. This comparison shows that the proposed system offers higher accuracy results, with a significant reduction on the training paraments. To the best of our knowledge and based in the state-of-art recompiling in this work, the application of the NASNet architecture training with skin image lesion from ISIC archive for multi-class classification and evaluated by cross-validation, represents a novel skin disease classification system.


2021 ◽  
Vol 2107 (1) ◽  
pp. 012064
Author(s):  
S.M. Othman ◽  
Noorhazirah Sunar ◽  
Hassrizal H.B ◽  
A.H. Ismail ◽  
M.N. Ayob ◽  
...  

Abstract Electro-Hydraulic Actuator (EHA) system is a third order non-linear system which is highly suffer from system uncertainties such as Coulomb friction, viscous friction and pump leakage coefficient which makes this system more complicated for the designing of the controller. The Proportional-Integral-Derivative (PID) controller has proposed in this paper to control EHA system and main problem in its application is to tune the parameter to its optimum value. Two different methods are used to tune the PID controller which are trial and error and Ziegler-Nichols method. MATLAB Simulink is used to simulate the system. In order to determine the performance of EHA system for the position tracking. 3 different of external disturbance such as 0N, 5000N and 10000N has been injected into the system. Simulation results show that the Ziegler-Nichols fine tuning method provides the better tracking performance when compared to the trial and error method for every specific disturbance setting. The Ziegler Nichols method provides better disturbance rejection as the performances indexes such as percentage overshoot, settling time and steady state error are not affected by the varying of disturbance.


Sign in / Sign up

Export Citation Format

Share Document