scholarly journals Hybrid CNN-LSTM Model for Answer Identification

2019 ◽  
Vol 8 (3) ◽  
pp. 1163-1166

User quest for information has led to development of Question Answer (QA) system to provide relevant answers to user questions. The QA task are different than normal NLP tasks as they heavily depend to semantics and context of given data. Retrieving and predicting answers to verity of questions require understanding of question, relevance with context and identifying and retrieving of suitable answers. Deep learning helps to produce impressive performance as it employs deep neural network with automatic feature extraction methods. The paper proposes a hybrid model to identify suitable answer for posed question. The proposes power exploits the power of CNN for extracting features and ability of LSTM for considering long term dependencies and semantic of context and question. Paper provides a comparative analysis on deep learning methods useful for predicting answer with the proposed method .The model is implemented on twenty tasks of babI dataset of Facebook .

Author(s):  
G. Rama Janani

The paper is based on classification of respiratory illness like covid 19 and pneumonia by using deep learning. The symptoms of COVID-19 and pneumonia are similar. Due to this, it is often difficult to identify what is causing your condition without being tested for COVID-19 or other respiratory infections. To find out how COVID-19 and pneumonia differs from one another, this paper presents that a novel Convolutional Neural Network in Tensor Flow and Keras based Covid-19 pneumonia classification. The proposed system supported implements CNN using Pneumonia images to classify the Covid-19, normal, pneumonia. The knowledge from these studies can potentially help in diagnosis of the concerned disease. It is predicted that the success of the anticipated results will increase if the CNN method is supported by adding extra feature extraction methods for classifying covid-19 and pneumonia successfully thereby improving the efficacy and potential of using deep CNN to pictures.


2020 ◽  
Vol 11 (1) ◽  
Author(s):  
Takuya Maekawa ◽  
Kazuya Ohara ◽  
Yizhe Zhang ◽  
Matasaburo Fukutomi ◽  
Sakiko Matsumoto ◽  
...  

Abstract A comparative analysis of animal behavior (e.g., male vs. female groups) has been widely used to elucidate behavior specific to one group since pre-Darwinian times. However, big data generated by new sensing technologies, e.g., GPS, makes it difficult for them to contrast group differences manually. This study introduces DeepHL, a deep learning-assisted platform for the comparative analysis of animal movement data, i.e., trajectories. This software uses a deep neural network based on an attention mechanism to automatically detect segments in trajectories that are characteristic of one group. It then highlights these segments in visualized trajectories, enabling biologists to focus on these segments, and helps them reveal the underlying meaning of the highlighted segments to facilitate formulating new hypotheses. We tested the platform on a variety of trajectories of worms, insects, mice, bears, and seabirds across a scale from millimeters to hundreds of kilometers, revealing new movement features of these animals.


2021 ◽  
Vol 13 (13) ◽  
pp. 2575
Author(s):  
Jiangbo Xi ◽  
Ming Cong ◽  
Okan K. Ersoy ◽  
Weibao Zou ◽  
Chaoying Zhao ◽  
...  

Recently, deep learning has been successfully and widely used in hyperspectral image (HSI) classification. Considering the difficulty of acquiring HSIs, there are usually a small number of pixels used as the training instances. Therefore, it is hard to fully use the advantages of deep learning networks; for example, the very deep layers with a large number of parameters lead to overfitting. This paper proposed a dynamic wide and deep neural network (DWDNN) for HSI classification, which includes multiple efficient wide sliding window and subsampling (EWSWS) networks and can grow dynamically according to the complexity of the problems. The EWSWS network in the DWDNN was designed both in the wide and deep direction with transform kernels as hidden units. These multiple layers of kernels can extract features from the low to high level, and because they are extended in the wide direction, they can learn features more steadily and smoothly. The sliding windows with the stride and subsampling were designed to reduce the dimension of the features for each layer; therefore, the computational load was reduced. Finally, all the weights were only from the fully connected layer, and the iterative least squares method was used to compute them easily. The proposed DWDNN was tested with several HSI data including the Botswana, Pavia University, and Salinas remote sensing datasets with different numbers of instances (from small to big). The experimental results showed that the proposed method had the highest test accuracies compared to both the typical machine learning methods such as support vector machine (SVM), multilayer perceptron (MLP), radial basis function (RBF), and the recently proposed deep learning methods including the 2D convolutional neural network (CNN) and the 3D CNN designed for HSI classification.


Sensors ◽  
2021 ◽  
Vol 21 (8) ◽  
pp. 2846
Author(s):  
Anik Sen ◽  
Kaushik Deb ◽  
Pranab Kumar Dhar ◽  
Takeshi Koshiba

Recognizing the sport of cricket on the basis of different batting shots can be a significant part of context-based advertisement to users watching cricket, generating sensor-based commentary systems and coaching assistants. Due to the similarity between different batting shots, manual feature extraction from video frames is tedious. This paper proposes a hybrid deep-neural-network architecture for classifying 10 different cricket batting shots from offline videos. We composed a novel dataset, CricShot10, comprising uneven lengths of batting shots and unpredictable illumination conditions. Impelled by the enormous success of deep-learning models, we utilized a convolutional neural network (CNN) for automatic feature extraction, and a gated recurrent unit (GRU) to deal with long temporal dependency. Initially, conventional CNN and dilated CNN-based architectures were developed. Following that, different transfer-learning models were investigated—namely, VGG16, InceptionV3, Xception, and DenseNet169—which freeze all the layers. Experiment results demonstrated that the VGG16–GRU model outperformed the other models by attaining 86% accuracy. We further explored VGG16 and two models were developed, one by freezing all but the final 4 VGG16 layers, and another by freezing all but the final 8 VGG16 layers. On our CricShot10 dataset, these two models were 93% accurate. These results verify the effectiveness of our proposed architecture compared with other methods in terms of accuracy.


Author(s):  
A. Sokolova ◽  
A. Konushin

In this work we investigate the problem of people recognition by their gait. For this task, we implement deep learning approach using the optical flow as the main source of motion information and combine neural feature extraction with the additional embedding of descriptors for representation improvement. In order to find the best heuristics, we compare several deep neural network architectures, learning and classification strategies. The experiments were made on two popular datasets for gait recognition, so we investigate their advantages and disadvantages and the transferability of considered methods.


Author(s):  
Dong-Dong Chen ◽  
Wei Wang ◽  
Wei Gao ◽  
Zhi-Hua Zhou

Deep neural networks have witnessed great successes in various real applications, but it requires a large number of labeled data for training. In this paper, we propose tri-net, a deep neural network which is able to use massive unlabeled data to help learning with limited labeled data. We consider model initialization, diversity augmentation and pseudo-label editing simultaneously. In our work, we utilize output smearing to initialize modules, use fine-tuning on labeled data to augment diversity and eliminate unstable pseudo-labels to alleviate the influence of suspicious pseudo-labeled data. Experiments show that our method achieves the best performance in comparison with state-of-the-art semi-supervised deep learning methods. In particular, it achieves 8.30% error rate on CIFAR-10 by using only 4000 labeled examples.


2019 ◽  
Vol 9 (5) ◽  
pp. 940 ◽  
Author(s):  
Huseyin Polat ◽  
Homay Danaei Mehr

Lung cancer is the most common cause of cancer-related deaths worldwide. Hence, the survival rate of patients can be increased by early diagnosis. Recently, machine learning methods on Computed Tomography (CT) images have been used in the diagnosis of lung cancer to accelerate the diagnosis process and assist physicians. However, in conventional machine learning techniques, using handcrafted feature extraction methods on CT images are complicated processes. Hence, deep learning as an effective area of machine learning methods by using automatic feature extraction methods could minimize the process of feature extraction. In this study, two Convolutional Neural Network (CNN)-based models were proposed as deep learning methods to diagnose lung cancer on lung CT images. To investigate the performance of the two proposed models (Straight 3D-CNN with conventional softmax and hybrid 3D-CNN with Radial Basis Function (RBF)-based SVM), the altered models of two-well known CNN architectures (3D-AlexNet and 3D-GoogleNet) were considered. Experimental results showed that the performance of the two proposed models surpassed 3D-AlexNet and 3D-GoogleNet. Furthermore, the proposed hybrid 3D-CNN with SVM achieved more satisfying results (91.81%, 88.53% and 91.91% for accuracy rate, sensitivity and precision respectively) compared to straight 3D-CNN with softmax in the diagnosis of lung cancer.


Author(s):  
E.Yu. Shchetinin ◽  
A.V. Demidova ◽  
D.S. Kulyabov ◽  
L.A. Sevastyanov

In this paper, we propose an approach to solving the problem of recognizing skin lesions, namely melanoma, based on the analysis of dermoscopic images using deep learning methods. For this purpose, the architecture of a deep convolutional neural network was developed, which was applied to the processing of dermoscopic images of various skin lesions contained in the HAM10000 data set. The data under study were preprocessed to eliminate noise, contamination, and change the size and format of images. In addition, since the disease classes are unbalanced, a number of transformations were performed to balance them. The data obtained in this way were divided into two classes: Melanoma and Benign. Computer experiments using the built deep neural network based on the data obtained in this way have shown that the proposed approach provides 94% accuracy on the test sample, which exceeds similar results obtained by other algorithms.


2020 ◽  
Vol 02 (03) ◽  
pp. 7-12
Author(s):  
Elcin Nizami Huseyn ◽  

Generally, Parkinson’s disease (PD) in medicine is a long-term neurodegenerative and progressive disorder. In some brain parts, as the dopamine generating neurons die or they are damaged. Then people begin to have difficulty in walking, writing, speaking or making other basic missions Some of the indications of the disease worsen over time and thus result in increased acuteness of Parkinson's disease. We have proposed a methodology for the prognosis of Parkinson’s disease acuteness. In this scientific article, we used deep neural networks in UCI's Parkinson's telemonitoring voice dataset patients. We have utilized Keras and TensorFlow in Python deep learning library to implement our neural network for prognosis the PD acuteness. The correctness values obtained with our method are preferable than the correctness values specified in the previous research test. Key words: Parkinson's disease, Deep Learning, UCI, Python, Deep Neural Network, Keras, TensorFlow, UPDRS


Sign in / Sign up

Export Citation Format

Share Document