DNCON2: Improved protein contact prediction using two-level deep convolutional neural networks

Mapping Intimacies ◽

10.1101/222893 ◽

2017 ◽

Cited By ~ 4

Author(s):

Badri Adhikari ◽

Jie Hou ◽

Jianlin Cheng

Keyword(s):

Neural Networks ◽

Ab Initio ◽

Convolutional Neural Networks ◽

Structure Prediction ◽

Supplementary Information ◽

Deep Convolutional Neural Networks ◽

Arbitrary Length ◽

Contact Maps ◽

Link Type ◽

Improved Performance

AbstractMotivationSignificant improvements in the prediction of protein residue-residue contacts are observed in the recent years. These contacts, predicted using a variety of coevolution-based and machine learning methods, are the key contributors to the recent progress in ab initio protein structure prediction, as demonstrated in the recent CASP experiments. Continuing the development of new methods to reliably predict contact maps is essential to further improve ab initio structure prediction.ResultsIn this paper we discuss DNCON2, an improved protein contact map predictor based on two-level deep convolutional neural networks. It consists of six convolutional neural networks – the first five predict contacts at 6, 7.5, 8, 8.5, and 10 Å distance thresholds, and the last one uses these five predictions as additional features to predict final contact maps. On the free-modeling datasets in CASP10, 11, and 12 experiments, DNCON2 achieves mean precisions of 35%, 50%, and 53.4%, respectively, higher than 30.6% by MetaPSICOV on CASP10 dataset, 34% by MetaPSICOV on CASP11 dataset, and 46.3% by Raptor-X on CASP12 dataset, when top L/5 long-range contacts are evaluated. We attribute the improved performance of DNCON2 to the inclusion of short- and medium-range contacts into training, two-level approach to prediction, use of the state-of-the-art optimization and activation functions, and a novel deep learning architecture that allows each filter in a convolutional layer to access all the input features of a protein of arbitrary length.AvailabilityThe web server of DNCON2 is at http://sysbio.rnet.missouri.edu/dncon2/ where training and testing datasets as well as the predictions for CASP10, 11, and 12 free-modeling datasets can also be downloaded. Its source code is available at https://github.com/multicom-toolbox/DNCON2/[email protected] informationSupplementary data are available online.

Download Full-text

Improving prediction of protein secondary structure, backbone angles, solvent accessibility and contact numbers by using predicted contact maps and an ensemble of recurrent and residual convolutional neural networks

Bioinformatics ◽

10.1093/bioinformatics/bty1006 ◽

2018 ◽

Vol 35 (14) ◽

pp. 2403-2410 ◽

Cited By ~ 35

Author(s):

Jack Hanson ◽

Kuldip Paliwal ◽

Thomas Litfin ◽

Yuedong Yang ◽

Yaoqi Zhou

Keyword(s):

Neural Networks ◽

Protein Structure ◽

Secondary Structure ◽

Structure Prediction ◽

Rapid Expansion ◽

Solvent Accessible Surface Area ◽

Supplementary Information ◽

Test Set ◽

Contact Maps ◽

Unseen Data

Abstract Motivation Sequence-based prediction of one dimensional structural properties of proteins has been a long-standing subproblem of protein structure prediction. Recently, prediction accuracy has been significantly improved due to the rapid expansion of protein sequence and structure libraries and advances in deep learning techniques, such as residual convolutional networks (ResNets) and Long-Short-Term Memory Cells in Bidirectional Recurrent Neural Networks (LSTM-BRNNs). Here we leverage an ensemble of LSTM-BRNN and ResNet models, together with predicted residue-residue contact maps, to continue the push towards the attainable limit of prediction for 3- and 8-state secondary structure, backbone angles (θ, τ, ϕ and ψ), half-sphere exposure, contact numbers and solvent accessible surface area (ASA). Results The new method, named SPOT-1D, achieves similar, high performance on a large validation set and test set (≈1000 proteins in each set), suggesting robust performance for unseen data. For the large test set, it achieves 87% and 77% in 3- and 8-state secondary structure prediction and 0.82 and 0.86 in correlation coefficients between predicted and measured ASA and contact numbers, respectively. Comparison to current state-of-the-art techniques reveals substantial improvement in secondary structure and backbone angle prediction. In particular, 44% of 40-residue fragment structures constructed from predicted backbone Cα-based θ and τ angles are less than 6 Å root-mean-squared-distance from their native conformations, nearly 20% better than the next best. The method is expected to be useful for advancing protein structure and function prediction. Availability and implementation SPOT-1D and its data is available at: http://sparks-lab.org/. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

DeepAffinity: Interpretable Deep Learning of Compound-Protein Affinity through Unified Recurrent and Convolutional Neural Networks

10.1101/351601 ◽

2018 ◽

Cited By ~ 7

Author(s):

Mostafa Karimi ◽

Di Wu ◽

Zhangyang Wang ◽

Yang shen

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Supplementary Information ◽

Seamless Integration ◽

Source Codes ◽

Link Type ◽

Protein Affinity ◽

Protein Classes ◽

Deep Learning Model

AbstractMotivationDrug discovery demands rapid quantification of compound-protein interaction (CPI). However, there is a lack of methods that can predict compound-protein affinity from sequences alone with high applicability, accuracy, and interpretability.ResultsWe present a seamless integration of domain knowledges and learning-based approaches. Under novel representations of structurally-annotatedprotein sequences, a semi-supervised deep learning model that unifies recurrent and convolutional neural networks has been proposed to exploit both unlabeled and labeled data, for jointly encoding molecular representations and predicting affinities. Our representations and models outperform conventional options in achieving relative error in IC50 within 5-fold for test cases and 20-fold for protein classes not included for training. Performances for new protein classes with few labeled data are further improved by transfer learning. Furthermore, separate and joint attention mechanisms are developed and embedded to our model to add to its interpretability, as illustrated in case studies for predicting and explaining selective drug-target interactions. Lastly, alternative representations using protein sequences or compound graphs and a unified RNN/GCNN-CNN model using graph CNN (GCNN) are also explored to reveal algorithmic challenges ahead.AvailabilityData and source codes are available at https://github.com/Shen-Lab/[email protected] informationSupplementary data are available at http://shen-lab.github.io/deep-affinity-bioinf18-supp-rev.pdf.

Download Full-text

Deep learning methods for protein prediction problem

10.32469/10355/65461 ◽

2017 ◽

Author(s):

◽

Son Phong Nguyen

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Structure Prediction ◽

State Of The Art ◽

Protein Structures ◽

Distance Matrix ◽

Loop Modeling ◽

Deep Convolutional Neural Networks

[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT AUTHOR'S REQUEST.] Computational protein structure prediction is very important for many applications in bioinformatics. Many prediction methods have been developed, including Modeller, HHpred, I-TASSER, Robetta, and MUFOLD. In the process of predicting protein structures, it is essential to accurately assess the quality of generated models. Consensus quality assessment (QA) methods, such as Pcons-net and MULTICOM-refine, which are based on structure similarity, performed well on QA tasks. The drawback of consensus QA methods is that they require a pool of diverse models to work well, which is not always available. More importantly, they cannot evaluate the quality of a single protein model, which is a very common task in protein predictions and other applications. Although many single-model quality assessment methods, such as ProQ2, MQAPmulti, OPUS-CA, DOPE, DFIRE, and RW, etc. have been developed to address that problem, their accuracy is not good enough for most real applications. In this dissertation, based on the idea of using C-[alpha] atoms distance matrix and deep learning methods, two methods have been proposed for assessing quality of protein structures. First, a novel algorithm based on deep learning techniques, called DL-Pro, is proposed. From training examples of distance matrices corresponding to good and bad models, DL-Pro learns a stacked autoencoder network as a classifier. In experiments on selected targets from the Critical Assessment of Structure Prediction (CASP) competition, DL-Pro obtained promising results, outperforming state-of-the-art energy/scoring functions, including OPUS-CA, DOPE, DFIRE, and RW. Second, a new method DeepCon-QA is developed to predict quality of single protein model. Based on the idea of using protein vector representation and distance matrix, DeepCon-QA was able to achieve comparable performance with the best state-of-the-art QA method in our experiments. It also takes advantage the strength of deep convolutional neural networks to â€œlearnâ€ and â€œunderstandâ€ the input data to be able to predict output data precisely. On the other hand, this dissertation also proposes several new methods for solving loop modeling problem. Five new loop modeling methods based on machine learning techniques, called NearLooper, ConLooper, ResLooper, HyLooper1 and HyLooper2 are proposed. NearLooper is based on the nearest neighbor technique; ConLooper applies deep convolutional neural networks to predict CÎ± atoms distance matrix as an orientation-independent representation of protein structure; ResLooper uses residual neural networks instead of deep convolutional neural networks; HyLooper1 combines the results of NearLooper and ConLooper while HyLooper2 combines NearLooper and ResLooper. Three commonly used benchmarks for loop modeling are used to compare the performance between these methods and existing state-of-the-art methods. The experiment results show promising performance in which our best method improves existing state-of-the-art methods by 28% and 54% of average RMSD on two datasets while being comparable on the other one.

Download Full-text

Protein secondary structure prediction based on two dimensional deep convolutional neural networks

2017 3rd IEEE International Conference on Computer and Communications (ICCC) ◽

10.1109/compcomm.2017.8322886 ◽

2017 ◽

Cited By ~ 3

Author(s):

Yihui Liu ◽

Jinyong Cheng ◽

Yuming Ma ◽

Yehong Chen

Keyword(s):

Neural Networks ◽

Secondary Structure ◽

Convolutional Neural Networks ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Protein Secondary Structure ◽

Two Dimensional ◽

Protein Secondary Structure Prediction ◽

Deep Convolutional Neural Networks

Download Full-text

AUTOMATIC DIAGNOSIS OF BREAST CANCER IN HISTOLOGY IMAGES USING DEEP CONVOLUTIONAL NEURAL NETWORKS

KỶ YẾU HỘI NGHỊ KHOA HỌC CÔNG NGHỆ QUỐC GIA LẦN THỨ XI NGHIÊN CỨU CƠ BẢN VÀ ỨNG DỤNG CÔNG NGHỆ THÔNG TIN ◽

10.15625/vap.2018.0009 ◽

2018 ◽

Author(s):

Hung Le Minh ◽

Manh Mai Van ◽

Toan Tran Dinh ◽

Tot Tran Dac ◽

Tran Van Lang

Keyword(s):

Breast Cancer ◽

Neural Networks ◽

Convolutional Neural Networks ◽

Deep Convolutional Neural Networks ◽

Automatic Diagnosis

Download Full-text

CNN-based Classification of Degraded Images

Electronic Imaging ◽

10.2352/issn.2470-1173.2020.10.ipas-028 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 28-1-28-7 ◽

Cited By ~ 1

Author(s):

Kazuki Endo ◽

Masayuki Tanaka ◽

Masatoshi Okutomi

Keyword(s):

Neural Networks ◽

Image Restoration ◽

Image Classification ◽

Convolutional Neural Networks ◽

Deep Convolutional Neural Networks ◽

Alternative Approach ◽

Degraded Image ◽

Degraded Images ◽

Straightforward Approach

Classification of degraded images is very important in practice because images are usually degraded by compression, noise, blurring, etc. Nevertheless, most of the research in image classification only focuses on clean images without any degradation. Some papers have already proposed deep convolutional neural networks composed of an image restoration network and a classification network to classify degraded images. This paper proposes an alternative approach in which we use a degraded image and an additional degradation parameter for classification. The proposed classification network has two inputs which are the degraded image and the degradation parameter. The estimation network of degradation parameters is also incorporated if degradation parameters of degraded images are unknown. The experimental results showed that the proposed method outperforms a straightforward approach where the classification network is trained with degraded images only.

Download Full-text

Deep Convolutional Neural Networks Approach for Classification of Lung Diseases using X-Rays: COVID-19, Pneumonia, and Tuberculosis

International Journal of Performability Engineering ◽

10.23940/ijpe.20.09.p2.13321340 ◽

2020 ◽

Vol 16 (9) ◽

pp. 1332

Author(s):

Patil Narayani ◽

Ingole Kalyani ◽

Mangala T. Rajani

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Lung Diseases ◽

X Rays ◽

Deep Convolutional Neural Networks

Download Full-text

Dorsal Hand Vein Identification Based on Deep Convolutional Neural Networks and Visualizing Intermediate Layer Activations

European Journal of Science and Technology ◽

10.31590/ejosat.713601 ◽

2020 ◽

pp. 512-521

Author(s):

Khaled ALASHIK ◽

Saleh HUSSIN ◽

Remzi YILDIRIM ◽

Abubaker ALGUTTAR

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Intermediate Layer ◽

Deep Convolutional Neural Networks ◽

Dorsal Hand ◽

Hand Vein ◽

Dorsal Hand Vein

Download Full-text

Deep convolutional neural networks for cardiovascular vulnerable plaque detection

MATEC Web of Conferences ◽

10.1051/matecconf/201927702024 ◽

2019 ◽

Vol 277 ◽

pp. 02024 ◽

Cited By ~ 1

Author(s):

Lincan Li ◽

Tong Jia ◽

Tianqi Meng ◽

Yizhe Liu

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Vulnerable Plaque ◽

Recall Rate ◽

Superior Performance ◽

Learning Approaches ◽

Deep Convolutional Neural Networks ◽

Vulnerable Plaques ◽

Plaque Detection

In this paper, an accurate two-stage deep learning method is proposed to detect vulnerable plaques in ultrasonic images of cardiovascular. Firstly, a Fully Convonutional Neural Network (FCN) named U-Net is used to segment the original Intravascular Optical Coherence Tomography (IVOCT) cardiovascular images. We experiment on different threshold values to find the best threshold for removing noise and background in the original images. Secondly, a modified Faster RCNN is adopted to do precise detection. The modified Faster R-CNN utilize six-scale anchors (122,162,322,642,1282,2562) instead of the conventional one scale or three scale approaches. First, we present three problems in cardiovascular vulnerable plaque diagnosis, then we demonstrate how our method solve these problems. The proposed method in this paper apply deep convolutional neural networks to the whole diagnostic procedure. Test results show the Recall rate, Precision rate, IoU (Intersection-over-Union) rate and Total score are 0.94, 0.885, 0.913 and 0.913 respectively, higher than the 1st team of CCCV2017 Cardiovascular OCT Vulnerable Plaque Detection Challenge. AP of the designed Faster RCNN is 83.4%, higher than conventional approaches which use one-scale or three-scale anchors. These results demonstrate the superior performance of our proposed method and the power of deep learning approaches in diagnose cardiovascular vulnerable plaques.

Download Full-text

An automated framework for efficiently designing deep convolutional neural networks in genomics

Nature Machine Intelligence ◽

10.1038/s42256-021-00316-z ◽

2021 ◽

Author(s):

Zijun Zhang ◽

Christopher Y. Park ◽

Chandra L. Theesfeld ◽

Olga G. Troyanskaya

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Deep Convolutional Neural Networks

Download Full-text