scholarly journals DANNP: an efficient artificial neural network pruning tool

2017 ◽  
Vol 3 ◽  
pp. e137 ◽  
Author(s):  
Mona Alshahrani ◽  
Othman Soufan ◽  
Arturo Magana-Mora ◽  
Vladimir B. Bajic

Background Artificial neural networks (ANNs) are a robust class of machine learning models and are a frequent choice for solving classification problems. However, determining the structure of the ANNs is not trivial as a large number of weights (connection links) may lead to overfitting the training data. Although several ANN pruning algorithms have been proposed for the simplification of ANNs, these algorithms are not able to efficiently cope with intricate ANN structures required for complex classification problems. Methods We developed DANNP, a web-based tool, that implements parallelized versions of several ANN pruning algorithms. The DANNP tool uses a modified version of the Fast Compressed Neural Network software implemented in C++ to considerably enhance the running time of the ANN pruning algorithms we implemented. In addition to the performance evaluation of the pruned ANNs, we systematically compared the set of features that remained in the pruned ANN with those obtained by different state-of-the-art feature selection (FS) methods. Results Although the ANN pruning algorithms are not entirely parallelizable, DANNP was able to speed up the ANN pruning up to eight times on a 32-core machine, compared to the serial implementations. To assess the impact of the ANN pruning by DANNP tool, we used 16 datasets from different domains. In eight out of the 16 datasets, DANNP significantly reduced the number of weights by 70%–99%, while maintaining a competitive or better model performance compared to the unpruned ANN. Finally, we used a naïve Bayes classifier derived with the features selected as a byproduct of the ANN pruning and demonstrated that its accuracy is comparable to those obtained by the classifiers trained with the features selected by several state-of-the-art FS methods. The FS ranking methodology proposed in this study allows the users to identify the most discriminant features of the problem at hand. To the best of our knowledge, DANNP (publicly available at www.cbrc.kaust.edu.sa/dannp) is the only available and on-line accessible tool that provides multiple parallelized ANN pruning options. Datasets and DANNP code can be obtained at www.cbrc.kaust.edu.sa/dannp/data.php and https://doi.org/10.5281/zenodo.1001086.

2020 ◽  
Vol 25 (2) ◽  
pp. 145-152
Author(s):  
Yan Kuchin ◽  
Ravil Mukhamediev ◽  
Kirill Yakunin ◽  
Janis Grundspenkis ◽  
Adilkhan Symagulov

AbstractMachine learning (ML) methods are nowadays widely used to automate geophysical study. Some of ML algorithms are used to solve lithological classification problems during uranium mining process. One of the key aspects of using classical ML methods is causing data features and estimating their influence on the classification. This paper presents a quantitative assessment of the impact of expert opinions on the classification process. In other words, we have prepared the data, identified the experts and performed a series of experiments with and without taking into account the fact that the expert identifier is supplied to the input of the automatic classifier during training and testing. Feedforward artificial neural network (ANN) has been used as a classifier. The results of the experiments show that the “knowledge” of the ANN of which expert interpreted the data improves the quality of the automatic classification in terms of accuracy (by 5 %) and recall (by 20 %). However, due to the fact that the input parameters of the model may depend on each other, the SHapley Additive exPlanations (SHAP) method has been used to further assess the impact of expert identifier. SHAP has allowed assessing the degree of parameter influence. It has revealed that the expert ID is at least two times more influential than any of the other input parameters of the neural network. This circumstance imposes significant restrictions on the application of ANNs to solve the task of lithological classification at the uranium deposits.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Zhonghui Thong ◽  
Jolena Ying Ying Tan ◽  
Eileen Shuzhen Loo ◽  
Yu Wei Phua ◽  
Xavier Liang Shun Chan ◽  
...  

AbstractRegression models are often used to predict age of an individual based on methylation patterns. Artificial neural network (ANN) however was recently shown to be more accurate for age prediction. Additionally, the impact of ethnicity and sex on our previous regression model have not been studied. Furthermore, there is currently no age prediction study investigating the lower limit of input DNA at the bisulfite treatment stage prior to pyrosequencing. Herein, we evaluated both regression and ANN models, and the impact of ethnicity and sex on age prediction for 333 local blood samples using three loci on the pyrosequencing platform. Subsequently, we trained a one locus-based ANN model to reduce the amount of DNA used. We demonstrated that the ANN model has a higher accuracy of age prediction than the regression model. Additionally, we showed that ethnicity did not affect age prediction among local Chinese, Malays and Indians. Although the predicted age of males were marginally overestimated, sex did not impact the accuracy of age prediction. Lastly, we present a one locus, dual CpG model using 25 ng of input DNA that is sufficient for forensic age prediction. In conclusion, the two ANN models validated would be useful for age prediction to provide forensic intelligence leads.


Sensors ◽  
2021 ◽  
Vol 21 (5) ◽  
pp. 1688
Author(s):  
Luqman Ali ◽  
Fady Alnajjar ◽  
Hamad Al Jassmi ◽  
Munkhjargal Gochoo ◽  
Wasif Khan ◽  
...  

This paper proposes a customized convolutional neural network for crack detection in concrete structures. The proposed method is compared to four existing deep learning methods based on training data size, data heterogeneity, network complexity, and the number of epochs. The performance of the proposed convolutional neural network (CNN) model is evaluated and compared to pretrained networks, i.e., the VGG-16, VGG-19, ResNet-50, and Inception V3 models, on eight datasets of different sizes, created from two public datasets. For each model, the evaluation considered computational time, crack localization results, and classification measures, e.g., accuracy, precision, recall, and F1-score. Experimental results demonstrated that training data size and heterogeneity among data samples significantly affect model performance. All models demonstrated promising performance on a limited number of diverse training data; however, increasing the training data size and reducing diversity reduced generalization performance, and led to overfitting. The proposed customized CNN and VGG-16 models outperformed the other methods in terms of classification, localization, and computational time on a small amount of data, and the results indicate that these two models demonstrate superior crack detection and localization for concrete structures.


2021 ◽  
Author(s):  
Sascha Flaig ◽  
Timothy Praditia ◽  
Alexander Kissinger ◽  
Ulrich Lang ◽  
Sergey Oladyshkin ◽  
...  

<p>In order to prevent possible negative impacts of water abstraction in an ecologically sensitive moor south of Munich (Germany), a “predictive control” scheme is in place. We design an artificial neural network (ANN) to provide predictions of moor water levels and to separate hydrological from anthropogenic effects. As the moor is a dynamic system, we adopt the „Long short-term memory“ architecture.</p><p>To find the best LSTM setup, we train, test and compare LSTMs with two different structures: (1) the non-recurrent one-to-one structure, where the series of inputs are accumulated and fed into the LSTM; and (2) the recurrent many-to-many structure, where inputs gradually enter the LSTM (including LSTM forecasts from previous forecast time steps). The outputs of our LSTMs then feed into a readout layer that converts the hidden states into water level predictions. We hypothesize that the recurrent structure is the better structure because it better resembles the typical structure of differential equations for dynamic systems, as they would usually be used for hydro(geo)logical systems. We evaluate the comparison with the mean squared error as test metric, and conclude that the recurrent many-to-many LSTM performs better for the analyzed complex situations. It also produces plausible predictions with reasonable accuracy for seven days prediction horizon.</p><p>Furthermore, we analyze the impact of preprocessing meteorological data to evapotranspiration data using typical ETA models. Inserting knowledge into the LSTM in the form of ETA models (rather than implicitly having the LSTM learn the ETA relations) leads to superior prediction results. This finding aligns well with current ideas on physically-inspired machine learning.</p><p>As an additional validation step, we investigate whether our ANN is able to correctly identify both anthropogenic and natural influences and their interaction. To this end, we investigate two comparable pumping events under different meteorological conditions. Results indicate that all individual and combined influences of input parameters on water levels can be represented well. The neural networks recognize correctly that the predominant precipitation and lower evapotranspiration during one pumping event leads to a lower decrease of the hydrograph.</p><p>To further demonstrate the capability of the trained neural network, scenarios of pumping events are created and simulated.</p><p>In conclusion, we show that more robust and accurate predictions of moor water levels can be obtained if available physical knowledge of the modeled system is used to design and train the neural network. The artificial neural network can be a useful instrument to assess the impact of water abstraction by quantifying the anthropogenic influence.</p>


Author(s):  
Jung-eui Hong ◽  
Cihan H. Dagli ◽  
Kenneth M. Ragsdell

Abstract The primary function of the Wheatstone bridge is to measure an unknown resistance. The elements of this well-known measurement circuit will take on different values depending upon the range and accuracy required for a particular application. The Taguchi approach to parameter design is used to select values for the measurement circuit elements so as to reduce measurement error. Next we introduce the use of an artificial neural network to extrapolate limited experimental results to predict system response over a wide range of applications. This approach can be employed for on-line quality control of the manufacture of such device.


2018 ◽  
Vol 215 ◽  
pp. 01011
Author(s):  
Sitti Amalia

This research proposed to design and implementation system of voice pattern recognition in the form of numbers with offline pronunciation. Artificial intelligent with backpropagation algorithm used on the simulation test. The test has been done to 100 voice files which got from 10 person voices for 10 different numbers. The words are consisting of number 0 to 9. The trial has been done with artificial neural network parameters such as tolerance value and the sum of a neuron. The best result is shown at tolerance value varied and a sum of the neuron is fixed. The percentage of this network training with optimal architecture and network parameter for each training data and new data are 82,2% and 53,3%. Therefore if tolerance value is fixed and a sum of neuron varied gave 82,2% for training data and 54,4% for new data


Author(s):  
Hadjira Maouz ◽  
◽  
Asma Adda ◽  
Salah Hanini ◽  
◽  
...  

The concentration of carbonyl is one of the most important properties contributing to the detection of the thermal aging of polymer ethylene propylene diene monomer (EPDM). In this publication, an artificial neural network (ANN) model was developed to predict concentration of carbenyl during the thermal aging of EPDM using a database consisting of seven input variables. The best fitting training data was obtained with the architecture of (7 inputs neurons, 10 hidden neurons and 1 output neuron). A Levenberg Marquardt learning (LM) algorithm, hyperbolic tangent transfer function were used at the hidden and output layer respectively. The optimal ANN was obtained with a high correlation coefficient R= 0.995 and a very low root mean square error RMSE = 0.0148 mol/l during the generalization phase. The comparison between the experimental and calculated results show that the ANN model is able of predicted the concentration of carbonyl during the thermal aging of ethylene propylene diene monomer


Sign in / Sign up

Export Citation Format

Share Document