Image-based taxonomic classification of bulk biodiversity samples using deep learning and domain adaptation

Complex bulk samples of invertebrates from biodiversity surveys present a great challenge for taxonomic identification, especially if obtained from unexplored ecosystems. High-throughput imaging combined with machine learning for rapid classification could overcome this bottleneck. Developing such procedures requires that taxonomic labels from an existing source data set are used for model training and prediction of an unknown target sample. Yet the feasibility of transfer learning for the classification of unknown samples remains to be tested. Here, we assess the efficiency of deep learning and domain transfer algorithms for family-level classification of below-ground bulk samples of Coleoptera from understudied forests of Cyprus. We trained neural network models with images from local surveys versus global databases of above-ground samples from tropical forests and evaluated how prediction accuracy was affected by: (a) the quality and resolution of images, (b) the size and complexity of the training set and (c) the transferability of identifications across very disparate source-target pairs that do not share any species or genera. Within-dataset classification accuracy reached 98% and depended on the number and quality of training images and on dataset complexity. The accuracy of between-datasets predictions was reduced to a maximum of 82% and depended greatly on the standardisation of the imaging procedure. When the source and target images were of similar quality and resolution, albeit from different faunas, the reduction of accuracy was minimal. Application of algorithms for domain adaptation significantly improved the prediction performance of models trained by non-standardised, low-quality images. Our findings demonstrate that existing databases can be used to train models and successfully classify images from unexplored biota, when the imaging conditions and classification algorithms are carefully considered. Also, our results provide guidelines for data acquisition and algorithmic development for high-throughput image-based biodiversity surveys.

Download Full-text

A Profile Analysis of User Interaction in Social Media Using Deep Learning

Traitement du signal ◽

10.18280/ts.380101 ◽

2021 ◽

Vol 38 (1) ◽

pp. 1-11

Author(s):

Hafzullah İş ◽

Taner Tuncer

Keyword(s):

Social Networks ◽

Social Media ◽

Deep Learning ◽

Profile Analysis ◽

User Interaction ◽

Network Models ◽

User Profiles ◽

Success Rates ◽

Neural Network Models ◽

Data Set

It is highly important to detect malicious account interaction in social networks with regard to political, social and economic aspects. This paper analyzed the profile structure of social media users using their data interactions. A total of 10 parameters including diameter, density, reciprocity, centrality and modularity were used to comprehensively characterize the interactions of Twitter users. Moreover, a new data set was formed by visualizing the data obtained with these parameters. User profiles were classified using Convolutional Neural Network models with deep learning. Users were divided into active, passive and malicious classes. Success rates for the algorithms used in the classification were estimated based on the hyper parameters and application platforms. The best model had a success rate of 98.67%. The methodology demonstrated that Twitter user profiles can be classified successfully through user interaction-based parameters. It is expected that this paper will contribute to published literature in terms of behavioral analysis and the determination of malicious accounts in social networks.

Download Full-text

Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters

Deep Learning and Neural Networks ◽

10.4018/978-1-7998-0414-7.ch017 ◽

2020 ◽

pp. 274-294

Author(s):

Yilin Yan ◽

Min Chen ◽

Saad Sadiq ◽

Mei-Ling Shyu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Imbalanced Data ◽

Network Models ◽

Multimedia Data ◽

Neural Network Models ◽

Minority Class ◽

Imbalanced Data Classification

The classification of imbalanced datasets has recently attracted significant attention due to its implications in several real-world use cases. The classifiers developed on datasets with skewed distributions tend to favor the majority classes and are biased against the minority class. Despite extensive research interests, imbalanced data classification remains a challenge in data mining research, especially for multimedia data. Our attempt to overcome this hurdle is to develop a convolutional neural network (CNN) based deep learning solution integrated with a bootstrapping technique. Considering that convolutional neural networks are very computationally expensive coupled with big training datasets, we propose to extract features from pre-trained convolutional neural network models and feed those features to another full connected neutral network. Spark implementation shows promising performance of our model in handling big datasets with respect to feasibility and scalability.

Download Full-text

Arabic text summarization using deep learning approach

Journal Of Big Data ◽

10.1186/s40537-020-00386-7 ◽

2020 ◽

Vol 7 (1) ◽

Author(s):

Molham Al-Maleh ◽

Said Desouki

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Language Processing ◽

Network Models ◽

Arabic Language ◽

Text Summarization ◽

Neural Network Models ◽

Data Set ◽

Learning Techniques ◽

Arabic Text Summarization

AbstractNatural language processing has witnessed remarkable progress with the advent of deep learning techniques. Text summarization, along other tasks like text translation and sentiment analysis, used deep neural network models to enhance results. The new methods of text summarization are subject to a sequence-to-sequence framework of encoder–decoder model, which is composed of neural networks trained jointly on both input and output. Deep neural networks take advantage of big datasets to improve their results. These networks are supported by the attention mechanism, which can deal with long texts more efficiently by identifying focus points in the text. They are also supported by the copy mechanism that allows the model to copy words from the source to the summary directly. In this research, we are re-implementing the basic summarization model that applies the sequence-to-sequence framework on the Arabic language, which has not witnessed the employment of this model in the text summarization before. Initially, we build an Arabic data set of summarized article headlines. This data set consists of approximately 300 thousand entries, each consisting of an article introduction and the headline corresponding to this introduction. We then apply baseline summarization models to the previous data set and compare the results using the ROUGE scale.

Download Full-text

Deep-Learning-Based Automated Sedimentary Geometry Characterization From Borehole Images

Petrophysics – The SPWLA Journal of Formation Evaluation and Reservoir Description ◽

10.30632/pjv62n6-2020a4 ◽

2021 ◽

Vol 62 (6) ◽

pp. 636-650

Author(s):

Marie Lefranc ◽

◽

Zikri Bayraktar ◽

Morten Kristensen ◽

Hedi Driss ◽

...

Keyword(s):

Deep Learning ◽

Flow Direction ◽

Synthetic Data ◽

Network Models ◽

Training Data ◽

Proper Class ◽

Neural Network Models ◽

Data Set ◽

Sedimentary Structures ◽

Automated Method

Sedimentary geometry on borehole images usually summarizes the arrangement of bed boundaries, erosive surfaces, crossbedding, sedimentary dip, and/or deformed beds. The interpretation, very often manual, requires a good level of expertise, is time consuming, can suffer from user bias, and becomes very challenging when dealing with highly deviated wells. Bedform geometry interpretation from crossbed data is rarely completed from a borehole image. The purpose of this study is to develop an automated method to interpret sedimentary structures, including the bedform geometry resulting from the change in flow direction from borehole images. Automation is achieved in this unique interpretation methodology using deep learning (DL). The first task comprised the creation of a training data set of 2D borehole images. This library of images was then used to train deep neural network models. Testing different architectures of convolutional neural networks (CNN) showed the ResNet architecture to give the best performance for the classification of the different sedimentary structures. The validation accuracy was very high, in the range of 93 to 96%. To test the developed method, additional logs of synthetic data were created as sequences of different sedimentary structures (i.e., classes) associated with different well deviations, with the addition of gaps. The model was able to predict the proper class in these composite logs and highlight the transitions accurately.

Download Full-text

Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters

International Journal of Multimedia Data Engineering and Management ◽

10.4018/ijmdem.2017010101 ◽

2017 ◽

Vol 8 (1) ◽

pp. 1-20 ◽

Cited By ~ 15

Author(s):

Yilin Yan ◽

Min Chen ◽

Saad Sadiq ◽

Mei-Ling Shyu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Imbalanced Data ◽

Network Models ◽

Multimedia Data ◽

Neural Network Models ◽

Minority Class ◽

Imbalanced Data Classification

Download Full-text

Deep learning and complex network theory based analysis on socialized manufacturing resources utilisations and an application case study

Concurrent Engineering ◽

10.1177/1063293x211003194 ◽

2021 ◽

pp. 1063293X2110031

Author(s):

Maolin Yang ◽

Auwal H Abubakar ◽

Pingyu Jiang

Keyword(s):

Deep Learning ◽

Complex Network ◽

Network Models ◽

Neural Network Models ◽

Complex Network Theory ◽

New Type ◽

Manufacturing Resource ◽

Manufacturing Resources ◽

Resource Characteristics

Social manufacturing is characterized by its capability of utilizing socialized manufacturing resources to achieve value adding. Recently, a new type of social manufacturing pattern emerges and shows potential for core factories to improve their limited manufacturing capabilities by utilizing the resources from outside socialized manufacturing resource communities. However, the core factories need to analyze the resource characteristics of the socialized resource communities before making operation plans, and this is challenging due to the unaffiliated and self-driven characteristics of the resource providers in socialized resource communities. In this paper, a deep learning and complex network based approach is established to address this challenge by using socialized designer community for demonstration. Firstly, convolutional neural network models are trained to identify the design resource characteristics of each socialized designer in designer community according to the interaction texts posted by the socialized designer on internet platforms. During the process, an iterative dataset labelling method is established to reduce the time cost for training set labelling. Secondly, complex networks are used to model the design resource characteristics of the community according to the resource characteristics of all the socialized designers in the community. Two real communities from RepRap 3D printer project are used as case study.

Download Full-text

Classification of Germination Images of Pear Pollen Using Random Forest and Convolution Neural Network Models

IEEE Access ◽

10.1109/access.2021.3067677 ◽

2021 ◽

Vol 9 ◽

pp. 45993-45999

Author(s):

Ung Yang ◽

Seungwon Oh ◽

Seung Gon Wi ◽

Bok-Rye Lee ◽

Sang-Hyun Lee ◽

...

Keyword(s):

Neural Network ◽

Random Forest ◽

Network Models ◽

Convolution Neural Network ◽

Neural Network Models

Download Full-text

RISC-V Virtual Platform-Based Convolutional Neural Network Accelerator Implemented in SystemC

Electronics ◽

10.3390/electronics10131514 ◽

2021 ◽

Vol 10 (13) ◽

pp. 1514

Author(s):

Seung-Ho Lim ◽

WoonSik William Suh ◽

Jin-Young Kim ◽

Sang-Young Cho

Keyword(s):

Neural Network ◽

Deep Learning ◽

Network Model ◽

Neural Network Model ◽

Deep Neural Network ◽

System Level ◽

Neural Network Models ◽

Data Set ◽

Embedded Device ◽

Virtual Platform

The optimization for hardware processor and system for performing deep learning operations such as Convolutional Neural Networks (CNN) in resource limited embedded devices are recent active research area. In order to perform an optimized deep neural network model using the limited computational unit and memory of an embedded device, it is necessary to quickly apply various configurations of hardware modules to various deep neural network models and find the optimal combination. The Electronic System Level (ESL) Simulator based on SystemC is very useful for rapid hardware modeling and verification. In this paper, we designed and implemented a Deep Learning Accelerator (DLA) that performs Deep Neural Network (DNN) operation based on the RISC-V Virtual Platform implemented in SystemC in order to enable rapid and diverse analysis of deep learning operations in an embedded device based on the RISC-V processor, which is a recently emerging embedded processor. The developed RISC-V based DLA prototype can analyze the hardware requirements according to the CNN data set through the configuration of the CNN DLA architecture, and it is possible to run RISC-V compiled software on the platform, can perform a real neural network model like Darknet. We performed the Darknet CNN model on the developed DLA prototype, and confirmed that computational overhead and inference errors can be analyzed with the DLA prototype developed by analyzing the DLA architecture for various data sets.

Download Full-text

Classification of binary document images into textual or nontextual data blocks using neural network models

Machine Vision and Applications ◽

10.1007/bf01211490 ◽

1995 ◽

Vol 8 (5) ◽

pp. 289-304 ◽

Cited By ~ 12

Author(s):

Daniel X. Le ◽

George R. Thoma ◽

Harry Wechsler

Keyword(s):

Neural Network ◽

Network Models ◽

Document Images ◽

Neural Network Models

Download Full-text

Feature extraction-based image steganalysis using deep learning

WEENTECH Proceedings in Energy ◽

10.32438/wpe.182021 ◽

2021 ◽

pp. 188-198

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Secure Communication ◽

Information Technologies ◽

Network Models ◽

Error Rates ◽

Multimedia Data ◽

Secret Message ◽

Neural Network Models

The innovations in advanced information technologies has led to rapid delivery and sharing of multimedia data like images and videos. The digital steganography offers ability to secure communication and imperative for internet. The image steganography is essential to preserve confidential information of security applications. The secret image is embedded within pixels. The embedding of secret message is done by applied with S-UNIWARD and WOW steganography. Hidden messages are reveled using steganalysis. The exploration of research interests focused on conventional fields and recent technological fields of steganalysis. This paper devises Convolutional neural network models for steganalysis. Convolutional neural network (CNN) is one of the most frequently used deep learning techniques. The Convolutional neural network is used to extract spatio-temporal information or features and classification. We have compared steganalysis outcome with AlexNet and SRNeT with same dataset. The stegnalytic error rates are compared with different payloads.

Download Full-text