Combining Supervised Learning Techniques to Key-Phrase Extraction for Biomedical Full-Text

Key-phrase extraction plays a useful a role in research areas of Information Systems (IS) like digital libraries. Short metadata like key phrases are beneficial for searchers to understand the concepts found in the documents. This paper evaluates the effectiveness of different supervised learning techniques on biomedical full-text: Sequential Minimal Optimization (SMO) and K-Nearest Neighbor, both of which could be embedded inside an information system for document search. The authors use these techniques to extract key phrases from PubMed and evaluate the performance of these systems using the holdout validation method. This paper compares different classifier techniques and performance differences between the full-text and it’s abstract. Compared with the authors’ previous work, which investigated the performance of Naïve Bayes, Linear Regression and SVM(reg1/2), this paper finds that SVMreg-1 performs best in key-phrase extraction for full-text, whereas Naïve Bayes performs best for abstracts. These techniques should be considered for use in information system search functionality. Additional research issues also are identified.

Download Full-text

Combining Supervised Learning Techniques to Key-Phrase Extraction for Biomedical Full-Text

Organizational Efficiency through Intelligent Information Technologies ◽

10.4018/978-1-4666-2047-6.ch003 ◽

2012 ◽

pp. 33-44

Author(s):

Yanliang Qi ◽

Min Song ◽

Suk-Chung Yoon ◽

Lori deVersterre

Keyword(s):

Information System ◽

Supervised Learning ◽

Full Text ◽

Naive Bayes ◽

Naïve Bayes ◽

Phrase Extraction ◽

Learning Techniques ◽

Key Phrase Extraction ◽

And Performance ◽

Key Phrases

Download Full-text

Sentiment Analysis using various Machine Learning and Deep Learning Techniques

Journal of the Nigerian Society of Physical Sciences ◽

10.46481/jnsps.2021.308 ◽

2021 ◽

pp. 385-394

Author(s):

V Umarani ◽

A Julian ◽

J Deepa

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Analysis Process ◽

Learning Techniques

Sentiment analysis has gained a lot of attention from researchers in the last year because it has been widely applied to a variety of application domains such as business, government, education, sports, tourism, biomedicine, and telecommunication services. Sentiment analysis is an automated computational method for studying or evaluating sentiments, feelings, and emotions expressed as comments, feedbacks, or critiques. The sentiment analysis process can be automated using machine learning techniques, which analyses text patterns faster. The supervised machine learning technique is the most used mechanism for sentiment analysis. The proposed work discusses the flow of sentiment analysis process and investigates the common supervised machine learning techniques such as multinomial naive bayes, Bernoulli naive bayes, logistic regression, support vector machine, random forest, K-nearest neighbor, decision tree, and deep learning techniques such as Long Short-Term Memory and Convolution Neural Network. The work examines such learning methods using standard data set and the experimental results of sentiment analysis demonstrate the performance of various classifiers taken in terms of the precision, recall, F1-score, RoC-Curve, accuracy, running time and k fold cross validation and helps in appreciating the novelty of the several deep learning techniques and also giving the user an overview of choosing the right technique for their application.

Download Full-text

Klasifikasi Metagenom dengan Metode Naïve Bayes Classifier

Jurnal Ilmu Komputer dan Agri-Informatika ◽

10.29244/jika.3.1.9-17 ◽

2017 ◽

Vol 3 (1) ◽

pp. 9

Author(s):

Dian Kartika Utami ◽

Wisnu Ananta Kusuma ◽

Agus Buono

Keyword(s):

Supervised Learning ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

Studi metagenom merupakan langkah penting pada pengelompokan taksonomi. Pengelompokan pada metagenom dapat dilakukan dengan menggunakan metode binning. Binning diperlukan untuk mengelompokkan contigs yang dimiliki oleh masing-masing kelompok spesies filogenetik. Pada penelitian ini, binning dilakukan dengan menggunakan pendekatan komposisi berdasarkan supervised learning (pembelajaran dengan contoh). Metode supervised learning yang digunakan yaitu Naïve Bayes Classifier. Adapun metode yang digunakan untuk ekstraksi ciri adalah dengan melakukan perhitungan frekuensi k-mer. Klasifikasi pada metagenom dilakukan berdasarkan tingkat takson genus. Dari proses klasifikasi yang dilakukan, akurasi yang diperoleh dengan menggunakan fragmen pendek (400 bp) adalah 49.34 % untuk ekstraksi ciri 3-mer dan 53.95 % untuk ekstrasi ciri 4-mer. Sementara itu, untuk fragmen panjang (10 kbp), akurasi mengalami peningkatan yaitu 82.23 % untuk ekstraksi ciri 3-mer dan 85.89 % untuk esktraski ciri 4-mer. Dari hasil tersebut dapat disimpulkan bahwa akurasi semakin tinggi seiring dengan semakin panjangnya ukuran fragmen. Selain itu, penelitian ini juga menyimpulkan bahwa metode ekstrasi ciri yang memberikan hasil paling maksimal adalah dengan menggunakan ekstraksi ciri 4-mer.<br /><br />Kata Kunci: metagenom, k-mer, Naïve Bayes Classifier, binning, klasifikasi

Download Full-text

Comparison of Various Classification Techniques for Prediction of the Agriculture Production Based on Different Parameters Rainfall, Temperature

Journal of University of Shanghai for Science and Technology ◽

10.51201/jusst/21/04247 ◽

2021 ◽

Vol 23 (04) ◽

pp. 356-372

Author(s):

Manpreet Kaur ◽

◽

Dr. Dinesh Kumar ◽

Keyword(s):

Naive Bayes ◽

Confusion Matrix ◽

Big Data Analysis ◽

Naïve Bayes ◽

The Other ◽

Machine Learning Techniques ◽

True Positive ◽

True Negative ◽

Classification Techniques ◽

Learning Techniques

The classification techniques based on various machine learning techniques are having use for the Big data analysis. This will be useful in identifying the classification and then finally the prediction which will be useful for the decision managers for having quality decisions. There are various types of supervised and unsupervised learning techniques which are having capabilities in the terms of driving the analysis. This analysis will be useful for having identification of relationship between the various attributes which is required to device the analysis. There are various supervised learning techniques which are useful to drive the analysis. These techniques are SVM, Logistic regression, KNN, Naïve Bayes, Tree, Neural network. The relative comparison of this technique is done in the terms of various parameters for example AUC, CA, F1, Recall and precision. The accuracy in the terms of AUC, CA is highest for the Naïve Bayes. This shows the Naïve Bayes is having higher true positives, true negative ratio. The proposed technique is having higher accuracy of 81% which is far above than all the remaining techniques. The confusion matrix for the Naïve Bayes is having true positive count as 729, true negative at 103. This shows that the true positive and true negative count is far above for this technique compared to the other techniques.

Download Full-text

Twitter Sentiment Analysis using Machine Learning Techniques

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.c6281.029320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 4205-4209

Keyword(s):

Logistic Regression ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Machine Learning Techniques ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Learning Techniques ◽

Social Media Platforms

Nowadays people share their views and opinions in twitter and other social media platforms, the way of recognizing sentiments and speculation in tweets is Twitter Sentiment Analysis. Determining the contradiction or sentiment of the tweets and then listing them into positive, negative and neutral tweets is the main classifying step in this process. The issue related to sentiment analysis is the naming of the correct congruous sentiment classifier algorithm to list the tweets. The foundation classifier techniques like Logistic regression, Naive Bayes classifier, Random Forest and SVMs are normally used. In this paper, the Naïve Bayes classifier and Logistic Regression has been used to perform sentiment analysis and classify based on the better accuracy of catagorizing Technique. The outcome shows that Naive Bayes classifier works better for this approach. Data pre-processing and feature extraction is realized as a portion of task.

Download Full-text

Sentiment Analysis of Tweets on the COVID-19 Pandemic Using Machine Learning Techniques

Handbook of Research on Innovations and Applications of AI, IoT, and Cognitive Technologies - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-7998-6870-5.ch021 ◽

2021 ◽

pp. 310-320

Author(s):

Jothikumar R. ◽

Vijay Anand R. ◽

Visu P. ◽

Kumar R. ◽

Susi S. ◽

...

Keyword(s):

Machine Learning ◽

Decision Tree ◽

Respiratory Tract ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Machine Learning Techniques ◽

Respiratory Tract Diseases ◽

Thought Processes ◽

Learning Techniques

Sentiment evaluation alludes to separate the sentiments from the characteristic language and to perceive the mentality about the exact theme. Novel corona infection, a harmful malady ailment, is spreading out of the blue through the quarter, which thought processes respiratory tract diseases that can change from gentle to extraordinary levels. Because of its quick nature of spreading and no conceived cure, it ushered in a vibe of stress and pressure. In this chapter, a framework perusing principally based procedure is utilized to discover the musings of the tweets related to COVID and its effect lockdown. The chapter examines the tweets identified with the hash tags of crown infection and lockdown. The tweets were marked fabulous, negative, or fair, and a posting of classifiers has been utilized to investigate the precision and execution. The classifiers utilized have been under the four models which incorporate decision tree, regression, helpful asset vector framework, and naïve Bayes forms.

Download Full-text

Classification of multi-lingual tweets, into multi-class model using Naïve Bayes and semi-supervised learning

Multimedia Tools and Applications ◽

10.1007/s11042-020-09512-2 ◽

2020 ◽

Vol 79 (43-44) ◽

pp. 32749-32767

Author(s):

Ayaz H. Khan ◽

Muhammad Zubair

Keyword(s):

Supervised Learning ◽

Naive Bayes ◽

Naïve Bayes ◽

Class Model

Download Full-text

Chronic Kidney Disease using Machine Learning Techniques

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.f3359.059720 ◽

2020 ◽

Vol 9 (7) ◽

pp. 683-`686

Keyword(s):

Kidney Disease ◽

Naive Bayes ◽

Naïve Bayes ◽

Machine Learning Techniques ◽

Disease Area ◽

Course Of Action ◽

Learning Techniques ◽

Probabilistic Classifier ◽

Stage Renal Disease ◽

End Stage

Interminable Kidney Disease (CKD) proposes the realm of kidney chance which may even crumble by means of time and through implying the factors. If it continues finishing all the more dreadful Dialysis is and most desperate conclusive outcomes believable it'd flash off kidney misery (End-Stage Renal Disease). Area of CKD in a starting period should help in filtering by means of the complexities and harm.In the pastwork portrayal applied are SVM and Naïve Bayes, it happened that the execution time took by methods for Naïve Bayes is irrelevant appeared differently in relation to SVM, confused events are substantially less with SVM that results in less request execution of Naïve Bayes, inferable from gentle exactness distinction. It can be corrected by methods for taking less improvements. Unsuspecting Bayes is a probabilistic classifier a fundamental count by utilizing Bayes Theorem with a prohibitive independence supposition. The artistic creations for the most segment brings around growing symptomatic exactness and decrease commitment time, this is the guideline factor. An undertaking is made to develop a form evaluating CKD data collected from a particular course of action of people. From the model data, recognizing verification should be conceivable. This work has enchanted on developing up a system relying upon gathering procedures: SVM, Naïve Bayes, glomerular filtration rate (GFR) is the best pointer of how well the kidneys are working.CKD has got no cure but it can be treated based on symptoms to reduce complicationsand

Download Full-text

Peningkatan Performa Pendeteksian Anomali Menggunakan Ensemble Learning dan Feature Selection

Creative Information Technology Journal ◽

10.24076/citec.2020v7i1.238 ◽

2021 ◽

Vol 7 (1) ◽

pp. 1

Author(s):

Ripto Sudiyarno ◽

Arief Setyanto ◽

Emha Taufiq Luthfi

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Ensemble Learning ◽

Naive Bayes ◽

Confusion Matrix ◽

Naïve Bayes ◽

Machine Learning Techniques ◽

Detection Systems ◽

Learning Techniques ◽

Performance Results

Intrusion detection systems (IDS) atau Sistem pendeteksian intrusi dikenal sebagai teknik yang sangat menonjol dan terkemuka untuk menemukan malicious activities pada jaringan komputer, tidak seperti firewall konvensional, IDS berbeda dalam hal pengidentifikasian serangan secara cerdas dengan pendekatan analitik seperti data mining dan teknik machine learning. Dalam beberapa dekade terakhir, ensemble learning sangat memajukan penelitian pada machine learning dan klasifikasi pola, serta menunjukan peningkatan hasil kinerja dibandingkan single classifier. Pada Penelitian ini dilakukan percobaan peningkatan nilai akurasi terhadap sistem pendeteksian anomali, pertama dilakukan klasifikasi menggunakan single classifier untuk didapati hasil nilai akurasi yang nantinya dibandingkan dengan hasil dari ensemble learning dan feature selection. Penggunaan ensemble learning bertujuan untuk mendapatkan nilai akurasi yang terbaik dari single classifier. Hasil didapatkan dari nilai confusion matrix dan akan dilakukan pengujian dengan cara membandingkan nilai kedua metode diatas. Penelitian berhasil mendapatkan nilai akurasi single classifier (naïve bayes) yaitu 77,4% dan nilai ensemble learning 96,8%. Kata Kunci— ensemble learning, nsl-kdd, naïve bayes, anomali, feature selectionIntrusion detection systems (IDS) are known as very prominent and leading techniques for finding malicious activities on computer networks, unlike conventional firewalls, IDS differs in terms of identifying attacks intelligently with analytic approaches such as machine learning techniques. In the last few decades, ensemble learning has greatly advanced research in machine learning and pattern classification it has shown an improve in performance results compared to a single classifier. In this study an attempt was made to increase the accuracy of anomalous detection systems, first by classification using a single classifier to find the results of accuracy which will be compared with the results of ensemble learning and feature selection. The use of ensemble learning aims to get the best accuracy value from a single classifier. The results are obtained from the value of the confusion matrix and will be tested by comparing the values of the two methods above. The research succeeded in getting a single classifier accuracy value of 77,4% and ensemble learning 96,8%. Keywords— ensemble learning, nsl-kdd, naïve bayes, anomali, feature selection

Download Full-text

Identification of Violent Response with Stretch Sensor Data from a Smart-Jacket using Naïve Bayes Algorithm

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.a9244.119119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 5265-5270

Keyword(s):

Supervised Learning ◽

Naive Bayes ◽

Learning Algorithm ◽

Pressure Sensors ◽

Naïve Bayes ◽

Sensor Data ◽

Body Movements ◽

Bayes Algorithm ◽

Do So

In this paper, a smart-jacket using stretch sensors, pressure sensors was built for purpose of generating body-movements data and in order to record different kinds of signals and the distribution of the same on the jacket. Every degree of motion, when exercised, generates voltage changes in the stretch sensors as it is its property to do so. This data is collected in a flora chip set, which is Arduino based. The collected data is processed, pruned and filtered for outliers. This paper concerns with a supervised learning algorithm called Naive Bayes, which is applied over independent datasets, meaning one set of observation has no direct relations to each other. The placement of sensor are on the shoulders and elbows and the responses from each are independent of each other. Using Naive Bayes, the date has been classified for the violent response and the normal action.

Download Full-text