Application of machine learning methods in big data analytics at management of contracts in the construction industry

The number of experts who realize the importance of big data continues to increase in various fields of the economy. Experts begin to use big data more frequently for the solution of their specific objectives. One of the probable big data tasks in the construction industry is the determination of the probability of contract execution at a stage of its establishment. The contract holder cannot guarantee execution of the contract. Therefore it leads to a lot of risks for the customer. This article is devoted to the applicability of machine learning methods to the task of determination of the probability of a successful contract execution. Authors try to reveal the factors influencing the possibility of contract default and then try to define the following corrective actions for a customer. In the problem analysis, authors used the linear and non-linear algorithms, feature extraction, feature transformation and feature selection. The results of investigation include the prognostic models with a predictive force based on the machine learning algorithms such as logistic regression, decision tree, randomize forest. Authors have validated models on available historical data. The developed models have the potential for practical use in the construction organizations while making new contracts.

Download Full-text

Big Data Analytics of Identifying Geochemical Anomalies Supported by Machine Learning Methods

Natural Resources Research ◽

10.1007/s11053-017-9357-0 ◽

2017 ◽

Vol 27 (1) ◽

pp. 5-13 ◽

Cited By ~ 37

Author(s):

Renguang Zuo ◽

Yihui Xiong

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Learning Methods ◽

Machine Learning Methods ◽

Geochemical Anomalies

Download Full-text

Machine Learning Methods in Precision Medicine Targeting Epigenetic Diseases

Current Pharmaceutical Design ◽

10.2174/1381612824666181112114228 ◽

2019 ◽

Vol 24 (34) ◽

pp. 3998-4006

Author(s):

Shijie Fan ◽

Yu Chen ◽

Cheng Luo ◽

Fanwang Meng

Keyword(s):

Machine Learning ◽

Big Data ◽

Precision Medicine ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Learning Methods ◽

Advantages And Disadvantages ◽

Machine Learning Methods ◽

Accelerated Studies ◽

Applications Of Machine Learning

Background: On a tide of big data, machine learning is coming to its day. Referring to huge amounts of epigenetic data coming from biological experiments and clinic, machine learning can help in detecting epigenetic features in genome, finding correlations between phenotypes and modifications in histone or genes, accelerating the screen of lead compounds targeting epigenetics diseases and many other aspects around the study on epigenetics, which consequently realizes the hope of precision medicine. Methods: In this minireview, we will focus on reviewing the fundamentals and applications of machine learning methods which are regularly used in epigenetics filed and explain their features. Their advantages and disadvantages will also be discussed. Results: Machine learning algorithms have accelerated studies in precision medicine targeting epigenetics diseases. Conclusion: In order to make full use of machine learning algorithms, one should get familiar with the pros and cons of them, which will benefit from big data by choosing the most suitable method(s).

Download Full-text

Usages of Spark Framework with Different Machine Learning Algorithms

Computational Intelligence and Neuroscience ◽

10.1155/2021/1896953 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Mohamed Ali Mohamed ◽

Ibrahim Mahmoud El-henawy ◽

Ahmad Salah

Keyword(s):

Machine Learning ◽

Big Data ◽

Internet Of Things ◽

Large Data ◽

Machine Learning Algorithms ◽

Smart Devices ◽

The Internet ◽

Learning Methods ◽

Machine Learning Methods ◽

The Internet Of Things

Sensors, satellites, mobile devices, social media, e-commerce, and the Internet, among others, saturate us with data. The Internet of Things, in particular, enables massive amounts of data to be generated more quickly. The Internet of Things is a term that describes the process of connecting computers, smart devices, and other data-generating equipment to a network and transmitting data. As a result, data is produced and updated on a regular basis to reflect changes in all areas and activities. As a consequence of this exponential growth of data, a new term and idea known as big data have been coined. Big data is required to illuminate the relationships between things, forecast future trends, and provide more information to decision-makers. The major problem at present, however, is how to effectively collect and evaluate massive amounts of diverse and complicated data. In some sectors or applications, machine learning models are the most frequently utilized methods for interpreting and analyzing data and obtaining important information. On their own, traditional machine learning methods are unable to successfully handle large data problems. This article gives an introduction to Spark architecture as a platform that machine learning methods may utilize to address issues regarding the design and execution of large data systems. This article focuses on three machine learning types, including regression, classification, and clustering, and how they can be applied on top of the Spark platform.

Download Full-text

Big Data Technology To Improve The Reliability Of Solutions When Choosing A Contractor

Promyshlennoe i Grazhdanskoe Stroitel stvo ◽

10.33622/0869-7019.2019.10.65-70 ◽

2019 ◽

pp. 65-70

Keyword(s):

Machine Learning ◽

Big Data ◽

Big Data Analytics ◽

Machine Learning Algorithms ◽

Prognostic Models ◽

Modeling And Analysis ◽

Current Economic Situation ◽

Contract Failure ◽

Contract Implementation ◽

The Impact

Considering current economic situation, the level of competition among different companies is great. In order to gain a higher position in the ratings, to attract more new customers, to predict the demand for products, and finally to protect themselves from wrong decisions, companies are increasingly turning to big data analytics. In the sphere of construction an opportunity to foresee the probability of contract implementation before its conclusion is always relevant. The higher the probability, the more attractive the contractor and lower the risks of the customer. Developing the topic of applicability of machine learning methods to the problem of determining the probability of successful completion of the contract, the authors are experimenting with a set of analyzed indicators assessing the impact of each of them on the decision on the possibility of contract failure. The article considers in detail the stages of data preparation for modeling, direct modeling and analysis of the results obtained. The authors tested the adequacy of the models on actual data and set the metrics by which it is possible to customize and improve the models for the needs of a particular organization. The prognostic models with a predictive power, based on machine learning algorithms, such as logistic regression, decision tree, random forest, developed by the authors, have the potential for practical use in construction organizations at the stage of contract conclusion.

Download Full-text

“Big Data” And Machine Learning When Managing Default Contract Risks In Construction Industry

Promyshlennoe i Grazhdanskoe Stroitel stvo ◽

10.33622/0869-7019.2019.05.81-87 ◽

2019 ◽

pp. 81-87

Keyword(s):

Machine Learning ◽

Big Data ◽

Construction Industry ◽

Predictive Power ◽

Machine Learning Algorithms ◽

Successful Completion ◽

Machine Learning Methods ◽

Construction Organizations ◽

Big Data Technologies ◽

Random Decision Forest

Big data technologies are currently being used in various sectors of the economy. In the construction industry, big data technologies can be used to determine the probability of contract execution at the stage of its conclusion. The contractor cannot guarantee the performance of the contract. This imposes certain risks on the customer. The article discusses the applicability of machine learning methods to the problem of determining the probability of successful completion of the contract. An attempt is made to identify the factors influencing the possible failure of the contract and to form corrective actions. To solve this problem, linear and nonlinear algorithms were used, the choice and transformation of classification features was carried out. The result of the study is predictive models with predictive power, which are based on such machine learning algorithms as logistic regression, decision tree, random decision forest, developed by the authors of the article. The adequacy of the models on the data of the official website of the unified information system in the field of procurement was checked. The possibility of using the obtained model to solve the set problem is established, as well as a generalized evaluation of the main results is made. The proposed models have the potential for practical use in construction organizations at the stage of conclusion of contracts.

Download Full-text

Advanced Interpretable Machine Learning Methods for Clinical NGS Big Data of Complex Hereditary Diseases

10.3389/978-2-88966-274-6 ◽

2020 ◽

Keyword(s):

Machine Learning ◽

Big Data ◽

Hereditary Diseases ◽

Learning Methods ◽

Machine Learning Methods ◽

Interpretable Machine Learning

Download Full-text

Identifying Cancer Targets Based on Machine Learning Methods via Chou’s 5-steps Rule and General Pseudo Components

Current Topics in Medicinal Chemistry ◽

10.2174/1568026619666191016155543 ◽

2019 ◽

Vol 19 (25) ◽

pp. 2301-2317 ◽

Cited By ~ 2

Author(s):

Ruirui Liang ◽

Jiayang Xie ◽

Chi Zhang ◽

Mengying Zhang ◽

Hai Huang ◽

...

Keyword(s):

Machine Learning ◽

Growth Rate ◽

Big Data ◽

Human Genome Project ◽

Genome Project ◽

Support Vector ◽

Successful Implementation ◽

Learning Methods ◽

Machine Learning Methods ◽

Vector Machines

In recent years, the successful implementation of human genome project has made people realize that genetic, environmental and lifestyle factors should be combined together to study cancer due to the complexity and various forms of the disease. The increasing availability and growth rate of ‘big data’ derived from various omics, opens a new window for study and therapy of cancer. In this paper, we will introduce the application of machine learning methods in handling cancer big data including the use of artificial neural networks, support vector machines, ensemble learning and naïve Bayes classifiers.

Download Full-text

Systematic literature review of machine learning methods used in the analysis of real-world data for patient-provider decision making

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-021-01403-2 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Alan Brnabic ◽

Lisa M. Hess

Keyword(s):

Machine Learning ◽

Decision Making ◽

Literature Review ◽

Systematic Literature Review ◽

Real World ◽

Learning Algorithms ◽

External Validation ◽

Machine Learning Algorithms ◽

Learning Methods ◽

Machine Learning Methods

Abstract Background Machine learning is a broad term encompassing a number of methods that allow the investigator to learn from the data. These methods may permit large real-world databases to be more rapidly translated to applications to inform patient-provider decision making. Methods This systematic literature review was conducted to identify published observational research of employed machine learning to inform decision making at the patient-provider level. The search strategy was implemented and studies meeting eligibility criteria were evaluated by two independent reviewers. Relevant data related to study design, statistical methods and strengths and limitations were identified; study quality was assessed using a modified version of the Luo checklist. Results A total of 34 publications from January 2014 to September 2020 were identified and evaluated for this review. There were diverse methods, statistical packages and approaches used across identified studies. The most common methods included decision tree and random forest approaches. Most studies applied internal validation but only two conducted external validation. Most studies utilized one algorithm, and only eight studies applied multiple machine learning algorithms to the data. Seven items on the Luo checklist failed to be met by more than 50% of published studies. Conclusions A wide variety of approaches, algorithms, statistical software, and validation strategies were employed in the application of machine learning methods to inform patient-provider decision making. There is a need to ensure that multiple machine learning approaches are used, the model selection strategy is clearly defined, and both internal and external validation are necessary to be sure that decisions for patient care are being made with the highest quality evidence. Future work should routinely employ ensemble methods incorporating multiple machine learning algorithms.

Download Full-text

Machine-learning based prediction of Cushing’s syndrome in dogs attending UK primary-care veterinary practice

Scientific Reports ◽

10.1038/s41598-021-88440-z ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Imogen Schofield ◽

David C. Brodbelt ◽

Noel Kennedy ◽

Stijn J. M. Niessen ◽

David B. Church ◽

...

Keyword(s):

Machine Learning ◽

Cushing’S Syndrome ◽

Clinical Decision Making ◽

Predictive Performance ◽

Clinical Decision ◽

Cushing's Syndrome ◽

Machine Learning Algorithms ◽

Learning Methods ◽

Machine Learning Methods ◽

Clinical Records

AbstractCushing’s syndrome is an endocrine disease in dogs that negatively impacts upon the quality-of-life of affected animals. Cushing’s syndrome can be a challenging diagnosis to confirm, therefore new methods to aid diagnosis are warranted. Four machine-learning algorithms were applied to predict a future diagnosis of Cushing's syndrome, using structured clinical data from the VetCompass programme in the UK. Dogs suspected of having Cushing's syndrome were included in the analysis and classified based on their final reported diagnosis within their clinical records. Demographic and clinical features available at the point of first suspicion by the attending veterinarian were included within the models. The machine-learning methods were able to classify the recorded Cushing’s syndrome diagnoses, with good predictive performance. The LASSO penalised regression model indicated the best overall performance when applied to the test set with an AUROC = 0.85 (95% CI 0.80–0.89), sensitivity = 0.71, specificity = 0.82, PPV = 0.75 and NPV = 0.78. The findings of our study indicate that machine-learning methods could predict the future diagnosis of a practicing veterinarian. New approaches using these methods could support clinical decision-making and contribute to improved diagnosis of Cushing’s syndrome in dogs.

Download Full-text

A Research on Machine Learning Methods for Big Data Processing

Proceedings of the 4th International Conference on Information Technology and Management Innovation ◽

10.2991/icitmi-15.2015.155 ◽

2015 ◽

Author(s):

Junfei Qiu ◽

Youming Sun

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Processing ◽

Learning Methods ◽

Big Data Processing ◽

Machine Learning Methods

Download Full-text