Forecasting Software Vulnerabilities Using Time-Series Techniques

Author(s):  
Baidyanath Biswas

This chapter discusses the concepts of time-series applications and forecasting in the context of information systems security. The primary objective in such formulation is the training of the models followed by efficient prediction. Although economic and financial forecasting problems extensively use time-series, predicting software vulnerabilities is a novel idea. The chapter also provides appropriate guidelines for the implementation and adaptation of univariate time-series for information security. To achieve this, the authors focus on the following techniques: autoregressive (AR), moving average (MA), autoregressive integrated moving average (ARIMA), and exponential smoothing. The analysis considers a unique data set consisting of the publicly exposed software vulnerabilities, available from the U.S. Dept. of Homeland Security. The problem is presented first, followed by a general framework to identify the problem, estimate the best-fit parameters of that model, and conclude with an illustrative example from the above dataset to familiarize readers with the business problem.

2020 ◽  
Vol 5 (1) ◽  
pp. 374
Author(s):  
Pauline Jin Wee Mah ◽  
Nur Nadhirah Nanyan

The main purpose of this study is to compare the performances of univariate and bivariate models on four time series variables of the crude palm oil industry in Peninsular Malaysia. The monthly data for the four variables, which are the crude palm oil production, price, import and export, were obtained from Malaysian Palm Oil Board (MPOB) and Malaysian Palm Oil Council (MPOC). In the first part of this study, univariate time series models, namely, the autoregressive integrated moving average (ARIMA), fractionally integrated autoregressive moving average (ARFIMA) and autoregressive autoregressive (ARAR) algorithm were used for modelling and forecasting purposes. Subsequently, the dependence between any two of the four variables were checked using the residuals’ sample cross correlation functions before modelling the bivariate time series. In order to model the bivariate time series and make prediction, the transfer function models were used. The forecast accuracy criteria used to evaluate the performances of the models were the mean absolute error (MAE), root mean square error (RMSE) and mean absolute percentage error (MAPE). The results of the univariate time series showed that the best model for predicting the production was ARIMA  while the ARAR algorithm were the best forecast models for predicting both the import and export of crude palm oil. However, ARIMA  appeared to be the best forecast model for price based on the MAE and MAPE values while ARFIMA  emerged the best model based on the RMSE value.  When considering bivariate time series models, the production was dependent on import while the export was dependent on either price or import. The results showed that the bivariate models had better performance compared to the univariate models for production and export of crude palm oil based on the forecast accuracy criteria used.


Energies ◽  
2020 ◽  
Vol 14 (1) ◽  
pp. 141
Author(s):  
Jacob Hale ◽  
Suzanna Long

Energy portfolios are overwhelmingly dependent on fossil fuel resources that perpetuate the consequences associated with climate change. Therefore, it is imperative to transition to more renewable alternatives to limit further harm to the environment. This study presents a univariate time series prediction model that evaluates sustainability outcomes of partial energy transitions. Future electricity generation at the state-level is predicted using exponential smoothing and autoregressive integrated moving average (ARIMA). The best prediction results are then used as an input for a sustainability assessment of a proposed transition by calculating carbon, water, land, and cost footprints. Missouri, USA was selected as a model testbed due to its dependence on coal. Of the time series methods, ARIMA exhibited the best performance and was used to predict annual electricity generation over a 10-year period. The proposed transition consisted of a one-percent annual decrease of coal’s portfolio share to be replaced with an equal share of solar and wind supply. The sustainability outcomes of the transition demonstrate decreases in carbon and water footprints but increases in land and cost footprints. Decision makers can use the results presented here to better inform strategic provisioning of critical resources in the context of proposed energy transitions.


2019 ◽  
Vol 2 (1) ◽  
pp. 25-44 ◽  
Author(s):  
S. Mohanasundaram ◽  
G. Suresh Kumar ◽  
Balaji Narasimhan

Abstract Groundwater level prediction and forecasting using univariate time series models are useful for effective groundwater management under data limiting conditions. The seasonal autoregressive integrated moving average (SARIMA) models are widely used for modeling groundwater level data as the groundwater level signals possess the seasonality pattern. Alternatively, deseasonalized autoregressive and moving average models (Ds-ARMA) can be modeled with deseasonalized groundwater level signals in which the seasonal component is estimated and removed from the raw groundwater level signals. The seasonal component is traditionally estimated by calculating long-term averaging values of the corresponding months in the year. This traditional way of estimating seasonal component may not be appropriate for non-stationary groundwater level signals. Thus, in this study, an improved way of estimating the seasonal component by adopting a 13-month moving average trend and corresponding confidence interval approach has been attempted. To test the proposed approach, two representative observation wells from Adyar basin, India were modeled by both traditional and proposed methods. It was observed from this study that the proposed model prediction performance was better than the traditional model's performance with R2 values of 0.82 and 0.93 for the corresponding wells' groundwater level data.


MAUSAM ◽  
2021 ◽  
Vol 68 (2) ◽  
pp. 349-356
Author(s):  
J. HAZARIKA ◽  
B. PATHAK ◽  
A. N. PATOWARY

Perceptive the rainfall pattern is tough for the solution of several regional environmental issues of water resources management, with implications for agriculture, climate change, and natural calamity such as floods and droughts. Statistical computing, modeling and forecasting data are key instruments for studying these patterns. The study of time series analysis and forecasting has become a major tool in different applications in hydrology and environmental fields. Among the most effective approaches for analyzing time series data is the ARIMA (Autoregressive Integrated Moving Average) model introduced by Box and Jenkins. In this study, an attempt has been made to use Box-Jenkins methodology to build ARIMA model for monthly rainfall data taken from Dibrugarh for the period of 1980- 2014 with a total of 420 points.  We investigated and found that ARIMA (0, 0, 0) (0, 1, 1)12 model is suitable for the given data set. As such this model can be used to forecast the pattern of monthly rainfall for the upcoming years, which can help the decision makers to establish priorities in terms of agricultural, flood, water demand management etc.  


Author(s):  
Mohammad Karim Ahmadzai

Wheat is the most important food crop in Afghanistan, whether consumed by the bulk of the people or used in various sectors. The problem is that Afghanistan has a significant shortfall of wheat between domestic production and consumption. Thus, the present study looks at the issue of meeting self-sufficiency for the whole population due to wheat shortages. To do so, we employ time series analysis, which can produce a highly exact short-run prediction for a significant quantity of data on the variables in question. The ARIMA models are versatile and widely utilised in univariate time series analysis. The ARIMA model combines three processes: I the auto-regressive (AR) process, (ii) the differencing process, and (iii) the moving average (MA) process. These processes are referred to as primary univariate time series models in statistical literature and are widely employed in various applications. Where predicting future wheat requirements is one of the most important tools that decision-makers may use to assess wheat requirements and then design measures to close the gap between supply and consumption. The present study seeks to forecast Production, Consumption, and Population for the period 2002-2017 and estimate the values of these variables between 2002 and 2017. (2018-2030).  


Author(s):  
Sudip Singh

India, with a population of over 1.38 billion, is facing high number of daily COVID-19 confirmed cases. In this chapter, the authors have applied ARIMA model (auto-regressive integrated moving average) to predict daily confirmed COVID-19 cases in India. Detailed univariate time series analysis was conducted on daily confirmed data from 19.03.2020 to 28.07.2020, and the predictions from the model were satisfactory with root mean square error (RSME) of 7,103. Data for this study was obtained from various reliable sources, including the Ministry of Health and Family Welfare (MoHFW) and http://covid19india.org/. The model identified was ARIMA(1,1,1) based on time series decomposition, autocorrelation function (ACF), and partial autocorrelation function (PACF).


Author(s):  
T. Warren Liao

In this chapter, we present genetic algorithm (GA) based methods developed for clustering univariate time series with equal or unequal length as an exploratory step of data mining. These methods basically implement the k-medoids algorithm. Each chromosome encodes in binary the data objects serving as the k-medoids. To compare their performance, both fixed-parameter and adaptive GAs were used. We first employed the synthetic control chart data set to investigate the performance of three fitness functions, two distance measures, and other GA parameters such as population size, crossover rate, and mutation rate. Two more sets of time series with or without known number of clusters were also experimented: one is the cylinder-bell-funnel data and the other is the novel battle simulation data. The clustering results are presented and discussed.


2021 ◽  
Vol 6 (3) ◽  
pp. 22-33
Author(s):  
Atiqa Nur Azza Mahmad Azan ◽  
Nur Faizatul Auni Mohd Zulkifly Mototo ◽  
Pauline Jin Wee Mah

Gold is known as the most valuable commodity in the world because it is a universal currency recognized by every single bank across the globe. Thus, many people were interested in investing gold since gold market was always steadier compared to other investment (Khamis and Awang, 2020). However, the credibility of gold was questionable due to the changes in gold prices caused by a variety of circumstances (Henriksen, 2018). Hence, information on the inflation of gold prices were needed to understand the trend in order to plan for the future in accordance with international gold price standards. The aim of this study was to identify the trend of Kijang Emas monthly average prices in Malaysia from the year 2010 to 2021, to determine the best fit time series model for Kijang Emas prices in Malaysia and using univariate time series models to forecast Kijang Emas prices in Malaysia. The ARIMA and ARFIMA models were used in this study to model and forecast the prices of gold (Kijang Emas) in Malaysia. Each of the actual monthly Kijang Emas prices for 2021 were found to be within the 95% predicted intervals for both the ARIMA and ARFIMA models. The performances for each model were checked by considering the values of MAE, RMSE and MAPE. From the findings, all the MAE, RMSE and MAPE values showed that the ARFIMA model emerged as the better model in forecasting the Kijang Emas prices in Malaysia compared to the ARIMA model.


Sign in / Sign up

Export Citation Format

Share Document