Forecasting Model for Stock Market Based on Probabilistic Linguistic Logical Relationship and Distance Measurement
The fluctuation of the stock market has a symmetrical characteristic. To improve the performance of self-forecasting, it is crucial to summarize and accurately express internal fluctuation rules from the historical time series dataset. However, due to the influence of external interference factors, these internal rules are difficult to express by traditional mathematical models. In this paper, a novel forecasting model is proposed based on probabilistic linguistic logical relationships generated from historical time series dataset. The proposed model introduces linguistic variables with positive and negative symmetrical judgements to represent the direction of stock market fluctuation. Meanwhile, daily fluctuation trends of a stock market are represented by a probabilistic linguistic term set, which consist of daily status and its recent historical statuses. First, historical time series of a stock market is transformed into a fluctuation time series (FTS) by the first-order difference transformation. Then, a fuzzy linguistic variable is employed to represent each value in the fluctuation time series, according to predefined intervals. Next, left hand sides of fuzzy logical relationships between currents and their corresponding histories can be expressed by probabilistic linguistic term sets and similar ones can be grouped to generate probabilistic linguistic logical relationships. Lastly, based on the probabilistic linguistic term set expression of the current status and the corresponding historical statuses, distance measurement is employed to find the most proper probabilistic linguistic logical relationship for future forecasting. For the convenience of comparing the prediction performance of the model from the perspective of accuracy, this paper takes the closing price dataset of Taiwan Stock Exchange Capitalization Weighted Stock Index (TAIEX) as an example. Compared with the prediction results of previous studies, the proposed model has the advantages of stable prediction performance, simple model design, and an easy to understand platform. In order to test the performance of the model for other datasets, we use the prediction of the Shanghai Stock Exchange Composite Index (SHSECI) to prove its universality.