Using a sentiment analysis for the examination of tourism blogs – a step by step methodological reflection process
Abstract The article at hand is driven by a methodological interest in the opportunities and challenges of applying an automated text mining approach, particularly a sentiment analysis on various tourism blogs at the same time. The study aims to answer the question to what extent advanced computational methods can improve the data acquisition and analysis of unstructured data sets stemming from various blogs and forums. Furthermore, the authors intend to explore to what extent the sentiment analysis is able to objectify the qualitative results identified by an earlier analysis by the authors using content analysis done by thematic coding. For the purpose of the specific tourism research question in this paper a new approach is proposed, which consists of a combination of sentiment analyses, supervised learning, and dimensionality reduction in order to identify terms that strongly load on specific emotions. The contribution indicates on the one hand, that advanced computational methods have their own specific constraints, but on the other hand, are able to provide a richer and deeper analysis following a quantitative approach. Several issues have to be taken into account, such as data protection constraints, the need for data cleaning, such as word stemming, dimension reduction, such as removal of custom stop words, and the development of descent ontologies. On the other hand, the quantitative method also provides, due to its standardised procedure, a less subjective insight in the given content, but is not less time consuming than traditional content analysis.