Text Mining with RapidMiner

July 14, 2014
45 Views

Download > Text Mining with RapidMiner

Download SUPPLEMENT Data > TripAdvisor Dataset

Parameters for the operators within the Process Documents from Files operator

Parameters for the operators within the Process Documents from Files operator

 

The goal of this chapter is to introduce the text mining capabilities of RAPIDMINER through a use case. The use case involves mining reviews for hotels at TripAdvisor.com, a popular web portal. We will be demonstrating basic text mining in RAPIDMINER using the text mining extension. We will present two different RAPIDMINER processes, namely Process01 andProcess02, which respectively describe how text mining can be combined with association mining and cluster modeling. While it is possible to construct each of these processes from scratch by inserting the appropriate operators into the process view, we will instead import these two processes readily from existing model files. Throughout the chapter, we will at times deliberately instruct the reader to take erroneous steps that result in undesired outcomes. We believe that this is a very realistic way of learning to use RAPIDMINER, since in practice, the modeling process frequently involves such steps that are later corrected.

Ertek, G., Tapucu, D., and Arın, I., 2013. Text Mining with RapidMiner. In: Markus Hofmann, Ralf Klinkenberg (Eds.) RapidMiner: Data Mining Use Cases and Business Analytics Applications. Chapman & Hall/CRC Data Mining and Knowledge Discovery Series. Chapman and Hall/CRC.

Note: This is the final draft version of this paper. Please cite this paper (or this final draft) as above.

Download > Text Mining with RapidMiner

Download SUPPLEMENT Data > TripAdvisor Dataset

Dr. Gürdal Ertek recommends the following related book:

Dr. Gürdal Ertek @ Social Web:

Dr. Gürdal Ertek @ TwitterDr. Gürdal Ertek @ LinkedIn

You may be interested

Wind Turbine Accidents: A Data Mining Study
Data Science
191 views
Data Science
191 views

Wind Turbine Accidents: A Data Mining Study

admin - December 12, 2016

[caption id="attachment_575" align="alignnone" width="960"] Fig. 1. The cause-effect relationship and stages where an accident occurs.[/caption] download the final draft of…

Perception gap and its impact on supply chain performance
Data Envelopment Analysis (DEA)
162 views
Data Envelopment Analysis (DEA)
162 views

Perception gap and its impact on supply chain performance

admin - January 10, 2016

Figure 4. SEM modelling for relationship between performance gaps and performance shortfall download: lu_ertek_2015_perception_gap.pdf The main purpose of this paper…

New knowledge in strategic management through visually mining semantic networks
Data Science
93 views
Data Science
93 views

New knowledge in strategic management through visually mining semantic networks

admin - January 10, 2016

[caption id="attachment_567" align="alignnone" width="598"] Fig. 5. Outlier objects.[/caption] download: ertek_et_al_2015_strategic_management.pdf Today’s highly competitive business world requires that managers be able…