Text Mining with RapidMiner

October 31, 2017
116 Views

Perception of the respondents at the suppliers and the buyer regarding the performance measures
Perception of the respondents at the suppliers and the buyer regarding the performance measures

The goal of this chapter is to introduce the text mining capabilities of RAPIDMINER through a use case. The use case involves mining reviews for hotels at TripAdvisor.com, a popular web portal. We will be demonstrating basic text mining in RAPIDMINER using the text mining extension. We will present two different RAPIDMINER processes, namely Process01 andProcess02, which respectively describe how text mining can be combined with association mining and cluster modeling. While it is possible to construct each of these processes from scratch by inserting the appropriate operators into the process view, we will instead import these two processes readily from existing model files. Throughout the chapter, we will at times deliberately instruct the reader to take erroneous steps that result in undesired outcomes. We believe that this is a very realistic way of learning to use RAPIDMINER, since in practice, the modeling process frequently involves such steps that are later corrected.

Ertek, G., Tapucu, D., and Arın, I., 2013. Text Mining with RapidMiner. In: Markus Hofmann, Ralf Klinkenberg (Eds.) RapidMiner: Data Mining Use Cases and Business Analytics Applications. Chapman & Hall/CRC Data Mining and Knowledge Discovery Series. Chapman and Hall/CRC.

Note: This is the final draft version of this paper. Please cite this paper (or this final draft) as above.

Download
Text Mining With Rapidminer

view PDF

Download SUPPLEMENT Data
TripAdvisor Dataset

Dr. Gürdal Ertek @ Social Web:

Dr. Gürdal Ertek @ TwitterDr. Gürdal Ertek @ LinkedIn

You may be interested

Learning and Personal Attributes of University Students in Predicting and Classifying the Learning Styles:
Uncategorized
29 views
Uncategorized
29 views

Learning and Personal Attributes of University Students in Predicting and Classifying the Learning Styles:

Dr. Gurdal Ertek - December 8, 2017

[caption id="attachment_1093" align="alignnone" width="707"] Fig. 1. The nine-region learning style grid[/caption] Learning and Personal Attributes of University Students in Predicting…

A Framework for Mining RFID Data From Schedule-Based Systems
Uncategorized
27 views
Uncategorized
27 views

A Framework for Mining RFID Data From Schedule-Based Systems

Dr. Gurdal Ertek - November 24, 2017

[caption id="attachment_1082" align="alignnone" width="1019"] Fig. 1. A schedule-based system where entities entering and exiting the system are tracked with RFID.[/caption]…

Wind Turbine Accidents: A Data Mining Study
Data Science
470 views
Data Science
470 views

Wind Turbine Accidents: A Data Mining Study

Dr. Gurdal Ertek - November 12, 2017

Fig. 1. The cause-effect relationship and stages where an accident occurs. Wind Turbine Accidents: A Data Mining Study While the…