| SardiniaHotels {VGAMdata} | R Documentation |
Data from hotels in Sardinia, Italy
Description
This data set contains information and satisfaction scores appearing on the TripAdvisor website between the years 2008 and 2016 regarding hotels in Sardinia, Italy.
The satisfaction data refer to the reputation of hotel located along Sardinian coasts, as expressed by clients with respect to different services (e.g., breakfast, restaurant, swimming pool) offered by the hotel.
Usage
data(SardiniaHotels)
Format
A data frame with 518 rows and 43 columns (variables). Each row refers to a single hotel.
The following variables are included in the dataset:
municipalitya factor, the municipality where the hotel is located.
starsan ordered factor with levels:
1OR2starsfor 1 star or 2 star hotels,3stars3 star hotels,residence,4stars, 4 star hotels,5starsORresort, 5 star hotels or resorts.areaa factor with levels related to the area of the Sardinian coast where each single hotel is located:
AlgheroSassari,CagliariVillasimius,CostaSmeralda,DorgaliOrosei,Gallura,NurraAnglona,Ogliastra,Olbia,OristanoBosa,PulaChia,Sarrabus,Sulcis.seaLocationa factor with levels
yes(if the hotel is located close to the sea) andno(otherwise).excellenta numeric vector, the number of people that expressed the highest level of satisfaction.
gooda numeric vector, the number of people that expressed a good level of satisfaction.
averagea numeric vector, the number of people that expressed an average level of satisfaction.
bada numeric vector, the number of people that expressed a bad level of satisfaction.
poora numeric vector, the number of people that expressed the lowest level of satisfaction.
familya numeric vector, the number of people travelling with family.
couplea numeric vector, the number of people travelling with their partner.
singlea numeric vector, the number of people travelling alone.
businessa numeric vector, the number of people travelling for work.
MarMaya numeric vector, the number of people travelling during the period March to May.
JunAuga numeric vector, the number of people travelling during the period June to August.
SepNova numeric vector, the number of people travelling during the period September to November.
DecFeba numeric vector, the number of people travelling during the period December to February.
locationa numeric vector, the satisfaction score expressed by tourists towards the location.
sleepQualitya numeric vector, the satisfaction score expressed by tourists towards the sleep quality.
rooma numeric vector, the satisfaction score expressed by tourists towards the comfort and quality of the room.
servicesa numeric vector, the satisfaction score expressed by tourists towards the quality of the services.
priceQualityRatea numeric vector, the satisfaction score expressed by tourists towards ratio between price and quality.
cleaninga numeric vector, the satisfaction score expressed by tourists towards level of room and hotel cleaning.
bt1a factor with levels
breakfast,cleaning,location,overall,price,restaurant,room,services,staff,structureandWi-Fi.It expresses the 1st most used word in reviews for a hotel.
ratebt1a factor with levels
-1(if the satisfaction score espressed inbt1is prevalently negative) and1(if the satisfaction score espressed inbt1is prevalently positive).bt2a factor with levels
breakfast,cleaning,location,overall,price,restaurant,room,services,staff,structureandWi-Fi.It expresses the 2nd most used word in reviews for a hotel.
ratebt2a factor with levels
-1(if the satisfaction score espressed inbt2is prevalently negative) and1(if the satisfaction score espressed inbt2is prevalently positive).bt3similar to
bt1andbt2, but with a corresponding different ranking.bt4similar to
bt1andbt2, but with a corresponding different ranking.bt5similar to
bt1andbt2, but with a corresponding different ranking.bt6similar to
bt1andbt2, but with a corresponding different ranking.bt7similar to
bt1andbt2, but with a corresponding different ranking.bt8similar to
bt1andbt2, but with a corresponding different ranking.bt9similar to
bt1andbt2, but with a corresponding different ranking.bt10similar to
bt1andbt2, but with a corresponding different ranking.ratebt3similar to
ratebt1andratebt2, but with a corresponding different ranking.ratebt4similar to
ratebt1andratebt2, but with a corresponding different ranking.ratebt5similar to
ratebt1andratebt2, but with a corresponding different ranking.ratebt6similar to
ratebt1andratebt2, but with a corresponding different ranking.ratebt7similar to
ratebt1andratebt2, but with a corresponding different ranking.ratebt8similar to
ratebt1andratebt2, but with a corresponding different ranking.ratebt9similar to
ratebt1andratebt2, but with a corresponding different ranking.ratebt10similar to
ratebt1andratebt2, but with a corresponding different ranking.
Details
These data were manually collected during March–June 2016 by students of the class of "Statistics for Turism" at the University of Cagliari, Italy (Bachelor's degree in Tourism Economics and Managment), under the supervision of Prof. Claudio Conversano and Dr. Giulia Contu.
Many of the variables fall into several natural groups, e.g.,
[municipality,
stars,
area,
seaLocation];
[excellent,
good,
average,
bad,
poor];
[MarMay,
JunAug,
SepNov,
DecFeb];
[family,
couple,
single,
business];
[location,...cleaning];
[bt1,...bt10];
[ratebt1,...ratebt10].
Source
TripAdvisor, https://www.tripadvisor.it/.
Examples
data(SardiniaHotels)
summary(SardiniaHotels)