Sentiment Analysis in TripAdvisor

Page created by Ruby Gibson
 
CONTINUE READING
Sentiment Analysis in TripAdvisor
AFFECTIVE COMPUTING AND SENTIMENT ANALYSIS
                     Editor: Erik Cambria, Nanyang Technological University, Singapore, cambria@ntu.edu.sg

                     Sentiment Analysis
                     in TripAdvisor

                     Ana Valdivia, M. Victoria Luzón, and Francisco Herrera, University of Granada

       T
            he number of Web 2.0 websites has recently              product, and so on is generally positive, negative,
            experienced significant growth. These web-              or neutral. The interest in sentiment analysis has
                                                                    increased significantly over the last few years due
       sites emerged as an evolution of Web 1.0, or static,         to the large amount of stored text in Web 2.0 ap-
       websites. In Web 2.0, users aren’t only content              plications and the importance of online customer
       consumers, they can also generate content and col-           opinions. As a result, more than 1 million research
       laborate with other users. In this way, users take           papers contain the term “sentiment analysis,” and
       an active role and create a virtual community.               various start-ups have been created to analyze
       Web 2.0 websites include blogs (Blogger or Word-             sentiments in social media companies.
       press), media content (Prezi, YouTube, Flickr), wi-             Multiple studies on TripAdvisor exist, but there
       kis (Wikipedia, WikiSpace), collaboration (Drop-             is no complete analysis from the sentiment analy-
       box, Google Docs), and social networks (Twitter,             sis viewpoint. This article proposes TripAdvisor
       Facebook, Google+).1 The burgeoning informa-                 as a source of data for sentiment analysis tasks.
       tion explosion offered through Web 2.0 suggests              We develop an analysis for studying the match-
       that customers often check other users’ opin-                ing between users’ sentiments and automatic sen-
       ions in forums, blogs, and social networks be-               timent-detection algorithms. Finally, we discuss
       fore buying a product or contracting a service.              some of the challenges regarding sentiment analy-
         TripAdvisor emerged in 2004 as a Web 2.0 appli-            sis and TripAdvisor, and conclude with some final
       cation for the tourism domain. This user-generated           remarks.
       content website offers a plethora of reviews de-
       tailing travelers’ experiences with hotels, res-             TripAdvisor and Sentiment Analysis
       taurants, and tourist spots. TripAdvisor has                 According to Wikipedia, TripAdvisor is an Amer-
       since been ranked as the most popular site for               ican travel website company providing reviews
       trip planning, with millions of tourists visit-              from travelers about their experiences in hotels,
       ing the site when arranging their holidays (see              restaurants, and monuments. Stephen Kaufer
       Figure 1).                                                   and Langley Steinert, along with others, founded
         Sentiment analysis is a natural language pro-              TripAdvisor in February 2000 as a site listing in-
       cessing tool that is useful for monitoring Web 2.0           formation from guidebooks, newspapers, and mag-
       applications, as it can reveal public opinion about          azines. InterActiveCorp purchased the site in 2004,
       numerous issues without requiring satisfaction               and one year later, spun off its business travel
       enquiries. 2,3 According to the Oxford diction-              group, Expedia. After that, the website turned to
       ary, sentiment analysis is the process of compu-             user-generated content. It has since become the
       tationally identifying and categorizing opinions             largest travel community, reaching 390 million
       expressed in a piece of text to determine whether            unique visitors each month and listing 465 million
       the writer’s attitude toward a particular topic,             reviews and opinions about more than 7 million

72		                                      1541-1672/17/$33.00 © 2017 IEEE                           IEEE INTELLIGENT SYSTEMS
                                       Published by the IEEE Computer Society
Sentiment Analysis in TripAdvisor
accommodations, restaurants, and
attractions in 49 markets worldwide.
Figure 2 shows the Google search rate
for TripAdvisor, illustrating its popu-
larity around the world.
   Because it has so much data, Trip­
Advisor has become extremely popu-
lar with both tourists and managers.
Tourists can read the accumulated
opinions of millions of everyday tour-
ists. They can also check the popular-
ity index, which is computed using
an algorithm that accounts for user         Figure 1. Granada (Paseo de los Tristes). TripAdvisor is the most popular site for
reviews and other published sources         planning a trip.
such as guidebooks and newspaper
articles. This index runs from num-
ber 1 to the overall total number of
restaurants, hotels, or other attrac-
tions within the city. Travelers can
find the most interesting visitor at-
traction or most popular restaurant.
Linked to this is the bubble rating
(user rating), a 1–5 scale where one
bubble represents a terrible experi-
ence and five bubbles an excellent
experience. All reviewers are asked
to use this scale to summarize their
feedback. Together with this rating,
users include their opinions, which
can cover the performance of a res-
taurant, hotel, or tourist spot. There-
fore, reading and analyzing reviews         Figure 2. Map of the popularity of TripAdvisor searches in Google over the last
can help develop a business.                five years.
   The World Travel & Tourism Coun-
cil report shows that tourism gener-
ates 9.8 percent of the wider gross         data. Sentiment classification, the          detect sentiments from documents,
domestic product and supports 248           best-known sentiment analysis task,          sentences, or words. A large variety
million jobs.4 These numbers suggest        aims to detect sentiments within a           of SAMs address the different catego-
that the tourism industry is the most       document, a sentence, or an aspect.          ries of texts (blogs, reviews, tweets,
important economic driver of many           This task can be divided into three          and so on). However, the analysis of
economies. Therefore, it’s important        steps: polarity detection (label the         feelings is not a perfect science, espe-
to understand the main drivers of the       sentiment of the text as positive, neg-      cially when applied to the unstruc-
tourist flow as well as tourists’opinions   ative, or neutral), aspect selection/        tured texts that predominate in social
about a city’s restaurants, hotels, and     extraction (obtain the features for          networks.9 Human language is com-
tourist attractions.                        structuring the text), and classifica-       plex, so teaching a machine to detect
   TripAdvisor has enough standing          tion (apply machine learning or lexi-        different grammatical nuances, cul-
to be used as a text source, 5 storing      con approaches to classify the text).        tural variations, jargon, and misspell-
numerous reviews of tourist busi-              Sentiment analysis methods (SAMs),        ings in messages on the network is a
nesses around the world. Sentiment          which are trained for sentiment po-          difficult process, and it is even more
analysis extracts insights from this        larity detection,6–8 can automatically       difficult to automatically understand

JULY/AUGUST 2017                                    www.computer.org/intelligent                                                    73
Sentiment Analysis in TripAdvisor
Alhambra               Mezquita Córdoba                 Sagrada Familia                                                                       Alhambra               Mezquita Córdoba                  Sagrada Familia
                                  100%

                                                                                                                                            of total SentiStrength
                                                                                                                                                                                 60%

                                                                                                                                                    Percent
                                  90%                                                                                                                                            40%

                                                                                                                                                                                 20%
                                  80%                                                                                                                                            0%

                                                                                                                                                      Percent of total Bing
                                  70%                                                                                                                                            40%
   Percent of total user rating

                                  60%                                                                                                                                            20%

                                                                                                                                                                                  0%
                                  50%                                                                                                                                            80%

                                                                                                                                                      Percent of total Syuzhet
                                                                                                                                                                                 60%
                                  40%
                                                                                                                                                                                 40%

                                                                                                                                                                                 20%
                                  30%
                                                                                                                                                                                 0%

                                  20%
                                                                                                                                            Percent of total

                                                                                                                                                                                 40%
                                                                                                                                               CoreNLP

                                  10%
                                                                                                                                                                                 20%

                                   0%                                                                                                                                            0%
                                         Negative

                                                      Neutral

                                                                                                                                                                                                                         Negative

                                                                                                                                                                                                                                    Neutral

                                                                                                                                                                                                                                                           Negative

                                                                                                                                                                                                                                                                      Neutral
                                                                           Negative

                                                                                      Neutral

                                                                                                           Negative

                                                                                                                       Neutral

                                                                                                                                                                                       Negative

                                                                                                                                                                                                    Neutral
                                                                Positive

                                                                                                                                                                                                                                                Positive

                                                                                                                                                                                                                                                                                Positive
                                                                                                Positive

                                                                                                                                 Positive

                                                                                                                                                                                                              Positive

 Figure 3. Distribution of sentiments between TripAdvisor users (bubble ratings) and four sentiment analysis methods (SAMs):
 SentiStrength, Bing, Syuzhet, and CoreNLP. Red indicates negative sentiments, orange neutral, and green positive.

 how the context can affect the mes-                                                                   social networks, such as TripAdvisor for                                                                   Figure 3 shows the results. We ob-
 sage’s tone. Because humans can ap-                                                                   analysis.12 Similarly, other research-                                                                  serve that the distributions of polari-
 ply a contextual understanding, we                                                                    ers developed a tool to analyze tour-                                                                   ties are different from user ratings. The
 can intuitively interpret the intention-                                                              ists’ opinions of restaurants as well as                                                                user ratings bar plot shows that us-
 ality of any writing. Computers, how-                                                                 hotels from a region in Chile.13                                                                        ers tend to rate their visits to the three
 ever, have difficulty understanding the                                                                                                                                                                       monuments positively, with more than
 context in which a phrase is expressed                                                                A Study for Calibrating                                                                                 90 percent of ratings having four or
 and detecting whether a person is be-                                                                 User’s Polarity                                                                                         five bubbles. SentiStrength and Syuzhet
 ing sarcastic or not.                                                                                 We scrape TripAdvisor webpages                                                                          methods reach a similar distribution
    A few sentiment analysis studies                                                                   on three well-known monuments in                                                                        to the user rating. However, Bing and
 set TripAdvisor as a data source. For                                                                 Spain: Alhambra, Mezquita Cór-                                                                          CoreNLP detect more negativity in the
 example, researchers have collected                                                                   doba, and Sagrada Familia. We con-                                                                      TripAdvisor opinions. In all cases, the
 reviews about the TripAdvisor app in                                                                  sider user ratings of one and two                                                                       number of neutral polarities is higher
 Google Play Store for extracting app                                                                  bubbles as negative, three as neutral,                                                                  than the neutral ratings (three bubbles).
 features to help developers.10 Others                                                                 and four and five as a positive sen-                                                                       Next, we studied the distribution of
 analyzed TripAdvisor’s hotel reviews                                                                  timent. We then apply four SAMs                                                                         bubble ratings over the negative SAM
 for classifying good and bad customer                                                                 (SentiStrength,14 Bing,15 Syuzhet, and                                                                  polarities. We thus analyzed the behav-
 opinions.11 Still others propose a system                                                             CoreNLP6) and extract the overall                                                                       ior of user feedback against the SAM
 to summarize comments from travel                                                                     polarity on each opinion.                                                                               evaluations. Figure 4 presents 12 bar

74		                                                                                                                  www.computer.org/intelligent                                                                                            IEEE INTELLIGENT SYSTEMS
Sentiment Analysis in TripAdvisor
Alhambra                                Mezquita Córdoba                              Saqrada Familia
                    100%
    SentiStrength

                           57.84%
                    50%             40.22%                                         42.86%
                                                                                                                       32.24%
                                                                                            22.22%                              23.72%
                                              19.41%                                                                                     16.26%
                                                                          14.29%                     10.56%                                       11.89%
                                                        9.94%     6.43%                                       6.60%                                        9.90%
                      0%
                    100% 91.18%                                                    92.86%
                                    78.26%
                                              60.37%                      57.14%                                       55.19%
    Bing

                    50%                                 45.35%                                                                  47.71%
                                                                                            38.89%                                       38.64%
                                                                 33.27%                                                                           33.95%   31.64%
                                                                                                     27.83% 26.56%

                      0%
                    100%
    Syuzhet

                    50% 45.10%
                                    36.96%                                         35.71%
                                              26.06%                      28.57%                                       27.32%
                                                                                                                                20.49% 16.43%
                                                        14.14% 10.52%                       16.67%                                            13.32%       12.15%
                                                                                                     10.94%   9.98%
                     0%
                    100% 94.12% 92.39%
                                                                                   85.71%
                                              76.60%                      71.43%
    Core NLP

                                                                                            58.33%                     57.38%   59.03%
                                                        54.16%
                                                                                                                                         48.02%
                    50%
                                                                 35.68%                              32.44%                                       35.96%
                                                                                                              25.41%                                       29.26%

                     0%
                             *       **        ***      ****      *****     *       **       ***     ****     *****       *      **       ***     ****     *****

Figure 4. Distribution of SAMs’ negative polarity by user rating in TripAdvisor.

plots (four SAMs for each of three                               we suggest not setting the user rate as               TripAdvisor-based opinions. Although
monuments) containing the shares of                              a label sentiment for the whole review                some related topics have been exten-
all negative SAM polarity over the bub-                          and analyzing the opinions in depth.                  sively studied in the literature, their
ble evaluation (percent over the origi-                            This study clearly shows the need                   adaptation to the context of TripAdvi-
nal user ratings). Analyzing this data,                          to analyze opinions beyond user rat-                  sor opinions requires revisiting them.
we observe that Senti­    Strength and                           ings. As a practical methodology, we                     Aspect-based sentiment analysis (
Syuzhet detect at best 57.48 and 45.10                           propose following three steps: handle                 ABSA) is an important sentiment anal-
percent of negative reviews. However,                            the negative opinions identified with                 ysis task.16 An aspect refers to an at-
they misclassify on average 20 percent                           SAMs via learning models, get a good                  tribute of the entity, for example, hotel
of positive user reviews (three and four                         clusterization according to consensus                 room cleanliness, the staff at a tour-
bubbles). On the other hand, Bing and                            degrees among SAMs, and discover                      ist spot, or the service at a restaurant.
CoreNLP methods detect as negative                               relationships among common aspects                    ABSA aims to identify the sentiment
more negative user ratings, but misclas-                         to characterize the cause behind the                  toward an aspect and extract fine-
sify over 30 percent of positive reviews.                        negative comments.                                    grained information about specific
Bing and CoreNLP tend to highlight                                                                                     Trip­Advisor-based opinions (hotels,
the negative opinions.                                           Challenges                                            monuments, restaurants, and so on).
   In general, we observe that users tend                        Several challenges arise when Trip­                   Recent relevant studies are based on
to write negative sentences on positive                          Advisor uses sentiment analy-                         deep learning,17 which should be ana-
user ratings, and vice versa. Therefore,                         sis, due to the specific content in                   lyzed in the TripAdvisor context.

JULY/AUGUST 2017                                                          www.computer.org/intelligent                                                              75
Sentiment Analysis in TripAdvisor
ABSA is helpful to business man-        applications in the tourist domain,                   Sentiment Analysis Methods,” EPJ Data
 agers because it allows for the ex-        sentiment analysis has great potential                Science, vol. 5, no. 1, 2016, pp. 1–29.
 traction of transparent customer           to directly influence quality improve-           7.   J. Serrano-Guerrero et al., “Sentiment
 opinions. Discovery knowledge tech-        ment in tourism.                                      Analysis: A Review and Comparative
 niques such as subgroup discovery18           Because of inconsistencies between                 Analysis of Web Services,” Information
 can be applied to discover relation-       user ratings and SAM evaluations,                     Science, vol. 311, Aug. 2015, pp. 18–38.
 ships among common aspects and get         with users often writing negative sen-           8.   E. Cambria and A. Hussain, Sentic
 aspect associations for both positive      tences in positive opinions and vice                  Computing: A Common-Sense-Based
 and negative opinions.                     versa, we need new approaches to fix                  Framework for Concept-Level Senti-
    The detection of irony and sarcasm      the positive, negative, and neutrality                ment Analysis, Springer, 2015.
 is a complex sentiment analysis task.      via consensus among SAMs, as well                9.   F.H. Khan, S. Bashir, and U. Qamar,
 The detection of ironic expressions        as design models to discover rela-                    “TOM: Twitter Opinion Mining
 in TripAdvisor reviews is an open          tionships among common aspects to                     Framework Using Hybrid Classification
 problem that could help to extract         characterize the reasons behind nega-                 Scheme,” Decision Support Systems,
 more valuable information about the        tive comments.                                        vol. 57, Jan. 2014, pp. 245–257.
 study’s subject.19 Spam is another                                                         10.   E. Guzman and W. Maalej, “How Do
 sentiment analysis-related concern.                                                              Users Like This Feature? AFine Grained
 Some authors have developed studies        Acknowledgments                                       Sentiment Analysis of App Reviews,”
 to measure the credibility of TripAd-      This work has been supported by FEDER                 Proc. IEEE 22nd Int’l Conf. Require-
 visor with satisfactory results. 20        (Fondo Europeo de DEsarrollo Regional)                ments Eng., 2014, pp. 153–162.
                                            and the Spanish National Research Project
    A novel approach is the extraction                                                      11.   D. Gräbner et al., “Classification of
                                            TIN2014-57251-P.
 of aspects/features from opinions                                                                Customer Reviews based on Sentiment
 to raise the issue as a bag of feature                                                           Analysis,” Information and Comm.
 vectors, considering the problem as        References                                            Technologies in Tourism, Springer,
 multi-instance learning. 21 This might      1. P. Andersen, What Is Web 2.0? Ideas,              2012, pp. 460–470.
 provide a robust approach from the             Technologies and Implications for Edu-      12.   S. Palakvangsa-Na-Ayudhya et al.,
 classification viewpoint.                      cation, tech. report, JISC Technology &           “Nebular: A Sentiment Classification
                                                Standards Watch, 2007; www.ictliteracy            System for the Tourism Business,” Proc.

 S
                                                .info/rf.pdf/Web2.0_research.pdf.                 8th Int’l Joint Conf. Computer Science
      entiment analysis is an incipi-        2. E. Cambria, “Affective Computing                  and Software Eng., 2011, pp. 293–298.
 ent research field. It is difficult to         and Sentiment Analysis,” IEEE Intel-        13.   E. Marrese-Taylor, J.D. Velásquez, and
 determine how it will evolve in the            ligent Systems, vol. 31, no. 2, 2016,             F. Bravo-Marquez, “A Novel Determin-
 future, although there is a general            pp. 102–107.                                      istic Approach for Aspect-Based Opinion
 belief that this analysis needs to go       3. B. Liu, Sentiment Analysis: Mining                Mining in Tourism Products Reviews,”
 beyond a simple classification of              Opinions, Sentiments, and Emotions,               Expert Systems with Applications, vol.
 texts on a positive and negative one-          Cambridge Univ. Press, 2015.                      41, Dec. 2014, pp. 7764–7775.
 dimensional scale. Over the last few        4. World Travel and Tourism Council,           14.   M. Thelwall, “Heart and Soul: Senti-
 years, the list of sentiment analysis-         Travel & Tourism Economic Impart                  ment Strength Detection in the Social
 related challenges has grown (subjec-          2016 World, 2016; www.wttc.org/-                  Web with SentiStrength,” Cyberemo-
 tivity classification, opinion summa-          /media/files/reports/economic%20                  tions: Collective Emotions in Cyber-
 rization, opinion retrieval, and so on).       impact%20research/regions%202016                  space, Springer, 2017, pp. 119–134.
    Through Web platforms such as               /world2016.pdf.                             15.   M. Hu and B. Liu, “Mining and
 TripAdvisor, tourists can openly de-        5. P. O’Connor, “User-Generated                      Summarizing Customer Reviews,”
 scribe their experiences and thus af-          Content and Travel: A Case Study on               Proc. 10th ACM SIGKDD Int’l Conf.
 fect a business’s viability. Therefore,        TripAdvisor.com,” Information and                 Knowledge Discovery and Data Min-
 the implementation of sentiment                Comm. Technologies in Tourism,                    ing, 2004, pp. 168–177.
 analysis techniques to mine sources            P. O’Connor, W. Höpken, and U. Gret-        16.   K. Shouten and F. Frasincar, “Survey on
 of opinion is crucial to understand-           zel, eds., Springer, 2008, pp. 47–48.             Aspect-Level Sentiment Analysis,” IEEE
 ing the faults and assets of a tourist      6. F.N. Ribeiro et al., “SentiBench: ABench-         Trans. Knowledge and Data Eng., vol.
 service. Given the large number of             mark Comparison of State-of-the-Practice          28, March 2016, pp. 813–830.

76		                                                www.computer.org/intelligent                             IEEE INTELLIGENT SYSTEMS
Sentiment Analysis in TripAdvisor
17. S. Poria, E. Cambria, and A. Gelbukh,                                                                                                                              Influence on Recommendation                      at University of Granada. Her research
    “Aspect Extraction for Opinion Min-                                                                                                                                Adoption and Word of Mouth,” Tour-               interests include sentiment analysis, arti-
    ing with a Deep Convolutional Neural                                                                                                                               ism Management, vol. 51, Dec. 2015,              ficial intelligence, and computer graphics.
    Network,” Knowledge-Based Systems,                                                                                                                                 pp. 174–185.                                     Luzón has a PhD in industrial engineering
    vol. 108, Sep. 2016, pp. 42–49.                                                                                                                                21. F. Herrera et al., Multiple Instance             from the University of Vigo. Contact her at
18. M. Atzmueller, “Subgroup Discovery,”                                                                                                                               Learning: Foundations and Algo-                  luzon@ugr.es.
    Wiley Interdisciplinary Reviews: Data                                                                                                                              rithms, Springer, 2016.
    Mining and Knowledge Discovery,                                                                                                                                                                                     Francisco Herrera is a full professor in
    vol. 5, no. 1, 2015, pp. 35-49.                                                                                                                                Ana Valdivia is a research fellow in the             the Computer Science and Artificial Intel-
19. S. Poria et al., “A Deeper Look                                                                                                                                Computer Science and Artificial Intelligence         ligence Department at the University of
    into Sarcastic Tweets Using Deep                                                                                                                               Department at the University of Granada.             Granada. and head of the Soft Comput-
    Convolutional Neural Networks,”                                                                                                                                Her research interests include big data, so-         ing and Intelligent Information Systems
    Proc. 26th Int’l Conf. Computational                                                                                                                           cial media analysis, and artificial intelli-         research group. His research interests in-
    Linguistics (COLING16), 2016,                                                                                                                                  gence. Valdivia has an MsC in data science           clude data science, data preprocessing,
    pp. 1601–1612.                                                                                                                                                 from the University of Granada. Contact              big data, sentiment analysis, and compu-
20. R. Filieri, S. Alguezaui, and F. McLeay,                                                                                                                       her at avaldivia@ugr.es.                             tational intelligence. Herrera has a PhD
    “Why Do Travelers Trust TripAdvi-                                                                                                                                                                                   in mathematics from the University of
    sor? Antecedents of Trust Towards                                                                                                                              M. Victoria Luzón is an associate profes-            Granada. Contact him at herrera@decsai
    Consumer-Generated Media and Its                                                                                                                               sor in the Software Engineering Department           .ugr.es.

                             JULY/AUGUST 2016
        IEEE SOFTWARE
        July/August 2016

                               SOFTWARE                                                                                         JANUARY/FEBRUARY 2016                                                         IEEE Software offers
                                QUALITY                                                                                                                                                                       pioneering ideas,
                                                                                           IEEE SOFTWARE

                                                                                                                                                                               WWW.COMPUTER.ORG/SOFTWARE
        SOFTWARE QUALITY

                                                                                                                                                                                                              expert analyses, and
                                                                                            January/February 2016

                                                                                                                                                                                                              thoughtful insights for
                                                                              ARCHITECTURAL DESIGN PRINCIPLES // 15

                                                                                                                                                                                                              software professionals
                                                                                   WHY LARGE IT PROJECTS FAIL // 117
        Volume 33 Number 4

                                                                                                                                                                                                              who need to keep up
                                                                                           THE FUTURE OF SOFTWARE ENGINEERING

                                                           MARCH/APRIL 2016
                                                                                     WWW.COMPUTER.ORG/SOFTWARE

                                                                                                                                                                                                              with rapid technology
                                      IEEE SOFTWARE

                                                                                                                                           WWW.COMPUTER.ORG/SOFTWARE
                                                                                                                                                                             TINY PROGRAMMING TOOLS // 24

                                                                                                                                                                             REQUIREMENTS
                                                                                                                                                                             & SOCIAL RESPONSIBILITY // 109

                                                                                                                                                                                                              change. It’s the authority
                                      March/April 2016

                                                                                                                                                        CODE CLARITY // 22
                                                                                                                                                                                                              on translating software
                                                                                                                                                                                                              theory into practice.
                                                                                                                                                        SOFTWARE
                                                                                           Volume 33 Number 1

                                                                                                                                                        ON A COMET // 81
                                      BIG DATA

                                                                                                                                                                                                              www.computer.org/
                                                                                                                                                                                                              software/subscribe
                                      Volume 33 Number 2

JULY/AUGUST 2017                                                                                                                                                                   www.computer.org/intelligent                                                       77
You can also read