Divided We Rule: Influencer Polarization on Twitter During Political Crises in India

Page created by Rose Padilla
 
CONTINUE READING
Divided We Rule: Influencer Polarization on Twitter During Political Crises in India
Divided We Rule: Influencer Polarization on Twitter During Political Crises in
                                                                               India
                                                               Saloni Dash, 1 Dibyendu Mishra, 1 Gazal Shekhawat,1 Joyojeet Pal 1
                                                                                                    1
                                                                                           Microsoft Research
                                                                                           Bengaluru, India
                                                t-sadash@microsoft.com, v-dibmis@microsoft.com, t-gshekhawat@microsoft.com, joyojeet.pal@microsoft.com
arXiv:2105.08361v1 [cs.SI] 18 May 2021

                                                                     Abstract                                handles of politicians from two major national parties in In-
                                                                                                             dia – the national incumbent BJP (Bharatiya Janata Party)
                                           Influencers are key to the nature and networks of information     and the opposition party INC (Indian National Congress).
                                           propagation on social media. Influencers are particularly im-
                                           portant in political discourse through their engagement with
                                                                                                             We do this by examining influencer engagement on four is-
                                           issues, and may derive their legitimacy either solely or partly   sues that have dominated the news and seen divisive political
                                           through online operation, or have an offline sphere of exper-     activism during 2019-2021. In the order of their onset, they
                                           tise such as entertainers, journalists etc. To quantify influ-    are as follows –
                                           encers’ political engagement and polarity, we use Google’s        • Abrogation of Article 370: The central government’s
                                           Universal Sentence Encoder (USE) to encode the tweets of
                                                                                                               unilateral abrogation of article 370 of the Indian Con-
                                           6k influencers and 26k Indian politicians during political
                                           crises in India. We then obtain aggregate vector represen-          stitution, which had secured autonomy for India’s only
                                           tations of the influencers based on their tweet embeddings,         Muslim majority state, Jammu & Kashmir. The region,
                                           which alongside retweet graphs help compute the stance and          which is also part of a major international dispute, was
                                           polarity of these influencers with respect to the political is-     broken into two union territories, with no consultation
                                           sues. We find that while on COVID-19 there is a confluence          with the state’s residents. Hotly debated actions following
                                           of influencers on the side of the government, on three other        the abrogation included cutting of communication lines in
                                           contentious issues around citizenship, Kashmir’s statehood,         Kashmir, imprisoning politicians, imposing a curfew, and
                                           and farmers’ protests, it is mainly government-aligned fan          increasing armed forces on the ground.
                                           accounts that amplify the incumbent’s positions. We propose       • Citizenship Amendment Act (CAA): The Act, which
                                           that this method offers insight into the political schisms in
                                                                                                               disqualified Musims from qualifying for refugee status
                                           present-day India, but also offers a means to study influencers
                                           and polarization in other contexts.                                 when entering from India’s neighboring countries, was
                                                                                                               enacted alongside the government’s plans to conduct a na-
                                                                                                               tionwide exercise to identify “illegal foreigners” living in
                                                                 Introduction                                  India by requiring historical documentation of residency.
                                         As social media raises concerns of polarizing politics,               Since such an exercise would automatically exclude peo-
                                         worldwide, questions arise on the role of public figures              ple of certain faiths from scrutiny and intensify attention
                                         who wield the ability to influence the political discourse. In        towards others, it stoked fears of lost citizenship among
                                         parts of the west, there is an inherent tension between what          India’s Muslims and led to widespread protests all around
                                         are largely seen as liberal mainstream celebrities (Demaine           the country.
                                         2009) and an emerging group of hyper-partisan micro-                • COVID-19: We studied the tweets surrounding the ongo-
                                         celebrities (Lewis 2020). Meanwhile in India, the role of in-         ing COVID-19 pandemic in India up until March 2021,
                                         fluencers has been complex, especially among mainstream               before the second wave of the crisis began. The first wave
                                         offline celebrities who have leaned more towards the nation-          period was marked by the government announcing a na-
                                         alist incumbent, which has a much better organized online             tionwide lockdown, which triggered an exodus of low-
                                         social media presence (Lalani, Kommiya Mothilal, and Pal              income migrant workers from Indian cities to hinterlands,
                                         2019), while public figures speaking against the government           often on foot, since transport lines were closed. The mi-
                                         have faced online harassment, trolling, and attacks to their          grant crisis and economic impacts, as well as the health
                                         livelihoods (Singh 2020; Menon 2020).                                 concerns and practices were part of the contentious issues
                                            In this paper, we study the role of influencers in ampli-          in these conversations.
                                         fying political polarization in India, by quantifying the par-      • Farmers’ Protests: The Indian Farm Bills which were
                                         tisan engagement of Twitter handles of Indian entertainers,           passed in parliament in September 2020, targeted pric-
                                         sportspersons, journalists and other public figures alongside         ing and subsidies and allowed the entry of corporates into
                                                                                                               certain domains of crop trading. This led to protests by
                                         Copyright © 2021, Association for the Advancement of Artificial       farmers, particularly in states of North India where the
                                         Intelligence (www.aaai.org). All rights reserved.                     new configurations of crops and purchase prices brought
Divided We Rule: Influencer Polarization on Twitter During Political Crises in India
threats to livelihoods.                                        platform celebrities (those with no other known sources of
    Our analysis builds on past work about Twitter-driven po-      renown other than for their social media presence) or media
litical outreach strategies in India, particularly by the rul-     houses rather than mainstream journalists, entertainers etc.
ing party BJP and its leader, Prime Minister Narendra Modi         (refer to Data for the categories of influencers in this study).
(Pal 2015; Sinha 2017; Neyazi, Kumar, and Semetko 2016).
The four events we study here bear elements of communi-                                  Related Work
cation and outreach both for and against the government’s
position, while three events are related to responses on spe-      In their formative study of political polarization, Conover
cific government initiatives (Article 370, CAA, Farm Bills).       et al. (2011) explain the homophily on social media through
The COVID-19 related discourse, while marked by a few              partisan user behavior in the US. The authors particularly
key events, is an ongoing issue with longer term sustained         find the retweet network of their sample to be highly po-
social media communication.                                        larized, with limited interaction between the two camps. A
                                                                   body of work also suggests that influencers exacerbate po-
                                                                   larization on social media by endorsing and spreading parti-
                                                                   san content. In an experiment with Democrats and Republi-
                                                                   cans, Becker, Porter, and Centola (2019) demonstrated that
                                                                   polarization does not spread in egalitarian networks where
                                                                   all individuals have equal influence throughout the network.
                                                                   They argue that partisan bias is amplified on social media
                                                                   platforms like Twitter because these networks are organised
                                                                   around few key influencers (Centola 2020) and the ability
                                                                   to obtain central positions could allow extremists to gener-
                                                                   ate and sustain belief polarization in social media networks
                                                                   (Becker, Porter, and Centola 2019). Additionally, Garibay
                                                                   et al. (2019) found through their simulations that polariza-
                                                                   tion in social media helps influencers gain more influence.
                                                                      Moreover, studying the elections in the US and Spain,
Figure 1: Influencers’ Timeline of Retweeting BJP (Top)            Barberá and Rivero (2015) find that ideologically motivated
and INC (Bottom) Politicians.                                      Twitter users create disproportionately more content than
                                                                   moderate ones. In India, studies of Modi’s political cam-
   Figure 1 shows us a weekly timeline (January 2019 -             paigning have particularly noted the use of celebrities on
March 2021) of influencers retweeting politicians, with the        Twitter for image building (Pal 2015). By interacting with
peaks corresponding to key events in India. We see that            key influencers, Modi was able to successfully attach his
the four events studied here all see peaks, and that in gen-       brand with that of leading entertainers and sportspersons.
eral, influencer tweets on both sides tend to spike around         Neyazi, Kumar, and Semetko (2016) find Twitter networks
the same events with the exception of a few events, such as        in India to be highly polarizing, especially in BJP’s political
”Howdy Trump” when the ruling party mobilized more in-             campaigning. This polarization also affects traditional com-
fluencers, while influencers on the opposition side mostly ig-     munication platforms due to the increasingly hybrid nature
nored it. Table 1 offers some insight into influencers retweet-    of media systems with digital platforms acting in conjunc-
                                                                   tion with traditional media in politicians’ campaigns (Chad-
                           Retweet Range                           wick 2017).
  Party                                                    Total
          [1,50]   (50,100] (100,150]       (150,max]                 In their study of the Brazillian presidential election that
  BJP      3265      214        99       272 (max-10053)   3850    brought far-right politician Jair Bolsonaro to power, Re-
  INC      3144      185        62       151 (max-54301)   3542
                                                                   cuero, Soares, and Gruzd (2020) find a strong connection
Table 1: Distribution of Influencers Retweeting BJP and            between polarization and disinformation spread on Twitter.
INC Politicians.                                                   This degree of polarization also has parallels in offline de-
                                                                   velopments of the region. Here, the authors observed that the
                                                                   pro-Bolsonaro camp was asymmetrically polarized and note
ing politicians. We see that most individual influencers have      that the escalation of ‘hyperpartisanship’ also led to the pe-
generally engaged with politicians in small measure (1-50          ripheralization of traditional media. In such scenarios, par-
times). The major difference we see is among the influ-            tisan media houses, that defy traditional notions of balance
encers who have engaged with politicians’ content more             and detachment from the contested subject, gain prominence
than 150 times. We see a larger number of influencers en-          (Faris et al. 2017). The degree of polarization also varies on
gaging with BJP – 272, compared with 151 influencers who           the nature of the crises, and politics is especially distinct in
have retweeted INC politicians, though the overall maxi-           its contribution (Barberá and Rivero 2015).
mum engagement is of an influencer retweeting INC’s con-
tent 54k times. This concentration of influencers on the high
end underlines the importance of a loyal set of accounts that                                Method
actively promote a party’s political content. We also find that    We now outline our data collection and polarity score com-
the influencers who most engage with politicians tend to be        putation methodology.
Divided We Rule: Influencer Polarization on Twitter During Political Crises in India
Data                                                                 prominent members of nonprofits, affiliates of religious or
We use a publicly available dataset (Panda et al. 2020) of           quasi-religious organizations who present themselves as ac-
Indian politicians and their twitter handles. The dataset also       tivists, or any who self-identify with the label. The Business
has party annotations for each of the politicians, which we          category includes accounts of commercial establishments,
use to obtain the accounts of 14,094 BJP politicians and             their founders, and key employees. The Entertainment seg-
12,341 INC politicians. We then collect tweets 1 from these          ment refers to people engaged in films, television, music,
26,435 accounts between January 2019 - March 2021.                   and fashion. Fan Accounts have to do with profiles that
   To build the set of influencers we iteratively collect the        distinctly avow to support public figures or organizations,
friends of the filtered politicians. From the resultant list,        and includes unofficial accounts dedicated to actors, politi-
we remove all the politicians, non-Indian global figures             cians and political parties. Journalist refers to individuals
etc., such that we are left with Twitter accounts that are           who work in the reporting and production of news, features
highly followed by Indian politicians. Our process being             and commentary. The Law & Policy category is for lawyers,
built ground up from the friends of politicians means that           judges, and public policy practitioners. Media Houses are
our definition of influencer biases towards those accounts           platforms, both digital and conventional that produce news,
that are of interest to politicians - ie for instance, if our seed   entertainment or other content, and thus this category also
set had started with sportspersons, it is likely we would have       includes newspapers, websites, and TV channels. Like Fan
ended up with several highly followed sports management              Accounts, the Platform Celebrity category captures a digital
companies or commentators in our set, or likewise for other          phenomenon of accounts whose popularity stems from their
domains. In essence, a journalist with 3000 followers, of            online presence and content creation, rather than the offline
which 200 followers are politicians would be included in             work of the individuals involved. The Social Work category
our set, as opposed to say a PR agent with 300k followers,           refers to the non-profit sector and includes accounts of its
but with few politicians following them.                             employees. Sports captures both athletes, as well as the ac-
   The iterative process helps mitigate this bias, and our           counts of teams, tournaments, and governing bodies. Writer
team’s contextual knowledge of India is used to ensure that          is a category for published authors of fiction and non-fiction,
known public figures are included in the sample as far as            it does not include columnists who are employed by news
possible. Thus, we are confident that the vast majority of ’A        platforms.
List’ public figures such as film stars, sportspersons, busi-
nesspersons etc. are included in our sample, but as Table 2          Tweet Classification
shows, we capture a relatively high number of journalists            In order to capture event specific tweets, we use a Word2Vec
and media houses, since their work is of obvious importance          (Mikolov et al. 2013) bag of words based technique to clas-
to politicians. We categorized every influencer account into         sify the collected tweets (Vijayaraghavan, Vosoughi, and
only one category, whichever they are primarily known by,            Roy 2016). We first define a set of high precision keywords
thus a major sportsperson with business interests is nonethe-        that are indicative of the event. We then filter tweets that
less categorized under sports.                                       contain at least one of the keywords, and consequently train
                                                                     a Word2Vec model on the filtered tweets. After obtaining a
                   Category              Count                       vector representation of all the words, we expand each of the
                   Academia              172                         keywords from the seed set, by computing the cosine simi-
                   Activist              81                          larity of pairs of words for all words in the vocabulary. We
                   Business              208                         then iteratively add words to the seed set according to the
                   Entertainment         1251                        cosine similarity based criteria defined in Vijayaraghavan,
                   Fan Account           36                          Vosoughi, and Roy (2016). Some of the keywords identified
                   Journalist            3551                        by the Word2Vec model are showed in Table 3.
                   Law & Policy          141
                   Media House           550                                                Keywords & Hashtags
                   Platform Celebrity    126                          Article 370           #jammuandkashmir, #article370, 35a, abro-
                   Social Work           63                                                 gation, kashmiri
                   Sports                348                          CAA/NRC               #caanrc,        #citizenshipamendmentact,
                                                                                            #caanrcnpr, #anticaa, #caanrcprotest
                   Writer                99
                                                                      COVID-19              covid2019, #covidpandemic, #coronapan-
                  Total                  6626                                               demic, #coronacrisis
                                                                      Farmers’ Protest      #farmersprotests, #farmersagitation, #del-
               Table 2: Influencers by Category                                             hichalo, #farmersdelhiprotest

                                                                           Table 3: Top 5 Keywords/Hashtags by Event.
   The final set of influencers is divided into a 12 cate-
gories. Academia refers to accounts that largely work with
higher education institutes and conduct research. The Ac-               In order to obtain a rough estimate of the performance of
tivist category includes grassroots organizers, leaders or           our classification pipeline, we randomly select a represen-
                                                                     tative sample (n = 100) and manually annotate whether the
   1
       Using Tweepy - https://www.tweepy.org/                        tweet is related to the particular event or not. We report pre-
Divided We Rule: Influencer Polarization on Twitter During Political Crises in India
cision values of 87.0%, 98.0%, 100% and 97% for Article             Content Polarity We adopt the cultural axis generation
370, CAA/NRC, COVID-19 and Farmer’s Protests respec-                procedure described by Waller and Anderson (2020). We
tively. Overall, we obtain a combined precision of 95.6%.           compute a partisan axis in the high dimensional user embed-
While these precision values are noisy estimates of the true        ding space, by taking the difference of vector embeddings of
precision values, it provides a sanity check for the classifi-      the top n (=10) INC and BJP politicians, sorted by retweet
cation methodology.                                                 polarity scores. Briefly, let x̃ and ỹ be the high dimension
                                                                    embedding vectors representing INC and BJP politicians re-
User Embeddings                                                     spectively. For any user u, with a high dimension embedding
                                                                    vector ũ, the content polarity score c(u) is defined as the co-
Given a set of accounts and their tweets for an event, we
                                                                    sine similarity of the user with the partisan axis:
obtain user embeddings based on aggregate tweet embed-
dings (Rashed et al. 2020). We use Google’s Universal Sen-                                 ũ · (x̃ − ỹ)
tence Encoder (USE) (Cer et al. 2018) to obtain high di-                         c(u) =                      ∈ (−1, 1)          (2)
                                                                                          kũk kx̃ − ỹk
mensional vector representations of the tweets by a user for
a given event, and average out all the tweet embeddings to          where a score close to 1 indicates a user’s stance on the par-
represent a single user. The user embeddings are projected          ticular issue is highly pro-INC, whereas a score close to -1
to a two dimensional space using UMAP (McInnes, Healy,              signifies that the user’s stance on the issue is highly pro-BJP.
and Melville 2020) and clustered using hierarchical density
based clustering (HDB-SCAN) (McInnes and Healy 2017).                                      Experiments
According to Rashed et al. (2020), this method results in           We capture event specific tweets based on the classifica-
clusters indicative of users’ stance on the specific event.         tion pipeline discussed previously. The number of tweets for

User Polarity                                                                       Topic           Users      Tweets
                                                                                  Article 370       14,050     346,197
We then use the clusters to compute polarity scores of users.                     CAA/NRC           13,744     527,702
Let V be the set of all users who have tweeted about a partic-                    COVID-19          20,871    3,069,851
ular event. We partition V into two groups, X and Y based                       Farmers’ Protest    12,224     269,742
on the clusters returned by HDB-SCAN. We partition V in
a way such that partitions X and Y predominantly consist                        Table 4: Users and Tweets by Event
of pro-INC users and pro-BJP users respectively.
                                                                    each event, including the number of users who have tweeted
Retweet Polarity We construct an undirected retweet
                                                                    about that particular event are enumerated in Table 4.
graph G with users in V as vertices. An edge eij exists be-
tween users vi and vj ∈ V, if vi and vj have at least two           Stance Detection With Polarized Clusters
retweets between them, i.e. if either vi has retweeted vj at
least twice, or vi has retweeted vj at least once and vice          To compute the stance of user for a given event, we use the
versa or vj has retweeted vi at least twice, ∀i, j = 1...|V|.       user’s tweets related to that event to obtain user embeddings.
We choose 2 retweets as our threshold based on the experi-          We do this for all the users who have tweeted about the
ments by Garimella et al. (2018) to avoid unreliable results.       event, and then cluster their embeddings using HDB-SCAN.
We then use the Random Walk Controversy (RWC) score                 As shown in Figure 2, the users are clustered according to
(Garimella et al. 2018) to define a retweet based user po-          their stance. We see two clusters across all the events, one
larity score.                                                       for each of the stances (pro-INC and pro-BJP). We see that
   Let X∗ and Y∗ be the set of top k (=100) high degree             INC and BJP politicians lie in separate clusters across all the
vertices, i.e. the top k users with the most number of retweet      events, which reaffirms the polarizing nature of the events.
edges, from partitions X and Y respectively. Let luX as de-         Moreover, we observe that majority of the influencers (INF)
fined in Garimella et al. (2018) be the expected hitting time       also lie in either of the two clusters, confirming our hypoth-
for a vertex u ∈ V, i.e. the expected number of steps for           esis that the influencers engage with these issues in a par-
a random walk starting with vertex u and ending in a high           tisan manner. We observe that for all of the events except
degree vertex in X∗ . Therefore p(u)X is defined as the frac-       COVID-19, majority of the influencers are in the pro-INC
tion of vertices v ∈ V for which the expected hitting time is       cluster, whereas for COVID-19 majority of the influencers
more than that of vertex u, i.e. luX < lvX . Therefore a score of   are in the pro-BJP cluster.
r(u)X ≈ 1 indicates that the vertex u is close to a high de-           To understand this pattern further, we manually studied
gree vertex in X∗ . Subsequently, r(u)Y is defined similarly,       the individual tweets. The overall clustering of influencers
leading to retweet polarity score r(u) being defined as:            towards the INC is explained by the engagement of jour-
                                                                    nalistic content by the opposition party, and the reliance of
            r(u) = r(u)X − r(u)Y         ∈ (−1, 1)           (1)    the ruling party on relatively more influential, but less nu-
                                                                    merous platform celebrities, or mainstream celebrities. Thus
where a score close to 1 indicates that a user is more likely       while the overall number of influencers seem to cluster much
to be retweeted by another user with a pro-INC stance, while        higher on the INC side, this finding needs to be seen along-
a score close to -1 indicates that the user is more likely to be    side the retweet polarity scores (Figure 8) which help un-
retweeted by a user with a pro-BJP stance.                          derstand the size and scope of influencer engagement. We
observed also that the relatively larger clustering on the INC      Quantifying Polarization Using Clusters
side is a result its politicians’ tendency to engage with influ-    We further use the clusters to quantify two aspects of polar-
encers on account of their criticism of the BJP – this does not     ization – retweet polarity which captures which “side” of the
mean those individuals are politically aligned with the INC,        issue is more likely to endorses the influencer, and content
simply that they are critical of the ruling party. The BJP’s        polarity which signifies the extent to which the influencers’
engagements and the resultant clusters, on the other hand,          stance resembles that of either parties on a particular issue.
are closest to accounts who advocate for the party.                    To compute retweet polarity scores, we first construct the
   These patterns speak to the larger state of politics in India,   retweet graph for all the events in Figure 4. Each node repre-
where political opposition on a national level is weak, and         sents users who are either politicians or influencers and the
civil society members and journalists, rather than the INC,         edges represent retweets between them. The graph is color
have emerged as popular critics of the government. Our sec-         partitioned as INC in blue, BJP in orange and influencers in
tion on categorical analysis further explains the distinctive-      pink. The graph is visualised using the Force Atlas 2 layout
ness of the COVID-19 clustering.                                    in Gephi (Bastian, Heymann, and Jacomy 2009), where we
                                                                    set gravity rules for node attraction using their edge strength
                                                                    to further clarify the clusters. The size of the nodes indi-
                                                                    cates their degree centrality. Overall, similar to the clusters
                                                                    above, we observe that for all events with the exception of
                                                                    COVID-19, most influencers engage with INC and a small
                                                                    fraction of them engage with BJP. Whereas, in the case of
                                                                    COVID-19, maximum engagement of influencers is with the
                                                                    incumbent party, BJP. While top leaders from either parties
                                                                    including Members of Parliament and Members of Legisla-
         (a) Article 370                 (b) CAA/NRC                tive Assemblies have high degree centrality in all events, IT
                                                                    cell presidents wield significant influence as well. Amongst
                                                                    the influencers, we see journalists, fan accounts and media
                                                                    houses comprising some of the larger nodes.
                                                                       Based on the clusters and retweet graphs from above, we
                                                                    calculate retweet and content polarity scores for all the influ-
                                                                    encers for a particular event. The distributions of the polarity
                                                                    scores in Figure 5 show that a number of influencers highly
                                                                    side with the narratives of a certain party for each topic. For
                                                                    COVID19 the median scores for influencers is closer to the
         (c) COVID-19                 (d) Farmers’ Protests         median of BJP politicians, while for Article 370, CAA/NRC
                                                                    and Farmers’ Protest it is closer to the median of INC politi-
                Figure 2: Clusters by Event.                        cians.

   Figure 3 shows the wordclouds, based on prominence                            Analysis of Polarity Scores
scores (Rashed et al. 2020), for each of the stances and helps
underscore the differences. On Article 370 and CAA/NRC,             In this section we use the computed polarity scores to gain
the INC side focuses on the act being “undemocratic” while          insights into the partisan engagement of influencers across
the pro-BJP tweets celebrate them as “historic”, having cit-        all four events. We first explore whether the polarity scores
izen support, and highlighting Hindu or refugee-related is-         display meaningful patterns with respect to the number of
sues. In essence, we see here shades of populism, where             followers and median retweet rates for an influencer. We
one side attempts to speak of the issue in legalistic terms,        then show how the retweet and content polarity scores vary
while another focuses on the partisan appeal of the act. On         according to the area of expertise of the influencers across
COVID-19, a less divisive issue (at the time), INC sympa-           all the events.
thizing influencers engage on “unemployment” and “fail-
ure”, while on the side of the ruling party, the engagements        Aggregate Trends
are more utilitarian, focusing on the “fight” against the Coro-     Here we use the absolute values of the polarity scores be-
navirus. With the Farm Bills, we see a different form of pop-       cause we are interested in how the magnitude of the scores
ulist engagement, in which the INC side uses terms such as          vary with attributes like number of followers etc. We assign
“draconian”, and “anti-farmer” while the BJP side highlights        an aggregate polarity score to an influencer by taking the
farmers as heroic using brand terms like “aatmanirbhar”             median of the absolute polarity scores across all events.
(self-reliant), “hardworking” etc. Unlike with CAA/NRC                 By Followers Count To analyse the polarity scores by
and 370, where there is a clear nativist appeal in creating         the number of followers, we categorize the influencers by
an “other” in those that oppose the act, the discourse with         their number of followers. We categorize an influencer as
the farm bill is a bit different, since the ruling party would      Very Low if the number of followers lies in the first quar-
rather not demonize farmers, and must toe a careful line in         tile of the followers count distribution across all influencers,
countering opposition online.                                       Low if it lies in the second quartile, Medium if it lies in the
(a) Article 370                                                 (b) CAA/NRC

                          (c) COVID-19                                                 (d) Farmers’ Protests

               Figure 3: WordClouds of BJP (Right, in Orange) and INC (Left, in Green) Clusters by Event

third quartile and High in the fourth quartile. We then plot          We also plot the median of content polarity scores for each
the median retweet polarity scores for each of the categories.     category, where we observe that Very Low and Low influ-
Overall, we see that the retweet polarity score increases with     encers have the highest content polarity. A Welsch’s t-test
the number of followers. However, we see a sharp decrease          between Low and Very Low gives a p-value > 0.1 indicating
in the polarity scores between Medium influencers and High         that the two categories are not statistically different, whereas
influencers. A Welsch’s t-test between the scores of these         t-tests between the scores of Low and Medium and Low and
two categories results in a p-value of 0.0684 which is a sta-      High give p-values of < 0.0001 indicating that the scores for
tistically significant difference with a threshold of 0.1. The     Very Low and Low are statistically different from Medium
p-values for scores between Very Low, Medium and Low,              and High.
Medium are 0.0000 and 0.0081 respectively indicating that             Overall we observe that there is a twilight zone for in-
the distribution of retweet polarity scores for Medium influ-      fluencer polarization with respect to number of followers,
encers is statistically different from all the other categories.   wherein influencers with a certain range of followers are
                                                                   significantly more polarizing than other groups. In terms of
                                                                   retweet polarity, which quantifies how likely an influencer’s
                                                                   tweet is to be retweeted by either side, influencers with
                                                                   Medium followers have the highest polarity scores. Whereas
                                                                   in terms of content polarity, which signifies how extreme an
                                                                   influencer’s stance is, those with Very Low and Low follow-
                                                                   ers seem to have the highest scores.
                                                                      Moreover, we find that from 20 of the influencers with
                                                                   the highest number of followers - mostly entertainers and
                                                                   sportspersons - only two were retweeted by the INC. This
        (a) Article 370                     (b) CAA/NRC            confirms suggestions in past work that the ruling party has
                                                                   a much stronger grip on the social media output of A-list
                                                                   celebrities (Lalani, Kommiya Mothilal, and Pal 2019).
                                                                      By Retweet Count We fit a regression model with retweet
                                                                   polarity and content polarity as the independent variables
                                                                   and the logarithm of median retweet count as the dependent
                                                                   variable. Regression results are displayed in Table 5. Over-
                                                                   all, we observe that the median retweet rate is higher for
                                                                   influencers with higher retweet polarity scores, potentially
                                                                   implying that polarized influencers are retweeted more in
         (c) COVID-19                (d) Farmers’ Protests         comparison to less polarizing ones. To explore this further,
                                                                   we filter out influencers whose retweet polarity scores lie in
           Figure 4: Retweet Graph by Events.                      the fourth quartile. We then calculate the median retweet rate
(a)                              (b)                          (c)                               (d)

               (e)                              (f)                          (g)                               (h)

Figure 5: Distribution of Polarity Scores by Event. Subfigures (a) - (d) depict retweet polarity scores and (e) - (h) depict
content polarity scores for Article 370, CAA/NRC, COVID-19 and Farmers’ Protests respectively

                                                                  higher retweets as compared to other tweets.

                                                                                           log rt count      R2      pvalue
                                                                        rt polarity       2.1450 (0.157)   0.196     0.1

                                                                  Table 5: Regression Analysis. Log of Median Retweet
                                                                  Count (log rt count) is the dependent variable. The regres-
                                                                  sion coefficient along with the standard error, R2 values and
                                                                  p-values are displayed. Statistically significant results are
                                                                  highlighted in bold.
Figure 6: Average Retweet Polarity Scores By Followers
Category. We observe that influencers with Medium number
of followers have the highest median retweet polarity score.      Categorical Analysis
                                                                  Retweet Polarity For each of the four events, we show in
                                                                  Figure 8 how the retweet polarity scores vary with the area
                                                                  of expertise of influencers. A close examination of the top
                                                                  20 polarized accounts towards either side in each category
                                                                  reveals that, barring COVID-19, the INC side overwhelm-
                                                                  ingly retweets journalists on the three political issues. For
                                                                  instance, in the activity after the abrogation of 370, 18 of the
                                                                  top 20 accounts with polarity towards the party are journal-
                                                                  ists. The BJP’s most proximate accounts have varied profiles
                                                                  in comparison, including platform celebrities, businesses,
                                                                  fan accounts, sports, and journalists. Unlike other issues in
                                                                  which voices on the ground of journalists, academics etc
Figure 7: Average Content Polarity Scores By Followers            are key to the political discourse, on the 370 issue, there
Category. Influencers with Very Low and Low number of             is no representation of Muslim Kashmiris from the Valley
followers have the highest content polarity and those with        on-ground among top accounts on either side. From the top
High number of followers have the lowest content polarity.        pro-INC accounts, only one is from Kashmir. Among the
                                                                  pro-BJP ones, the discussions forwarded include three Kash-
                                                                  miri Pandits, a Hindu community whose exodus from the
for their polarizing tweets and median retweet rate of the rest   valley in the early 1990s has been cited as a justification
of their tweets. We observe that 84% of the influencers have      for the need to ‘integrate’ the region within the nation-state.
a higher median retweet rate for tweets related to polarizing     The absence of on-ground accounts can partly be attributed
events as compared to their other tweets, potentially imply-      to the shutdown of internet services and jailing of Kashmiri
ing that polarized tweets are more likely to be rewarded with     politicians, but nonetheless points to the lack of representa-
tive voices in the discourse.                                      low scores, the top accounts in terms of content polarity, are
   Except journalists and activists, other categories of influ-    journalists and academics critical of the move. That the high
encers are less likely to take stances critical of the govern-     score on the government’s side is driven by fan accounts and
ment. We speculate that certain professions are more hesi-         platform celebrities calls to question the role of hyperparti-
tant to be critical and are thus disproportionately retweeted      san voices, who may not have proven credentials of on the
by the BJP side. Here, de-politicized fields such as sports,       ground knowledge of the topic.
businesses and entertainment have also shown tilts towards            CAA/NRC protests score high on content polarity in com-
the ruling government. The entertainment category polar-           parison. Of the top 20 polarizing accounts, an overwhelm-
izes towards INC only during the CAA protests, which is            ing proportion has to do with journalism, including reporters
in step with the on-ground developments in the movement            and media houses, largely those critical of the government.
when film industry figures spoke out against the Modi gov-         We see the presence of Indian public service broadcasters
ernment’s for the first time in significant numbers.               (e.g @DDNational, @DD Bharati, @PBNS India), which
   The lower ratio of journalists in the BJP’s top polarity        in principle, are meant remain autonomous and neutral. Con-
scorers is an interesting trend that has connections to the na-    tent from these accounts have relayed the official line and are
ture of Modi’s political campaign. As Chakravartty and Roy         thus engaged significantly by BJP politicians.
(2015) note, in Modi’s populist rhetoric, liberal media has           COVID-19 has had the most mixed reaction between the
remained as a prominent antagonistic signifier, ever since         categories in terms of the content, possibly because on the
the press covered his role in the 2002 Gujarat riots where         whole it is a less polarizing event. The relatively polarized
Muslims died in disproportionate numbers. This aversion            accounts here are often explicitly aligned with one of an-
also translates to the lack of press conferences and inter-        other party.
views given by the leader, and as our study shows, perme-             On the farmers’ protests, the top 20 accounts are di-
ates to the ruling party’s social media strategy after coming      vided between supporters and opponents of the government
to power. Of the top media-related accounts the party cen-         with a cross-category representation. Here too, government-
tres, many are linked to partisan media houses, especially         aligned accounts, in the form of public broadcasters and in-
online portals that have gained prominence as vocal defen-         dustrial consortia are prominent. Somewhat exceptionally,
dants of right-wing ideology. Our finding is relevant with the     businesses, who protesters see as the real beneficiaries of the
Recuero, Soares, and Gruzd (2020) study that observed the          policy, have a high polarity score on this issue. Our analysis
distancing of hyperpartisan Twitter clusters in Brazil from        of tweets from the highest-scoring accounts in this category
mainstream media. With INC, we observe that the top ac-            shows how business houses, while avoiding explicit politi-
counts are not as much supportive of the party as they are         cal keywords, have used a mediated, corporate vocabulary to
critical of the ruling government.                                 convey the benefits of the farm bills, which intersects with
   The dissimilar polarity scores on the COVID-19 crisis           a lot of the official language on the issue. Sports-persons
also indicate a unique set of responses it set off among Twit-     have also only had higher polarity scores on this topic. En-
ter influencers. We attribute the shift in polarity scores to-     tertainers have scored mildly on all issues except for the
wards the BJP, to the change in the type of the crisis, as pan-    CAA. Note that we use the absolute values of content polar-
demic foremost raised concerns about public health and pre-        ity scores for this categorical analysis, as the median signed
paredness, which is a bipartisan issue. Here it is likely, that    scores for each category did not yield interpretable results in
rather than being critical actors that invite accusations of be-   the experiments.
ing dispiriting, influencers were engaged in raising morale as
part of government’s campaigning and disseminating front-                                   Discussion
line information and resources. We also see journalists and        In this work, we present an important, non-western case of
media houses disproportionately amplified by the BJP, indi-        partisan engagement in a polarized political environment.
cating a trend of relative cooperation with the government in      While our findings are immediately of relevance in under-
the face of a global health emergency.                             standing the political economy of social media in India, they
   Content Polarity A category-wide content polarity anal-         are likewise relevant to approaching the role of influencers
ysis in Figure 9 for the four events produces distinct results.    of various kinds in political engagement.
Of the four issues, we see that Article 370 draws some of             Our findings show that the differences in clustering
the most uniform scores between categories, that are lower         around topics are themselves insightful in understanding the
on content polarity. Kashmir as a territory is considered          larger patterns in the discourse. Focusing our examination
central to India’s narratives of national security and territo-    around key events, helps understand the periods of elevated
rial claims. Historically, conversations about the region are      activity, i.e. those in which political parties may resort to en-
sensitive and regulated, attracting legal prosecution (Bose        gaging the various outreach resources at their behest, includ-
2009). This context seems relevant in understanding these          ing celebrities. In this work, we have shown that the ruling
scores. The overall lower scores can imply multiple strands        party has unique advantages over the opposition in some as-
at play. While political voices opposed to the national gov-       pects, but that the opposition gains tailwind from the work
ernment’s move were censored due to the communications             of journalists and commentators, who on the whole, tend to
shutdown, or deliberately ignored by the mainstream par-           dissent with the government. We also find that there is sig-
ties as taboo, what we see here also raises questions about        nificant difference between the four events, and the kinds of
a chilling effect of self-censorship. Within these relatively      influencers that can engage in each of those.
(a) Article 370                                              (b) CAA/NRC

                       (c) COVID-19                                              (d) Farmers’ Protests

Figure 8: Median Retweet Polarity Scores by Influencer Category. Length of stem indicates retweet polarity score and
bubble size indicates median retweet count.

                       (a) Article 370                                              (b) CAA/NRC

                       (c) COVID-19                                              (d) Farmers’ Protests

Figure 9: Median Content Polarity Scores by Influencer Category.Length of stem indicates absolute content polarity score
and bubble size indicates median retweet count.
These findings lay the ground for future work. There is a      the 2019 on Computer Supported Cooperative Work and So-
need to study various kinds of influencers, their relationship   cial Computing, 267–271.
with the government, and what that does to their online be-      Lewis, R. 2020. “This is what the news Won’t show you”:
havior. While we studied the intersection of influencer and      YouTube creators and the reactionary politics of micro-
politician behavior during crises, it is equally important to    celebrity. Television & New Media 21(2): 201–217.
extend such work into its typical daily patterns.
                                                                 McInnes, L.; and Healy, J. 2017. Accelerated hierarchi-
                       References                                cal density based clustering. In 2017 IEEE International
                                                                 Conference on Data Mining Workshops (ICDMW), 33–42.
Barberá, P.; and Rivero, G. 2015. Understanding the politi-     IEEE.
cal representativeness of Twitter users. Social Science Com-
puter Review 33(6): 712–729.                                     McInnes, L.; Healy, J.; and Melville, J. 2020. UMAP: Uni-
                                                                 form Manifold Approximation and Projection for Dimen-
Bastian, M.; Heymann, S.; and Jacomy, M. 2009. Gephi: An         sion Reduction.
Open Source Software for Exploring and Manipulating Net-
works. URL http://www.aaai.org/ocs/index.php/ICWSM/              Menon, N. 2020. Hindu Rashtra and Bollywood: A New
09/paper/view/154.                                               Front in the Battle for Cultural Hegemony. South Asia Mul-
                                                                 tidisciplinary Academic Journal (24/25).
Becker, J.; Porter, E.; and Centola, D. 2019. The wisdom
of partisan crowds. Proceedings of the National Academy of       Mikolov, T.; Chen, K.; Corrado, G.; and Dean, J. 2013. Ef-
Sciences 116(22): 10717–10722.                                   ficient estimation of word representations in vector space.
                                                                 arXiv preprint arXiv:1301.3781 .
Bose, S. 2009. Kashmir: Roots of conflict, paths to peace.
Harvard University Press.                                        Neyazi, T. A.; Kumar, A.; and Semetko, H. A. 2016. Cam-
                                                                 paigns, digital media, and mobilization in India. The Inter-
Centola, D. 2020.           Why Social Media Makes               national Journal of Press/Politics 21(3): 398–416.
Us More Polarized and How to Fix It.                 URL
https://www.scientificamerican.com/article/why-social-           Pal, J. 2015. Banalities turned viral: Narendra Modi and the
media-makes-us-more-polarized-and-how-to-fix-it/.                political tweet. Television & New Media 16(4): 378–387.
Cer, D.; Yang, Y.; Kong, S.-y.; Hua, N.; Limtiaco, N.; John,     Panda, A.; Gonawela, A.; Acharyya, S.; Mishra, D.; Moha-
R. S.; Constant, N.; Guajardo-Céspedes, M.; Yuan, S.; Tar,      patra, M.; Chandrasekaran, R.; and Pal, J. 2020. NivaDuck -
C.; et al. 2018. Universal sentence encoder. arXiv preprint      A Scalable Pipeline to Build a Database of Political Twitter
arXiv:1803.11175 .                                               Handles for India and the United States. In International
                                                                 Conference on Social Media and Society, SMSociety’20,
Chadwick, A. 2017. The hybrid media system: Politics and         200–209. New York, NY, USA: Association for Computing
power. Oxford University Press.                                  Machinery. ISBN 9781450376884. doi:10.1145/3400806.
Chakravartty, P.; and Roy, S. 2015. Mr. Modi goes to Delhi:      3400830. URL https://doi.org/10.1145/3400806.3400830.
Mediated populism and the 2014 Indian elections. Television      Rashed, A.; Kutlu, M.; Darwish, K.; Elsayed, T.; and
& New Media 16(4): 311–322.                                      Bayrak, C. 2020. Embeddings-Based Clustering for Target
Conover, M.; Ratkiewicz, J.; Francisco, M.; Gonçalves, B.;      Specific Stances: The Case of a Polarized Turkey. arXiv
Menczer, F.; and Flammini, A. 2011. Political polarization       preprint arXiv:2005.09649 .
on twitter. In Proceedings of the International AAAI Con-        Recuero, R.; Soares, F. B.; and Gruzd, A. 2020. Hyperparti-
ference on Web and Social Media, volume 5.                       sanship, disinformation and political conversations on Twit-
Demaine, L. J. 2009. Navigating policy by the stars: The         ter: The Brazilian presidential election of 2018. In Proceed-
influence of celebrity entertainers on federal lawmaking. JL     ings of the international AAAI conference on Web and social
& Pol. 25: 83.                                                   media, volume 14, 569–578.
Faris, R.; Roberts, H.; Etling, B.; Bourassa, N.; Zuckerman,     Singh, R. 2020. The Politics Of Offense: The Phenomena Of
E.; and Benkler, Y. 2017. Partisanship, propaganda, and          Censorship In The Contemporary Discourse Of Literature
disinformation: Online media and the 2016 US presidential        And Media. Literary Herald 6(3): 145–153.
election. Berkman Klein Center Research Publication 6.           Sinha, S. 2017. Fragile hegemony: Modi, social media and
Garibay, I.; Mantzaris, A. V.; Rajabi, A.; and Taylor, C. E.     competitive electoral populism in India. International Jour-
2019. Polarization in social media assists influencers to be-    nal of Communication 11(2017): 4158–4180.
come more influential: analysis and two inoculation strate-      Vijayaraghavan, P.; Vosoughi, S.; and Roy, D. 2016. Au-
gies. Scientific reports 9(1): 1–9.                              tomatic detection and categorization of election-related
Garimella, K.; Morales, G. D. F.; Gionis, A.; and Math-          tweets. In Proceedings of the International AAAI Confer-
ioudakis, M. 2018. Quantifying controversy on social me-         ence on Web and Social Media, volume 10.
dia. ACM Transactions on Social Computing 1: 1–27.               Waller, I.; and Anderson, A. 2020. Community embeddings
Lalani, F. M.; Kommiya Mothilal, R.; and Pal, J. 2019. The       reveal large-scale cultural organization of online platforms.
Appeal of Influencers to the Social Media Outreach of In-        arXiv preprint arXiv:2010.00590 .
dian Politicians. In Conference Companion Publication of
You can also read