Predicting the 1998 Indian parliamentary election - Amazon ...

Page created by Randall Willis
 
CONTINUE READING
Electoral Studies 21 (2002) 69–89
                                                                          www.elsevier.com/locate/electstud

        Predicting the 1998 Indian parliamentary
                         election
  Rajeeva L. Karandikar a, Clive Payne                              b,*
                                                                          , Yogendra Yadav             c

                                  a
                                    Indian Statistical Institute, Delhi, India
                           b
                            Nuffield College, New Road, Oxford, OX1 1NF, UK
                      c
                          Centre for the Study of Developing Societies, Delhi, India

Abstract

   An account is given of the methods used to predict the results of the 1998 Indian parliamen-
tary election by a team from the Centre for the Study of Developing Societies, Delhi. The
special features of the Indian party and electoral system and the election-night context as they
relate to the problem of election forecasting are discussed. The methods developed for the
polling of the Indian electorate and for forecasting the composition of the parliament from the
sequence of results declared are presented.  2001 Elsevier Science Ltd. All rights reserved.

Keywords: Indian general election; Prediction; Opinion polls

1. Introduction

   Forecasting the composition of parliament from opinion polls and on election night
when results are declared, are integral parts of the media coverage of elections in
all the major democracies. In the last fifteen years, the Indian media has also taken
to serious election forecasting. But the Indian electoral and party system and its
‘election-night’ context has many particular features which present the forecaster
with both special problems and opportunities. We first give a brief description of
the Indian context and review the particular problems of election forecasting in the
Indian context. We then describe the forecasting methods developed by a team from
the Centre for the Study of Developing Societies, Delhi (CSDS — two of the authors
were members and the third author was a consultant), for the opinion poll conducted

  * Corresponding author. Tel.: +44-1865-278713; fax: +44-1865-278725.
    E-mail address: clive.payne@nuf.ox.ac.uk (C. Payne).

0261-3794/01/$ - see front matter  2001 Elsevier Science Ltd. All rights reserved.
PII: S 0 2 6 1 - 3 7 9 4 ( 0 0 ) 0 0 0 4 2 - 1
70                  R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89

for India Today — a leading news weekly in India — and for the live television
coverage of the results of the Indian general election in 1998 on Doordarshan, India’s
public terrestrial channel. Special attention is given here to comparing the Indian
forecasting problems with those encountered in other plurality systems, particularly
those of the UK. We conclude with a description of a new method developed for
‘election-night’ forecasting which performed well in the live TV programme.

2. Election forecasting in plurality systems

   The main aim of election forecasting is to predict the number of seats won by
each party in the parliament; a subsidiary aim is to also predict the vote shares for
the parties. Election forecasting has three main stages, each of which uses different
types of data source and presents different methodological problems:
   (1) Prediction before the election using voter-intention polls — ‘poll projection’.
Here the task is to convert predicted vote shares from a national opinion poll into
predicted seats in the legislature. This can be a challenging problem in some plurality
systems. A standard approach is to estimate changes in vote share for each party
from the poll and apply these uniformly in constituencies to predict the winning
party. In Section 4 we discuss opinion polling in India while Section 5 sets out the
method developed for converting vote shares into seats which is specially designed
to cope with particular features of the Indian elections, especially the complex and
changing party system within a federal structure. Section 6.1 discusses the perform-
ance of the CSDS–India Today projections based on a pre-election poll.
   (2) Prediction immediately after the polling stations have closed — a ‘prior fore-
cast’. A frequently used approach is to conduct an exit poll of voters as they leave
polling stations to estimate changes in vote shares. Often the exit polling data is
supplemented with data obtained from opinion polls conducted as close as possible
to polling day. Another possibility is a ‘post-poll’ where a poll is conducted on the
day or the days after the election but before results are declared, asking voters how
they had actually voted. Such polls are feasible in India where there is a gap between
voting and the declaration of results with the staggered elections that take place over
a period of two weeks. Section 6.2 describes the conduct and performance of the
post poll conducted by CSDS for the 1998 election results TV programme.
   (3) Predictions made during the sequence of election results as they are declared —
the ‘results-based forecast’. This may involve fitting statistical models for the
changes in party shares using the subset of results declared and then using the coef-
ficients so estimated to produce predicted changes in vote share, and hence winning
parties, in the undeclared constituencies. An alternative and simpler approach is to
define types of constituency and use declared results for each type to estimate the
mean changes in vote share which are then applied to undeclared seats of the relevant
type to predict the winning party. This result-based prediction may also include prior
information obtained from exit and opinion polls. The special issue of the New Zea-
land Statistician (Morton, 1990), gives descriptions of election-night forecasting in
other countries while Brown et al. (1999) deal with election-night forecasting in the
R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89      71

1997 UK general election in particular. Section 5.4 describes the method developed
for the Indian ‘election-night’ context and Section 6.3 presents its performance dur-
ing the results sequence. The method and data sources used for these tasks are
determined by features of the electoral and party system to which we now turn.

3. The Indian context

   The Indian Parliament consists of two houses — the Upper House or the Rajya
Sabha (council of states) and the Lower House or Lok Sabha (house of the people).
It follows the standard conventions of a parliamentary system where the party or a
coalition enjoying the support of a majority of members in the lower house, the Lok
Sabha, forms the government and its leader holds the office of Prime Minister and
forms the Cabinet, which runs the national government. The term of the Lok Sabha
is five years but it can be dissolved earlier on the advice of the Prime Minister. This
happened in 1997, when the Lok Sabha was dissolved after only 18 months, thus
paving the way for the mid-term elections in 1998. The following features of the
parliamentary elections have a direct bearing on the challenge of forecasting.

3.1. The electorate

   In the 1998 elections the total electorate was a little above 600 million, the largest
in any democratic election in the world. The range of social, ethnic, cultural,
religious, linguistic and regional diversity that characterises this electorate is
unmatched within any single nation-state in modern times. The Indian Constitution
recognises 15 languages as official languages, while the last Census had counted
more than 700 dialects. The last grand anthropological exercise of enumerating the
social communities in terms of caste/tribe produced a count of more than 4000
(Singh, 1992).
   To make matters worse for the pollsters, the social cleavages relevant to the elec-
toral politics do not follow any standard pattern all over the country. More often
than not the politically salient division is that of caste or a cluster of castes brought
together by local context. But in many cases the religious, class or regional-linguistic
cleavage cuts across and over-rides the caste cleavage. These patterns vary so much
across the different states of the Indian union that one cannot apply any nation-wide
social classification for purposes of sampling, weighting for differential non-response
and projections.
   These features present survey designers with the challenge of choosing a sample
which does justice to such a complex pattern of diversity. The sampling problem is
actually more difficult than it appears, for reliable information about most of these
social attributes is either not available at all or is not available at the level of the
sampling units. The regular decennial Census of India does not record caste except
in the case of Scheduled Castes and Tribes. While the religion of the ‘head of the
household’ is recorded, the religious composition of the population is not disclosed
at the level of a locality, let alone for households. Carrying out an opinion poll or
72                  R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89

any other survey itself is a gigantic task. An average constituency covers more than
6000 km2 (compared to about 350 in Britain). It is not unusual for a field researcher
to take a full day to travel from one location to another within the same parliamentary
constituency. The vast geographical spread and poor transport communication makes
a door-to-door survey very time consuming and expensive. However, the households
with telephones represent a very small proportion (about 3–4%) of the population,
and are mostly confined to the urban elite. Therefore a survey based on randomly
generated telephone numbers is completely ruled out. Besides, a large number of
the electors are still illiterate or near illiterate, thus ruling out a certain range of
questions and some techniques of interviewing.

3.2. The FPTP electoral system

   India follows the first-past-the-post or plurality system of elections which makes
seats prediction more difficult than in any system based on proportional represen-
tation. The Lok Sabha has 543 elected members each representing a constituency
(or ‘seat’) where they have to obtain a plurality of the valid votes cast. The consti-
tution provides for a regular redrawing of the constituency boundaries after every
decade to maintain constituencies with similar sizes of electorate. However, a consti-
tutional amendment resulted in freezing the delimitation from 1990 until 2001. The
migration of population to big cities and towns and the differential rates of population
growth in this period have resulted by now in a considerable variation in the size
of constituencies (McMillan, 2000; Sivaramakrishnan, 1997). In 1998, while the
average number of electors per constituency was 1.12 million, there were 12 constitu-
encies with less than 0.5 million electors and 21 with more than 1.5 million.

3.3. The state as the effective political arena

   One of the most striking developments of what has been described as India’s ‘third
electoral system’ in the last decade has been the emergence of the state as the effec-
tive unit of political choice and electoral competition (Yadav, 1999). While India
has always been a federal polity — a ‘union of states’ as the constitution describes
it — it is only recently that the states have become such a decisive political unit
rendering the national election into no more than a simultaneous election held in all
the states. The country is divided into 25 States and 6 ‘Union Territories’ directly
administered by the central government. There is a huge variation in the size of the
states. Some of the states are very big (like Uttar Pradesh, which accounts for 16%
of seats in the Lok Sabha) while others have only one member in the Parliament.
   Each state has a legislative Assembly, whose members are elected directly, also
by the plurality system. The assembly constituency (or ‘segment’) is a basic unit in
the Indian electoral system and several (usually between 5 and 8) assembly segments
together make one Lok Sabha constituency. Initially, in 1952 and 1957, the elections
to all the state legislative assemblies and the Lok Sabha were held concurrently.
Since then most state assembly elections have been held at a time different from the
Lok Sabha elections. Unlike the UK, this hierarchical structure of state assembly
R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89       73

segments contained within parliamentary constituencies, means that assembly-level
voting data can be used to provide good estimates of recent voting behaviour at the
parliamentary level and, in particular, can provide a useful basis for sample selection
for polling exercises.

3.4. The counting process

   There are three major differences between the counting process followed in India
and the typical ‘election night’ of the UK. First, the counting usually begins 36 hours
after the voting is over, so as to allow one day for transportation of the ballot boxes
from remote areas. So, strictly speaking, there is no Indian equivalent of the UK
‘election night’. The counting begins on the morning of the day after voting and
then continues uninterrupted until it is over. Secondly, the entire counting process
takes much longer. The completion of the count can take from 12 hours in the smaller
constituencies to up to 48 hours in some of the large constituencies where counting
has been traditionally a slow process. Thirdly, because the completion takes so long,
partial counts are officially declared. All the ballot boxes from the various polling
stations within a given assembly segment are collected at one place and the ballot
papers are mixed (after verifying the number of ballots from each polling station
against the record filed by the election officer of the polling station) so as to avoid
detection of the voting pattern within each polling district. Subsequently, they are
randomly divided into 10 to 12 lots and each lot is taken up in turn for counting.
The results of a lot (or ‘round’) are announced and then the next lot is counted and
so on. This procedure is followed simultaneously for each assembly segment of every
parliamentary seat.

3.5. The party system

   On the face of it, India in the 1990s looks like an extreme example of multi-party
fragmentation. No less than 40 parties were represented in the Lok Sabha after the
1998 elections. But the value of the Effective Number of Political Parties index was
6.9 in the same year. The figure drops sharply to 3.0 if we look at the state level
average in the same election and to 2.7 at the level of the parliamentary constituency
(Heath, 1999). The system is best described as ‘multiple bipolarities’: the party sys-
tem in most of the states has evolved into a two-party competition, but the pairing
varies from state to state. There are, to be sure, exceptions to the biparty norm. The
largest state of Uttar Pradesh witnessed effective three-way and often four-way con-
tests in the 1998 elections. West Bengal witnessed a four-way contest. There were
many states like Karnataka, Kerala, Tamil Nadu and Orissa where there were more
than two major political parties in the electoral fray, but they formed a coalition.
Looked at from the vantage point of the national election, it appears to be like a
multi-party system. No single party exists as a viable electoral force across all the
states. To make matters worse for the psephologists, these parties split, merge and
split again producing a very confusing picture at times. Between the 1996 and 1998
Lok Sabha elections major splits took place in the states of Bihar, Karnataka, Orissa
74                  R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89

and West Bengal which altered the electoral equation fundamentally. These parties
form various kinds of alliances — ranging from enduring coalitions to temporary
and informal ‘seats adjustments’ — with one another. The AIADMK, a regional
party in Tamil Nadu formed an alliance first with the Congress in 1996, then with
the BJP in 1998 and then back with the Congress in the 1999 elections. Furthermore,
the same party can form an alliance with a major party in one state (or even part
of a state) while contesting on its own in other states (or in the remaining part of
the state). In Bihar, the Congress and the RJD had an alliance in the northern and
the central region but it did not extend to southern Bihar.
   Party loyalty and identification is highly fluid in India. The high degree of voters’
alignment that characterises the stable party systems of Western democracies is larg-
ely missing in India. This results in an unusually high rate of aggregate and individual
volatility across elections. Between 1977 and 1998, all the parliamentary elections
had witnessed a minimum of 10 per cent aggregate volatility. Between two parlia-
mentary elections, a major political party often retains only between 50 to 65 per
cent of the votes it secured last time (Yadav, 1999). Some of this sharp fluctuation
can take place very close to the elections, as happened in the 1998 elections. This
constant state of flux renders past electoral results, except the most recent one, rather
unhelpful to the task of prediction. It also makes it difficult to say which of the
constituencies are ‘marginal’.

3.6. Structure of party competition in the 1998 elections

   An analysis of the three main blocs fighting the 1998 election serves to illustrate
the nature of the party system and the challenge it poses for the task of election pre-
diction.
   (1) The centrist Congress party, or the INC, which won the first five elections for
the Lok Sabha and ruled the country for all but four years between 1952 and 1996,
constituted the first bloc largely on its own. The party has undergone many splits,
but each split has left only one major survivor. The main inheritor of the Congress
legacy is currently called The Indian National Congress (I), the ‘I’ standing for Indira
Gandhi. It was led in the 1998 election campaign by Sonia Gandhi, widow of Rajiv
Gandhi who was Prime Minister from 1984 to 1989 and was assassinated in 1991
during the Lok Sabha elections. It had forged an alliance with two parties in the
State of Maharashtra, the Republican Party of India (RPI) and Samajwadi Party (SP).
In Bihar, in about half the constituencies, it had an alliance with Rashtriya Janata
Dal (RJD). In the State of Kerala it continued its old alliance, the UDF, with two
regional parties, the Muslim League and the Kerala Congress.
   (2) The other major bloc was led by the right-wing Bhartiya Janata Party (literally,
Indian Peoples Party), or the BJP, a party that advocates a pronounced Hindu majorit-
arian ideology. The BJP emerged as the single largest party in the Lok Sabha elec-
tions in 1998 and led the coalition government with Atal Bihari Vajpayee as the
Prime Minister, before it lost a confidence vote in April 1999 and bounced back to
power in the mid-term elections held in September 1999. While in the past the BJP
had few allies and had no presence in the south and the east, in 1998 it forged
R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89       75

alliances with regional parties and breakaway groups in almost every state. In 1998
its partners were: Akali Dal (in Punjab), Shiv Sena (in Maharashtra), Lok Shakti
(breakaway faction of Janata Dal led by Hegde in Karnataka), AIADMK (and its
allies- MDMK, PMK, TRC and JP in Tamil Nadu), Biju Janata Dal (in Orissa),
Telagu Desam Party (the ‘NTR’ faction led by Laxmi Parvati in Andhra Pradesh),
Trinamool Congress (breakaway Congress group led by Mamata Banerjee in West
Bengal), Samata Party (in Bihar), Haryana Vikas Party (in Haryana). The alliance
carried no name then and was christened the ‘National Democratic Alliance’ in 1999.
   (3) Several left-of-centre parties, including the two communist parties and the
Janata Dal (which was once a major force having had three Prime Ministers from
its ranks) along with some regional parties contested the 1998 parliamentary election
under the banner of the ‘United Front’. These parties had formed the United Front
as a coalition after the 1996 parliamentary election and had formed the government,
with the support from Congress party. The government lasted about 18 months.
Besides Janata Dal and the Left Front (itself a coalition of four parties: CPI, CPI
(Marxist), RSP and Forward Bloc), this decidedly more heterogeneous alliance
included the following regional parties: Samajwadi Party (Uttar Pradesh), the ‘Naidu’
faction of Telagu Desam Party (in Andhra Pradesh), Assam Gana Parishad (Assam)
and the TMC and DMK (in Tamil Nadu).
   Table 1 shows the state-by-state outcome for the major alliances in the 1998 Lok
Sabha election.

3.7. The Indian forecasting challenge

  To summarise, making a seats forecast in an Indian election presents four kinds
of challenge specific to the Indian situation:

1. The conceptual challenge of converting votes into seats in any plurality system but
   with the added problems of the system of ‘multiple-bipolarities’ and the practice of
   shifting alliances;
2. The methodological challenge of designing a representative, efficient and feasible
   survey for the electorate of the kind described above;
3. The operational challenge of data collection and data transmission, given the large
   distances and the difficult transport conditions involved; and
4. The statistical challenge of devising a model for results-based forecasting that
   takes into account the Indian system of partial counts.

We first look at how earlier attempts to make election forecasts in India had
responded to some of these challenges and then turn to our own methods used in
the 1998 elections.

4. Past election forecasts in India

  Viewed from the perspective of the enormous challenge which faces any election
forecaster in India, the performance of Indian pollsters has been very good, at least
76                         R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89

Table 1
Seats won by major alliances in the 1998 electiona

State                                          Seats        BJP+        INC+            UF   Other

Andhra Pradesh                                  42            4           22            15    1
Arunchal Pradesh                                 2            –            –             –    2
Assam                                           14            1           10             –    3
Bihar                                           54           29            5             1   19
Goa                                              2            –            2             –    –
Gujarat                                         26           19            7             –    –
Haryana                                         10            2            3             –    5
Himachal Pradesh                                 4            2            1             –    1
Jammu & Kashmir                                  6            1            1             –    4
Karnataka                                       28           16            9             3    –
Kerala                                          20            –          11b             9    –
Madhya Pradesh                                  40           30           10             –    –
Maharashtra                                     48           10          37c             –    1
Manipur                                          2            –            –             1    1
Meghalaya                                        2            –            2             –    –
Mizoram                                          1            –            –             –    1
Nagaland                                         1            –            1             –    –
Orissa                                          21           16            5             –    –
Punjab                                          13           11            –             1    1
Rajasthan                                       25            5           18             –    2
Sikkim                                           1            –            –             –    1
Tamil Nadu                                      39           30            –             9
Tripura                                          2            –            –             2    –
Uttar Pradesh                                   85           59            –            20    6
West Bengal                                     42            8            1            31    2
A & N Isls.                                      1            –            1             –    –
Chandigarh                                       1            1            –             –    –
D & N Havelli                                    1            1            –             –    –
Daman & Diu                                      1            1            –             –    –
Delhi                                            7            6            1             –    –
Lakshadweep                                      1            –            1             –    –
Pondicherry                                      1            –            –             1    –
Total                                          543          252          148            93   50

     a
         Source: CSDS Data Unit.
     b
         Includes 3 seats won by independent candidates.
     c
         Includes 4 RPI seats.

in the last decade or so. And this despite the fact that nation-wide opinion polls and
election forecasts based on these are of recent origin in India. The huge size of the
country and lack of funds required to conduct such an extensive polling exercise has
mitigated against the regular use of polls in the past. After some unsuccessful
attempts in the late 1960s and early 1970s, election forecasting in India came of age
in the parliamentary elections held in 1980 and 1984 when the English-language
magazine India Today backed the efforts of Ashok Lahiri and Prannoy Roy to under-
take a sophisticated analysis and forecast of the Indian elections. They abandoned
R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89      77

the earlier attempts to use crude methods like the cube law or to discover a stable
vote–seat conversion ratio. David Butler joined the team as they went on to innovate
methodologically. Their main innovations were the development of a simplified ver-
sion of swing (denoting percentage change in vote for the then dominant party,
Congress) to measure shift in votes and a new Index of Opposition Unity (proportion
of the total votes for non-dominant parties obtained by the largest non-dominant
party candidate) to measure the extent of the split of non-Congress votes (Butler et
al., 1995). They also teamed up with MARG (Marketing and Research Group) to
come up with innovations in the construction of a sampling frame and field research
for large sample surveys to provide the first set of robust predictions of the Indian
elections. The selection of constituencies was based on demarcation of ‘swing zones’
from which a representative constituency was selected to reflect the electoral shifts
in that zone. Their election forecasts of the 1980 and the 1984 elections might appear
in retrospect to be off the mark, but they did establish the superiority of survey-
based predictions over the speculative modes which dominated earlier.
    As compared to their first two efforts, the forecasts made by the same team in
the 1989 and the 1991 parliamentary elections proved more accurate, at least in
predicting the seats for the then dominant Congress party. They were much less
successful in predicting the seats for other parties. A weakness was built into their
method which divided the entire party system into Congress versus its opponents.
The method also proved rather unsuccessful in predicting the seats for the state-level
elections in three-cornered contests. The model became less useful as the Congress
slipped from its position as the single dominant party around which political compe-
tition was organised. To be fair, the method worked if the Congress was replaced
by another dominant party — such as the BJP — against which its opposition was
divided. But increasingly there was no party that occupied this position in most of
the states. For this and for other various other reasons, this pioneering team withdrew
from election forecasting after the 1996 elections. We summarise in Tables 2–5 some
of the predictions made by various agencies for the national parliament in the last
20 years.
The details of the survey methods used by the various agencies have not been dis-
closed and no post mortems were published so it has been impossible to get any
indication of the causes of any forecasting errors. Most of the market research organ-
isations which conduct opinion polls in India use quota sampling. The sample is
selected by prescribing to the interviewer a target sample profile (in terms of
rural/urban locations and male/female composition etc.). Alternatively, some agenc-
ies use the rule of thumb approach (anyone from the nth house on your left) for
randomising the choice of respondents. For this reason market research agencies
have been reluctant to provide details of the social profiles of their achieved samples
and how these compare with population characteristics.
While these pioneering predictions proved that Indian elections could be predicted
with a fair degree of accuracy, they left open the challenge of methodological
improvement and of developing a model suited to the special features of the Indian
party system in the 1990s. As for the results-based forecast during the early stages
of counting, this was never attempted in India.
78                         R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89

Table 2
Lok Sabha seat predictions, 1984–98a, b

Year                    Party                                      Forecast seats        Actual seats

1984                    Congress                                   366                   415
1989                    Congress                                   195                   195
1991                    Congress                                   233                   232
                        BJP+                                       155                   124
                        LF/NF                                      105                   115
1996                    Congress+                                  168–187               143
                        BJP+                                       178–197               176
                        LF/NF                                      103–117               151
1998                    Congress+                                  134                   148
                        BJP+                                       252                   252
                        UF                                         109                    93

     a
     Source: India Today-MARG opinion polls.
     b
     Notes: 1996 Congress+ includes UDF in Kerela, BJP+ includes Shiv Sena in Maharashtra, NF/LF
includes JD, SP, DMK, TDP(N), AGP and CPM, CPI, RSP, FBL. 1991 BJP+ includes Shiv Sena, NF/LF
includes JD,DMK,AGP,TDP,CPM, CPI, RSP, FBL.

Table 3
1996 Exit poll vote share predictionsa, b

                                INC+                    BJP+                        LF/NF

Exit poll                       31                      30                          24
Actual                          30                      24                          24

     a
   Source: CSDS Data Unit.
   Notes: INC+ = INC, UDF, AIADMK. BJP+ = BJP, Shiv Sena, Samta. LF = JD, SP, DMK, TDP(N),
     b

AGP. NF = CPIM, CPI, RSP, FBL.

Table 4
1998 Seat predictionsa

Agency                  Source               INC+                  BJP+                  UF

CSDS                    India Today          164                   214                   127
ORG-MARG                TVI Network          165                   208                   146
AC Nielson              Outlook I            149                   238                   123
AC Nielson              Outlook II           172                   231                   118
CMS                     Frontline            150                   230                   125
CMS                     Asianet              150                   235                   115
DRS                     TOI                  155                   249                   102
C-Voter                 Pioneer              142                   268                   114
Actual                  –                    148                   252                    93

     a
         Source: The Pioneer, 17/3/1998.
R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89        79

Table 5
1998 Vote share predictionsa

Agency                   Source                      INC+              BJP+          UF

CSDS                     India Today                 32                31            21
AC Nielson               Outlook II                  30                31            23
CMS                      Frontline                   32                35            20
DRS                      TOI                         27                34            21
C-Voter                  Pioneer                     24                36            21
Actual                   –                           25                36            22

  a
      Source: The Pioneer, 17/3/1998.

5. The CSDS method and model

   Ideally, one would conduct a voter intention poll in each of the 543 constituencies
and predict each seat based on the sample from that constituency. However, even a
moderate sample size of 625 (required to estimate the true proportion of votes within
4% at a 95% confidence level) would require a total sample size of 340 000. This
is not feasible, as the cost would be prohibitive and it would be virtually impossible
to carry out. Even getting reliable interviewers to undertake a door-to-door survey
at this level would be unthinkable.
   Thus, one has to resort to sampling parts of a subset of constituencies. In order
to make predictions in constituencies where no sampling has been done, one needs
to use the past voting data and a prediction model as set out below.

5.1. Sampling

   An insistence on random sampling distinguished our approach from those used
by all other market research agencies. We decided to aim at a national representative
sample rather than follow the earlier practice of sampling only those constituencies
which were considered representatives of various ‘swing zones’. As mentioned
above, the concepts of a swing zone and representative constituencies assumed a
degree of crystallisation of voting patterns which is not obtained in the fluid political
situation in most parts of India today. We followed the principles of multi-stage
stratified random sampling for selecting the survey locations.
   At the first stage we chose 20% of the parliamentary constituencies by a circular
sampling method (i.e. 1 in 5 from a list of constituencies ordered by state), with
probability proportionate to the size of the electorate (PPS). These 108 constituencies
were thus spread across all the major states in the same proportion as the number
of Lok Sabha seats in each state. Some of the small states were aggregated together
for this purpose. Since the official list of seats follows a geographical sequence,
circular sampling based on it ensured regional representation within each state. Once
the constituencies were selected, we chose two assembly segments within that con-
stituency, also by circular sampling, again with probability proportionate to the size
of the electorate in that segment. Then the list of polling stations for the sampled
80                         R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89

segments was obtained. Again, two out of an average of one hundred polling stations
were chosen from each assembly segment in the sample by circular sampling (at
this level, the PPS procedure was not feasible as the information about the electorate
size could not be obtained at the time of sampling).
   The same principle was rigorously applied to the selection of the respondents as
well. The target sample of 15 500 was distributed among various states as per their
proportion of the national electorate, so as to adjust for the variations in the numbers
of electors per seat in different states. The state sample was then divided equally
among the sampled polling booths within the state. The list of electors for these
chosen booths was obtained and from that the target sample was chosen, again via
circular sampling with a random starting point. A team of specially trained investi-
gators conducted face-to-face interviews with the selected respondents. No substi-
tution was allowed when prospective respondents could not be contacted. The CSDS
had conducted a voting intention poll in 1996 following the above method. We
decided to target the same respondents again and to add a proportionate sample of
the new voters whose names were added to the electoral register subsequently.
   The final profile of the achieved sample and its comparison with the known attri-
butes of the population is reported in Table 6. Since the two match fairly closely,
there is a good reason to believe that we had obtained a national representative
sample, at any rate a better sample than used by other pollsters. We did not adjust
for any differential non-response — this is not really feasible in the Indian context
given the lack of detailed socio-demographic information about the electorate.
   The same target sample was approached twice, once before the elections for the
pre-poll forecast and once in a post-poll survey held the morning after voting.

5.2. The model for state level forecasting

    Given the nature of the Indian party system and the permanent political flux, the
methods used in UK election prediction, which rely on the fact that each of the two
major parties there, Conservative and Labour, has a large committed vote and that
it is mainly a contest between these two parties, cannot be used in the Indian context.

Table 6
CSDS sample and population profilea

                                      Census                              1998 Sample

Female                                48                                  49
Rural                                 74                                  75
Hindu                                 82                                  83
Muslim                                12                                  11
Christian                              2                                   4
Sikh                                   2                                   2
Scheduled Castes                      17                                  17
Scheduled Tribes                       8                                   8

     a
         Source: 1991 Census and 1998 NES post-poll survey.
R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89       81

Thus, election results data that is more than a few years old has, in our opinion,
very little information that is of use for forecasting the result of a forthcoming elec-
tion. Each state has its own unique political history. Because of this, it has been
observed that contiguous areas falling in different states but with similar socio-econ-
omic profiles can have very different voting patterns. Thus, as far as predicting vote
percentages in a given constituency is concerned, only the most recent results from
that constituency and the others in the same state and opinion poll results from the
same state are relevant. We cannot work with an assumption of a homogenous
national swing that cuts across the state boundaries. A state is a more appropriate
unit for the purposes of prediction than any analytically carved out ‘swing zone’.
   Therefore, we decided to predict the seats at the state level and then aggregate
them to yield a national prediction. Some states are very small, with less than 5
constituencies. Here we use subjective projections of changes in shares of the vote
from the most recent election which were provided by local political experts. Thus
in effect, an election forecast in India amounts to 20 separate state opinion polls
conducted simultaneously, yielding 20 separate forecasts that need to be added up
to give a national forecast.
   We now discuss predicting seats within a state. From the opinion poll we can
predict the percentage votes that a major party is to get in the election. However,
to convert this to a seat prediction, we need to estimate the vote shares for each of
the major parties contesting the election. Here we operate with the uniform regional
change model that stipulates that the change of vote for a given party from the results
of the previous election is uniform over a state, or within a contiguous group of
states. Thus, under this model, all we need to do is to estimate the percentage of
vote for each of the major parties over the state and take its difference from the
average vote in the previous election. Applying the change to the actual vote share
in the previous election would give the predicted vote share for each party in each
constituency. The uniform regional change model is only an approximation — but
one that helps us estimate the vote share for a party in each constituency. Once we
have the predicted vote share for each of the major parties in each seat, we can
estimate the seats to be won by each party by the method described below.

5.3. Dealing with shifting alliances and splits

   The existence of a stable coalition or alliance of parties does not pose a major
difficulty for this method, for we can estimate the vote percentage and seats for the
alliance instead of the particular party. We face a special problem, however, where
a major party has split or the composition of alliances has changed, for the result
from the previous election cannot then serve as a starting point for measuring change.
We need, first of all, comparable results from a previous election that can form the
basis for measuring the changes and superimposing the change as described above.
Between 1996 and 1998, major parties in several states had split: Congress (I) in
West Bengal, Janata Dal in Bihar, Orissa and Karnataka and BJP in Gujarat. Some
older alliances were broken and new ones were forged.
   How do we take into account mergers or splits of parties or changes in alliances
82                  R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89

in the uniform regional change model? For this purpose, we constructed simulated
or ‘notional’ election results for the previous election. This was meant to capture
what the results would have been if the voters had voted as in the last election but
with the party competition and alliances as in the current one. Then this simulated
election result is used instead of the previous election result as the initial position
from which the change is estimated and applied.
   New alliances of parties that had fought the previous elections separately are sim-
pler to handle, for their votes could simply be aggregated. Assessing the effect of
a recent split, however, required the help of the local political experts. In the case
of a split of a party, we could allocate its votes in some ratio across the state based
on subjective input from the experts. For example in Bihar, the Janata Dal split to
give birth to a new party called Rashtriya Janata Dal (RJD) which in the opinion of
the experts was the dominant faction. It was felt that RJD would get most of the
Janata Dal votes and for the purpose of constructing the simulated results, we gave
it 80% of the Janata Dal votes. If the sphere of influence of the splinter faction varies
from region to region in the state, this could be taken into account. For example, it
was felt that in West Bengal, the breakaway faction of the Congress (I), called the
Trinamool Congress (TC), had a lot of support in the urban area surrounding Cal-
cutta, but much less outside it. Similarly, in Karnataka the experts’ judgement was
that the Lok Shakti that broke away from the Janata Dal had more influence in
Northern Karnataka.
   When it comes to the formation of new alliances or the break-up of old alliances,
the starting point is again subjective assessment, for there is no clear record of the
independent electoral strength of each of the alliance partners. When an existing
alliance breaks, we have to subjectively assess the relative strength of the constituent
parts and split the votes of the alliance in the previous election in the same way as
we dealt with splits in parties. In Tamil Nadu, the 1996 Parliament election was
contested by an alliance of AIAMDK and Congress (I). In 1998, they were no longer
allies. In fact, AIADMK had forged an alliance with BJP and several other regional
parties. It was perceived that most of the votes of this alliance in 1996 were indeed
AIADMK votes. Hence, we assigned 80% of the votes of the AIADMK and the
Congress (I) candidates in the 1996 actual results to AIADMK. When two or more
parties come together, the vote for the coalition could be less than the sum of their
votes. This could be the case if the social base of the two parties was judged to be
incompatible (due to the political history of the state). The sum of their votes was
suitably adjusted to account for this, by multiplying the votes of the minor party by
a factor called the ‘friction’ factor, and then added to the major party votes. The
friction factor itself was assigned a value (between 0 and 1) subjectively, based on
local experts’ opinion.
5.4. The seats forecasting method
  To summarise, the steps required to make a seat prediction under the uniform
regional change model are:
1. To construct the simulated initial position, i.e. what the results would have been
R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89       83

   at the previous election if that had been fought by the pattern of parties and
   alliances fighting at the current election;
2. To conduct a voter intention poll and estimate the vote share for each of the major
   parties/alliances in each state;
3. To translate predicted votes in each constituency into a seat prediction for each
   of the major parties/alliances in each state and to aggregate these to give a
   national prediction

We have already discussed the first and the second step. Let us now examine the
third step. The simplest method is to treat the predicted votes as actual votes and
count off the winners — the party/alliance predicted to get the highest vote share
in a constituency is predicted as the winner. However, one can see that there is a
difference between two seats where in one the forecast winner is well ahead of the
runner-up (say 46% to 34%) to another where the first and second predicted parties
are forecast to be very close (say 40% to 39%). This consideration leads to the idea
of producing probabilities that each party will win the seat. Thus in the former case
we might produce a probability of 1 for the winner, and 0 for all other parties, while
in the latter case giving probabilities close to 0.5 for the two competing parties would
seem to be a more realistic and accurate assessment of the parties’ chances of win-
ning. This probabilistic method has been used in other countries such as the UK
(Payne, 1992).
   Suppose that the standard error of the estimated difference of vote percentage
between the first two parties is a. Label the party with the highest predicted vote
share as party X and the party with the second highest as party Y. We wish to assign
probabilities of winning to the two parties X and Y.
   Assuming that party Y is actually going to get (U%) more votes than X, let us
calculate the probability that the survey produces a result as extreme as the observed
result, i.e. party X getting d% or more votes than party Y in the survey. This is
     P( Z ⬎ (U⫹d)/ a)

where Z has the standard Normal distribution. The best case scenario for party Y
then is when U is small, say nearly zero, when party Y will win the seat by a very
small margin. The probability of this is
     P(Z⬎ d / a)

So for each constituency in the country, we compute the observed difference between
the predicted vote shares for the two leading parties (d%) and assign probabilities

     P(Z⬍d/a) and P(Z⬎d/a)
respectively to the leading party and the party coming second. Parties predicted to
be in third place or lower are assigned a probability of winning of zero. Then the
sum of probabilities of winning for a given party over all constituencies would give
its expected (predicted) number of seats in the parliament.
84                 R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89

5.5. The model for results-based forecasting

   In the past, the media have never attempted to forecast the composition of the
new parliament until partial counts were available from nearly all the seats. But the
procedure for declaration of results in India makes it possible to forecast using a
few partial counts. As mentioned above, a special feature of the Indian elections is
that partial counts or ‘trends’ are declared in chunks of about 10% for each assembly
segment. To make matters helpful for the forecasters, these chunks are a random
sample of the total votes cast in that segment, thanks to the procedure of ‘mixing’
votes from different boxes so as to maintain the secrecy of booth-wide voting pat-
terns. That placed us in an advantageous position of having large random samples.
But we still have the usual problem that the constituencies that report their trends
early may form an unrepresentative sample. It is known that some states tend to
count faster than others.
   The solution to this problem was to combine the results-based forecast with a
postpoll-based prior forecast for the undeclared seats. In the beginning, before coun-
ting started, we had state-wise predicted vote shares for the major alliances based
on the post-poll survey (which we call the prior predicted share). As counting trends
become available we calculate the predicted vote shares for alliances in each state
based on the counting trends (called the posterior predicted share). If the number
of seats where trends were available was large enough, these were regarded as good
predictions on their own. Thus, the predicted vote share at any given time was a
convex combination of the prior and the posterior vote shares, i.e.
     predicted vote share ⫽ P * prior ⫹ (1-P)* posterior
where P=1 if no partial count-based trend is available and P=0 if trends are available
from a large proportion of seats. We had a heuristic formula to give values of
between 0 and 1 for the in-between stages when some trends were available but not
enough to discount the prior. Once we have predicted vote shares, we could convert
these to seats just as in the case of opinion polls.

6. Review of forecasts and actual results

6.1. The opinion poll based forecast

  The CSDS carried out a survey between 4th February and 8th February 1998 for
a target sample of 15 500 voters of which 8938 were interviewed. It needs to be
remembered that at the time of this opinion poll, the actual election dates were
between 8 to 20 days away. The actual voting took place mainly in three phases on
16th, 22nd and 28th February. Furthermore 36% of the respondents said that they
had not made up their mind at that time. Moreover, even those who had made up their
mind could later change their mind (and perhaps did, as the post poll results showed).
  Table 7 compares the seats predicted in the CSDS-India Today opinion poll and
the actual results. While there are some large discrepancies between the predicted
R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89                     85

Table 7
Comparison of the opinion poll based forecast and the actual resulta, b

Alliance                             Predicted seats                     Actual seats

BJP and allies                       214                                  251
Congress(I) and allies               164                                  144
United Front                         127                                   93

   a
     The actual result reported here and in the following tables is different from the final result reported
in Table 1 as it excludes the seats that did not go to polls at the time of counting.
   b
     The definition of the Congress allies excludes the RJD with which the Congress had a partial
seats adjustment.

and actual seats, these can perhaps be explained by the gap between the date of the
survey (finished on February 8, 1998) and the dates of the actual voting, which were
February 16, February 22 and February 28 with about a third of the constituencies
voting on each day.

6.2. Post-poll based prior forecast

   The CSDS also carried out a post-poll survey, when the same target respondents
contacted in the pre-poll survey were interviewed the day after they had voted. This
time we could interview 8133 respondents from the target sample. Given the large
time gap between polling and counting, the results of the post-poll survey were
available to us before the counting began. This survey formed the basis of a prior
forecast which could not be published because of the Election Commission’s ban
on forecasting before the last round of voting was completed. In any case, the main
aim of this post-poll survey was to serve as an input to the result-based forecasting
model described below.
   Table 8 gives the seat prediction based on the post-poll and the actual outcome.
This prediction is quite close to the final outcome. We note that the difference
between the pre-poll and post-poll figures was most significant in the state of Tamil
Nadu, the most surprising set of results of the 1998 elections.

Table 8
Comparison of post-poll based forecast and actual results

Alliance                             Predicted seats                     Actual seats

BJP and allies                       252                                  251
Congress (I) and allies              134                                  144
United Front                         109                                   93
86                     R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89

6.3. Results-based forecast

   The counting of votes all over the country began at 8 am on March 2 (day 1).
The first significant trends (based on a partial count of about 10% of votes) started
coming in from a few constituencies around 1430 hours on day 1. It was not before
another 40 hours of actual counting that all the final results were declared. The first
prediction was made at about 1750 hours on day 1. Although we had our first predic-
tion ready within two hours after the first counting trends were available, we waited
for three hours for the figures to stabilise before actually going on air with the fore-
cast. Table 9 presents the results of these early projections.
   It can be seen that initially there is a strong effect of the ‘prior’ on this forecast:
the predicted seats are nearly the same as for the post-poll based forecast. It is close
to the final outcome for BJP and allies but is an overestimate for United Front and
an underestimate for Congress and allies.This error was corrected after some more
trends were available. By 11.00 pm on the first day of counting (about 8 hours from
the time first trends came in), we had forecast a picture that was very close to the
final outcome for all the three alliances.

6.4. Detailed analysis of forecast discrepancies

  As has been reported earlier, the opinion poll was designed to give an all-India
prediction of seats and though our method involved predicting seats at the state level,
we refrained from publishing them as the sample size at the state level was too small

Table 9
Results-based forecast by time

Day                 Time                 BJP and allies        Congress (I) and     United Front
                                                               allies

1                   1750                 252                   130                  105
1                   1814                 255                   128                  101
1                   1839                 256                   127                   99
1                   1908                 244                   132                  103
1                   2114                 239                   147                  102
1                   2210                 238                   150                   99
1                   2302                 246                   150                   98
1                   2350                 245                   147                  100
2                   0025                 249                   147                   95
2                   0205                 249                   145                   98
2                   0605                 256                   143                   95
2                   0756                 251                   144                   97
2                   0853                 252                   143                   95
2                   0945                 251                   142                   96
2                   1015                 251                   142                   96
2                   1720                 252                   140                   92
–                   –                    –                     –                      –
Final                                    251                   144                   93
R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89             87

Table 10
Some state-level predictions

                     BJP+          BJP+          INC+           INC+          UF+      UF+
                     votes         seats         votes          seats         votes    seats

ANDHRA PRADESH: Sample size: 618
Pre-poll      11.68          2                    40.93         14            41.16    25
Post-poll     13.1           1                    40.72         15            40.56    26
Final         19.53          4                    38.46         22            38.37    15

GUJRAT: Sample size: 477
Pre-poll          44.95            14             34.68         12              5.29    0
Post-poll         46.58            16             32.89         10              2.28    0
Final             48.37            19             36.49          7              3.56    0

KARNATAKA: Sample size: 484
Pre-poll      28.62         11                    40.84         15            24.15     2
Post-poll     38.51         20                    32.01          6            24.67     2
Final         38.52         16                    36.23          9            21.82     3

MADHYA PRADESH: Sample size: 567
Pre-poll     45.43         36                     40.81          4              3.03    0
Post-poll    46.61         31                     42.66          9              1.13    0
Final        46.62         30                     39.40         10              2.64    0

MAHARASHTRA:       Sample size: 980
Pre-poll            35.33         10              50.93         38              5.18    0
Post-poll           36.89         17              44.76         31              3.43    0
Final               42.14         10              50.41         37              3.21    1

ORISSA: Sample size: 305
Pre-poll           32.22            8             41.63         13            10.72     0
Post-poll          30.16            7             45.24         14             7.94     0
Final              48.99           16             41.04          5             6.42     3

RAJASTHAN: Sample size: 429
Pre-poll        41.62              10             49.82         10              0.88    0
Post-poll       50.88              18             44.01          7              0.20    0
Final           41.98               5             44.45         18              4.66    0

TAMIL NADU: Sample size: 626
Pre-poll       21.38          8                   18.93          3            36.55    29
Post-poll      34.90         23                   12.68          0            41.94    16
Final          45.08         30                    4.78          0            43.42     8

UTTAR PRADESH:       Sample size: 1254
Pre-poll             41.36         64             11.69          7            26.32    10
Post-poll            40.32         63              8.94          5            26.81    13
Final                37.34         59              6.02          0            29.57    20

WEST BENGAL: Sample size: 765
Pre-poll       17.26           0                  22.61          1            53.13    41
Post-poll      27.34          13                  15.98          0            50.69    29
Final          34.64           8                   6.02          1            41.41    33
88                      R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89

to have any measure of confidence in the predictions. Only when these were aggre-
gated at the national level do they give reasonable predictions. In Table 10 we give
the predicted vote shares and seat predictions for some of the major states. As we
can see, while in some states the predictions were quite close, in others they are
quite off the mark — this is not surprising as the predictions are based on a very
small sample size at the state level.

7. Concluding observations

   The post-poll based predictions were very close to the actual results, although the
sample size was not large enough for us to be confident of our forecasts in individual
states. We consider ourselves lucky to be so close to the final aggregate outcome.
The entire exercise depends crucially on the assumption that the political picture has
not undergone major change since the previous election — at least in a major part
of the country. As Indian general elections are currently occurring at frequent inter-
vals this assumption is likely to hold for the time being at least.
   But we cannot infer from the successful results-based forecasts produced that our
methodology for converting votes into seats is optimal and will always work well
in future elections. Furthermore it is impossible to disentangle the various compo-
nents of the prediction scheme and assess their relative contribution to the success
(or failure) of the forecasts in the Indian context. The main components in this exer-
cise were: the prior data obtained form the post-poll, the declaration order for the
partial counts, the veracity of the experts’ construction of the notional results for the
previous election and the method for converting votes into seats. For this election
we were fortunate to have a good post-poll, at least in the aggregate, and the declar-
ation order did not contain any significant biases. But the notional results are a
counterfactual — it is impossible to judge their accuracy. Furthermore it is quite
possible to get a good forecast when there are compensating errors in some of the
components — in the UK context there have been good forecasts where errors in
the poll-based prediction of vote shares have been compensated for by a poor method
of converting votes into seats. And most election forecasters could tell you that if
they rerun the forecast making this change here and that change there they would
have got a very accurate forecast. So it is what is actually produced with the last
poll before the election and in the hectic context of ‘election night’ itself that counts.
   Thus, like all election forecasters we were at the mercy of Lady Luck. Were we
just lucky or have we developed a sound methodology for Indian election fore-
casting? Only with repeated application over a series of elections, with varying polit-
ical scenarios, could this question be answered.

References

Brown, P., Firth, D., Payne, C., 1999. Forecasting on British election night 1997. J. R. Statist. Soc. A.
   162, 211–226.
R.L. Karandikar et al. / Electoral Studies 21 (2002) 69–89                    89

Butler, D., Lahiri, A., Roy, P., 1995. India Decides: Elections 1952–1995. Books and Things, Delhi.
Heath, O., 1999. The fractionalisation of Indian parties. Seminar 480, 66–71.
McMillan, A., 2000. Delimitation, democracy and the end of constitutional freeze, Economic and Political
   Weekly, April 8 2000, 1271–1276.
Morton, H. (Ed), 1990. Election night forecasting. The New Zealand Statistician 25, 36–84.
Payne, C., 1992. Statistical methods for election forecasting in the United Kingdom, 1970–90. In: Norris,
   P., Crewe, I., Denver, D., Broughton, D. (Eds.) British Elections and Parties Handbook 1992. Harvester
   Wheatsheaf, Hemel Hempstead, pp. 138–158.
Singh, K., 1992. People of India, vol 1. Anthropological Survey of India, Delhi.
Sivaramakrishnan. K., 1997. Under-franchise in urban areas: freeze on delimitation of constituencies and
   resultant disparities. Economic and Political Weekly, Dec 20 1997, 3275–3281.
Yadav, Y., 1999. Electoral politics in the times of change: India’s third electoral system. Economic and
   Political Weekly, Aug 21 1999, 2393–2399.
You can also read