CSIR Elections Forecasting - 2016 Local Government Elections Zaid Kimmie 28 October 2016 - iccssa

Page created by Jamie Patton
 
CONTINUE READING
CSIR Elections Forecasting - 2016 Local Government Elections Zaid Kimmie 28 October 2016 - iccssa
CSIR Elections Forecasting
 2016 Local Government Elections

          Zaid Kimmie
        28 October 2016
CSIR Elections Forecasting - 2016 Local Government Elections Zaid Kimmie 28 October 2016 - iccssa
Overview

    1. Team Members
    2. Some History
    3. Why Forecast?
    4. Methods: Clustering
    5. Methods: Predictions
    6. Model Performance
    7. What Next?

1
The CSIR Team

    Statisticians, computer scientists, and programmers . . .
    • Peter Schmitz, Jenny Holloway, Nontembeko Dudeni-Thlone
    • Brenwen Ntlangu, Tyrone Naidoo
    • Zaid Kimmie, Ndumiso Cingo and Luyanda Vappie
    • Paul Mokilane, Quintin van Heerden, Sumarie Meintjes
    • Hans Ittmann, Jan Greben, Renee Koen

2
Some History

    • Worked with IEC for 1999 national 2000 municipal elec-
      tions
      – Checking inconsistencies in voting patterns
      – Forecasts a “by-product” of this methodology
    • Worked with SABC for all elections since 2004
      – Produce a forecast of the final results

3
Why Forecast?

    • Election results are released by Voting District (VD)
       – Some 22,500 VDs in total

    • If the VDs reported in a random order there would be noth-
       ing much to do
       – The final result would become clear relatively quickly
       – E.g. 5% of VDs have reported the tabulated results are within
         a couple of percentage points of their final values

4
Why Forecast?

    • Fortunately (for us) VDs do not report randomly
       – There is in fact a systematic bias in the reporting order
       – The difference, in the early stages, between the “live” and final
         results are often substantial
       – These differences may persist . . .
    • People (including political analysts and the curious member
      of the public) looking at the “scoreboard” will find that it gives
      them very little useful information

5
ANC – Johannesburg Metro
         48
              Final ANC%

         46

         44
% Vote

         42

         40

         38
              4am     9am   2pm   7pm   12am   10am   3pm   8pm   1am   6am   11am   5pm

                                               Time
ANC – Johannesburg Metro
         48
              Final ANC%

         46

         44
% Vote

         42

         40

         38
              10%          20%    30%   40%    50%    60%   70%   80%   90%

                                              % VDs
Why Forecast?

    • In this “window of opportunity” election forecasts can pro-
       vide useful insights
       – What do the the initial results really mean?
       – Identify interesting patterns that are emerging

    • The combination of pre-election polling data and exit polls
       can get it wrong . . .
       – Brexit, UK 2015 general election

8
Why Forecast?

    • In this “window of opportunity” election forecasts can pro-
       vide useful insights
       – What do the the initial results really mean?
       – Identify interesting patterns that are emerging

    • The combination of pre-election polling data and exit polls
       can get it wrong . . .
       – Brexit, UK 2015 general election

    • It can make you look smarter than you actually are . . .

9
Why Forecast?

10
Forecasting Model: Basics

     • Method published by Greben, Elphinstone & Holloway, 2006
       in ORiON: The Journal of ORSSA

11
Forecasting Model: Basics

     • Method published by Greben, Elphinstone & Holloway, 2006
        in ORiON: The Journal of ORSSA

     There are a couple of basic principles:
     1. Voters do not randomly allocate their electoral preferences – they
        are influenced by political, socio-economic and demographic fac-
        tors, as well as past voting history;
     2. Changes in voting behaviour between one election and the next
        are also not random, but are correlated with past voting behaviour,
        demographic and socio-economic factors.

12
Example

     Suppose our area of interest consists of 200 VDs, and that in
     the previous election party A has obtained 70% of the vote in
     the area, with relatively small variation between VDs
     • When the first VD reports . . .
     • When 10 VDs have reported . . .
     • When 30 VDs have reported . . .

13
Methods: Clustering

     The first step is to create clusters of VDs based on previous
     voting results.
     • Fuzzy-c-means
     • Fixed number of clusters, c
     • Fuzzy clustering performs better than other methods (k-means,
       k-means with discriminant analysis) – smallest prediction error
     • How many clusters?

14
Methods: Predictions

     Two-step process:
     • Estimate turnout for outstanding VDs
     • Assign fuzzy-cluster estimates to VDs

15
Methods: 2016 Predictions

     • Metro predictions based on provincial clusters
     • This method allowed us to (accurately) predict eThekwini
        when no results had been released
     • But this setup can let us down when inter-provincial varia-
        tions do occur, as was the case with Tshwane

16
Model Performance

     There are two aspects of model performance – the technical
     performance of the model and our ability to communicate the
     model output to the general public

17
Model Performance

     • Assuming that IT snafus have not rendered us mute . . .
     • Early on the Thursday morning after election day – some-
        where between 5am and 9am, when only about 10% of all
        VDs have reported – we forecast the final results
     • We continually update our forecasts, but the numbers do
        not change all that much, and the level of interest in the
        forecast declines as the “scoreboard” starts to match the
        final score

18
How did we do?

     • Pretty well!
     • By 5am on Thursday we identified the major trends well
       before they could be inferred just by looking at the data
       – That the DA would be the largest party in NMB, but not achieve
         a majority
       – That the ANC would lose its majorities in all the Gauteng metros
       – That the DA would increase its majority in Cape Town
       – That the ANC would continue to hold a majority in Buffalo City,
         Mangaung and eThekwini

19
How did we do?

     • We were able to predict that the ANC’s share of the na-
        tional vote would fall to 54%
     • We did not get the final result in Tshwane right – our model
        predicted (and continued to predict until quite late into the
        reporting) that the ANC would be the largest party
     • In general we were able to get within 1.5 percentage points
        of the final result for the larger parties, and in most cases
        within 0.5 percentage points

20
Forecasts

      Metro          Party   Predicted 5am   Final   Actual 5am

      Johannesburg   ANC         44.5        44.9       39.5

                     DA          38.9        38.4       45.3

                     EFF         10.7        10.9       9.8

      Tshwane        DA          41.5        43.1       47.0

                     ANC         42.8        41.5       41.0

                     EFF         10.7        11.6       7.8

      Ekurhuleni     ANC         47.8        48.9       38.8

                     DA          35.8        34.2       50.0

                     EFF         10.7        11.1       7.8

21
Forecasts

     Metro            Party   Predicted 5am   Final   Actual 5am

     Cape Town        DA          65.7        66.8       72.5

                      ANC         25.1        24.5       18.8

                      EFF          2.9         3.1       3.1

     Nelson Mandela   DA          48.3        46.6       58.6

                      ANC         42.2        41.5       32.9

                      EFF          3.9         5.0       3.8

     eThekwini        ANC         58.8         60         0

                      DA          27.7        27.5        0

                      IFP          4.0         4.3        0

22
Johannesburg

                                         ANC – Johannesburg Metro
              48
                   Prediction Thursday 5am

              46

              44
     % Vote

              42

              40

              38
                   4am      9am       2pm    7pm   12am   10am   3pm   8pm   1am   6am   11am   5pm

                                                          Time

23
Johannesburg

                                             DA – Johannesburg Metro
              48
                   Prediction Thursday 5am

              46

              44
     % Vote

              42

              40

              38

              36
                   4am      9am       2pm      7pm   12am   10am   3pm   8pm   1am   6am   11am   5pm

                                                            Time

24
Ekurhuleni

                                               ANC – Ekurhuleni Metro
                   Prediction Thursday 5am

              50

              46
     % Vote

              42

              38

              34

              30
                   3am        8am            1pm   6pm   11pm   8am   1pm   6pm   11pm   4am

                                                            Time

25
Ekurhuleni

     • The difference between the predicted and actual ANC vote
      count in Ekurhuleni was less than 9,000 votes – just under
      900,000 people voted in this metro.

26
Ekurhuleni

                                                   DA – Ekurhuleni Metro
                   Prediction Thursday 5am

              50

              46
     % Vote

              42

              38

              34

              30
                   3am        8am            1pm     6pm   11pm   8am   1pm   6pm   11pm   4am

                                                              Time

27
Ekurhuleni

                                               EFF – Ekurhuleni Metro
              20
                   Prediction Thursday 5am

              16

              12
     % Vote

               8

               4

               0
                   3am        8am            1pm   6pm   11pm   8am   1pm   6pm   11pm   4am

                                                            Time

28
Nelson Mandela Bay

                                              DA – Nelson Mandela Bay
                    Prediction Thursday 5am

              65

              60
     % Vote

              55

              50

              45

              40
                   12am         3am           6am   9am    12pm   3pm   6pm   9pm   12am

                                                          Time

29
Nelson Mandela Bay

                                          ANC – Nelson Mandela Bay
              50
                    Prediction Thursday 5am

              45

              40
     % Vote

              35

              30

                   12am         3am           6am   9am    12pm   3pm   6pm   9pm   12am

                                                          Time

30
Nelson Mandela Bay

                                          EFF – Nelson Mandela Bay
              16
                    Prediction Thursday 5am

              12
     % Vote

               8

               4

               0
                   12am         3am           6am   9am    12pm   3pm   6pm   9pm   12am

                                                          Time

31
Cape Town

                                              DA – Cape Town
                   Prediction Thursday 5am

              75

              70
     % Vote

              65

              60

              55
                     12am               4am    8am          12pm   4pm   8pm   1am

                                                     Time

32
Ethekwini

                                                 ANC – eThekwini
              70
                     Prediction Thursday 5am

              65

              60
     % Vote

              55

              50

               3pm            6pm              9pm     9am     12pm   3pm   6pm

                                                       Time

33
Tshwane

                                                       ANC – Tshwane
                      Prediction Thursday 5am
              50          Prediction Friday 8am

              45
     % Vote

              40

              35

                   12am     4am       8am     12pm   4pm   8pm   12am   10am   2pm   6pm   10pm   2am   6am   11am

                                                                  Time

34
Tshwane

                                                           DA – Tshwane
                      Prediction Thursday 5am
              50          Prediction Friday 8am

              45
     % Vote

              40

              35

                   12am     4am       8am     12pm   4pm   8pm   12am   10am   2pm   6pm   10pm   2am   6am   11am

                                                                  Time

35
Cluster Comparisons

36
Cluster Comparisons

37
What Next?

     • Running multiple models with different clustering options
     • Improved diagnostics
     • 2019 SA National Election
     • 2016 US Election . . .
     • 2020 UK Election

38
You can also read