Measuring Word of Mouth's Impact on Theatrical Movie Admissions

Page created by Raymond Rose
 
CONTINUE READING
Measuring Word of Mouth’s Impact
            on Theatrical Movie Admissions
                                       CHARLES C. MOUL
                       Washington University Department of Economics
                                      Campus Box 1208
                                 St. Louis, MO 63130-4899
                                   moul@artsci.wustl.edu

Information transmission among consumers (i.e., word of mouth) has received
little empirical examination. I offer a technique that can identify and measure
the impact of word of mouth, and apply it to data from U.S. theatrical movie
admissions. While variables and movie fixed effects comprise the bulk of observed
variation, the variance attributable to word of mouth is statistically significant.
Results indicate approximately 10% of the variation in consumer expectations
of movies can be directly or indirectly attributed to information transmission.
Information appears to affect consumer behavior quickly, with the length of a
movie’s run mattering more than the number of prior admissions.

                                      1. Introduction
Despite its widespread theoretical implications in environments of
incomplete information, information transmission among consumers
(a.k.a. word of mouth) has received relatively little empirical support. In
this paper, I show that an existing method of detecting word of mouth is
overly broad, in that its empirical prediction of autocorrelated growth
can also be generated by a simple model of saturation in demand. I
instead offer a model of demand that can accommodate both saturation
and word of mouth, and then consider its implications within an error
components framework. Applying this technique to U.S. theatrical
admissions, my estimates suggest that word of mouth is statistically
and economically significant and that information travels quickly to the
average consumer in commonly observed situations. Simulations using
these estimates confirm that word-of-mouth can have large impacts on
how movies play out in theaters.

I thank the editor and two anonymous referees for comments on an earlier draft that
greatly improved the paper. Seminar participants at Washington University in St. Louis
and the DeSantis Center’s Business and Economics Scholars Workshop also provided
helpful feedback. The usual caveat applies.
C 2007, The Author(s)
Journal Compilation C 2007 Blackwell Publishing
Journal of Economics & Management Strategy, Volume 16, Number 4, Winter 2007, 859–892
860                                 Journal of Economics & Management Strategy

      In a world of incomplete information, consumers sharing informa-
tion about experience goods can play a critical role in moving economic
outcomes closer to the full information ideal. This intuitive insight has
been formalized by Ellison and Fudenberg (1993, 1995) who present
mechanisms for and implications of what they refer to as social learning.
Furthermore, either word of mouth or repeat purchases are essential in
the theoretical literature explaining how advertising can be used as a
signal of quality in a separating equilibrium (Nelson, 1974; Milgrom
and Roberts, 1986). The speed and manner of information transmission
among consumers, however, are inherently empirical questions, and it is
there that I will concentrate my efforts. Given the presumed importance
of word of mouth in the theatrical sector of the movie industry, these
results also potentially have bearing on the best responses to information
transmission among consumers.
      I discern information transmission by interpreting my model’s
residuals, and thus, while I will refer throughout to this transmission as
“word of mouth,” I am unable to distinguish between consumers shar-
ing information among themselves and information that is exogenously
revealed after a movie’s release (e.g., late movie reviews, published box
office announcements, etc.). With that caveat in mind, the general idea
of my approach is that word of mouth will be revealed in a specific
and well-defined manner. Products presumably have unique differences
between consumer expectations and realizations (the information that
is relayed by word of mouth). All products, however, will begin their
commercial lifespans in the absence of such information. If a product
stays available for a long enough time and enough consumers purchase
it and share the product’s true quality, then the original consumer
expectations will be supplanted by the conveyed realized quality.1 This
systematic spread of information has implications for serial correlation
and heteroskedasticity, specifically that both will increase over a movie’s
theatrical run. My use of movie fixed effects in estimation alters this
prediction, in that heteroskedasticity and serial correlation based upon
these residuals will be nonmonotonic (specifically U-shaped) over each
movie’s run. The speed of information can then be inferred from the
nature and importance of asymmetry in these U-shaped relationships.

    1. Herding therefore does not arise in this context, as signals (in the form of word of
mouth) as well as actions are observable. See Bikhichandi et al. (1992). This framework
also implicitly assumes that consumers are aware of all products’ existence. Relaxing this
assumption and allowing word of mouth to affect the likelihood that a consumer knows
of a product’s existence is a potential extension, though a computationally burdensome
one. This burden arises as observed quantities are the weighted averages of all possible
combinations of consumer awareness. At the 50 weekly movies observed in my sample,
this would require the calculation of 250 − 1 probabilities of each potential combination
of movies and 50 ∗ 249 informationally conditional quantities.
Theatrical Movie Admissions                                            861

This proposed error components approach differs substantially from the
prior literature which either exploits additional information regarding
the sign and extent of the information gap (e.g., Chevalier and Mayzlin,
2006) or takes advantage of detailed microlevel data (e.g., Foster and
Rosenzweig, 1995).
      This paper is not the first to use movie admissions in an attempt to
document word of mouth. De Vany and Walls (1996) show that informa-
tion transmission among consumers will cause autocorrelated growth
rates among movies. They then document rejections of a Pareto Law
relationship in favor of positively autocorrelated growth, and interpret
this as evidence of word of mouth. This paper makes two improvements
upon that work. First, I show that positively autocorrelated growth can
also be generated by a simple model in which consumers typically buy
a product at most once (as seems likely in the theatrical movie industry
and many others). Second, my approach can not only detect the presence
of word of mouth, but can also provide measures of the importance of
that information transmission.
      The movie industry has been the subject of several other studies
in recent years. Moul (2001) finds that, from the late 1920s to early
1940s, studio revenues rose with experience using synchronous sound
recording technology and interprets this as evidence of qualitative
learning-by-doing. Einav (2007) decomposes the seasonal demand for
theatrical movies into an underlying component and the amplification
that arises from endogenous industry practices. Corts (1998) looks at
the issue of theatrical release timing in finding that a studio’s divisions
largely behave as an integrated whole. On the exhibition side, Davis
(2006) uses the location and prices of theaters to measure how cross-
price elasticities change with distance, and, on the production side,
Goettler and Leslie (2004) look at causes and consequences of two
studios cofinancing a movie. Finally, Ravid (1999) examines measures
of profitability over a movie’s entire commercial run rather than simply
revenues in a particular sector.
      This paper makes some contributions to this literature beyond its
conclusion regarding the importance of word of mouth in theatrical
admissions. My initial estimates of demand are generally plausible and
conform well with implications of other studies. As expected, ignoring
and imperfectly addressing saturation in demand substantially distort
the predicted impacts of Oscar nominations, awards, and abbreviated
weeks. Estimates of the full model suggest that Oscar nominations have
a substantial impact on admissions in equilibrium, but indicate no such
comparable bump from winning the award. Prior commercial perfor-
mance of both starring cast and director have significant impacts on
consumer expectations in equilibrium. There is also strong evidence of
862                           Journal of Economics & Management Strategy

heterogeneity among consumers in how they substitute between movies
and other goods, as well as between family and nonfamily movies.
      My error components approach reveals that information travels
quickly to the average consumer, though this is obviously dependent
upon how many people have seen the movie. For example, a movie that
has been seen by 200,000 persons by its fourth week can expect that
consumers behave as if they have incorporated 50% of the difference
between the movie’s true and originally expected quality. This level
of incorporation exceeds 90% for movies of comparable age that have
been seen by 1 million persons. Estimates of the importance of word of
mouth are also informative and robust. About 10% of the variance in the
model’s implied consumer expectations is attributable to information
transmission. Of the unobserved disturbances, roughly 38% of the
variance can be attributed to word of mouth. Word of mouth is less
critical in explaining serial correlation of unobserved demand shocks,
explaining about 32% of that phenomenon. Simulations from these
estimates indicate that a movie of average expectations but with good
word of mouth will gross $30 M more than a movie with the same
expectations and bad word of mouth. I conclude with a discussion of
implications for research on movies and other similar industries.

                             2. Industry
While the length and importance of the theatrical window of a movie’s
commercial run have recently diminished relative to the video market, it
still offers a unique advantage over other periods of a movie’s lifespan: at
release, consumers have only their expectations upon which to base their
decisions. These expectations will depend upon all information (e.g.,
trailers, movie reviews, etc.) that is available prior to the movie’s release.
As noted in De Vany and Walls (1996), the sector has institutionally
evolved to exploit favorable word of mouth. Moul and Shugan (2005)
further argue that the current strategy of wide release that replaced the
multi-run structure in the 1970s is at least in part an attempt to limit the
adverse effects of negative word of mouth. In the original structure of
theaters sequentially showing older prints, only movies with favorable
word of mouth could be expected to have long and lucrative runs.
The new structure offers the chance for movies that generate negative
word of mouth to yield at least some returns in the initial wide release,
even though the movie’s commercial run will be short as disappointed
consumers share their information.
       Contractual details between movie distributors (usually studios)
and exhibitors further suggest that information transmission and the
Theatrical Movie Admissions                                                               863

risks therein are paramount. Contracts during my sample centered
around the rental rate, a percentage of exhibitor box office receipts that
is ceded to the distributor.2 This percentage typically declines with the
length of time since a movie’s release; a common nonblockbuster rental
schedule is 60% the opening week, followed by 50%, 40%, 35%, and
30%. This declining rate attempts to compensate an exhibitor for not
switching to a newer movie.3 Other contractual details also suggest the
potential importance of information. Contracts during my data often
specified a 4-week timeline, but clauses required movies to be held
over if they did especially well. Conversely, exhibitors showing movies
that performed especially poorly were often prematurely excused from
their contractual obligations or, in less extreme circumstances, given
a split (allowed to show another of the distributor’s movies instead
of the underperforming title at some showtimes). One of the critical
determinants of demand, the number of exhibiting theaters, is thus
able to adjust in response to information transmission. The distributor
can also unilaterally adjust a movie’s advertising to react to word
of mouth. Advertising and promotion is typically the largest cost of
distribution, with printing the reels of film being the other primary
variable cost. While general advertising budgets are often set during
a movie’s production, it is relatively common to see television and
newspaper ads that reference positive word of mouth.

                                     3. Theory
3.1 Autocorrelated Growth and Its Causes
De Vany and Walls (1996) offer a detailed model of movie-goer behavior
and claim that word of mouth is the best explanation for the substantial
autocorrelation of growth in demand that they document. Their primary
empirical tests reject the linear relation between log(Revenues) and
log(Rank) implied by Pareto’s Law in favor of positively autocorrelated
growth.4 The method by which word of mouth can generate autocorre-
lated growth in demand is sufficiently intuitive that I will dispense with
a formal model in the following explanation.
      A subset of potential consumers sees a movie at its opening. These
consumers then share their opinions (i.e., how the movie compared to
their expectations) with their acquaintances, and these acquaintances

   2. Filson et al. (2005) offer evidence that the best explanation of this contract is risk-
sharing rather than a correction for a principal-agent problem.
   3. Given the shortening theatrical window, an increasing number of movies are
replacing this traditional declining rate with a fixed rate (e.g., 50%) of cumulative revenues.
   4. An application of Chesher’s (1979) more transparent approach yields the same
conclusion.
864                                   Journal of Economics & Management Strategy

then make their decision the following period. When this second
generation of viewers share their information, the source of the au-
tocorrelation in demand becomes apparent. Movies with realizations
that far exceeded consumer expectations will benefit from positive
word of mouth in both weeks, while the converse occurs when a
movie’s expectations exceed its realization. Consequently, growth rates
in demand will be positively correlated. While offering an explanation
of this autocorrelation, the approach taken by De Vany and Walls cannot
provide any sense of the magnitude of word of mouth.
      The presence of autocorrelated growth is a sufficient indicator of
word of mouth only if there are no other explanations. It is straightfor-
ward, however, to show that products that consumers usually purchase
only once will generally exhibit such autocorrelated growth. The model
of saturation that I consider can be characterized as

Qt, j = (Mj − PastAdmt, j )πt, j ,                                                    (1)

where Qt,j denotes weekly sales, Mj the potential consumer population
at the product’s launch, PastAdmt,j the cumulative admissions preced-
ing that period, and πt,j the probability of purchase. A finite pool of
consumers is gradually exhausted as people purchase the product. For
simplicity, consider the case where purchase-probability is exogenously
determined.5 I specify the purchase-probability as depending upon a
product-specific term and an idiosyncratic disturbance:

πt, j = θ j + εt, j , E(πt, j ) = θ j .                                               (2)

Growth (saturation) rates in sales then take the form
           πt, j
gt, j =           − πt, j − 1.                                                        (3)
          πt−1, j

Because the movie-specific component in the probability appears di-
rectly in the “growth rate,” high saturation rates last week are likely to be
followed by high saturation rates this week, and positive autocorrelated
growth results.
      An admittedly strong restriction on the above saturation model
has an additional implication that I will exploit, namely that there is
a direct transformation to relating log(Q) to a product’s age. Define a
movie’s Age as the number of full weeks since its release and assume

   5. It is straightforward to show that purchase-probabilities that include endogenous
variables such as screening intensity and advertising will amplify this autocorrelation.
Allowing for such purchase-probabilities to decline over time exogenously, as would
occur if consumers with the highest expectations see the movie first, has the same impact.
Theatrical Movie Admissions                                                        865

that πt,j = π for all movies and weeks.6 A movie in its opening week
therefore has Age = 0. Quantities then take the form
Qt, j = Mj (1 − π )Aget, j π                                                        (4)
and log-quantities
ln(Qt, j ) = ln(Mj ) + Aget, j ln(1 − π) + ln(π ).                                  (5)
Given that the discrete-choice model which supports my later work
effectively uses a variation of log-quantities as the dependent variable,
it is reassuring to know that there is some theoretical support for using
Age as a way to capture the saturation process.7

3.2 A Model of Demand
The model of demand that I consider here is very similar to that used
by Einav (2007), and one particular aspect of that application war-
rants discussion before considering the formal model. Several factors
which presumably affect consumer expectations are endogenously de-
termined. The number of theaters exhibiting the movie and the amount
of advertising for that movie are obvious examples. If word of mouth is
important, these endogenously determined variables will presumably
reflect (and amplify) the underlying word of mouth. “Sleeper hits”
(e.g., My Big Fat Greek Wedding) gradually expand their advertising
and number of showcases before eventually tapering off. Conversely,
high expectation “bombs” (e.g., The Hulk) see especially fast decays in
both advertising and screening intensity. Because I want to consider the
impacts of word of mouth on screening and advertising, I consider a
reduced form expression of demand (like Einav) in which the number
of exhibiting theaters and the amount of advertising have been “solved
out.” While a structural approach in which exhibiting theaters and
advertising are explicitly included is possible, it would necessarily limit
word of mouth to its direct (and presumably much smaller) effect. For
this initial application, I improve my chances of discerning word of
mouth by using the reduced form, and thus the estimated impact of
variables on demand in this paper should be interpreted as equilibrium
effects. For example, the estimated effect of an Oscar nomination is
the sum of both the immediate effect on consumer behavior and the
feedback effect of consumer responses to increased advertising and

    6. This simplifying assumption of course removes the source of the autocorrelated
growth.
    7. This specification has been the standard in the marketing literature using weekly
data. There it is interpreted to capture both saturation and consumer preferences for
“fresh” products.
866                                Journal of Economics & Management Strategy

screening intensity. The same pertains to the estimated impact of word
of mouth.
      Much of recent empirical industrial organization has made use
of the discrete choice model of demand, and I follow in this literature
introduced by Berry (1994). Let consumer i’s belief about the utility from
seeing movie j in week t take the form
Ui,t, j = δt, j + εi,t, j                                                             (6)
so that δ is the mean value of consumer utility and ε is the individual
deviation from that average utility. A consumer’s choice set also includes
the outside option of seeing no movie, and the utility of that outside
option is normalized to zero. Throughout, I will assume that mean
consumer utility takes the form
δt, j = Xt, j β +  j + Wt γ + f (Aget, j , α) + ξt, j .                              (7)
Variables that vary across both movies and weeks are included in X.
The three variables that I consider here are an indicator for whether the
movie’s week was abbreviated by a non-Friday release, and indicators
for whether the movie had been nominated or won a major Academy
Award prior to that week. Rather than estimate the impact of movie
characteristics upon consumer expectations, I instead use movie-specific
fixed effects  for movies observed for at least four weeks.8 Seasonality
variables (e.g., month and holiday indicators) are included in W. I
then consider the reduced form impact of a movie’s age on demand
(both directly and through screening intensity and advertising) with the
function f (•). I assume that all explanatory variables are uncorrelated
with the disturbance ξ .
       Consumers choose at most one movie from among their various
options to maximize utility. Under these conditions, Berry (1994) shows
that there exists a unique one-to-one mapping from observed quantities
and market size to products’ mean utilities (δ). If ε is drawn from the
logit distribution (Type 1 extreme value), then a closed-form solution for
these predicted quantities exists and this mapping is especially tractable:
                                    
                             
δt, j = ln(Qt, j ) − ln Mt −     Qt,k ,                                 (8)
                                k∈(t)

where M denotes the market size (i.e., all potential consumers), (t)
denotes the set of movies available in week t, and Q again denotes

   8. Movies that I observe 3 or fewer weeks have Xj β + χj included in their mean utility
specification. This matches the second-stage estimation where I use estimates of  as
dependent variables.
Theatrical Movie Admissions                                                        867

weekly quantities.9 The demand for the entire set of products is thus
characterized by the parametric specification of products’ mean utilities.
       The logit assumption, however, comes with the well-known price
of unreasonable substitution patterns. In this context, it may be an
onerous restriction to impose that all consumers are equally likely
to substitute to the option of seeing no movie should their favorite
movie become unavailable. I therefore separately use the nested logit
framework to allow consumer heterogeneity along three dimensions:
substitution between movies and nonmovies, between action and non-
action movies, and between family and nonfamily movies.10 In each
case the parameter µ equals zero if such heterogeneity is unimportant
and is bounded above by one if there is total segmentation. Applying
the assumptions appropriate for such a nesting, the transformation from
observed quantities to mean consumer utilities is now
                                   
                                             
δt, j = ln(Qt, j ) − ln Mt −    Qt,k − µ ln st,Nj ,                    (9)
                                           k∈(t)

                              Q
where st,Nj =        t, j
                                 and N(j) is the set of available movies that share
                 N(k)=N( j) Qt,k
movie j’s characteristic along dimension N. The general regression is
then
                                      
                            
ln(Qt, j ) − ln Mt −              Qt,k
                                  k∈(t)
                       
   = µ ln       st,Nj       + Xt, j β +  j + Wt γ + f (Aget, j , α) + ξt, j .     (10)
The disturbance ξ appears in contemporary quantities, and thus sN    t,j ,
by construction. Least squares estimation will therefore bias each µ
upwards, overstating the impact of segmentation, and instrumental
variable techniques are needed to consistently estimate µ.
     As pointed out by Berry (1994), an advantage of the discrete
choice framework beyond its parsimonious parameterization is the
provision of instruments based upon a product’s competitive envi-
ronment. Intuitively, the characteristics of rival movies will affect a
movie’s admissions, but those characteristics are themselves plausibly
uncorrelated with the disturbance ξ and excluded from the mean

    9. For notational convenience, I have dropped the population size that often appears
in the denominators of both terms and thereby transforms quantities into unconditional
market shares.
    10. Bresnahan et al. (1997) introduced the Principles of Differentiation Generalized
Extreme Value to address multiple types of consumer heterogeneity simultaneously. My
attempts to apply it in this context were prevented by excessive collinearity.
868                                Journal of Economics & Management Strategy

utility specification.11 I discuss the specific movie characteristics and
instruments that I employ in the data section below.

3.3 Word of Mouth
After the above parameters are estimated, I can turn to the composition
of ξ . Fixed effects for most of the movies absorb what would presumably
be the biggest such component, namely the average effect on consumer
expectations of movie characteristics that I cannot observe or measure.
Letting Dj be a binary variable indicating whether movie j has a fixed
effect (i.e., is observed at least four times) and PastAdmt,j be cumulative
admissions for movie j in week t, four other components of ξ plausibly
remain:
ξt, j = υt + ωt, j + (1 − D j )χ j + φ j P(PastAdmt, j , Aget, j , λ).              (11)
The first disturbance υt is the impact of week-specific variables that are
not captured by my variables in W. The second ωt,j is an idiosyncratic
disturbance that is specific to a particular movie in a particular week.
The mean valuation of a movie j’s unobserved characteristics if it does
not have a fixed effect is χj . Last is the word of mouth disturbance. The
gap between movie j’s true quality (j ) and its expected quality (j )
is represented by the parameter φj = j − j , so that φj > 0 indicates
that movie j has good word of mouth. Consistent with the reduced
form estimation of demand, I assume that consumers Bayesian update
their beliefs from the information provided by those who have seen
the movie. Given my lack of microlevel data, I assume that consumers
have identical priors and receive identical information, which together
imply that consumers have identical posteriors. That is, as information
about the movie spreads, consumers smoothly revise their beliefs about
a movie. Pt,j (•) then denotes the share of movie j’s word of mouth gap
that is incorporated into consumer posteriors as of week t.12
      The effective share of a movie’s word of mouth (henceforth WOM
share) will presumably depend upon both the length of a movie’s
theatrical run and the number of people who have seen the movie in
prior weeks. The relative importance of each of these variables will
hinge upon how consumers share information. Consumers telling a
fixed number of friends about the movie’s true quality each week when a
movie is in theatres will place special emphasis on the length of a movie’s
run. The opposite story of consumers telling a fixed number of friends

   11. As in Einav (2007), the identifying assumption within this reduced form context is
that the portions of equilibrium advertising and screening intensity that are uncaptured
by the exogenous regressors are not affected by rival movie characteristics.
   12. I am grateful to the referee who suggested this application of Bayesian updating.
Theatrical Movie Admissions                                                        869

regardless of a movie’s run length will instead focus more upon the
number of past admissions. Estimates using a flexible functional form
for this share and its implied responsiveness to the two variables should
be able to distinguish which of these explanations better describe the
data. Along these lines, my empirics begin with a simple interaction
of the two inputs (which imposes identical elasticities for age and
past admissions) and then examine a polynomial specification of these
interactions that allows for these implied elasticities to differ.
       Generally, P = 0 at a movie’s release, and λ are parameters that
capture the speed with which information travels. As P → 1, consumer
utilities are based upon a movie’s true quality j rather than its expected
quality j . Rational expectations among consumers ensure that the aver-
age word of mouth gap across movies is zero, E(φj ) = 0.13 For tractability
I assume that each disturbance is uncorrelated with the other three.
The variance of each component’s disturbance is assumed constant and
denoted by subscript: Var(υt ) = συ2 , Var(ωt, j ) = σω2 , Var(χ j ) = σχ2 , and
Var(φj ) = σφ2 . The magnitude of σφ2 and information’s speed captured by
λ relative to the variance of consumer mean expectations will describe
the importance of the word of mouth dynamic.
       This structure for the aggregate disturbance means that movie
fixed effects will not strictly capture a movie’s expectation  but will
instead reflect a weighted average of the expectation and the cumulative
effect of word of mouth. Specifically, each movie’s fixed effect will be
ˆ j =  j + φ j P• j , where P•j denotes the average WOM share over that
movie’s observed lifetime. This mixed estimate then implies that the
observed residuals are actually characterized as
ξt, j = υt + ωt, j + (1 − D j )χ j + φ j (Pt, j − D j P• j ).                      (12)
      There are numerous ways that observed residuals could be inter-
acted to match correlations with these parameters. I specify first-order
autoregressive patterns for week-specific and movie-and-week-specific
disturbances and exploit both the heteroskedasticity and the first-order
serial correlation implied by the above formulation. Taking the squared
residuals as the dependent variable generates the regression
ξt,2 j = συ2 + σω2 + (1 − D j )σχ2 + σφ2 (Pt, j − D j P• j )2 + u1t, j .           (13)
Intuitively, if word of mouth is present, my results should reveal a
nonmonotonic relationship between a movie’s age and the extent of

   13. Note that this restriction is over the set of movies, not all observations. It is
therefore not inconsistent with the expectation that movies with good word of mouth will
have longer theatrical runs than movies with bad word of mouth, thereby comprising a
disproportionately large share of observations.
870                                 Journal of Economics & Management Strategy

its heteroskedasticity. The speed of information and the relevant shape
of the WOM share function are revealed by the extent of asymmetry in
(Pt,j − Dj P•j )2 . If WOM shares rise evenly over a movie’s run, then the
relationship is symmetric, but especially fast information diffusion early
(late) in the run will generate steeper slopes on the left (right) side. The
separate identification of σφ2 and λ is provided by the quadratic form
and the nonlinearity of P(•).
       The basis of serial correlation can likewise be shown by interacting
a movie’s residual from one week with its residual from the prior week.
ξt, j ξt−1, j = ρυ συ2 + ρω σω2 + (1 − D j )σχ2
              + σφ2 (Pt, j − D j P• j )(Pt−1, j − D j P• j ) + u2t, j .             (14)
The empirical task is then to disentangle the serial correlation that
comes from the word of mouth process from the serial correlation that
stems from autocorrelation of weekly disturbances and idiosyncratic
disturbances. The same intuition regarding the identification of infor-
mation speed applies. Common parameters across equations suggest
joint estimation, and results show that freely estimated parameters are
quite similar to their restricted counterparts.
       Estimates of σφ2 and λ that are significantly greater than zero
then indicate the presence of word of mouth. The importance of word
of mouth in variance, however, is better captured by the ratio of
σφ2 E((Pt,j (λ) − P•j (λ))2 ) to Var(δt,j ). This measure captures the percentage
of variation in consumer expectations that the model attributes to word
of mouth and its supply-side responses. Other instructive measures are
the percentages of the disturbance’s variance Var(ξt,j ) or the extent of
serial correlation E(ξt,j ξt−1,j ) that can be traced back to word of mouth.

                                       4. Data
I gather the data with which I will estimate the above equations from
Variety, the motion picture industry’s trade magazine. Variety publishes
the revenues (REV t,j ) of the fifty highest grossing movies in the U.S. and
Canada each week, where weeks run from Friday to Thursday. This Top
50 listing is fairly exhaustive; the typical 50th ranked movie grossed
only $50,500. My data set begins in August 1990 and continues through
December 1996, spanning a total of 332 weeks. This sample then consists
of 16,600 observations of 1602 unique movies, 1252 of which I observe
for at least 4 weeks.14 Because admissions prices rose by almost 20%
over this time, analysis based upon revenues may be misleading. Using

   14. The empirical measure of autocorrelation naturally has fewer observations (14494).
Theatrical Movie Admissions                                                          871

an annual average admissions price (Price),15 I linearly extrapolate and
construct implied quantities (Qt,j = REV t,j /Pricet ).16 Consumer popula-
tion M is then the combined population of the U.S. and Canada.17 This
population measure is almost assuredly an overstatement of the number
of consumers who might see a movie in a given week, and I will consider
a fraction of this population in robustness checks.
      Even though I am exploiting movie fixed effects, movie-specific
characteristics are useful to get a sense of what drives these fixed effects
and are essential to create sufficiently powerful instruments. The ap-
pendix describes in detail my measures of cast and director appeal, but
both essentially make use of the box office history of movies within the
prior five years (in billions of dollars). Demand estimation using the
nested logit also requires some measure of a movie’s genre. I define
the action genre as any movie that is listed as Action or Adventure by
the Internet Movie Database. A movie falls within the family genre if it
is rated either G or PG.
      I also consider several variables that vary across both movies
and weeks. For both the demand and word-of-mouth regressions, I
define a movie’s Age as the number of weeks that a movie has already
spent on Variety’s Top 50, so that a movie in its opening week faces
Age = 0. This differs from the number of weeks since a movie’s release
because movies are sometimes removed from theatrical circulation
and then reintroduced or alternatively fall from the Top 50 and then
return. PastAdm is defined as the cumulative admissions of a movie
prior to a week (in millions), so that a movie has PastAdm = 0 at its
opening. The announcement of Academy Award nominations and the
Oscar awards themselves have received some attention in the movie
economics literature (Nelson et al., 2001). Academy Award nominations
are traditionally announced on a Tuesday morning, and (during my
data’s time frame) the Oscar ceremony was held on a Monday night. To
this end, I consider a OscNom?tj binary variable that equals one for all
weeks following (but not including the week of) the announcement that
movie j has received a nomination in any of the six major categories.18 I
define OscWin?tj analogously, so that its estimated effect should indicate
the additional impact in equilibrium beyond the necessary nomination.
      As mentioned in the discussion of the exogenous regressors,
movies are sometimes released on days other than Friday, with
    15. National Association of Theater Owners (2004).
    16. Admissions prices tend to be fixed over time and across movies at a given theater.
While the rigidity continues to trouble economists and lawyers (Orbach and Einav, 2007),
this idiosyncrasy of the exhibition sector greatly facilitates modeling.
    17. U.S. Census Bureau, Statistics Canada respectively.
    18. Best Picture, Best Director, Best Actor, Best Actress, Best Supporting Actor, and
Best Supporting Actress.
872                               Journal of Economics & Management Strategy

Wednesdays being the most common non-Friday release day. Those
movies therefore have an abbreviated week over which to accumulate
their admissions, a situation I denote by setting the binary variable
AbbWk? equal to one, zero otherwise. For instance, the first week admis-
sions for movies released on Wednesday are limited to sales on Wednes-
day and Thursday. Institutional knowledge offers an opportunity to
bound some of these parameter values in advance. Recent daily data
(Davis, 2006; Switzer, 2004) indicate that weekends make up between
66% and 72% of a nonholiday week’s admissions.19 Figures derived from
revenues are comparable. Other days of the week are roughly similar
and average between 7.2% and 8.4% of a week’s admissions.
      Using these figures, the normal daily breakdown of admissions
over the week suggests that a movie with such a Wednesday release prior
to an ordinary weekend would garner about 16% of the admissions that
it would have received with a counterfactual release on the prior Friday.
Most of these non-Friday releases, however, precede holiday weeks,
with Thanksgiving being a recurring example. In that case, the observed
week essentially replaces its Wednesday with a second Friday and its
Thursday with a second Saturday. If the typical weekend days accurately
capture the demand at these holiday weekdays, then a movie with a
Wednesday release prior to a holiday weekend would have 36%–39%
of the admissions that it would have received with a release the Friday
before. Assuming that Thanksgiving (or any other weekday holiday) is
no more convenient to see a movie than a normal Saturday then offers an
                   Q (AbbWk?=1)
upper bound on Q jj (AbbWk?=0) ≤ 0.4. Given that some Wednesday releases
occur before ordinary weekends and Thanksgiving (in particular) is
arguably less conducive to theatrical movies than an ordinary Saturday,
the average of these predicted ratios being substantially less than 0.4 is
likely.
      Week-specific regressors include linear extrapolations of the an-
nual average admissions price and the monthly U.S. unemployment rate
in my week-specific variables. Given the important role of seasonality
in demand (Einav, 2007), I also include dummy variables for each
month (excluding January) and eleven major holidays.20 In the demand
estimation, I estimate movie fixed effects only for those movies for which
I have at least four observations and include cast appeal, director appeal,

    19. Davis (2006) exploits a national survey from a single week (June 21–27, 1996).
Switzer (2004) considers a comprehensive data set from September 2001 to June 2002 in
St. Louis, MO.
    20. In chronological order, these holidays are New Year’s Day, Martin Luther King,
Jr. Day, Presidents’ Day, Easter, Memorial Day, Independence Day, Labor Day, Columbus
Day, Veteran’s Day, Thanksgiving, and Christmas.
Theatrical Movie Admissions                                                     873

                                   Table I.
                      Variable Definitions
Rev            Weekly revenues
Price          Average admissions price
Q              Weekly admissions (Rev/Price)
AbbWk?         Indicator of non-Friday release (abbreviated week)
OscNom?        Indicator of Oscar nomination prior to given week
OscWin?        Indicator of Oscar award prior to given week
Age            Number of prior weeks spent in Variety Top 50
PastAdm        Cumulative admissions (in millions)
CastApp        Total prior 5-year revenue history of starring cast/# of opportunities
DirApp         Total prior 5-year revenue history of director
AC?            Indicator of whether movie is action/adventure
FA?            Indicator of whether movie is family (G or PG rating)
sMovie         Conditional market share (Quantity/Total quantity sold that week)
sAc            Quantity/Total quantity of movies that share Action status that week
sFa            Quantity/Total quantity of movies that share Family status that week

action and family dummies, and an intercept for movies with fewer than
four observations.
       Table I displays variable definitions and sources, and Table II
displays summary statistics. The use of movie fixed effects does not
spare me from the necessity of finding some explanatory variables of
sufficient power in the case of the nested logit. As results from demand
will show, a movie’s cast appeal and director appeal (determined prior
to release and defined in the Appendix) are both positively correlated
with a movie’s fixed effect estimate. I therefore consider age-weighted
measures of competing movies’ appeal as instruments. Specifically, I
utilize these variables for competing movies that have been released
within the last 4 weeks (see Table III for instrument definitions). Thus,
this measure of a movie’s competitive environment is higher when
high appeal rival movies are younger. The validity of these instruments
obviously hinges upon the discrete choice model’s assumptions, but
their power can be illustrated with the data. In first stage regressions
(not reported) with ln (sN ) as the dependent variable, all six proposed
instruments have t-statistics (in absolute value) that exceed six and the
t-statistics of five instruments exceed eight. IV regressions therefore
should not suffer from the weak instrument problem.

                             5. Evidence
5.1 Demand
Least squares estimates of the logit model of demand (µ imposed to
be zero) are shown in Table IV. Asymptotic standard errors make use
874                                                Journal of Economics & Management Strategy

                                                         Table II.
                                       Summary Statistics
                                                   Mean      Median        Min.               Max.        Std. Dev.

By observation (N = 16,600)
  Rev (in millions)                             1.8781        0.3595     0.0025               79.2175        4.3208
  Q (in millions)                               0.4419        0.0849     0.0006               19.1208        1.0135
  Q/M                                           0.0015        0.0003     2.13E-06              0.0668        0.0035
  δ logit = ln(Q) − ln(M − Q)                 −7.8021       −8.0516   −12.9180               −2.5833        1.6553
  sMovie                                        0.0200        0.0042     1.60E-05              0.5903        0.0399
  ln(sMovie )                                  −5.2640       −5.4774   −11.0441               −0.5271        1.6793
  sAc                                           0.0400        0.0084     2.56E-05              0.9154        0.0788
  ln(sAc )                                     −4.6013       −4.7777   −10.5901               −0.0884        1.7182
  sFa                                           0.0400        0.0079      5.64E-05             0.9189        0.0823
  ln(sFa )                                     −4.6688       −4.8354     −9.7838              −0.0846        1.7687
  Age                                              8.0008     6            0                  70            7.8242
  PastAdm (in Ms)                                  6.8014     2.8936       0                  83.9052      10.7196
  AbbWk?                                           0.0049     0            0                   1            0.0697
  OscNom?                                          0.0502     0            0                   1            0.2183
  OscWin?                                          0.0120     0            0                   1            0.1088
By movie (n = 1602)
  AbbWk?                                           0.0506     0            0                   1             0.2192
  CastApp                                          0.0161     0            0                   0.1419        0.0223
  DirApp                                           0.0325     0            0                   0.7175        0.0672
  Action?                                          0.2541     0            0                   1             0.4355
  Family?                                          0.2010     0            0                   1             0.4009

                                                         Table III.
                           Definitions of Instruments
A(j, t) = {other movies available in week t that share Action or non-Action status
           with movie j}
F(j, t) = {other movies available in week t that share Family or non-Family status
           with movie j}

                  τ =t            Ca st App τ,k                                      τ =t           Dir App τ,k
MCa st t, j =       k = j,τ =t−3    Age τ,k +1                         MDir t, j =       k = j,τ =t−3 Age τ,k +1
                  τ =t                 Ca st App τ,k                                 τ =t              Dir App τ,k
ACa st t, j =       k=A( j,t),τ =t−3     Age τ,k +1                    ADir t, j =     k=A( j,t),τ =t−3 Age τ,k +1
                  τ =t                  Ca st App τ,k                                τ =t              Dir App τ,k
 F Ca st t, j =     k=F ( j,t),τ =t−3     Age τ,k +1                   F Dir t, j =    k=F ( j,t),τ =t−3 Age τ,k +1

of the Newey-West (1994) covariance matrix, allowing for arbitrary het-
eroskedasticity and incorporating 3 weeks of lagged residuals to account
for serial correlation. As the later residual analysis will make heavy use
of the age regressor, it is important to ensure that the impact of age on the
underlying demand is sufficiently robust. Consequently, I estimate six
Table IV.
                                                Least Squares Logit Estimates of Demand
                                          I                           II                          III                            IV                         V                          IV
                                   b          a.s.e.          b            a.s.e.          b             a.s.e.           b           a.s.e.         b          a.s.e.           b           a.s.e.

                                                              1st stage: n = 16,600, DV = ln(Qtj ) − ln(Mt − Qtk )
Age                               —                         −0.16      0.01      −0.27       0.01          —                                        —                          —
Age2                              —                          —                     0.0037    0.0002        —                                        —                          —
ln(Age + 1)                       —                          —                      —                     −1.28                       0.02         −0.30         0.05          —
ln(Age + 1)2                      —                          —                      —                      —                                       −0.33         0.02          —
                                                                                                                                                                                                       Theatrical Movie Admissions

α Age                             —                          —                      —                      —                                        —                          0.076         0.004
exp(−α Age Age)                   —                          —                      —                      —                                        —                          5.06          0.15
AbbWk?                            0.50         0.14          0.57      0.12      −1.02       0.12         −1.70                       0.13         −1.20         0.13         −1.24          0.12
OscNom?                          −1.22         0.12          1.04      0.13        0.91      0.11          0.41                       0.11          0.92         0.11          0.80          0.11
OscWin?                          −0.70         0.18          0.75      0.19        0.00      0.18         −0.22                       0.17          0.18         0.16         −0.08          0.16
R2 (ln(sj /s0 ) as DV)            0.42                       0.67                  0.73                    0.71                                     0.73                       0.74
R2 (Qj /M as DV)                  0.07                       0.39                  0.58                    0.53                                     0.58                       0.61
Ab%                                1.65                        0.57                       0.36                            0.19                       0.30                       0.29
OscNom%                            0.30                        2.82                       2.48                            1.51                       2.50                       2.21
OscWin%                            0.50                        2.11                       1.00                            0.80                       1.20                       0.92
                                                                              2nd stage: n = 1252, DV = 
CastAppeal                        15.56        1.43         16.45          1.58      17.62       1.64     17.42                       1.57         17.39         1.64        17.67           1.65
DirectorAppeal                     3.36        0.45          4.32          0.49       4.60       0.47      4.39                       0.45          4.53         0.48         4.57           0.47
Action?                            0.33        0.07          0.40          0.07       0.41       0.07      0.40                       0.07          0.41         0.07         0.41           0.07
Family?                            0.21        0.08          0.50          0.08       0.55       0.09      0.51                       0.08          0.55         0.09         0.55           0.09
Constant                          23.54        0.05         −2.76          0.05      −8.13       0.05     −3.68                       0.05         −7.65         0.05       −13.23           0.05
R2 (j as DV)                     0.20                         0.25                       0.26                            0.26                       0.26                       0.26
R2 (Qj /M as DV)                 −0.04                         0.11                       0.18                            0.21                       0.20                       0.21
                                                                                                                                                                                                       875

Notes: All estimates use movie fixed effects for movies which are observed for at least 4 weeks. All first-stage a.s.e. are based upon Newey-West covariance structure with 3 week lag. Second stage
estimation uses OLS with White-corrected a.s.e. Estimated coefficients of weekly variables, movie characteristics for movies without fixed effects, and movie fixed effects not shown.
876                            Journal of Economics & Management Strategy

regressions reflecting different specifications of the function f (Aget,j , α).
I then regress the estimated movie fixed effects on my cast and director
appeal variables, genre dummies, and a constant. Besides offering some
empirical support for my instruments, these second-stage results can
shed additional light in the movie economics literature on what pre-
release variables have explanatory power. They also offer an opportu-
nity to consider how much the industry’s variation in quantities can be
explained in the usual absence of such estimated movie fixed effects.
       The differences between results that ignore even first-order im-
pacts of age and those that include Age as a regressor are intuitive (Table
IV (I and II). Excluding Age forces the model to inflate the estimated effect
of an abbreviated week (because AbbWk? necessarily coincides with the
youngest movies), suggesting that abbreviated weeks actually increase
demand over the hypothetical full week. Likewise, ignoring saturation
dramatically decreases the estimated impact of the Oscar variables
(because both only occur late in a movie’s commercial run). Measures
of fit to the dependent variable δ and the more interesting purchase-
probability (Qt,j /Mt ) both rise appreciably when the Age regressor is
included. The results of the second-stage regression do not differ much
across the two specifications. While cast appeal has often been found
to affect demand, these results further confirm the less widely noted
significant impact of a movie’s director on consumer expectations as
found by Chen, Mitra, and Shugan (2006).
       The age quadratic (Table 4 (III)) extends this trend for the estimated
impact of an abbreviated week. This specification yields an average
estimated ratio of actual revenues to hypothetical revenues of about 36%
(shown below first stage measures of fit as Ab%), within the bounds indi-
cated by both Davis and Switzer [0.15, 0.40]. The estimated impacts of the
Oscar variables, however, change markedly. While including Age alone
indicates that both nominations and awards have statistically significant
impacts on consumer expectation, the quadratic specification shows that
the nomination result is somewhat inflated and that the award result is
entirely spurious. At these point estimates, a few observations (184 of
16,600) have such high ages that the quadratic specification suggests
that mean utility is increasing in Age. I therefore also consider three
specifications where such nonmonotonicity is impossible or less likely.
       Table IV (IV and V) display results when ln (Age + 1) is used alone
and when it is supplemented with ln (Age + 1)2 . Using ln (Age + 1) alone
appears to overstate the initial impact of a movie’s age on its demand,
but the quadratic specification generates results comparable to those of
Table 4(III) without the unappealing implication of mean utility increas-
ing for especially long-lived movies. As a last functional form, Table IV
(VI) shows NLLS estimates in which I use an exponential function as the
Theatrical Movie Admissions                                                           877

age specification: f (Aget,j , α) = α1 exp(−α0 Aget,j ). The logit fit improves
only marginally with the nonlinear specification, but there is a greater
improvement in the fit to purchase-probabilities (Q/M). In other results,
the impacts of an Oscar nomination and an abbreviated week are both
further diminished. Given these (slight) improvements, I will use this
exponential form for the remaining results. Going forward, it seems
that one can conclude from the second-stage estimates that both action
and family movies gross more than their nonaction and non-family
counterparts and that this simple model can explain about 20% of the
industry’s variation in admissions.
       While the logit results are useful for expressing correlations be-
tween the regressors and quantities, the imposed restriction that con-
sumers are homogeneous in their perceived substitutability between
movies of different genres or between movies and the nonmovie option
is likely untenable. I therefore reestimate the unrestricted demand using
the Generalized Method of Moments (Hansen, 1982) and the afore-
mentioned competitive environment instruments.21 Table V displays
results, with the earlier logit results shown in Table V(I) for comparison.
In all regressions, estimates indicate that segmentation is statistically
significant at conventional levels, and neither the movie or family
segmentation models are rejected by the implied J-statistic. Potential
segmentation between action and nonaction movies is estimated to be
the weakest, and it could be argued that such segmentation is not eco-
nomically important. Consumer heterogeneity regarding substitution
between movies and the nonmovie option and between family and
nonfamily movies, however, are estimated to be quite important. The
conclusion regarding the latter (Table V(IV)) speaks to Ravid’s (1999)
finding that G-rated movies are historically quite profitable. While
the average consumer preference may be a dislike for Family movies
(evidenced by the negative coefficient in the second stage), there exists
a population (e.g., parents with small children) that will only consider G
and PG rated movies. If competition for these consumers is light, such a
movie could be lucrative, especially given the typically low production
costs of such movies.
       Implications of the demand estimates are stable across these
different specifications. In all, receiving at least one Oscar nomination
in a major category boosts demand about 120%, but winning a com-
parable Oscar has no significant impact. Movies facing abbreviated
weeks receive about 30% of the demand from their counterfactual
week, again within the bounds suggested by other data. The variables

    21. In each case, a Hausman statistic clearly rejects the hypothesis that least squares
estimation of the nested logit specification is consistent.
Table V.
                                                                                                                                                                                                878
                                                     Logit and Nested Logit Demand Estimates
                                  I                        II                            III                         IV                           V                            VI
                           b          a.s.e.         b            a.s.e.           b           a.s.e.          b            a.s.e.          b            a.s.e.          b            a.s.e.

                                                                   1st stage: n = 16,600, DV = ln(Qtj ) − ln(Mt − Qtk ))
α0                        0.076         0.004       0.076           0.004        0.076           0.004        0.076           0.004        0.076           0.004         0.076         0.004
exp(−α 0 Age)             5.06          0.15        1.92            0.18         4.40            0.26         2.76            0.17         2.49            0.24          3.01          0.19
ln(sMovie )               —                         0.62            0.03         —                            —                            0.51            0.04          —
ln(sAc )                  —                         —                            0.14            0.05         —                            —                             —
ln(sFa )                  —                         —                            —                            0.47            0.03         —                             0.42          0.03
AbbWk?                   −1.24          0.12      −0.44             0.07       −1.06             0.12       −0.64             0.08       −0.56             0.09        −0.68           0.09
OscNom?                   0.80          0.11        0.30            0.05         0.69            0.10         0.43            0.07         0.39            0.07          0.47          0.07
OscWin?                  −0.08          0.16      −0.04             0.06       −0.06             0.14         0.00            0.09       −0.04             0.08        −0.01           0.10
Mkt Size                 Pop                       Pop                          Pop                          Pop                       0.25∗Pop                      0.25∗Pop
method                   NLLS                     GMM                          GMM                          GMM                          GMM                           GMM
IVs                      None                   MCast, MDir                  ACast, ADir                  FCast, FDir                 MCast, MDir                   FCast, FDir
Ab%                       0.29                      0.32                         0.29                         0.31                         0.33                          0.30
OscNom%                   2.21                      2.21                         2.22                         2.21                         2.21                          2.12
Var(δ(µ))                 2.74                      0.45                         2.05                         0.89                         0.78                          1.09
R2 (ln(sj /s0 ) as DV)    0.74                      0.96                         0.80                         0.92                         0.93                          0.90
R2 (δ j (µ) as DV)        0.74                      0.75                         0.74                         0.74                         0.75                          0.74
R2 (Qj /M as DV)          0.61                      0.69                         0.63                         0.70                         0.70                          0.64
Pr(Don’t reject)          —                         0.24                         0.05                         0.17                         0.11                          0.05
                                                                                  2nd: n = 1252, DV = 
CastAppeal                17.67         1.65        6.82            0.62         15.35           1.42         9.54            0.88         8.87            0.81        10.51           0.96
DirectorAppeal             4.57         0.47        1.75            0.18          3.94           0.41         2.47            0.26         2.27            0.23         2.71           0.28
Action?                    0.41         0.07        0.16            0.03          0.29           0.06         0.22            0.04         0.20            0.04         0.24           0.04
Family?                    0.55         0.09        0.22            0.03          0.48           0.07        −0.23            0.05         0.29            0.04        −0.14           0.05
Constant                 −13.23         0.05       −7.84            0.02        −11.69           0.05        −9.74            0.03        −7.88            0.03        −9.20           0.03
R2 (j as DV)              0.26                     0.27                          0.25                        0.25                         0.27                         0.25
R2 (Qj /M as DV)           0.21                     0.29                          0.23                        0.29                         0.29                         0.26

Notes: All estimates use movie fixed effects for movies observed for at least 4 weeks. GMM weights and all first-stage a.s.e. are based upon Newey-West covariance structure with 3 week lag.
Second stage estimation uses OLS with White-corrected a.s.e. Some coefficients not reported.
                                                                                                                                                                                                Journal of Economics & Management Strategy
Theatrical Movie Admissions                                             879

and fixed effects can explain about 75% of the variation in mean utili-
ties, and predicted purchase-probabilities using estimated fixed effects
capture between 60% and 70% of the variation in observed purchase-
probabilities. Predicted purchase-probabilities using the second-stage
coefficients instead of the estimated fixed effects explain about 29% of
the variation, as opposed to 21% for the straight logit. In all, the model
(especially when making use of the fixed effects) appears to do a fair job
of matching the data.
       As mentioned in Section 3, all these estimates are based upon
the pool of potential consumers being the combined population of
the United States and Canada. Demand estimates using the discrete-
choice model are typically robust to one’s choice of market size, but my
model’s emphasis on the impact of word of mouth on mean consumer
expectation could generate difficulties when estimating the primary
parameters of interest. To see whether my information transmission
results are robust to choice of market size, I therefore reestimate the
more successful nested logit models (segmentation by movie and by
family) using an assumed market size of 25% of the full population.
Table V(V and VI) displays these results. The only differences by market
size appear to be a lower estimation of the nested logit parameter µ
(leading to an upward rescaling of all other multiplicative coefficients)
and a higher probability of rejecting the validity of the exclusion restric-
tions. Segmentation between inside and outside options is often used to
minimize the importance of the choice of market size, so this is at some
level unsurprising. The primary robustness concern, however, pertains
to the word of mouth parameters, and so I will return later to apply
these quarter-population estimates.

5.2 Analysis of Error Components
This analysis hopes to exploit heteroskedasticity and serial correlation
across disturbances (all residuals are taken from the nested logit using
ln (sMovie )). A logical first step is to document such features of the data
before applying additional structure. The evidence of serial correlation,
albeit of undetermined source, is strong. Using residuals e = δ − Xβ̂
from the 14,494 observations in which I see the same movie in consecu-
tive weeks in a standard AR(1) regression yields
e t, j = 0.662e t−1, j + t, j
                                                                       (15)
         0.005.
The standard error is beneath the point estimate, and the fit is reasonable
(R2 = 0.55). The presence of heteroskedasticity is also clear (Var(e2tj ) =
0.05 vs. E(e2tj ) = 0.11), but again the source is uncertain. Given that the
880                                 Journal of Economics & Management Strategy

model’s predictions are regarding how these measures change over a
movie’s run, these findings are necessary but far from sufficient for
identifying word of mouth.
        It is straightforward to plot a time series of heteroskedasticity and
serial correlation for any particular movie, but cross-movie comparison
is complicated by the varying length of movies’ theatrical runs. The
general prediction, however, pertains to the location of observations
relative to the total movie run. I therefore consider only movies that
I observe at least four times (the same criterion for the application of
movie fixed effects) and for the entire theatrical run, focusing upon how
the residual relationships change over fractions of the movie’s run. These
restrictions reduce my number of movies for the heteroskedasticity
graph to 1190. I then assign the opening and closing week’s values to
endpoints and linearly extrapolate between intermediate points. My
approach is essentially the same for the serial correlation graph, except
that I further restrict the movies so that only movies observed for 2
consecutive weeks at least twice are included, reducing the sample
to 1186 movies. The following graphs are then simple averages across
movies at 101 points from 0 to 1 inclusive in 0.01 increments.
        Figure 1 shows a graph of the average imputed e2 over the set
of movies for different fractions of the total theatrical run. This het-
eroskedasticity graph reaches its minimum at about 0.4, and around
this minimum there is a strongly asymmetric and nonmonotonic shape.
Both the minimum being to the left of the midpoint and the steep
drop-off early in movie runs suggest that information is well dispersed
quickly (i.e., increases in the WOM share are dramatic early in the run
and modest later in the run). The same features are evident for serial
correlation in Figure 2, which shows a graph of the average imputed
et et−1 over the appropriate set of movies. Both figures then suggest that
the residuals exhibit the underlying features for the following analysis
to discern word of mouth and the speed of information.
        I begin with a exponential functional form for the WOM share P:
Ptj = 1 − exp(−λAge t, j Pa st Admt, j ).                                        (16)
Table VI(I) displays NLLS results of this joint estimation of equations
(12) and (13) from Section 3:
                                                               2
      e t,2 j = συ2 + σω2 + (1 − D j )σχ2 + σφ2 Pt, j − D j P• j + u1t, j (17)
e t, j e t−1, j = ρυ συ2 + ρω σω2 + (1 − D j )σχ2
                                                        
                  + σφ2 Pt, j − D j P• j Pt−1, j − D j P• j + u2t, j .           (18)

Identified parameters are λ, (συ2 + σω2 ), σχ2 , σφ2 , and (ρυ συ2 + ρω σω2 ).
You can also read