Seat Value Based Revenue Implications for Baseball

Page created by Clinton Moss
 
CONTINUE READING
Seat Value Based Revenue Implications for Baseball

                          Senthil Veeraraghavan∗• Ramnath Vaidyanathan
            OPIM Department, Wharton School, 3730 Walnut St, Philadelphia, PA 19104, USA

                         senthilv@wharton.upenn.edu • ramnathv@wharton.upenn.edu

                                                 March 2008

                                                   Abstract

      We study how patrons attending a baseball game value different seats in a professional league
      stadium based on their location and view. Most surprisingly, we find that customers seated in
      symmetric seats on left and right fields might derive very different valuations from the game.
      Thus, commonly followed symmetric pricing mechanisms in baseball stadiums might be inef-
      ficient from a revenue management perspective. Furthermore, customers perceive higher net
      valuations at distantly located upper deck seats and some outfield seats, possibly because of
      easy availability and low prices. Consistent with the notion of baseball being an experience
      good, more frequent customers to the baseball stadium experience lower variance in their val-
      uations of the game compared to first time visitors. Thus, we quantify the impact of repeated
      visits on customers’ learning of the valuation a seat might provide. Younger customers have
      higher variance in their valuations compared to customers who are older. Our research on cus-
      tomer valuations quantifies the significant influence of the seat locations on customers’ ex-post
      valuations of the game. The findings provide a novel opportunity for the teams to price tickets
      differentially and asymmetrically to accommodate varying seat valuations, using seat value mea-
      sures in different sections of the stadium. Keywords: Ordinal Logit Models, Customer Behavior,
      Revenue Management, Seat Valuations, Baseball, Experience goods.
   ∗
     Corresponding author. The authors would like to thank the NPB franchise and Yuta Namiki for the survey. We
would like to thank Ken Shropshire, Scott Rosner and the Wharton Sports Business Initiative for their support of
the project. Special thanks to Eric Bradlow, Gerard Cachon, Serguei Netessine, Devin Pope, and the participants at
WSBI seminar for their thoughtful comments.

                                                        1
1    Introduction

There is very limited research on how consumers value different seats in baseball stadiums. Despite

the paucity of research in this area, seat location discussions have been the focus of several articles

in the popular press, especially recently. In 2006, the Oakland Athletics decided to reduce the

capacity of McAfee Coliseum (where their home games are played) by covering several of their

seats with tarpaulin sheets, thus reducing the stadium capacity from 44,000 seats to about 34,077

seats (Urban 2005). The Oakland A’s announced that the decision was made in order to provide an

“intimate” experience to those in attendance, in a smaller field. In fact, when the team moves to a

newer field for the 2008 season, they will play in a stadium that has lesser planned capacity (32,000)

than the currently-used, tarpaulin-covered stadium. Bnet.com quoted “...the fans who are feeling

slighted most are the lower-income brackets who feel the third deck was their last affordable large-

scale refuge for a seat behind home plate, even one so high.”. The team management contended

that people liked the third deck mostly because of availability and price, and perhaps not so much

because of the view (Steward 2006). One article on Slate Magazine criticized the move, stating

“Some of us want to sit far away” (Craggs 2006). Thus, the valuation received by consumers seated

at the upper deck was not only unclear, but also varied among different fans. So is it true that the

consumers seated in the upper deck valued those seats highly? Are the net valuations perceived

by the consumers different across seats? Our research addresses such questions using data from a

professional league stadium in Japan.

    Employing statistical analysis of player performance in making drafting and payroll decisions,

and signing free-agents, is a common practice in present-day professional baseball. By re-evaluating

the strategies which produce wins on the field, the Oakland Athletics (A’s) were competitive with

several larger market teams, with approximately less than half the payroll of leading professional

teams. Managing such a high performance team with lower operating expenses has been discussed

in Moneyball (Lewis 2003), and “Sabrematics” popularized in the book is now a part of baseball

folklore. In the wake of the popularization of Sabrematics, several other professional baseball

franchises have employed full time analysts at the front office, thereby diminishing the competitive

edge enjoyed by the A’s. Consequently, several franchises have been examining new levers through

which they can continue to stay competitive while minimizing their expenses and/or increasing

                                                  2
revenues.

      Professional clubs (major leagues) generally have three sources of income: (1) Local income, (2)

Shared Broadcasting and Licensing deals, and (3) Revenue sharing across teams (Source: mlb.com).

Local income includes ticket sales, local broadcasting rights fees, parking fees, and local team

sponsorships; these generally form a significant portion of the revenues for any professional franchise.

According to CFO.com, local revenue is the biggest factor that explains disparity between teams.

However, both theory and application of revenue management in non-travel related industries,

especially in the sports business, has been very limited (See Talluri and van Ryzin (2004) for a

survey of applications of revenue management across various industry profiles). In this work, we

examine new revenue opportunities that would improve customer valuations and stadium revenues.

Through this paper, we seek to provide insights on how customers perceive different seats, which

in turn provides some opportunities for teams to improve customer traction and revenues.

      A careful analysis of heterogeneity in customer seat valuations can boost ticketing revenues,

and also increase additional revenues accrued through concession stands and parking fees. In a

league that has no salary cap structure,1 generation of local revenues can assist a team in freeing

up some space for payroll. However, revenue management schemes in the baseball industry are not

employed in full scale, unlike in the airline industry.

      This is possibly because there are many challenges in the implementation of revenue management

strategies in baseball stadiums. One primary difference between the baseball and airline industries

is that while airline seats are well defined products, baseball seats are not. Although there are

differences between aisle seats and middle seats, most seats in the same (business or economy) class

provide comparable valuations for consumers. Usually, an airline seat is considered simply as a

conduit for transporting a person from an origin to a destination. Therefore, for the most part, the

price of a ticket in economy class indicates how much a person values the trip, more than how much

he values the seat itself. However, (view from) baseball seats might be thought of as experience

goods. It is unclear how customers’ valuations are distributed across different attributes. The

location of a seat might affect the valuation realized by a consumer from the game. In addition,

there could be a number of factors that might affect how a patron values a seat. For instance, the

nature of the opposing team, the age of the patron, or whether the patron is a regular visitor or
  1
      The existence of luxury tax might create some ceiling.

                                                           3
an infrequent visitor, might affect her valuation from the seat. For most stadiums, understanding

heterogeneity in customer valuations is the key to increase local revenues. A clear understanding

of the seat valuations, would lead to the creation of better “fences” that would provide baseball

franchises an opportunity to improve and manage their stadium revenues. Our paper sheds more

light on the key factors influencing customers’ valuations in a baseball stadium.

   Based on a study of reported consumer seat valuations in a baseball stadium of a professional

franchise, we provide some insights and measures by which firms (baseball franchises) can improve

customer satisfaction through better handling of ticket pricing, seat rationing, and stadium design

decisions. Our methodology is applicable across a wide variety of products such as performances

in theaters and opera houses, and games in stadiums. We make the following theoretical and

managerial insights:

  1. To our knowledge, ours is among the first papers to study the distribution of consumer valu-

     ations in a professional baseball stadium, based on various attributes. We set up an empirical

     model to study consumer valuations from seats/products to characterize the heterogeneity

     in valuations. We find that baseball seats can be thought of as experience goods, whose

     valuation uncertainty decreases with every successive visit. Revenue management practice

     hinges on the ability to price-discriminate which is possible only if there is heterogeneity in

     valuations. We find clear evidence for heterogeneity in valuations in baseball stadiums, and

     quantify the heterogeneity based on customer attributes and seat locations of the customers.

  2. Repeated visits to the ballpark help consumers “learn” the “true value” of the seats. As

     consumers (fans) visit more often, the uncertainty in their valuation of the game, from a

     particular seat location, is diminished. This is consistent with the notion that baseball games

     are experience goods (with residual uncertainty). Furthermore, we estimate the change in

     perceived seat valuation and reduction in uncertainty from repeated visits. In particular,

     a customer who has visited the ball park five times has 29% lower variance in his realized

     valuation as compared to a first timer.

  3. Most surprisingly, we show that the valuations received by consumers in a baseball field are

     asymmetric. Consumers seated on the third base side (left field) are less likely to have extreme

     valuations, as compared to those seated on the first base side (right field). In particular, the

                                                 4
consumers on the third base side are 10% less likely to report the net valuations of their seat

      location as low. One likely cause was the location of the home-team dugout on the left field

      side of the stadium.

    4. We find that the consumers seated at the upper deck have higher average ex-post valuations

      than those in the lower deck seats. This has two possible implications that are important from

      the franchise’s perspective. First, consumers might be responding positively to the availability

      of lower priced upper deck seats. Second, even though the upper deck seats are located far

      off, they could still be providing comparatively higher ex-post valuations to consumers who

      sat there. We find that consumers seated in the upper deck also had higher heterogeneity in

      their valuations, with a 52% higher variance in realization, as compared to those in the lower

      deck.

    5. Age matters. Older customers have less uncertain valuations than younger customers seated

      at the same seats controlling for all other factors.

In the following section, we position our paper with respect to the existing literature in §2. In

Section 3, we discuss the research design and description of data. We discuss the main research

issues, and frame our main hypotheses. In §4, we briefly analyze the data, develop our model based

on the data, and summarize the empirical results. Finally, we conclude the paper in section §5 by

discussing the main insights derived in light of our empirical findings.

2     Literature Positioning

To our knowledge, our paper is among the first to analyze seat value perception and its implications

for stadium revenues of sports businesses. Most of the literature in the sports business has been

about secondary markets and ticket pricing in scalping markets (See Courty 2000, for a compre-

hensive survey). There is a stream of literature that focuses on labor market aspects in baseball

(Scully 1974, Rottenberg 1956). In contrast, we focus on seat valuations perceived by consumers,

which can potentially be used to improve traction for the team among the fans, and increase local

revenues for the franchise.

    Our paper also contributes to an emerging literature (Shen and Su 2007) on consumer behavior

                                                   5
in Revenue Management. Su (2007) finds that heterogeneity in consumer valuations, along with

waiting time behavior, influences pricing policies of a monopolist. Dana (1998) shows that advance

purchase discounts can be employed effectively in competitive markets, if consumers’ uncertain

demand for a good is not resolved before the purchase of the good. Shugan and Xie (2000) show that

advanced selling mechanisms can be used effectively to improve firm profits as long as consumers

have to purchase a product ahead of their consumption, and their post-consumption valuation is

uncertain. Due to the inherent heterogeneity in valuations realized by the consumers, we discover

that devising an advance selling mechanism is useful for the team. While this literature is mostly

analytical, ours is one of the first empirical papers analyzing customer valuations and revenue

implications in the sports business.

   Methodologically, our paper is related to the literature employing ordinal models to study the

antecedents and the drivers of customer satisfaction in Operations. Kekre et al. (1995) study the

drivers of customer satisfaction for software products by employing an ordinal probit model to

analyze a survey of customer responses. Gray et al. (2008) use an ordinal logit model to empirically

analyze the quality-risk associated with outsourcing to contract manufacturers. They find that

contrary to the conventional wisdom favoring specialization, contract manufacturing plants have

a higher quality-risk than internal plants. We use an ordinal model similar to the aforementioned

papers, but consider extended models that account for heterogeneity in reporting (across customers)

and heterogeneity in the distribution of ex-post valuations (across seats). For a detailed analysis

of customer satisfaction using ordinal Bayesian models, refer to Bradlow (1994).

   Anderson and Sullivan (1993) note that relatively few studies investigate the antecedents of

satisfaction, though the issue of post-satisfaction behaviors is treated extensively. They note that

disconfirmation of expected valuation causes lower satisfaction and affects future consumptions.

While previous considerations about a product might affect the experience realized from consuming

it, we mainly focus on how product attributes such as seat location and distance, and personal

attributes such as gender, age and frequency affect customer valuations.

   Ittner and Larcker (1998) provide empirical evidence that financial performance measures are

positively associated with customer satisfaction. We use customer valuation measures reported in a

consumer survey to recommend changes that would help the franchise achieve a chosen“service level”

objective on customer satisfaction. We believe this would lead to a better long-run performance of

                                                 6
the franchise.

3        Data Description and Research Issues

3.1        Research Description

This research is based on a survey conducted by a professional league baseball franchise (equivalent

of Major League Baseball) in Japan. The franchise is located in a mid-small city and hence could

not rely on conventional streams of revenue such as broadcasting, merchandising and advertising.

The franchise management decided to focus on ticket sales as it was one revenue source for which

the league’s intervention was minimal. In order to understand the revenue potential from ticket

sales, the management explored how consumers valued different seats in the stadium. The team

management saw an upside potential in considering improvements in pricing and seating layouts.

        As the team was a recently established franchise, the management intended to survey its cus-

tomers to better understand the traction for the team among its fans. The survey discussed in

the paper was conducted by the team based on inputs from various departments and team exec-

utives in the franchise. In the execution phase, the survey was administered to a random sample

of consumers at the stadium on a weeknight game. Only one response was obtained from each

consumer.

3.2        Research Issues

The focus of our research is to understand how the net valuation derived by a consumer from

a baseball game varies based on her seat location and frequency of visits. A large volume of

consumer choice literature focuses on how consumers choose between various options. In our case,

it would mean modeling the consumer’s decision problem of whether to go see a game and how

to choose between the different seats available. However, we only observe the respondents who

purchased a ticket, and hence are not privy to the underlying trade-offs made by the consumer

while arriving at the purchase and seat choice decisions. Consequently, we do not model the

customer’s revealed preferences (with respect to the seats chosen). Instead, we model the ex-post2

net valuations realized, to understand how they differ based on seat location, frequency of visits,
    2
        The results were collected during the last two innings.

                                                              7
and other attributes. In the survey, respondents were asked to report their perception of the seats

that they sat in as expensive, fair or cheap. We define Seat Value Index (SVI) as a measure of a

respondent’s ex-post valuation of her experience net of price. SVI takes the values {Low, Medium,

High} corresponding to the responses expensive, fair, and cheap respectively.

       To derive sharper insights, we assume that consumers are forward-looking and have rational

expectations, i.e. their beliefs about valuations are consistent with realizations, and that they do

not make systematic forecasting errors. The rational expectations assumption is widely employed

in empirical research in economics (Muth 1961, Lucas and Sargent 1981, Hansen and Sargent 1991)

and marketing literature (for example, Sun et al. 2003).3 Every consumer has some belief on the

distribution of possible valuations that she could realize, conditional on her covariates. For a rational

consumer, the ex-ante distribution of valuations is identical to the distribution of valuations realized

by all consumers with identical covariates. Note that rational expectations does not imply that

consumers are perfectly informed about their true valuations. The rational expectations approach

provides a parsimonious way to exclude ad hoc forecasting rules that some customers in the system

might adopt.

       We now focus on three key hypotheses of interest on seat valuations. The first two hypotheses

pertain to the effect of seat location while the third hypothesis considers the effect of repeat pur-

chases or interactions with the service provided. We describe the rest of our conclusions in section

§5.

       Unlike airline tickets, where the seat location is secondary to the value realized from being

transported to the destination, the location of the seat may play a critical role in the value a

consumer derives from a baseball game. One of the key factors through which the location of a seat

influences the game experience is the view offered; the better the view, the better the experience,

and the higher the ex-post valuation. However, the seats offering a better experience are also priced

higher. For example, backnet seats are always priced higher than the bleacher seats or upper deck

seats. Hence, it is not clear, in general, whether ex-post net valuations would be higher or lower

for seats with a better view.

       Consequently, we focus our attention on seat location attributes. We examine those seat lo-
   3
    In our paper, the rational expectations assumption does not affect any of the empirical results we derive from
the model. It allows a better interpretation of how the consumers’ seat valuations might influence selling decisions.

                                                         8
cations where there is an underlying supporting rationale for similar, higher or lower seat value

indices. To preserve focus, we examine two important seat location attributes: namely ‘horizontal’

symmetric locations (left vs right), and ‘vertical’ distance (lower deck vs. upper deck).4

H1: Customers seated on the left (3rd Base) side have identical SVI to those seated on the right

(1st Base) side.

H2: Customers seated at the Upper Deck have higher SVI as compared to those seated at the Lower

Deck.

      Our first hypothesis (H1) compares the valuations between two symmetrically located seats on

the 1st Base side (left side of the stadium) and the 3rd Base side (right side of the stadium). They

are priced the same (all professional baseball teams we examined offered identical prices) and offer

similar views. Hence, the expected net valuation of consumers should be identical across the 1st

Base and 3rd Base, once we have controlled for all other variations.

      In our second hypothesis (H2), we consider the valuation of consumers located in the far-placed

upper deck seats. Upper deck tickets are usually priced very low (None of the professional major

league teams priced them higher than 30% of the infield seats in the lower deck and for certain teams

uppermost decks were priced as low as 5% of the lower deck seats. A large fraction of the upper deck

seats sell closer to game day to the perceived common availability). While the view is inferior to

the lower deck seats, we believe that the relatively cheap price often more than compensates for the

lower utility a customer may obtain at these seats. Hence, our second hypothesis is that customers

seated in the upper deck derive higher valuations on average as compared to the customers seated

at the lower deck.

H3: More frequent customers are less likely to report extreme SVIs.

      Finally, our third hypothesis pertains to the notion that the attendance and viewership at

baseball games are perceived as experience goods. In the traditional literature on experience goods,

customers learn the true valuation of a good by consuming it. We consider baseball games to be

experience goods, whose residual uncertainty of quality decreases with the number of visits made

by a patron. Therefore, a customer who is a regular at all games is expected to have a narrower

distribution of his valuations as compared to a customer who is a first-timer. The consequence of

such learning can push the mean valuation in either direction, causing the more frequent customers
  4
      See Figure 1.

                                                   9
to report higher or lower average SVI.

       Equipped with the above hypotheses, we examine our data using SVI as a dependent variable

and seat locations and frequency, among other factors, as our covariates.

3.3       Data Definitions

Seat Value Index (SVI): Variable of interest defined earlier.

Age: In order to understand how different age groups valued their ticket, the survey required

respondents to indicate the age group they belonged to, from the buckets 0 − 9, 10 − 19, 20 − 29,

30 − 39, 40 − 49, 50 − 59, and 60 − 69, in which the management was interested. We code Age in

“buckets” as a continuous variable taking the values {1, 2, 3, . . . , 7}, where 1 refers to the age group

0 − 9, 2 represents the age group 10 − 19 and so on.5

Gender: The management felt that male fans valued the experience differently from the female

fans. Understanding the difference was critical to direct selling efforts. In addition, it was essential

to control for gender effects while making inferences about the effect of seat location and frequency of

visits on SVI. Hence, each respondent was asked to indicate their gender in the survey questionnaire.

Hometown: A positive perception from local fans is an important factor for franchise’s long-

term sustenance. Hence, the team was interested in measuring valuations of fans based on their

hometown. In order to characterize a respondent as a local or an outsider, the respondents were

asked to indicate their hometown as {City, Prefecture, Outside} based on whether they live in the

city where the team is based, the prefecture (district) excluding the city where the team is based,

or outside the prefecture, respectively.

Seat: The experience and the resulting valuation are highly dependent on the seat from which

a respondent watched the game. Hence, it was necessary to capture this information by asking

respondents to indicate the location of their seat from twelve possible choices as shown in Figure 1.

We further decomposed each seat into its location attributes Side={1st Base, 3rd Base, Backnet,

Field, Grass}, InOut={Infield, Outfield} and Deck={Upper, Lower}.

Frequency: It was important for the team to understand how their regular customers differed in

their valuations from those who were infrequent customers in order to improve their season tickets

program. Hence, respondents were asked to report how often they attended the team’s baseball
   5
       We could represent each age-group by its mid-point and the results would still stay the same.

                                                           10
Figure 1: Stadium Design and Seating Layout

games during the season. The number of visits was reported in frequency buckets: {First Time,

Once, Thrice, Five Times, All Games}.

Visiting Team: Some of the major league teams in the US price individual tickets differently based

on the strength or popularity of the visiting team. The franchise management was interested in

adopting this idea by understanding customer preferences for visiting teams. Hence, the respondents

were asked to indicate the visiting teams they would like to see, from a list of five major teams

{Team1, Team2, Team3, Team4, Team5} in the league that the home team plays against, labeled

in rank order of their finishes in the last two seasons combined.

4     Models and Data Analysis

4.1   Preliminary Data Analysis

From a total of 1397 respondents, 259 responses were dropped due to missing information. A

preliminary analysis revealed that the frequency distribution of seat value index was skewed towards

the right, as shown in Figure 2. This implies that a higher proportion of people reported a low seat

                                                 11
valuation index.6

      Figure 2 also reveals several interesting insights. The seat value index reported by older respon-

dents seem to be more homogeneous. Customers seated in Grass seats report higher SVI, while

respondents seated at Backnet seem to have a lower SVI. Infield and Lower Deck seats seem to

have a higher proportion of respondents reporting a Low SVI as compared to Outfield and Upper

Deck seats. Finally, the season regulars attending all games seem to have more homogeneous SVIs

as compared to the first-timers.

                                                                   843                                              71 119 137         324        235       150 102
                                 800

                                                                                                     0.0 0.4 0.8
                                 400

                                                216
                                                                                   79
                                 0

                                                Low            Medium             High                             0−9       20−29               40−49           >60

                                                           Seat Value Index                                                              Age

                                                     479                    659                                     124        313                   701
                                 0.0 0.4 0.8

                                                                                                     0.0 0.4 0.8

                                                Female                     Male                                    Outside                           City

                                                               Gender                                                                  Hometown

                                               257           483          198      92 108                                     653                        485
                                 0.0 0.4 0.8

                                                                                                     0.0 0.4 0.8

                                               1st           3rd         Backnet        Grass                                Infield                  Outfield

                                                                Side                                                                     Field

                                                      623                     515                                     221        326           306       166 119
                                 0.0 0.4 0.8

                                                                                                     0.0 0.4 0.8

                                                     Lower                    Upper                                First Time Once       Three Times         All Games

                                                                Deck                                                                   Frequency

Figure 2: Distribution of Respondents by the covariates and their Seat Value Index. The width of
the histogram signifies frequency. The number of respondents is indicated on the top for each value
of the covariate.

      We tested for the usual symptoms of multi-collinearity (Greene 2003): (1) high standard errors,

(2) incorrect sign or implausible magnitude of parameter estimates, and (3) sensitivity of estimates

to marginal changes in data. We found no evidence of these symptoms. We computed the Vari-

ance Inflation Factors (VIF) for every covariate and found all of them to be less than two, i.e.
  6
      The result of the game might have influenced the skewness observed.

                                                                                                12
(max(V IF ) < 2), which again suggests that multi-collinearity is not an issue. We also introduced

random perturbations in the data and found that our estimates were robust to such changes, thus

allaying the possible multi-collinearity related issues.

4.2     Model Analysis

The dependent variable of interest is the Seat Value Index (SVI). It is reported on a three point scale

and is a rank-ordered ordinal variable, as there is a well-understood ordering in the respondent’s

mind.

   We could treat SVI as continuous and estimate a multiple regression model using ordinary

least squares (OLS). However, this approach is flawed as (1) the OLS estimates are inefficient and

the predictions cannot be restricted to the interval [1, 3] (Kmenta 1986), and (2) the regression

estimates will roughly correspond to the correct ordered model only if differences in the valuations

between the two consecutive indices are identical. For additional discussion on the limitations of

OLS regressions, see Judge et al. (1980).

   Alternately, we could treat the dependent variable as categorical and employ a multinomial

logit or probit model. This overcomes the limitations of OLS regression, but is still inefficient as

it throws away valuable information by ignoring the ordinal nature of SVI. Hence, the appropriate

model for our purpose is an ordinal regression model that takes into account the categorical nature

of the data as well as the ordering information contained.

   According to the ordinal regression model, the values of the observed ordinal dependent variable,

Vi ∈ {1, 2, . . . , J}, are determined by collapsing a continuous unobserved (latent) variable, Vi∗ ,

into categories defined by the boundaries (thresholds) {τi0 , τi1 , ..., τiJ }, where it is understood that

τi0 = −∞ and τi0 = +∞ for purposes of identification. Mathematically, Vi = j, if and only if

τij−1 < Vi∗ ≤ τij , for j = 1, 2, . . . , J. The model specification is completed by assuming that

Vi∗ = xTi β + i , where xi is a vector of covariates (excluding a constant), β is the associated vector

of parameters, and i is the stochastic error term. Based on the distribution of i being normal or

logistic, we get the ordinal probit model (McKelvey and Zavoina 1975), or the ordinal logit model

(McCullagh 1980). Also see Zaslavsky and Bradlow (1999) for a detailed discussion for ordinal

models.

   We found very little difference between the results obtained from the probit and logit models.

                                                    13
McCullagh (1980) shows that the ordinal probit and logit models are qualitatively similar and that

the fits are indistinguishable for any given data set; hence the selection of an appropriate distribution

should be primarily based on ease of interpretation. We chose the logistic error distribution as it

allowed us to interpret the regression coefficients in terms of log-odds.

4.2.1   Ordinal Logit Model

The standard ordinal logit model assumes that the thresholds are identical for all customers, i.e.

τij = τ j , ∀i, and that the error terms are independent identically distributed logistic random vari-

ables with mean zero. As Vi∗ is not observed, the variance of the error terms are not identified and
               π2
hence set to   3 ,   the variance of the standard logistic distribution. Given these assumptions, the

cumulative probability distribution of Vi can be written as

                       Pr(Vi ≤ j | xi ) = Pr(Vi∗ ≤ τ j | xi ) = Pr(xTi β + i ≤ τ j | xi )
                                                                                 j −xT β)
                                                                           e(τ       i
                                         = Pr(i ≤ τ j − xTi β) =                  j   T
                                                               1 + e(τ −xi β)
                                              j   T
                                         = Λ(τ − xi β), j = 1, 2, . . . , J − 1                      (1)

Note that when J = 2, this reduces to a logistic regression model. The conditional probability of

Vi = j is given by:

                              Pr(Vi = j | Xi ) = Λ(τ j − xTi β) − Λ(τ j−1 − xTi β)                   (2)

For the probabilities to be well-defined, the threshold parameters need to satisfy the condition

                                               τ 1 < τ 2 < . . . < τ J−1                             (3)

   We can now express the log-likelihood function for the standard ordinal logit model as

                                         i=N
                                         X
                                               I(Vi = j) Λ(τ j − xTi β) − Λ(τ j−1 − xTi β) ,
                                                        
                     LL(β, τ | V, X) =                                                               (4)
                                         i=1

where I is the indicator function. The parameters of the model are estimated by maximizing the

log-likelihood in equation (4) subject to the constraints in condition (3).

   The standard ordinal logit model is equivalent to estimating J − 1 logistic regressions of the

                                                          14
form Pr(Vi ≤ j | Xi ) = Λ(τ j − xTi β j ), with the assumption that the slope coefficients, β j , are

identical across all equations, i.e. β j = β, j = 1, 2, . . . J − 1. Then, we can rewrite Equation (1) in

terms of the log-odds of {Vi ≤ j} to obtain log (Odds(Vi ≤ j | Xi )) = τ j − xTi β. This implies that

the ratio of odds for two different levels j1 and j2 ,

                                                          Odds(Vi ≤ j1 | x1 )
                                                                              = exp(τ j1 − τ j2 ),                          (5)
                                                          Odds(Vi ≤ j2 | x1 )

is independent of the covariate x1 . For this reason, the standard ordinal logit model is also referred

to as the Proportional Odds Model (POM).

   In our data set, Vi∗ represents respondent i’s underlying valuation of his seat, net of the price

paid, while Vi is the reported SVI that can take the rank-ordered values j = 1, 2, 3 corresponding

to Low, Medium and High, respectively. The mapping between a respondent’s underlying valuation

and reported SVI is illustrated in Figure 3.

                                                                               Distribution of V*
                           Probability Density f( V*i )

                                                                     SVIi =1          SVIi =2          SVIi =3

                                                          τ0 = − ∞              τ1               τ2              τ3 = + ∞

                                                                                Net Valuation (V*i )

   Figure 3: Mapping between Net Valuation Vi∗ (latent) and Seat Value Index SV Ii (observed)

   The vector of covariates xi consists of Age, Gender, Hometown, Side, Deck, InOut, Frequency

                                                                                     15
and Team 1 7 . We can express the linear predictor as:

   xTi β = β1 Agei + β2 Malei + β3 Cityi + β4 Prefecturei + β5 3rdBasei + β6 Backneti +

                β7 Fieldi + β8 Grassi + β9 UpperDecki + β10 Outfieldi + β11 Frequencyi + β12 T eam1i

We use the OLOGIT routine in STATA 10.0 to estimate the parameters of the proportional odds

model. The results are summarized in Table 1. We can reject the null model consisting only of the

intercepts τ j in favor of the proportional odds model, as the model χ2 = 149.02 is significant at the

p < 0.0001 significance level. A standard measure of fit for logit models is the McFadden pseudo-R2
                             LLP OM
which is defined as 1 −      LLN U LL .   It indicates the improvement in likelihood due to the explanatory

variables over the naive (null) model. For the proportional odds model we find the pseudo-R2 to

be 9.06%. This value needs to be interpreted with caution as it is not directly comparable to the

R2 obtained in OLS. Moreover, it is not uncommon to obtain low values for the pseudo-R2 even

when the explanatory power of the model is good (Gray et al. 2008).

    Before we make any inferences based on this model, we must validate the proportional odds

property stated in (5), which implies that all respondents have the same ratio of odds of reporting a

low SVI to odds of not reporting a high SVI. While it might be reasonable to assume that customers

sitting in different seats might inherently have the same propensity to find higher (or lower) value,

one would expect that customers ‘learn’ their valuation through repeated visits to the stadium, and

hence would have different odds ratios based on the number of visits. Hence, we need to investigate

the validity of the implicit proportional odds assumption made by the standard ordinal logit model.

    The standard approach to test the proportional odds assumption is to fit two models: a reduced

and a full model, where the former is the proportional odds model and the latter is an expanded

non-proportional odds model (NPOM) that allows the βs to depend on j, and perform a likeli-

hood ratio test. The null hypothesis being tested is H0 : β j = β, j = 1, 2, . . . , J − 1. The test

statistic −2 {ln(P OM ) − ln(N P OM )} has a χ2(k) distribution, where k is the number of additional

parameters in the full model. Applying the likelihood ratio test to our data set, we find that the

proportional odds model is strongly rejected in favor of the full model (χ2(12) = 46.74, p < 0.0001).
   7
     We considered all visiting teams in our specification, but retained only the statistically significant ones in order
to keep the model parsimonious. Our results and conclusions do not change significantly even if we include all visiting
teams as predictors.

                                                           16
Variable             Standard               Generalized                 Heteroskedastic
                                          Ordinal Logit          Ordinal Logit               Ordinal Logit
                                          j = 1, 2                j=1        j=2             j = 1, 2
  Threshold: Low-Medium            τ1     -1.215***                   -0.761**               -0.748***
  Threshold: Medium-High           τ2      3.387***                   2.071***                2.067***
                        Age        β1j     0.048                  0.127**     -0.172**        0.034
                       Male        β2j    -0.019                  -0.026       -0.026        -0.034
         City (vs. Outside)        β3j     0.083                   0.029        0.029         0.011
   Prefecture (vs. Outside)        β4j     0.192                   0.166        0.166         0.102
    3rd Base (vs. 1st Base)        β5j     0.428**               0.873*** -0.727**            0.145
     Backnet (vs. 1st Base)        β6j    -0.730***             -0.678*** -0.678***          -0.440***
        Field (vs. 1st Base)       β7j    -0.893***             -0.824*** -0.824***          -0.509***
       Grass (vs. 1st Base)        β8j     1.816***              1.206*** 1.206***            0.919***
                    Outfield       β9j     0.215                    0.211        0.211        0.171
                                     j
                Upper Deck         β10     0.246                    0.066     0.947***        0.263**
                                     j
                  Frequency        β11    -0.126**                -0.093      -0.234**       -0.081**
                                     j
                     Team 1        β12     0.249*                  0.250*       0.250*        0.185**
                        Age        γ1                          -NA-                          -0.075***
    3rd Base (vs. 1st Base)        γ5                          -NA-                          -0.324***
                Upper Deck         γ10                         -NA-                           0.208***
                  Frequency        γ11                         -NA-                          -0.057*
             Log Likelihood        LL    -748.12                        -727.18              -726.27
       Likelihood Ratio χ2         LR     149.02                        190. 90               192.72
         No. of Parameters               12                               16                 16
      McFadden Pseudo R2                 9.06%                          11.60%               11.71%
  *** p < 0.01, ** p < 0.05,      * p < 0.1

                               Table 1: Parameter Estimates for All Models

The likelihood ratio test is an omnibus test that the βs across threshold levels are equal for all the

covariates simultaneously. However, it is possible that the proportional odds assumption is violated

only for a subset of the covariates.

       A Wald test developed by Brant (Brant 1990) allows us to test the proportional odds assumption

for each covariate individually. The key idea of the Brant test is to fit separate logistic regressions

Pr(Vi ≤ j | xi ) = Λ(τ j − xTi β j ) for each of the J − 1 threshold levels, and test the equality of the βs

by constructing a test statistic based on the estimated coefficients and the asymptotic covariance

matrix. Conducting the Brant test on our data-set, we find that the proportional odds assumption

is violated8 for the coefficients β1 , β5 and β9 . A likelihood ratio test confirms that a partially
   8
    We applied the Brant test to ordinal regression models with different link functions such as probit, log-log and
complementary log-log to ensure that the violations are not on account of a misspecified link. We find that the
proportional odds assumption is violated for the same coefficients β1 , β5 and β9 in all cases.

                                                        17
constrained model that allows only for β1 , β5 and β9 to depend on j cannot be rejected in favor of

an unconstrained model that allows all the β’s to depend on j (χ2(9) = 6.33, p = 0.71).

    In order to validate our hypotheses, we now consider two different modifications to the standard

ordinal logit model that do not suffer from the proportional odds restriction. The first model is

a generalized threshold model that relaxes the assumption that the category boundaries, τij , are

identical for all respondents, while the second model is a heteroskedastic model that allows for the

variance of the error term, i , to systematically vary across respondents.

    The generalized thresholds model discussed in §4.2.2 addresses the different thresholds that

customers might use in reporting their responses, and usually pertain to the individual attributes

of customers that affect their response thresholds. The heteroskedastic model, discussed in §4.2.3

usually addresses the inherent differences in the distribution of product valuations as perceived by

the customers based on a particular covariate.

4.2.2    Generalized Threshold Ordinal Logit Models

The standard ordinal logit model assumes that all respondents use the same category boundaries in

reporting their underlying valuations. However, it is not uncommon for respondents of a survey to

use different thresholds in reporting their responses. For example, older respondents tend to assess

their health as better, as compared to younger respondents with similar characteristics (Groot 2000;

van Doorslaer and Gerdtham 2003). This has been referred to as response category threshold shift,

reporting heterogeneity, state-dependent reporting bias (Lindeboom and van Doorslaer (2004)), or

scale-usage heterogeneity (Rossi et al. 2001). This is the central motivation for the generalized

threshold model.

    The generalized threshold ordinal logit model retains the idea that respondents have underlying
                                                                           2
valuations drawn from a common distribution, Vi∗ ∼ Λ(xTi β, π3 ), but allows respondents to use

different thresholds, τij , while collapsing them into categories. A common approach to model

generalized thresholds is to make the threshold parameters linear functions (Maddala 1983, Peterson

and Harrell 1990), or polynomial functions (Rossi et al. 2001) of the covariates. We follow Maddala

(1983) and let τij = τ̃ j + xTi δ j , where xi is the set of covariates and δ j , j = 1, 2, . . . J − 1 are vectors

of the associated parameters that capture the effect of the covariates in shifting the thresholds. For
          2
example, δ3rdBase captures the shift in the threshold τ̃ 2 for respondents seated along the 3rd Base

                                                        18
side relative to those seated along the 1st Base side. Substituting the expression for τij in place of

τij in Equation (1), we can write the defining equation of the generalized ordinal logit model as

                                     Pr(Vi ≤ j | xi ) = Λ(τ̃ j − xTi β j ),                                    (6)

where β j = β − δ j and it is understood that τ̃ 0 = −∞ and τ̃ J = ∞ as before, for the purposes of

identification. The parameters of the model are estimated by maximizing the likelihood subject to

the constraints τ̃ j−1 − xTi δ j−1 ≤ τ̃ j − xTi δ j , j = 1, 2, . . . , J, which are required for the probabilities

to be well-defined.

    The parameter β captures the real effect of the covariates on the valuation. According to the

generalized threshold model, the βs manifest as significantly different β j s in the expression for

log-odds, on account of the different thresholds used by the respondents. The β j ’s can hence be

interpreted as the net of the real effect (β) and the threshold-shifting effect (δ j ) of the covariates

on the log-odds. However, the two effects cannot be separately identified in this model.

    The generalized threshold model is very flexible. When the β j ’s are allowed to differ across

threshold levels for all covariates, it is referred to as the Unconstrained Ordinal Logit Model. The

generalized threshold model nests the standard ordinal logit model under the restriction that β j =

β, j = 1, 2, 3, . . . , J − 1. It is also possible to constrain the coefficients for a subset of covariates

zi ⊂ xi , by restricting βkj = βk , j = 1, 2, 3, . . . , J − 1, k ∈ zi , to obtain a Partially Constrained

Ordinal Logit Model.

    Given that the covariates Age, 3rd Base and Upper Deck violated the Brant test, we include

them in zi . In addition, we include the covariate Frequency in zi , as we believe that repeated visits

help respondents learn the true value of the game experience and would induce them to use different

thresholds. Hence, we let zi = {Age, 3rd Base, Upper Deck, Frequency}. We use the GOLOGIT2

routine (Williams 2006a) in STATA 10.0 to estimate the parameters of the generalized threshold

model. The results are summarized in Table 1.

    A likelihood ratio test comparing the generalized threshold model with the standard ordinal

logit model confirms that we can reject the latter in favor of the former (χ2(4) = 41.89, p < 0.0001).

The McFadden pseudo-R2 has also improved from 9.06% to 11.60%, which indicates a better fit.

    We observe that the predictors Side and Frequency continue to be significant. In addition, Age

                                                        19
1st Base, 3rd Base

                            Probability Density f( V*i )

                                                           τ13rd   τ11st                  τ21st τ23rd

                                                                   Net Valuation (V*i )

     Figure 4: Comparison of Thresholds for 3rd Base vs. 1st Base, when β3rdBase = β1stBase

is also a significant predictor. As noted before, the net effect, βkj , cannot be decomposed into the

real effect (βk ) and the threshold-shifting effect (δkj ) for any covariate k. If we believe that the real

effect βk = 0, then the δkj ’s can be identified as δkj = −βkj , j = 1, 2. This case is illustrated for the

covariate 3rd Base in Figure 4.

   Though we cannot separately identify δ 1 and δ 2 , their difference δ 2 − δ 1 = β 1 − β 2 is identified.

We can use this to compute the effect of the threshold-shifting on the size of the threshold interval

for being in the category SVI=Medium. For example, the size of the threshold interval for a

respondent on the 1st Base side is given by τ̃ 2 − τ̃ 1 = 2.832, while the same for a respondent on

the 3rd Base side is given by τ̃ 2 + δ 2 − τ̃ 1 − δ 1 = τ̃ 2 − τ̃ 1 + β 1 − β 2 = 4.432. This implies that the

respondents on the 3rd Base side are a lot less likely to report extreme values of SVI as compared

to respondents on the 1st Base side. This is also evident from Figure 4.

   The generalized threshold model implies that two groups of customers might have identical

valuation distributions, but their distribution of seat value indices might differ because of different

reporting thresholds. This explanation might be apt for a covariate such as age, where older

customers might inherently have lower thresholds for higher valuations (high SVI) as compared to

younger patrons, although they have identical distribution of valuations. However, we find such an

explanation less plausible for the difference in SVI distributions between customers who only differ

                                                                           20
in the location of their seat (left field vs. right field).

   In the next section §4.2.3, we consider the possibility that consumers might report different SVI

despite using the same thresholds, because the valuations that they experienced are distributed

differently across different seat locations.

4.2.3   Heteroskedastic Ordinal Logit Model

In ordinal regression models, the error variances are not determined and need to be explicitly

specified for the model to be identified. The standard ordinal logit and probit models assume that
                                                                 π2
the error terms have a constant variance and set it to           3    and 1, respectively. As the choice of this

constant is arbitrary, the β parameters of the model are identified only up to a scale factor. In
                                             β
other words, we essentially estimate         σ,   and identify β by fixing σ arbitrarily.

   When the errors are homoskedastic, the relative effect of any two covariates on the outcome
                     β̂1 /σ
variable, given by          ,   is correctly identified, independent of the choice of σ. However, these com-
                     β̂2 /σ

parisons become invalid in the presence of heteroskedasticity and can lead to erroneous conclusions.

For example, if β1st Base = β3rd Base , but σ1st Base = 2σ3rd Base , then assuming equal variance would

lead us to the erroneous conclusion that β̂1st Base = 0.5β̂3rd Base .

   Yatchew and Griliches (1985) show that heteroskedasticity in binary regression models leads to

parameter estimates that are biased, inconsistent, and inefficient. This extends to ordinal regression

models and is in contrast to OLS estimates for ordinary linear regression, which remain unbiased

and consistent even in the presence of heteroskedasticity. Hence accounting for heteroskedasticity

is critical and this is the central motivation for the Heteroskedastic Ordinal Logit Model.

   The heteroskedastic ordinal logit model assumes that the error variance differs systematically

across respondent groups. For example, frequent customers might have less heterogeneous valu-

ations as compared to first-timers, as they are more likely to have learned the true valuation of

their experience through repeated visits. This dependence of the error variance on the covariates

is captured by a skedastic function h(.) that scales the iid error terms in the standard ordinal logit

model. Mathematically, we write Vi∗ = xTi β + h(zi )i , where zi is the vector of covariates upon

which the residual error variance is believed to depend. Following Harvey (1976), we parametrize

                                                          21
h(.) as an exponential skedastic function given by

                                           h(zi ) = exp(zTi γ)

   Given these assumptions, we can rewrite Equation (1) to obtain the defining equation of the

heteroskedastic ordinal logit model:

                                                              τj − xTi β
                                                                          
                                   Pr(Vi ≤ j | xi ) = Λ                                          (7)
                                                              exp(xTi γ)

The heteroskedastic ordinal logit model belongs to a larger class of models known as location-scale

models, and the reader is directed to McCullagh and Nelder (1989) for more details. The parameters

of the model are estimated using the maximum likelihood approach as before.

   The heteroskedastic ordinal logit model does not display proportional odds for the covariates

in zi . This can be seen by writing out the expression for log-odds of Vi ≤ j given xi ,

                                                                τj − xTi β
                                 log(Odds(Vi ≤ j | xi )) =                 ,
                                                                exp(zTi γ)

and observing that the effect of the covariates zi on the log-odds is no longer independent of the

threshold level j.

   The heteroskedastic ordinal logit model nests the standard ordinal logit model under the re-

striction γ = 0. Hence it is possible to compare the two models using a likelihood ratio test, and

identify the sources of heterogeneity that are statistically significant.

   Note that the different β j s manifesting in the ordinal logit model could be explained by differ-

ences in residual error variance. Hence we include in zi the covariates Age, 3rd Base and Upper

Deck, all of which violated the Brant test. In addition, we include the covariate Frequency in zi ,

as we believe that repeated visits should help respondents learn the “true value” of the game expe-

rience, and as a consequence reduce the residual variation in their valuations. Hence, we let zi =

{Age, 3rd Base, Upper Deck, Frequency} as in the generalized threshold model. We use the OGLM

routine (Williams 2006b) in STATA 10.0 to estimate the parameters of the heteroskedastic ordinal

logit model. The results are summarized in Table 1.

   A likelihood ratio test comparing the heteroskedastic ordinal logit model with the standard

                                                   22
ordinal logit model confirms that we can reject the standard ordinal logit model in favor of the

former (χ2(4) = 43.7, p < 0.0001). The McFadden pseudo-R2 has also improved from 9.06% to

11.71%, which indicates a better fit.

   We observe that the significant β coefficients correspond to the covariates Frequency, Side

(except 3rd Base) and Upper Deck. All the γ coefficients included in the variance equation are

significant. We can draw several interesting inferences from these results.

                                                                       3rd Base
                          Probability Density f( V*i )

                                                                                          1st Base

                                                           τ1                        τ2

                                                         Net Valuation (   V*i   )

        Figure 5: Net Valuation of Respondents on the 1st Base side vs. the 3rd Base side

   Controlling for heteroskedasticity, respondents at the 3rd Base seem to have the same mean

valuations as respondents at the 1st Base, as β̂5 is no longer significant. However, that(β̂10 =

0.263, p = 0.04) implies that respondents seated on the 3rd Base side have significantly less hetero-

geneous valuations (standard deviation is 1-exp(γ̂5 ) = 28% lower) as compared to those seated on

the 1st Base side. This could be due to the location of the home team dugout and/or the relative

incidence of foul balls/home runs on the left field. Figure 5 illustrates the valuations implied by

the heteroskedastic ordinal logit model for the customers in first base side vs. third base side.
                                                                      ˆ = 0.263, p = 0.04)
   Respondents seated in the upper deck have a higher mean valuation (β 10

as well as higher heterogeneity (γ̂10 = 0.208, p = 0.0408) as compared to lower deck tickets. More
                                                ˆ
frequent customers have a lower mean valuation (β 11 = −0.081, p = 0.028) and lower variance

(γ̂11 = −0.058, p = 0.074) in their valuations. This finding is consistent with the experience goods

                                                                23
literature where customers learn the value of a product through repeated interactions. Age of a

respondent does not affect the mean valuation, but older respondents tend to have less variance in

their valuations.

4.3   Robustness Checks

We made several implicit assumptions in our model specification. These assumptions might impact

the results and insights. Hence, we carried out a sequence of tests to confirm the robustness of our

models to relaxation of those assumptions. We find that our results are robust to these assumptions.

   • We treat the ordinal explanatory variables Age and Frequency as continuous in our model

      specification. While this treatment is justified for Frequency, it is possible that the effect of

      Age on SVI is non-monotonic. In such cases, specifying Age as a categorical variable might

      be apt. Hence, we estimated all models presented here with separate parameter estimates for

      each level of Age. A likelihood ratio test rejects the categorical specification in favor of the

      continuous specification used before (χ2(10) = 13.38, p = 0.22).

   • Field seats are located on both sides of the home base, and hence can be classified into 1st

      base and 3rd base seats. However, we did not have the data to perform this classification.

      Consequently, the co-variate Field, in our current model specification, captures the aggregate

      effect of being on the left field or right field, and could be the cause for the asymmetry

      observed. Hence, we dropped all Field seats and ran the models discussed above. Our

      qualitative insights did not change, confirming that the inability to classify field seats into

      left field and right field is not driving our insights.

   • We treat Hometown as a categorical variable in our model specification. However, it is also

      possible to use the distance of the hometown from the stadium as a covariate, and treat it as

      a continuous variable taking the values {0,1,2} which refer to City, Prefecture and Outside

      respectively. We estimated a model treating Hometown as a continuous variable and find

      that the results are almost identical to the earlier specification of Hometown as a categorical

      variable.

   • We found the valuation of the customers independent of the opposing team, except for one

                                                    24
team (Team1) which they preferred to see. The franchise team under consideration played

      the opposing team (Team1) frequently, and Team1 had also been the strongest in the division

      over the past two years. The higher valuation the customers experienced when they listed

      Team1 is possibly a reflection of their informed-ness about the game and the team’s season

      performance. None of the other teams were significant in their influence of the valuation

      perceived by the customers.

4.4   Marginal Probabilities from the Models

Using our models, we can measure how a change in a covariate impacts the distribution of the

response variable using marginal probabilities. If we let xil denote the value of the lth covariate for
                                                                                            ∂ Pr(Vi =j|Xi )
respondent i, then the marginal probability effect for a continuous covariate is given by        ∂xil       .

In the case of a categorical covariate like Side, the marginal probability effect is given by the change

in probability ∆ Pr(Vi = j | xi ) when compared to a reference category. While measuring the

marginal probability effects of any covariate, we fix the rest of the covariates at their mean value

(or median value for categorical covariates). We use the MFX2 routine in STATA 10.0 to estimate

the marginal probability effects based on the three models, and the results are summarized in Table

2. Table 2 indicates insignificant differences between the two non-proportional models, which allows

us to use threshold interpretation for consumer attributes, and heterogeneity interpretation for seat

attributes.

5     Conclusions and Insights

We use a latent variable approach to measure post-consumption valuations of baseball seats. Con-

sumers with different descriptive attributes (age, gender, etc) might have different thresholds for

reporting their valuation as high or low. Based on the intrinsically different realizations of the value

of the experience due to product attributes (such as seat location), consumers might end up with

varying seat value indices. Hence, it is critical to account for differences in reporting thresholds

and heterogeneity in the distribution of valuations. Otherwise, the estimate of the real effects of a

covariate could be biased. For instance, the standard ordinal logit model (that ignores both of these

effects) leads us to an incorrect conclusion that the age of the fan does not affect his valuations.

                                                  25
You can also read