Coupon Design in Advertising Systems - Weiran Shen,1 Pingzhong Tang, 2 Xun Wang, 2 Yadong Xu, 2 Xiwang Yang 3 - PRELIMINARY ...

 
CONTINUE READING
Coupon Design in Advertising Systems - Weiran Shen,1 Pingzhong Tang, 2 Xun Wang, 2 Yadong Xu, 2 Xiwang Yang 3 - PRELIMINARY ...
PRELIMINARY VERSION: DO NOT CITE
                                              The AAAI Digital Library will contain the published
                                                   version some time after the conference

                                    Coupon Design in Advertising Systems
                Weiran Shen,1 Pingzhong Tang, 2 Xun Wang, 2 Yadong Xu, 2 Xiwang Yang 3
                                                     1
                                                 Renmin University of China
                                                   2
                                                     Tsinghua University
                                                        3
                                                          ByteDance
                        shenweiran@ruc.edu.cn, kenshinping@gmail.com, wxhelloworld@outlook.com,
                                 xuyd17@mails.tsinghua.edu.cn,yangxiwang@bytedance.com

                           Abstract                                        auction (Vickrey 1961; Clarke 1971; Groves et al. 1973)
                                                                           which is used by Facebook and ByteDance. The difference
  Online platforms sell advertisements via auctions (e.g., VCG             between them is that the VCG auction is truthful, i.e., ad-
  and GSP auction) and revenue maximization is one of the
                                                                           vertisers submit bids that truthfully reveal their valuations,
  most important tasks for them. Many revenue increment
  methods are proposed, like reserve pricing, boosting, coupons            while the GSP auction is not. However, it has been proved
  and so on. The novelty of coupons rests on the fact that                 that the GSP auction has a Nash equilibrium whose outcome
  coupons are optional for advertisers while the others are com-           is equivalent to that of the VCG auction. Therefore, in this
  pulsory. Recent studies on coupons have limited applications             paper, we study how to maximize the revenue in the VCG
  in advertising systems because they only focus on second                 auction.
  price auctions and do not consider the combination with other               In marketing, a coupon is a ticket or document that can
  methods. In this work, we study the coupon design problem                be redeemed for a financial discount or rebate when pur-
  for revenue maximization in the widely used VCG auction.
  Firstly, we examine the bidder strategies in the VCG auction
                                                                           chasing a product. It is effective in promoting sales and
  with coupons. Secondly, we cast the coupon design problem                attracting new customers in numerous realistic scenarios,
  into a learning framework and propose corresponding algo-                and consumers can find coupons everywhere, ranging from
  rithms using the properties of VCG auction. Then we further              the wrappings of food, to webpage banners. Coupons have
  study how to combine coupons with reserve pricing in our                 been widely studied in economics (Bester and Petrakis 1996;
  framework. Finally, extensive experiments are conducted to               Moraga-González and Petrakis 1999; Kang et al. 2006;
  demonstrate the effectiveness of our algorithms based on both            Board and Skrzypacz 2016). The most common arguments
  synthetic data and industrial data.                                      developed by economists to explain the use of coupons
                                                                           are twofold: 1) Coupons allow for price discrimination. 2)
                                                                           Coupons allow for peak-load pricing (Mckenzie 2008). In
                       Introduction                                        this work, coupons play the role of price discrimination.
Online advertising has been one of the most important in-                  The intuition of revenue increment is that low-valued ad-
dustry on the Internet. In the United States, its revenue has              vertisers would spontaneously bid higher if they are pro-
grown by 15.9 percent in 2019 compared to 2018, surpassing                 vided with coupons, which in turn forces the high-valued
124$ billion. 1 Advertising has become one of the key rev-                 advertisers to pay more when they win based on the pay-
enue sources for many Internet companies, such as Google,                  ment rule of the VCG auction. In fact, many methods can
Facebook, ByteDance. For example, more than 83.8 percent                   be used to increase revenue, such as reserve price (Hartline
of Google’s revenue comes from online advertising. And the                 and Roughgarden 2009), boosting (Golrezaei et al. 2017),
market still shows no signs of slowing down. There are var-                squashing (Lahaie and Pennock 2007a), anchoring (Lahaie
ious types of online advertising, for example, social media                and Pennock 2007b). Coupons are significantly different
advertising (e.g., Instagram and Facebook), paid search ad-                from them. As for coupons, advertisers hold the right to de-
vertising (e.g., Google and Baidu), and native advertising                 cide whether to use them. However, they cannot refuse to use
(e.g., BuzzFeed and Tiktok), and so on. These advertise-                   the other methods, e.g., advertisers always want to remove
ments are usually sold via auction mechanisms. The mecha-                  the reserve price but they can not. In fact, sometimes reserve
nisms determine which advertisements will be presented and                 prices may have a negative effect on revenue (Ostrovsky and
how much the corresponding advertisers need to pay. There                  Schwarz 2011). Higher reserve prices make the auction less
are two widely used auctions, i.e., the GSP auction (Edel-                 attractive which results in fewer participated advertisers.
man, Ostrovsky, and Schwarz 2007) (generalized second                         In this work, we generalize (Shen et al. 2020b) to the VCG
price) which is used by Google and Baidu, and the VCG                      auction, including the combination with reserve pricing. The
Copyright c 2021, Association for the Advancement of Artificial            intuition is that the VCG auction (with heterogeneous slots)
Intelligence (www.aaai.org). All rights reserved.                          can be decomposed into some tractable sub-VCG auctions
    1                                                                      (with homogeneous slots). Our contributions can be sum-
      https://www.statista.com/statistics/183816/us-online-
advertising-revenue-since-2000/                                            marized as follows.
• We generalize the application of coupons for revenue                                      Preliminaries
  maximization in advertising systems to more realistic sce-      In this section, the setting of VCG auction with coupons in
  narios (i.e., VCG auctions).                                    advertising systems will be introduced first. Then the formal
• We propose algorithms for coupon optimization using the         problem of learning to design coupons will be defined.
  properties of the VCG auction, along with the combina-
  tion with reserve pricing.                                      VCG Auction with Coupons in Advertising
• Based on both synthetic data and industrial data, exten-        Systems
  sive experiments are conducted to demonstrate the effec-        There are n bidders [n] = {1, 2, . . . , n} and m slots (usually,
  tiveness of our algorithms.                                     n  m). We will refer to ‘ad i’ as the advertisement submit-
                                                                  ted by bidder i. Each bidder can occupy at most 1 slot and
Related Works                                                     each slot can be allocated to at most 1 bidder. Let vi be the
Our work is related to the revenue-maximizing mechanism           private value which quantifies the value of a click for bidder
design. Myerson (1981) studies the optimal auction where          i. And vi is drawn from a publicly known distribution Fi .
advertisers should be ranked in descending order of their         For convenience, let v = (v1 , . . . , vn ) denote the value pro-
virtual values. If nobody has positive virtual values, the item   file, and let v−i = (v1 , . . . , vi−1 , vi+1 , . . . , vn ) denote the
would not be sold to anyone. However, Roughgarden and             value profile of all bidders except bidder i. Similarly, we can
Schrijvers (2016) argues that this setting relies heavily on      define the bid profile b and b−i . We only use αj ∈ [0, 1] to
the exact estimation of the advertisers’ value distribution and   denote the click-through-rate of ad i when it is allocated to
can be sensitive to estimation errors. Thus, simpler mecha-       slot j as (Edelman, Ostrovsky, and Schwarz 2007) do, which
nisms are applied in industry, e.g., VCG and GSP auction.         implies that the number of times a particular slot is clicked
Many methods have been proposed to increase revenue in            does not depend on the ad in this slot. The reason is that the
these auctions. Reserve pricing is one of the most widely         click-through-rate can be decomposed into position effect
studied methods: Hartline and Roughgarden (2009) presents         and ad effect while the ad effect can be taken into account in
how to use a reserve-price-based VCG auction to approxi-          the private valuation. Besides, αj ≥ αj 0 holds when j < j 0 .
mate the optimal one. Jin et al. (2019) proves a tight approx-    In response to the bid profile, VCG auction determines how
imation ratio for anonymous reserve prices. In (Zeithammer        to allocate each bidder and how much each bidder has to
2019), there is a bid level below which a winning bidder          pay according to some allocation rule and payment rule. To
has to pay according to the first-price auction instead of the    be specific, the allocation rule is a function π that maps the
second-price auction with reserve. Another method is boost-       bid profile b to an n-dimensional vector indicating the quan-
ing (Golrezaei et al. 2017) where the seller assigns a boost      tity of clicks allocated to each bidder, i.e., π : Rn 7→ [0, 1]n ;
value to each advertiser which can transform his regular bid      The payment rule is a function p : Rn 7→ Rn+ that maps the
into a boosted one. Besides, squashing (Lahaie and Pennock        bid profile b to an n-dimensional non-negative vector spec-
2007a) and anchoring (Lahaie and Pennock 2007b) are also          ifying the payment for each bidder. For simplicity, we use
widely used methods in industry (Chawla, Fu, and Karlin           π(b) and π, p(b) and p interchangeably. If bidder i is allo-
2014; Huang, Mansour, and Roughgarden 2015; Tang and              cated to slot j, then πi = αj , and if bidder i does not win
Wang 2016; Tang and Sandholm 2011).                               the auction, then πi = 0. These rules are specified as:
                                                                                                              Pn
   Our work is also related to automated mechanism de-            • Allocation rule: π(b) ∈ arg maxπ0 i=1 πi0 bi .
sign. Conitzer and Sandholm (2002) introduces the auto-
                                                                  • Payment rule: pi (b) = maxπ0 i0 6=i πi00 bi0 − i0 6=i πi0 bi0 .
                                                                                                       P                    P
mated mechanism design approach, where the mechanism
is computationally created for the specific problem instance         Coupons work as follows. Before submitting bids, each
at hand. Learning theory tools such as pseudo-dimension           bidder i will be informed that he will be provided with a
have been used to prove strong guarantees in auction set-         constant coupon ci ≥ 0. This coupon can get ci off the
tings (Morgenstern and Roughgarden 2015, 2016). In a sim-         payment when he wins. Hence bidders would change their
ilar direction, bounds on the sample complexity have also         regular bids depending on their values and coupons, i.e.,
been developed (Hsu et al. 2016). Mohri and Medina (2014)         bi = bi (vi ; ci ). Therefore, the allocation rule and payment
uses DC programming to learn anonymous reserve price              rule of VCG auction with coupons would be:
for the second-price auction. Derakhshan, Golrezaei, and                                              n
Leme (2019); Derakhshan, Pennock, and Slivkins (2020)                                                 X
                                                                              π(b) ∈ arg max                πi0 bi ,
use linear programming to compute personalized reserve                                    0  π
                                                                                                      i=1                                       (1)
prices in auction mechanism. Duetting et al. (2019) uses                                     X                     X
deep learning to design revenue-maximizing mechanisms                        pi (b) = max
                                                                                       0
                                                                                                      πi00 bi0 −            πi0 bi0 − πi ci .
                                                                                        π
and deals with the incentive constraint via regret network.                                  i0 6=i                i0 6=i
Shen, Tang, and Zuo (2019) also handles the same problem
but can always return exactly incentive compatible mecha-         The utility of bidder i is ui (b) = πi vi − pi , and the revenue
nisms. Recently, Tang (2017) uses reinforcement learning to       of platform would be:
design and optimize mechanisms in dynamic industrial en-                                                        n
vironments and has achieved success in industry (Cai et al.
                                                                                                                X
                                                                                         Rev(v, c) =                   pi .                     (2)
2018a,b; Shen et al. 2020a).                                                                                    i=1
In what follows, let (i) denote the order of bidders satisfying                   Definition 1 (No-feature case). In this case, features are
if i < i0 , then [b(i) > b(i0 ) ] ∨ [(b(i) = b(i0 ) ) ∧ (c(i) ≤ c(i0 ) )],        not considered, and the coupon optimization problem is to
that is, (i) denotes the bidder with the i-th highest bid (if                     minimize the following empirical loss:
there are more than one, then choose the bidder with the                                                       T
least coupon).                                                                                              1X
                                                                                                 LS (c) = −       Rev(v t ; c),               (5)
                                                                                                            T t=1
Learning Formulation                                                              where c is an n−dimensional vector specifying coupons
There are two sets of data – the training data and the testing                    which need to be optimized.
data, containing features and bids. Both sets of data are gen-                    Proposition 2. In the no-feature case, the optimization of
erated by VCG auction without coupons. And our goal is to                         coupons in VCG auction (i.e., Equation (5)) is NP-hard.
train a model which can provide coupons for bidders in each
                                                                                     Proposition 2 points out coupon optimization is hard,
auction and maximize the revenue for platform. Since VCG
                                                                                  so we use the coordination descent method to optimize
auction is a truthful mechanism, in these sets of data, we can
                                                                                  coupons. In this method, instead of optimizing coupons si-
regard bids as bidders’ values.
                                                                                  multaneously, we optimize one of them while fixing the oth-
   Let us define the problem formally. We consider a generic                      ers at each time step.
feature space X with the label space B = Rn+ consisting
of the value profile v. We regard (xt , v t ) as a data instance                  Comparison with Related Work
where xt , v t denote the feature vector and value profile in                     As proposition 1 demonstrated, although coupons are op-
auction t, respectively. The hypothesis function h : X →                          tional for advertisers, they still accept and use all coupons
Rn+ is used to set coupons ct = h(xt ). Thus, the objective is                    in equilibrium. We will compare the difference between
to select a hypothesis function h out of some hypothesis set                      coupons and other involuntary discount mechanism, like
H to minimize the corresponding empirical loss:                                   boosting (Golrezaei et al. 2017) and bidding credits in
                                 T                                                squashing (Lahaie and Pennock 2007a). Boosting and bid-
                              1X                                                  ding credits both multiply the submitted bids with a factor,
               LS (h) = −           Rev(v t ; h(xt )).                  (3)
                              T t=1                                               i.e., ai ∗ bi . As for boosting, there is no restriction on ai .
                                                                                  While for bidding credits, ai is between 0 and 1. Then in the
where S = ((x1 , v 1 ), . . . , (xT , v T )). Let αjt , pti and πit de-           VCG auction, there are two differences.
note the corresponding notations in auction t. The goal is to                     • When 0 ≤ ai < 1, advertiser i prefer not to accept boost-
learn the predictor which can work for any advertiser if cor-                        ing or bidding credits. However, as for coupons, advertis-
responding features can be provided. For the sake of descrip-                        ers’ dominant strategy would always be accepting them.
tion, we use a new hypothesis function hi to set the coupon
                                                                                  • Coupons can increase the revenue in some cases where
for bidder i as cti = hi (xti ), where xti denotes the feature for
                                                                                     boosting can not. For example, consider one slot and two
bidder i in auction t. For convenience, in what follows we
                                                                                     bidders whose values are 0 and 1. Then, boosting can not
will omit the superscript t when there is no confusion.
                                                                                     increase the revenue since ai multiplies 0 still equals 0.
   It is worth noting that all proofs can be found in supple-                        While for coupons, this would not be a problem.
mentary materials due to limited space.
                                                                                     Comparing with the second-price setting, several non-
                                                                                  trivial technical issues are raised in the VCG setting. For
                       Problem Analysis                                           example, Shen et al. (2020b) relies heavily on the fact that
In this section, we will introduce the property of coupons                        given value profile v and coupons profile c−i , Rev(v; c) (as
and show the difficulty of coupon optimization first. Then,                       a function of ci ) has at most one discontinuous point. Hence
the comparison with related work would be presented.                              the surrogate loss function can be decomposed as a differ-
                                                                                  ence of two convex functions and DC programming can be
Proposition 1. In VCG auction with coupons, the dominant                          applied. However, in the VCG setting, the revenue function
strategy of bidder i is using coupon ci and bidding vi + ci .                     is more complicated as Proposition 3 denotes.
   Proposition 1 denotes the dominant strategy of bidders in                      Proposition 3. In VCG auction with coupons, given value
VCG auction with coupons. Therefore, we can use vi + ci to                        profile v and the coupon profile c−i , Rev(v; c) (as a func-
denote the bid of bidder i when he is provided with coupon                        tion of ci ) is not continuous, and is neither convex nor con-
ci in experiments. Thus we can define the revenue of VCG                          cave. Actually, there can be m discontinuous points.
auction with coupons as Equation (4).                                                We use Example 1 to demonstrate Proposition 3. As illus-
                 m X
                 X  m                       m
                                            X                                    trated in Figure 1, Rev(v; c) (as a function of ci ) is a piece-
Rev(v; c) =                   αi b(i+1) −           αi b(i) − αj c(j)             wise function. There can be multiple discontinuous points.
                 j=1    i=j                 i=j+1                                 They divide the function into multiple line segments, which
                 m h                                                          i   can have different slopes.
                 X
             =         j · (αj − αj+1 )(v(j+1) + c(j+1) ) − αj c(j)               Example 1. Let n = 6, m = 4, v = (5, 4, 3, 2, 1, 0) and
                 j=1                                                              α = (1.0, 0.4, 0.3, 0.1). When c−6 = 0, the value of revenue
                                                                        (4)       with respect to c6 is demonstrated in Figure 1.
have many local minima (Mohri and Medina 2014). We ex-
                                  5                                              tend the technique in (Mohri and Medina 2014; Shen et al.
                                                                                 2020b) to tackle the special case:
           The value of revenue
                                  4
                                                                                1. We smooth Equation (6) to derive a surrogate loss func-
                                  3                                                 tion Lγ (y; v, c−i ), which is training-friendly and has a
                                  2                                                 good approximation of Equation (6) theoretically.
                                  1                                             2. We decompose Lγ as the difference of two convex func-
                                                                                    tions, i.e., Lγ = g1 −g2 . Then we apply DC programming
                                      0   1   2          3          4   5   6       and coordination descent method to optimize the coupons.
                                                  The value of c6
                                                                                 Due to limited space, detailed formulas of Lγ and algorithms
           Figure 1: The value of revenue w.r.t. c6                              are presented in the supplementary materials.

                                                                                           Solution for the General Case
   Furthermore, when reserve price is taken into considera-                     In this section, we will discuss how to extend the afore-
tion, the revenue function can be much more complicated,                        mentioned special case into the general case, i.e., the click-
making the coupon optimization harder.                                          through-rates of different slots are also different. We decom-
                                                                                pose each VCG auction into m sub-VCG auctions which
                                              Warm up                           belong to the special case, and complete the coupon opti-
In this section, we will introduce a special case of VCG auc-                   mization based on the decomposition. Then we study how
tion with coupons. As Proposition 4 denotes, we can extend                      to combine coupons with reserve prices in our framework.
the technique in (Mohri and Medina 2014; Shen et al. 2020b)
to tackle the special case.                                                     Constructing m sub-VCG Auctions
Definition 2 (Special case). In this case of VCG auction
with coupons, the slots are homogeneous, i.e., α1 = α2 =                        Algorithm 1 Constructing m sub-VCG auctions
· · · = αm = α.                                                                  1: Initialize Q = ∅.
   Let subscript (j) only represent the order of bids. We                        2: for j = 1 to m do
use b−i,(j) = v−i,(j) + c−i,(j) to denote the j-th high-                         3:    Let dj = αj − αj+1 , and construct a sub-VCG auc-
est bid in bid profile b−i , that is, if j < l, then the order                        tion Aj as: there are n bidders, j slots, and each slot
[b−i,(j) > b−i,(l) ] ∨ [(b−i,(j) = b−i,(l) ) ∧ (c−i,(j) ≤ c−i,(l) )]                  has the same click-through-rates dj .
is ensured. Let L(y; v, c−i ) = −Rev(v; (y, c−i )) be the                        4:   Q = Q ∪ {Aj }.
negative number of the revenue as a function of ci = y,                          5: end for
which can be rewritten as:
 L(y; v, c−i )                                                                     The decomposition process is demonstrated in Algo-
                    Pm                                                         rithm 1. There are m steps. At the j-th step, we construct
 −mαb−i,(m+1) + α
 
                      m
                        j=1 c−i,(j) , y ≤ b−i,(m+1) − vi                        the j-th sub-VCG auction, where there are j slots and n
                                                                                bidders, and all slots share the same click-through-rates dj .
                  P
 [−mαb−i,(m) + α j=1 c−i,(j)
 
 
= −α(c−i,(m) − y)+ ],                 y = b−i,(m) − vi                          Thus, the j-th sub-VCG auction belongs to the special case.
                      Pm−1                                                      Furthermore, Theorem 1 denotes that given value profile v
   αy − mαb−i,(m) + α j=1 c−i,(j) , y > b−i,(m) − vi
 
 
                                                                                and coupon profile c, the results of these m sub-VCG auc-
 
 
                     Pm
   −mαy − mαvi + α j=1 c−i,(j) ,      otherwise
 
                                                                                tions are equivalent to those results of the original VCG auc-
                                                  (6)                           tion for both platform (in terms of revenue) and each bidder
                                                                                (in terms of allocation and payment).
Here x+ means max {x, 0}. As Proposition 4 states,
                                                                                Theorem 1. Given value profile v and coupon profile c, the
L(y; v, c−i ) has at most one discontinuous point, which is
                                                                                original VCG auction and these sub-VCG auctions would
a reduction of Proposition 3.
                                                                                receive the same bid profile b. Besides, for bidder i, his total
Proposition 4. In the special case, given the value pro-                        payment and allocation (the sum of obtained click-through-
file v and the coupon profile c−i , L(y; v, c−i ) has at most                   rates) within these sub-VCG auctions are equivalent to those
one discontinuous point at y = b−i,(m) − vi . When y <                          in the original VCG auction.
b−i,(m) − vi , L(y; v, c−i ) is non-increasing and concave.                        As a result, we present Theorem 2 to denote that by com-
When y > b−i,(m) − vi , L(y; v, c−i ) is an increasing lin-                     puting the optimal coupon for bidder i in sub-VCG auctions,
ear function. What’s more, for all the cases, L(y; v, c−i )                     the optimal coupon for bidder i in the original VCG auction
achieves its minimum at y = (b−i,(m) − vi )+ .                                  can also be obtained.
   However, L(y; v, c−i ) is still not a good choice for opti-                  Theorem 2. Given v and c−i , the optimal coupon c∗i for
mization since L(y; v, c−i ) is neither convex nor concave,                     bidder i in the original VCG auction belongs to the follow-
but quasi-convex in fact. And a sum of quasi-convex func-                       ing sets
tions does not maintain the quasi-convex property and can                                       {0} ∪ {(b−i,(j) − vi )+ }j∈[m] ,
where (b−i,(j) − vi )+ is the optimal coupon for bidder i in           Algorithm 3 Algorithm for the general case
the j-th sub-VCG auction Aj .                                           1: Run Algorithm 1 to construct mT sub-VCG auctions.
  Combine with the coordinate descent method, we propose                2: Initialize ω ← 0 and kout = 0, and generate a random
Algorithm 2 to optimize coupon for the no-feature case.                      permutation of 1 to n as N̄ .
Here λ in line 7 denotes the learning rate,  and Kout in               3: repeat
                                                                        4:   Set ωp ← ω and kout ← kout + 1.
                                                                        5:   for i = N̄1 to N̄n do
Algorithm 2 Algorithm for the no-feature case
                                                                        6:     Use ω to initialize ωi0 , and calculate coupons for
 1: Run Algorithm 1 to construct mT sub-VCG auctions.                           bidders except i via ctj = ω · xtj + c0 .
 2: Initialize c ← 0 and kout = 0, and generate a random                7:      for k = 1 to K do
      permutation of 1 to n as N̄ .                                     8:        ωik ← DCA(ωik−1 ).
 3: repeat                                                              9:      end for
 4:   Set cp ← c and kout ← kout + 1.                                  10:      Use λωiK + (1 − λ)ω to update ω.
 5:   for i = N̄1 to N̄n do                                            11:   end for
 6:     Fix c−i , calculate the optimal coupon via evaluat-            12:   Set Ψ = {0.1 · (10∆/kωk)0.1j |j = 0, 1, . . . , 10},
           ing Equation (5) at the following O(mT ) points                   compute η ∗ ∈ arg minη∈Ψ LS (η · ω) and use η ∗ · ω
           and returning the best y ∗ .                                      to update ω.
                                                       t∈[T ]
                                                                       13:   Generate a new random permutation N̄ if LS (ω) ≥
                       {0} ∪ {(bt−i,(j) − vit )+ }j∈[m]                      LS (ωp ).
                                                                       14: until kω − ωp k ≤  or kout = Kout
 7:      Use λy ∗ + (1 − λ)ci to update ci .
 8:   end for
 9:   Generate a new random permutation N̄ if LS (c) ≥                 following optimization problem (Equation (8)),
      LS (cp ).
                                                                                       T X
                                                                                         m
10: until kc − cp k ≤  or kout = Kout                                                 X
                                                                          min                    stj − δG2 (ωik−1 ) · ωi         s.t. ∀j ∈ [m]
                                                                       ||ωi ||≤Λ,s
                                                                                       t=1 j=1
line 10 are used to determine when the algorithm terminates.            t
                                                                        ci = ωi · xti + c0 ,                                    t ∈ [T ]
                                                                       
                                                                          t    t    t       t
                                                                                                     Pj−1 t                 j     j    j
                                                                         s  /d   ≥ c  −  jb        +        c       , t ∈ C   ∪ C 3 ∪ C4
                                                                       
                                                                       
Algorithm Design for the General Case
                                                                       
                                                                         j    j    i       −i,(j)       l=1 −i,(l)        1
                                                                                                     t,j
                                                                                                                                 t ∈ C2j
                                                                       
                                                                          t    t       t        t
                                                                       sj /dj ≥ −jci − jvi + fi,1 ,
                                                                       
                                                                       
                                                                       
The hypothesis set H consists of linear functions whose un-                                   et,j
biased term is bounded, i.e., H = {h : xi 7→ ω · xi +                     stj /dtj ≥ (1 + γdit,j )(cti − dt,j          t             t,j
                                                                                                           i,1 ) − jb−i,(j) + fi,1 , t ∈ C2
                                                                                                                                                     j
                                                                                            i,1
c0 |kωk ≤ ∆} and c0 is a positive constant. Besides, ∆
                                                                        
                                                                                       et,j
                                                                        
                                                                        stj /dtj ≥ ( it,j − j)(cti − dt,j             t             t,j
                                                                                                           i,1 ) − jb−i,(j) + fi,2 , t ∈ C3
                                                                                                                                                    j
                                                                        
                                                                        
is chosen to satisfy ∆ ≤ maxckx0
                                  ik
                                     , which guarantees that            
                                                                                     γdi,1
                                                                                     φt,j
                                                                        
                                                                          sj /dj ≥ αit (cti − dt,j                         t,j
                                                                                                                                           t ∈ C4j
                                                                        
                                                                         t t                                t
coupons are non-negative because                                                                  i,1 ) − jb−i,(j) + fi,2 ,
                                                                                                                                                (8)
         ω · xi + c0 ≥ c0 − |ω · xi | ≥ c0 − kωkkxi k
                                                                       where G2 (ωik−1 ) =                      k−1
                                                                                                                     · xti + c0 ; v t , ct−i , Atj )
                                                                                                  P
                     ≥ c0 − ∆ · max kxi k ≥ 0.                                                       t,j g2 (ωi
                                                                                     k−1
                                                                       and δG2 (ωi ) denotes an arbitrary element of the sub-
It is worth noting that max kxi k can be bounded once we
                                                                       gradient ∂G2 (ωik−1 ). Besides, Ckj , dt,j          t,j    t,j    t,j
                                                                                                                    i,1 , di,2 , ei , fi,1 , fi,2
                                                                                                                                                 t,j
normalize the feature space.
   Note that Theorem 1 states that the results of the m sub-           and φt,ji are similar to the definitions in the special case with
VCG auctions are equivalent to those of the original VCG               just a substitution from m to j as Equation (9).
auction, which holds over the T VCG auctions, thus the em-             dt,j    t          t     t,j     t          t    t,j       t  t
                                                                        i,1 = b−i,(j) − vi , di,2 = b−i,(j+1) − vi , ei = v−i,(j) − vi ,
pirical surrogate loss can be written as                                      Xj                          Xj−1
                                                                        t,j                        t,j
                                                                       fi,1 =         ct−i,(l) , fi,2  =       ct−i,(l) + dt,j
                                                                                                                            i,1 ,
            T X
            X m                                                                      l=1                          l=1
                       Lγ (ωi · xti + c0 ; v t , ct−i , Atj ),   (7)   φt,j  t    t,j       t           t           t,j
                                                                        i = dj /γdi,1 · (j(b−i,(j+1) − b−i,(j) ) + ei ),
             t=1 j=1
                                                                       C1j = {t|vit ≥ bt−i,(j) }, C2j = {t|vit < bt−i,(j) , vit ≤ v−i,(j)
                                                                                                                                   t
                                                                                                                                          },
where Atj denotes the j-th constructed sub-VCG auction of              C3j = {t|vit < bt−i,(j) , vit > v−i,(j)
                                                                                                        t
                                                                                                               , (1 − γ)dt,j    t,j
                                                                                                                         i,1 ≥ di,2 },
the t-th VCG auction by executing Algorithm 1, and the cor-
responding click-through-rate is dtj = αjt − αj+1
                                              t
                                                  . It is worth        C4j = {t|vit < bt−i,(j) , vit > v−i,(j)
                                                                                                        t
                                                                                                               , (1 − γ)dt,j    t,j
                                                                                                                         i,1 < di,2 }.
             t
noting that Aj belongs to the special case. Follow the work                                                                                      (9)
in (Mohri and Medina 2014; Shen et al. 2020b), we also use
the technique DC programming for optimization. Therefore,              Combination with Reserve Price
we complete the algorithm design for the general case as               We combine reserve prices with coupons to further improve
Algorithm 3. Here DCA(ωik−1 ) (line 8) means to solve the              the revenue by eliminating the negative payments caused by
large coupons. Let ri + ci denote the eager reserve price for       of features for each bidder in each auction is 178. Besides,
bidder i where ri is independent of ci , the eager reserve price    all the bids are transformed into the interval (0, 10), and
works as below.                                                     features are normalized to satisfy kxi k ≤ 10. We choose
1. The auction first eliminates the bidder who does not clear       c0 = 10 so that ∆ = 1.
   his reserve price, then the remaining bidders form the can-
                                                                    Surrogate Loss vs. Real Loss
   didate set S = {i|bi ≥ ri + ci }.
                                                                    We use synthetic data to show the difference between Lγ and
2. Bidders in the candidate set S are allocated according to        L. For a target bidder, we randomly select related T = 50
   the allocation rule of VCG auction.                              value profiles and generate others’ coupons. Without loss of
3. The payment rule can be modified as follows. If bidder i         generality, the click-through-rates are all 1. As Figure 2(a)
   gets the j-th slot, then he need to pay                          show, LγS is a lower bound for LS , and the difference be-
      m
                                                                    tween two functions is relatively small. Besides, Figure 2(b)
      X                                                             denotes that we can approach the real loss function as we
            (αk − αk+1 ) max{b(k+1) , ri + ci } − αj ci ,   (10)
                                                                    select sufficiently small γ. Similar results are also shown
      k=j
                                                                    on the industrial data (see in supplementary materials). Al-
   where b(k+1) = 0 holds for k = |S|, . . . , m if |S| ≤ m.        though small γ is necessary to guarantee the precision accu-
                                                                    racy, too small γ could make the surrogate loss function tend
It is worth noting that the payment (Equation (10)) must            to be discontinuous. γ is chosen to be 0.1 in the experiments.
be non-negative. Furthermore, we use Proposition 5 to
show the dominant strategy of bidders in VCG auction with
coupons when we combine reserve prices. Thus we can still
use vi + ci to denote the bid of bidder i when he is provided
with coupon ci in experiments.
Proposition 5. In VCG auction with coupons {ci } and re-
serve prices {ri + ci }, the dominant strategy of bidder i is
using coupon ci and bidding vi + ci .
   Note that ri is independent of ci , we can naturally inte-                     (a)                           (b)
grate the reserve prices into the framework of VCG auction
with coupons once we optimize the coupon for each bidder.           Figure 2: Plot of the difference between LS and LγS on the
For example, if we simply select ri = 0 for bidder i, then          synthetic data. (a) LS and LγS as the function of coupon y
the reserve price for bidder i is ci . Bidder i will always clear   when γ = 0.09, where x-axis denotes the coupon y while
his reserve price, but his payment can never be negative.           y-axis denotes the value of loss. (b) average difference be-
                                                                    tween LS and LγS as the function of γ, where x-axis denotes
                         Experiment                                 the choice of γ while y-axis denotes the difference.
In this section, we first verify the validity of the design of
surrogate loss function by comparing it with the real loss          Demonstration of Training Phase
function. Then we examine the impact of hyper-parameters            We implement the algorithms for coupon optimiza-
of our algorithms and verify the properties of coupons. Fi-         tion, where Algorithm 3 is implemented using Gurobi
nally, we conduct some experiments to show the effective-           9.0 (Gurobi Optimization 2020). We first examine the im-
ness of our algorithms against state-of-the-art algorithms.         pact of learning rate λ in terms of revenue. The results
                                                                    are shown as Figure 3(a), where Alg-2 and Alg-3 repre-
Data Description                                                    sent Algorithm 2 and 3 respectively and λ is selected from
We use both synthetic data and industrial data to demon-            {0.01, 0.05, 0.1, 0.5}. We can see that Alg-3 always yields
strate the results of our experiments.
   As for synthetic data, we choose three different types of
distribution to sample the value data, i.e., the uniform dis-
tribution, the Pareto distribution and the lognormal distri-
bution. To be specific, we sample T numbers from a stan-
dard distribution with fixed parameters for each bidder (i.e.,
X ∼ U (0, 1) or ln X ∼ N (0, 1)), and then each bidder
is associated with a random scale size to multiply these T                       (a)                            (b)
numbers as his values in T auctions.
   As for industrial data, it comes from one of the biggest         Figure 3: Plot of training process. (a) revenue curve, where
short-form mobile video community in the world. We ran-             x-axis denotes the iteration while y-axis denotes the corre-
domly extract 177 million auctions from the log. Each ad-           sponding revenue gain. (b) ratio curve, where x-axis denotes
vertiser instance contains many features, including labels,         the iteration while y-axis denotes the corresponding ratio of
industry category, pricing type (i.e., cost per click, cost per     negative payments.
mille or cost per action), budget and so on, and the number         better performance than Alg-2 since it can utilize features.
Algorithms may not converge within 20 iterations when λ             Table 1: Performance (i.e. ρa ) on synthetic data when m =
is small, i.e., λ ≤ 0.1 in Alg-2 and λ = 0.01 in Alg-3.             4. Numbers in the brackets denote standard deviations.
As for Alg-2, larger λ can achieve better performance, thus
we choose λ = 0.5 in Alg-2 to guarantee convergence and               Method        Uniform          Pareto        Lognormal
achieve better performance. While for Alg-3, although larger           ARP         2.58(1.05)      8.88(2.38)      2.92(3.24)
λ can have higher revenue in some iterations, it is unstable           SRP        12.16(5.98)      9.80(3.36)     21.86(9.44)
during the training phase. Hence λ is set to 0.05 in Alg-3            BVCG         7.51(3.32)      1.83(1.27)      6.94(2.80)
to maintain robustness and obtain comparable performance.             Alg-2       13.91(5.77)      3.18(2.57)     10.36(4.77)
Besides, in the remaining experiments, we use  = 0.001               Alg-2a      13.94(5.75)      3.28(2.57)     10.41(4.75)
and Kout = 20 in both algorithms.                                     Alg-2b      15.75(6.07)      9.94(3.36)     22.54(9.15)
   We further verify the property that the payment can be
negative for some bidder when the coupon provided for him
is too large. We calculate the ratio of negative payments after     erson 1981) via producing similar allocation (see in sup-
                       #{(i,t)|pti ≤0}
each iterations, i.e.,      mT         . As shown in Figure 3(b),   plementary materials). For other distributions, Alg-2 always
the payment of some bidder can be negative, but the ratio           has better performance than BVCG. Thirdly, results show
of negative payments is small (< 0.5%). Besides, the ratio          the effectiveness of combining reserve prices with coupons.
increases to improve the revenue in the beginning, but then         Alg-2(a) outperforms Alg-2 by eliminating the negative pay-
tends to decrease. What’s more, the final ratio of negative         ments and Alg-2(b) performs the best among above methods
payments in Alg-3 is smaller than that in Alg-2.                    in all datasets by setting bidder-specific reserve prices. Sim-
                                                                    ilar conclusions can also be seen in the results when m 6= 4,
Revenue Comparison                                                  we demonstrate them in the supplementary materials.
We conduct some experiments to show the effectiveness                  As for industrial data, We first divide 177 million into 59
of coupons and our algorithms. The performance metric is            groups. Then in each group, auctions are divided into train-
         a −Rev0
ρa = RevRev      × 100%, where Reva represents the rev-             ing data and testing data. After training, we summarize the
            0
                                                                    average results on these 59 groups as Table 2. On the indus-
enue achieved through the corresponding algorithm a and
Rev0 denotes the revenue obtained without these methods
in VCG auction. The following methods are compared:                     Table 2: Performance (i.e. ρa ) on the industrial data.

• (ARP) This method sets anonymous reserve price in VCG                       Method      training data   testing data
  auction (Mohri and Medina 2014).                                             ARP         0.00(0.00)      0.00(0.00)
• (SRP) This method sets bidder-specific reserve price in                      SRP        −1.05(1.13)     −2.86(4.01)
  VCG auction via maximizing vi (1−Fi (vi )) (Hartline and                    BVCG         1.19(0.36)      0.13(0.75)
  Roughgarden 2009).                                                          Alg-2        1.33(0.57)      0.32(0.29)
• (BVCG) This method assigns a boost value for each bid-                      Alg-2a                       0.36(0.28)
  der in VCG auction (Golrezaei et al. 2017).                                 Alg-3       4.74(3.95)       4.86(3.81)
                                                                              Alg-3a                      4.87(3.81)
• (Alg-2a/Alg-3a) This method combines Algorithm 2 or 3
  with reserve prices ri + ci where ri = 0.
                                                                    trial data, Alg-3(a) outperforms others by a large margin,
• (Alg-2b/Alg-3b) This method combines Algorithm 2                  demonstrating the effectiveness of our mechanisms. More
  or 3 with reserve prices ri + ci where ri belongs to              details can be found in supplementary materials.
  arg maxvi vi (1 − Fi (vi )).
Note that (Mohri and Medina 2014; Golrezaei et al. 2017)                                   Conclusion
only study the second price setting, we extend their methods
                                                                    In this paper, we study how to design coupons in VCG auc-
to VCG auction as ARP and BVCG (details are provided in
                                                                    tion via a learning framework. Firstly, we derive the dom-
supplementary materials).
                                                                    inant strategy of each bidder and characterize the coupons.
   As for synthetic data, given the type of distribution and        Then we start from a special case where slots are homoge-
the number of slots m, we randomly sample the value data            neous and extend this special case to the general one via auc-
and divide the whole dataset into training data and testing         tion decomposition and complete the algorithm design. We
data, with training data makes up about 70%. Then we run            also combine coupons with reserve prices in our framework.
different algorithms on the training data and calculate ρa on       Finally, extensive experiments are conducted to demonstrate
the testing data. This procedure is repeated for 20 times and       the effectiveness of the algorithms.
the results when m = 4 are summarized as Table 1 shows.
   We can draw the following conclusions from Table 1.
Firstly, coupons work well on these synthetic datasets as                              Acknowledgments
Alg-2 can increase the revenue by a large margin. Sec-              This work is is supported by Beijing Academy of Artifi-
ondly, Alg-2 always outperforms ARP, SRP and BVCG on                cial Intelligence (BAAI), Science and Technology Innova-
the datasets sampled from uniform distribution. This is be-         tion 2030—“New Generation Artificial Intelligence” Major
cause coupons can approximate the optimal auction in (My-           Project No. 2018AAA0100904, 2018AAA0101103.
References                               R., eds., Proceedings of the 36th International Conference
Bester, H.; and Petrakis, E. 1996. Coupons and oligopolis-      on Machine Learning, ICML 2019, 9-15 June 2019, Long
tic price discrimination. International Journal of Industrial   Beach, California, USA, volume 97 of Proceedings of Ma-
Organization 14(2): 227–242.                                    chine Learning Research, 1706–1715. PMLR. URL http:
                                                                //proceedings.mlr.press/v97/duetting19a.html.
Board, S.; and Skrzypacz, A. 2016. Revenue management
with forward-looking buyers. Journal of Political Economy       Edelman, B.; Ostrovsky, M.; and Schwarz, M. 2007. In-
124(4): 1046–1087.                                              ternet advertising and the generalized second-price auction:
                                                                Selling billions of dollars worth of keywords. American eco-
Cai, Q.; Filos-Ratsikas, A.; Tang, P.; and Zhang, Y. 2018a.
                                                                nomic review 97(1): 242–259.
Reinforcement Mechanism Design for e-commerce. In
Champin, P.; Gandon, F. L.; Lalmas, M.; and Ipeirotis, P. G.,   Golrezaei, N.; Lin, M.; Mirrokni, V.; and Nazerzadeh, H.
eds., Proceedings of the 2018 World Wide Web Conference         2017. Boosted Second Price Auctions: Revenue Optimiza-
on World Wide Web, WWW 2018, Lyon, France, April 23-27,         tion for Heterogeneous Bidders. Available at SSRN 3016465
2018, 1339–1348. ACM. doi:10.1145/3178876.3186039.              .
URL https://doi.org/10.1145/3178876.3186039.                    Groves, T.; et al. 1973. Incentives in teams. Econometrica
Cai, Q.; Filos-Ratsikas, A.; Tang, P.; and Zhang, Y.            41(4): 617–631.
2018b. Reinforcement Mechanism Design for Fraudu-               Gurobi Optimization, L. 2020. Gurobi Optimizer Reference
lent Behaviour in e-Commerce.          In McIlraith, S. A.;
                                                                Manual. URL http://www.gurobi.com.
and Weinberger, K. Q., eds., Proceedings of the Thirty-
Second AAAI Conference on Artificial Intelligence, (AAAI-       Hartline, J. D.; and Roughgarden, T. 2009. Simple versus
18), the 30th innovative Applications of Artificial Intel-      optimal mechanisms. In Chuang, J.; Fortnow, L.; and Pu,
ligence (IAAI-18), and the 8th AAAI Symposium on Ed-            P., eds., Proceedings 10th ACM Conference on Electronic
ucational Advances in Artificial Intelligence (EAAI-18),        Commerce (EC-2009), Stanford, California, USA, July 6–
New Orleans, Louisiana, USA, February 2-7, 2018, 957–           10, 2009, 225–234. ACM. doi:10.1145/1566374.1566407.
964. AAAI Press. URL https://www.aaai.org/ocs/index.php/        URL https://doi.org/10.1145/1566374.1566407.
AAAI/AAAI18/paper/view/16650.                                   Hsu, J.; Morgenstern, J.; Rogers, R. M.; Roth, A.; and
Chawla, S.; Fu, H.; and Karlin, A. R. 2014. Approximate         Vohra, R. 2016. Do prices coordinate markets? In Wichs,
revenue maximization in interdependent value settings. In       D.; and Mansour, Y., eds., Proceedings of the 48th Annual
Babaioff, M.; Conitzer, V.; and Easley, D. A., eds., ACM        ACM SIGACT Symposium on Theory of Computing, STOC
Conference on Economics and Computation, EC ’14, Stan-          2016, Cambridge, MA, USA, June 18-21, 2016, 440–453.
ford , CA, USA, June 8-12, 2014, 277–294. ACM. doi:             ACM. doi:10.1145/2897518.2897559. URL https://doi.org/
10.1145/2600057.2602858. URL https://doi.org/10.1145/           10.1145/2897518.2897559.
2600057.2602858.                                                Huang, Z.; Mansour, Y.; and Roughgarden, T. 2015. Making
Clarke, E. H. 1971. Multipart pricing of public goods. Pub-     the Most of Your Samples. In Roughgarden, T.; Feldman,
lic choice 17–33.                                               M.; and Schwarz, M., eds., Proceedings of the Sixteenth
Conitzer, V.; and Sandholm, T. 2002.              Complexity    ACM Conference on Economics and Computation, EC ’15,
of Mechanism Design.           In Darwiche, A.; and Fried-      Portland, OR, USA, June 15-19, 2015, 45–60. ACM. doi:
man, N., eds., UAI ’02, Proceedings of the 18th                 10.1145/2764468.2764475. URL https://doi.org/10.1145/
Conference in Uncertainty in Artificial Intelligence,           2764468.2764475.
University of Alberta, Edmonton, Alberta, Canada, Au-           Jin, Y.; Lu, P.; Qi, Q.; Tang, Z. G.; and Xiao, T. 2019. Tight
gust 1-4, 2002, 103–110. Morgan Kaufmann.              URL      approximation ratio of anonymous pricing. In Charikar, M.;
https://dslpitt.org/uai/displayArticleDetails.jsp?mmnu=1\       and Cohen, E., eds., Proceedings of the 51st Annual ACM
&smnu=2\&article\ id=850\&proceeding\ id=18.                    SIGACT Symposium on Theory of Computing, STOC 2019,
Derakhshan, M.; Golrezaei, N.; and Leme, R. P. 2019. LP-        Phoenix, AZ, USA, June 23-26, 2019, 674–685. ACM. doi:
based Approximation for Personalized Reserve Prices. In         10.1145/3313276.3316331. URL https://doi.org/10.1145/
Karlin, A.; Immorlica, N.; and Johari, R., eds., Proceed-       3313276.3316331.
ings of the 2019 ACM Conference on Economics and Com-           Kang, H.; Hahn, M.; Fortin, D. R.; Hyun, Y. J.; and Eom,
putation, EC 2019, Phoenix, AZ, USA, June 24-28, 2019,          Y. 2006. Effects of perceived behavioral control on the con-
589. ACM. doi:10.1145/3328526.3329594. URL https:               sumer usage intention of e-coupons. Psychology & Market-
//doi.org/10.1145/3328526.3329594.                              ing 23(10): 841–864.
Derakhshan, M.; Pennock, D. M.; and Slivkins, A. 2020.          Lahaie, S.; and Pennock, D. M. 2007a. Revenue analy-
Beating Greedy For Approximating Reserve Prices in Multi-       sis of a family of ranking rules for keyword auctions. In
Unit VCG Auctions. CoRR abs/2007.12653. URL https:              MacKie-Mason, J. K.; Parkes, D. C.; and Resnick, P., eds.,
//arxiv.org/abs/2007.12653.                                     Proceedings 8th ACM Conference on Electronic Commerce
Duetting, P.; Feng, Z.; Narasimhan, H.; Parkes, D. C.;          (EC-2007), San Diego, California, USA, June 11-15, 2007,
and Ravindranath, S. S. 2019. Optimal Auctions through          50–56. ACM. doi:10.1145/1250910.1250918. URL https:
Deep Learning. In Chaudhuri, K.; and Salakhutdinov,             //doi.org/10.1145/1250910.1250918.
Lahaie, S.; and Pennock, D. M. 2007b. Revenue analy-           2020, The Thirty-Second Innovative Applications of Artifi-
sis of a family of ranking rules for keyword auctions. In      cial Intelligence Conference, IAAI 2020, The Tenth AAAI
MacKie-Mason, J. K.; Parkes, D. C.; and Resnick, P., eds.,     Symposium on Educational Advances in Artificial Intelli-
Proceedings 8th ACM Conference on Electronic Commerce          gence, EAAI 2020, New York, NY, USA, February 7-12,
(EC-2007), San Diego, California, USA, June 11-15, 2007,       2020, 2236–2243. AAAI Press. URL https://aaai.org/ojs/
50–56. ACM. doi:10.1145/1250910.1250918. URL https:            index.php/AAAI/article/view/5600.
//doi.org/10.1145/1250910.1250918.                             Shen, W.; Tang, P.; Wang, X.; Xu, Y.; and Yang, X. 2020b.
Mckenzie, R. D. 2008. Why Popcorn Costs So Much at the         Learning to Design Coupons in Online Advertising Mar-
Movies: And Other Pricing Puzzles. Springer New York.          kets. In Seghrouchni, A. E. F.; Sukthankar, G.; An, B.;
                                                               and Yorke-Smith, N., eds., Proceedings of the 19th Inter-
Mohri, M.; and Medina, A. M. 2014. Learning Theory and         national Conference on Autonomous Agents and Multiagent
Algorithms for revenue optimization in second price auc-       Systems, AAMAS ’20, Auckland, New Zealand, May 9-13,
tions with reserve. In Proceedings of the 31th Interna-        2020, 1242–1250. International Foundation for Autonomous
tional Conference on Machine Learning, ICML 2014, Bei-         Agents and Multiagent Systems. URL https://dl.acm.org/
jing, China, 21-26 June 2014, volume 32 of JMLR Workshop       doi/abs/10.5555/3398761.3398905.
and Conference Proceedings, 262–270. JMLR.org. URL
http://proceedings.mlr.press/v32/mohri14.html.                 Shen, W.; Tang, P.; and Zuo, S. 2019. Automated Mecha-
                                                               nism Design via Neural Networks. In Elkind, E.; Veloso,
Moraga-González, J. L.; and Petrakis, E. 1999. Coupon ad-     M.; Agmon, N.; and Taylor, M. E., eds., Proceedings of the
vertising under imperfect price information. Journal of Eco-   18th International Conference on Autonomous Agents and
nomics & Management Strategy 8(4): 523–544.                    MultiAgent Systems, AAMAS ’19, Montreal, QC, Canada,
Morgenstern, J.; and Roughgarden, T. 2015. On the Pseudo-      May 13-17, 2019, 215–223. International Foundation for
Dimension of Nearly Optimal Auctions. In Cortes, C.;           Autonomous Agents and Multiagent Systems. URL http:
Lawrence, N. D.; Lee, D. D.; Sugiyama, M.; and Garnett, R.,    //dl.acm.org/citation.cfm?id=3331696.
eds., Advances in Neural Information Processing Systems        Tang, P. 2017. Reinforcement mechanism design. In
28: Annual Conference on Neural Information Processing         Sierra, C., ed., Proceedings of the Twenty-Sixth Interna-
Systems 2015, December 7-12, 2015, Montreal, Quebec,           tional Joint Conference on Artificial Intelligence, IJCAI
Canada, 136–144. URL http://papers.nips.cc/paper/5766-         2017, Melbourne, Australia, August 19-25, 2017, 5146–
on-the-pseudo-dimension-of-nearly-optimal-auctions.            5150. ijcai.org. doi:10.24963/ijcai.2017/739. URL https:
Morgenstern, J.; and Roughgarden, T. 2016. Learning Sim-       //doi.org/10.24963/ijcai.2017/739.
ple Auctions. In Feldman, V.; Rakhlin, A.; and Shamir, O.,     Tang, P.; and Sandholm, T. 2011. Approximating Optimal
eds., Proceedings of the 29th Conference on Learning The-      Combinatorial Auctions for Complements Using Restricted
ory, COLT 2016, New York, USA, June 23-26, 2016, vol-          Welfare Maximization. In Walsh, T., ed., IJCAI 2011, Pro-
ume 49 of JMLR Workshop and Conference Proceedings,            ceedings of the 22nd International Joint Conference on Arti-
1298–1318. JMLR.org. URL http://proceedings.mlr.press/         ficial Intelligence, Barcelona, Catalonia, Spain, July 16-22,
v49/morgenstern16.html.                                        2011, 379–385. IJCAI/AAAI. doi:10.5591/978-1-57735-
Myerson, R. B. 1981. Optimal Auction Design. Math. Oper.       516-8/IJCAI11-072. URL https://doi.org/10.5591/978-1-
Res. 6(1): 58–73. doi:10.1287/moor.6.1.58. URL https://        57735-516-8/IJCAI11-072.
doi.org/10.1287/moor.6.1.58.                                   Tang, P.; and Wang, Z. 2016. Optimal Auctions for Neg-
                                                               atively Correlated Items. In Conitzer, V.; Bergemann, D.;
Ostrovsky, M.; and Schwarz, M. 2011. Reserve prices in in-
                                                               and Chen, Y., eds., Proceedings of the 2016 ACM Confer-
ternet advertising auctions: a field experiment. In Shoham,
                                                               ence on Economics and Computation, EC ’16, Maastricht,
Y.; Chen, Y.; and Roughgarden, T., eds., Proceedings 12th
                                                               The Netherlands, July 24-28, 2016, 103–120. ACM. doi:
ACM Conference on Electronic Commerce (EC-2011), San
                                                               10.1145/2940716.2940766. URL https://doi.org/10.1145/
Jose, CA, USA, June 5-9, 2011, 59–60. ACM. doi:10.1145/
                                                               2940716.2940766.
1993574.1993585. URL https://doi.org/10.1145/1993574.
1993585.                                                       Vickrey, W. 1961. Counterspeculation, auctions, and com-
                                                               petitive sealed tenders. The Journal of finance 16(1): 8–37.
Roughgarden, T.; and Schrijvers, O. 2016. Ironing in the
Dark. In Conitzer, V.; Bergemann, D.; and Chen, Y.,            Zeithammer, R. 2019. Soft Floors in Auctions. Manag.
eds., Proceedings of the 2016 ACM Conference on Eco-           Sci. 65(9): 4204–4221. doi:10.1287/mnsc.2018.3164. URL
nomics and Computation, EC ’16, Maastricht, The Nether-        https://doi.org/10.1287/mnsc.2018.3164.
lands, July 24-28, 2016, 1–18. ACM. doi:10.1145/2940716.
2940723. URL https://doi.org/10.1145/2940716.2940723.
Shen, W.; Peng, B.; Liu, H.; Zhang, M.; Qian, R.; Hong,
Y.; Guo, Z.; Ding, Z.; Lu, P.; and Tang, P. 2020a. Re-
inforcement Mechanism Design: With Applications to Dy-
namic Pricing in Sponsored Search Auctions. In The Thirty-
Fourth AAAI Conference on Artificial Intelligence, AAAI
You can also read