Maximizing Clicks of Sponsored Search by Integer Programming
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
Maximizing Clicks of Sponsored Search by Integer Programming Ahmad Zainal-Abidin and Jun Wang Department of Computer Science University College London Gower St, London WC1E 6BT United Kingdom {a.zainalabidin, jun.wang}@cs.ucl.ac.uk ABSTRACT commonly used greedy methods. Sponsored search enables advertisers to promote their prod- ucts to targeted users based on the interests expressed in Categories and Subject Descriptors the users’ web search queries. A typical search-based ad- H.3.3 [Information Storage and Retrieval]: Information vertisement campaign requires an advertiser to select one or Search and Retrieval multiple phrases (query terms) for each ad (e.g. textual data with links to landing pages), and separately place bids for the selected phrases in an attempt to win display slots for General Terms the ads. A click on each ad costs the advertiser an amount Algorithms, Economics, Performance set by a search engine through an auction and rewards the search engine its advertising revenue. Although advertisers and web search providers have con- Keywords sidered the number of clicks as a practical measure of the Sponsored search, Online advertising, Optimization effectiveness, their optimization units are, however, not nec- essarily the same - the search providers generally focus on clicks per query, while the advertisers are interested in clicks 1. INTRODUCTION per ad. In this paper, we address the clicks maximization Online advertising has been growing rapidly. The large problem from the latter point of view. To the advertiser, all volume of ad data has brought challenging problems such ads in the campaign have potential future returns, e.g. clicks, as understanding query intent [7], personalized ad delivery conversions, revenues, or profits, and all phrases associated [4], ad ranking [8], and optimization [21, 9]. Search-based with the ads collectively contribute towards the overall re- advertising, display advertising, social network advertising, turns. Additionally, each phrase has uncertain impression and email marketing are under the umbrella of online ad- rate, click-through rate and cost per click. Due to these vertising. Search-based advertising itself, is a multi-billion uncertainties, using a budget effectively to generate max- dollar business. It was reported that search engines’ rev- imum possible clicks is a non-trivial task. In this paper, enues from search advertising in 2006 reached $9.4 billion, a to consider the overall effectiveness of the selected phrases, remarkable increase of approximately $3.65 billion from the clicks maximization is formulated as a constrained integer revenues earned in the previous year [20]. Typically, three programming problem. parties are involved in search-based advertising: search en- In the formulation, an overall budget and a maximum av- gines, advertisers, and users. erage cost per click the advertiser is willing to spend are Through their advertising services, the search engines such modeled as the constraints. Moreover, in certain practi- as Google and Yahoo! provide the mechanism to enable cal situations, it is more useful to formulate the problem the advertisers promote their products to targeted groups of by minimizing advertising cost given the number of clicks users. The search engines act as auctioneers selling keywords the advertiser intends to achieve. We show that the click to the advertisers. Examples of the advertising services in- maximization and the cost minimization formulations are clude Google AdWords1 and Yahoo! Sponsored Search2 . mathematically equivalent. In our simulation, the proposed The advertisers require slots to place their marketing mes- methods have yielded promising results when compared with sages (ads) on the Web. Joachims et al. [17] argue that for each ad, the position and the result page number of the ad have a significant influence on its click-through rate (CTR). In the sponsored search framework, bid prices and the rel- evancies between bid phrases and user queries influence the Permission to make digital or hard copies of all or part of this work for slot position to be awarded. However, the bids placed by personal or classroom use is granted without fee provided that copies are other advertisers on similar phrases are unknown. Either not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to each bid will end up winning a slot is uncertain. Motivated republish, to post on servers or to redistribute to lists, requires prior specific by the notion that the ads displayed at higher positions are permission and/or a fee. 1 ADKDD’10, July 25, 2010, Washington D.C., USA. http://www.google.com/adwords 2 Copyright 2010 ACM 978-1-4503-0221-0 ...$10.00. http://searchmarketing.yahoo.com/srch/index.php
likely clicked on, the advertisers typically compete to bid for age CPC. Both the click maximization and cost minimiza- phrases that they believe to be relevant to user queries to tion problems are posed as integer programming problems. increase the chances that their ads will be placed at higher Mathematically, the formulations for the two problems are positions. The CTRs are arranged in a non-increasing or- equivalent and our simulation confirms this insight. der with ads placed at higher positions have higher CTRs. The paper is organized as follows. We discuss related work The minimum bid for an ad to stay in a certain position in Section 2, present our problem formulation and theoreti- is therefore influenced by its CTR. Ideally, the CTR is a cal framework in Section 3, provide performance evaluation random variable, resembling a practical scenario that its ac- in Section 4, and conclude our study in Section 5. For sim- tual value is uncertain. However, it can be estimated by plicity, keyword(s) and phrase(s) are interchangably used. investing a small amount of money on keywords to experi- ment the traffic that they possibly attract. Search engines 2. RELATED WORK companies, through their advertising services, provide infor- In the last few years, many researches from the perspective mation for keyword performance prediction. An ad cost per of auctioneers (search engines) [26, 14, 5, 10] and advertisers click (CPC) is determined through an auction [14]. Cur- [15, 27, 13, 16, 28, 6, 25] have been conducted on auction and rently, for each ad, each advertiser is required to choose a bidding optimization. The views of auctioneers and adver- set of keywords, determine the bid price for each of the cho- tisers on sponsored search are different. For example, while sen keywords, set a daily budget, and specify the duration the auctioneers aim to increase revenues generated from user of participation in the ad campaign before taking part in the clicks on ads after issuing query terms, the advertisers are auction. interested in the return on investment (ROI) on their ads. The users issue ad-hoc topics to express their information Papers on keyword modeling, ad ranking positions, and needs. In organic search, the relevance between a search clicks have been discussed in [12, 3, 2, 18, 10]. Aggarwal, topic (query) and documents on the Web is used to retrieve Goel and Motwani [5] design an auction called laddered auc- relevant documents. However, in search-based advertising, tion. Unlike the next-price auction, they argue that their ads are not retrieved purely based on relevance. The match auction to be a truthful auction. Their approach is that, between the search topics and the advertisers’ keywords, the given a weight and a bid, the laddered auction assigns a bid prices and CTRs for the keywords are among factors in score, a ranking position and a price for the slot on auction. deciding which ads are eventually given slots, although the The assigned price is lower than the bid but not similar exact method is unique from a search engine to another. to the price assigned by the next-price auction. Edelman, The selected ads will be displayed alongside organic search Ostrovsky and Schwarz [14] study Internet advertising and results. For a pay-per-click model, the advertisers will be the generalized second-price (GSP) auction that has become charged only if there are clicks on their displayed ads. In successful. They compare GSP with Vickrey-Clarke-Groves this paper, we aim at improving advertisers’ experience by (VCG) mechanism that is truth-telling and has an equilib- maximizing the overall impact of their ad campaigns. Some rium dominant strategies. common notions to measure the impact are profits, revenues, Borgs et al. [10] study multi-unit auctions with budget- conversions, and clicks. We use the number of clicks, a simi- constrained bidders. Each bidder has a budget and a private lar measure assumed in [16]. Very recently, researchers have valuation. Their method is designed so that the revenue started looking at the direction to improve the advertiser’s for the auctioneer be maximized. Our proposed study also experience in the search-based advertising ecosystem, mod- considers budget and private valuation of a bidder, but the eling the keyword selection as an optimization problem, con- aim is to optimize clicks. Our focus is also different as we strained by the budgets [23, 27, 25, 15, 16, 6]. consider the budget constrained problem from a portfolio of In a budget constrained bidding, the advertisers specify phrases point of view. their advertising budgets. It is important for them to de- In [23], Rusmevichientong and Williamson discussed CTRs liver their marketing messages in a cost effective manner. for various keywords over time and how to select the key- As ads are perishable, one of the criteria that the auctioneer words effectively. In their study, keywords with high profit- awards display slots is when the advertisers have positive to-cost ratio are given priorities in the selection of profitable budgets. In a typical ad campaign, advertisers have many keywords. Although our study involves keywords (phrases), ads with each has multiple associated keywords. With an we do not address the choice of keywords. We assume the assumption that all ads have future returns, all keywords keywords selected by advertisers are the ones they are most associated with the ads potentially contribute towards the interested in. Another study on CTR was done by Richard- overall clicks. With a lot of ads to manage, setting a bid son, Dominowska and Ragno [22] to predict CTR for new for each keyword can be a daunting task. This motivates ads in the advertising system. They argue that the explo- us to view keywords as a portfolio as there has been little ration and exploitation of new ads are crucial to enable new previous research on how advertisers manage their ads in a ads to have opportunities to be clicked on. Unlike ads that portfolio of phrases mode to maximize the overall impact of have established themselves in the advertising system, new their campaigns. To maximize clicks, we propose a method ads lack historical data. They propose that CTRs for new whereby an advertiser specifies (i) an overall budget and (ii) ads rely on the information about the ads itself. Our work a maximum average CPC for an ad the advertiser is willing uses CTR for clicks estimation. Ideally, CTRs are random to spend. We argue that specifying the maximum average variables that are obtained at the time the bidding takes CPC for each ad eliminates the need to specify a bid for place. Accurate CTRs provide more meaningful informa- each keyword associated with an ad without sacrificing the tion on how likely an ad will be clicked on. overall total clicks that the ad can generate. We also for- In online advertising framework, the work using linear pro- mulate cost minimization constrained by (i) minimum clicks gramming has been discussed in [19] by Nakamura and Abe. an advertiser wishes to generate and (ii) a maximum aver- However, their work focuses on the banner and pay-per-
impression. Specifically, their interest leans towards schedul- ing ad impressions. Our approach considers integer pro- gramming but focuses on a more recent model in online ad- vertising which is cost-per-click advertising model. The click maximization is clearly a variant of optimization problem. Some papers relevant to search-based advertising discussing bid optimization from the perspective of advertis- ers can be found in [15, 27, 16]. In their paper, Feldman et al. [16] propose bidding technique on keywords. They apply uniform bidding on all keywords. They claim the strategy is able to generate at least 1 - 1/e fraction of the maxi- mum clicks possible. Similar to their objective to maximize clicks for a given budget, our focus is different. Our algo- rithm does not bid uniformly for all keywords. Rather, the algorithm uses the specified average CPC which is within the advertiser’s control as the limit to bid on each phrase in an attempt to maximize overall clicks. In [27], Zhou, Figure 1: An example of combinations of cost-click Chakrabarty and Lukose model the budget-constrained op- data timization problem as an online multiple-choice knapsack problem. Their objective is to maximize revenues of an ad- vertiser without the knowledge of other bidders’ bid prices budget β of 4000.00, and her valuation of maximum average and CTRs. The problem that we address is similar in the CPC ϕ is 3.00. A greedy approach to solve the problem sense that for each ad, an advertiser is awarded at most one will bid in such a way to be able to win the 3rd, 5th, and slot per ad per user query. Eyal et al. [15] address a prob- 4th positions3 from Table 1(a), Table 1(b) and and Table lem in bid optimization. Although our study is a variant of 1(c), respectively. The total number of clicks is 945; 500 for bid optimization, their focus is more on bid optimization for phrase X, 200 for phrase Y , and 245 for phrase Z. In this broad match auctions. example, the average CPC using greedy method is 2.68. However, the above solution is not optimal. If the adver- tiser A is willing to take more risks by spending more on 3. PROPOSED METHOD bids, the overall number of clicks will likely to increase. The We describe a phrase as a single keyword (e.g. under- optimal number of clicks, given the two constraints β and ϕ, ground ) or multiple keywords (e.g. London underground ). will be at positions 2nd, 4th, and 6th (700 + 250 + 210 = While it is not compulsory to associate an ad with multi- 1160 clicks) for phrases X, Y , and Z respectively. The total ple phrases, the additional phrases increase the likelihood advertising cost is 3452 which is less than β. The average of the ad to be clicked on. In this paper, the overall gen- CPC is 2.98 which is less than ϕ. Given the two constraints erated clicks will be the primary objective. As an illustra- and the data in Table 1, 2.98 is the closest to 3.00 it can get. tion, consider a static model in which an advertiser A has For a small number of ad such as in the example, it is easy an ad represented by phrases X, Y , and Z (Table 1). As- to find the optimal solution, given the advertiser’s budget sume further that there are seven display slots available. In β and the advertiser’s valuation of average CPC ϕ. With- practice, the total number of slots awarded per user query out ϕ the advertiser may be assigned to a more expensive varies with different search engines implement their algo- CPC, hence the budget β becomes exhausted sooner. For rithms differently. Table 1 provides some statistics on X, Y a large number of phrases, manually determining optimized and Z. In this example, an ad placed lower than 7th posi- combination can be a difficult task. Some might argue why tions on X, Y , and Z will not get displayed. In practice, would the algorithm choose to bid closer to the advertiser’s search engines also will not display ads beyond kth posi- maximum valuation of CPC when in fact the advertiser can tion. In the table, CP C denotes the cost per click, and save more if opting for lower bids. In practice, winning a CT R the clickthrough rate at the given position. Assume display slot is highly uncertain as other advertisers’ bids are that the total impressions I for phrases X, Y , and Z are unknown. Using our approach, all prices are considered as 10000, 5000 and 3500, respectively. The CTRs are in non- long as the maximum average CPC is not exceeded. We are increasing orders in which top positions have higher CTRs. now ready to formulate the problem. It has been a practice Generally, for n ranking positions their corresponding CTRs that search engine companies never award multiple positions are (CT R1 ≥ CT R2 ≥ ... ≥ CT Rn ). The impression I is to an ad per user query. So, we assume that an ad is given counted based on the number of displays associated with a at most one position at a time. Clicks and total costs are corresponding phrase. To illustrate, the possible click-cost defined as follows: combinations are shown in Figure 1. In the figure, given a budget as a constraint, our method will adaptively select λij = αij ∗ γi (1) points on the optimal curve. With the inclusion of the adver- where λij is the number of clicks at position j for phrase i, tiser’s valuation of maximum average CPC the advertiser is αij is the clickthrough rate at position j for phrase i, and willing to spend, the number of clicks generated for the same γi is the impression for phrase i; budget amount is expected to be no more than the number of clicks generated if budget is the only constraint. The τij = λij ∗ ρij (2) maximum average CPC constraint is used as a mechanism to control the budget exhaustion from happening sooner. 3 An advertiser bids for phrase(s). The display slot position is decided Consider an example. Suppose that the advertiser A has a normally based on the bid price and the quality (relevance) of the ad
Table 1: Examples on statistics for phrases X, Y and Z (a) Phrase X, I=10000 (b) Phrase Y, I=5000 (c) Phrase Z, I=3500 Position CPC CTR Position CPC CTR Position CPC CTR 1 3.75 0.10 1 5.15 0.09 1 4.55 0.12 2 3.15 0.07 2 4.75 0.07 2 3.65 0.10 3 2.61 0.05 3 3.95 0.06 3 3.33 0.09 4 1.59 0.03 4 3.35 0.05 4 2.75 0.07 5 1.21 0.02 5 2.75 0.04 5 2.41 0.07 6 0.98 0.02 6 2.35 0.03 6 1.95 0.06 7 0.59 0.02 7 1.65 0.03 7 1.55 0.05 where τij is the cost at position j for phrase i, and ρij is the ClickMax variant #1 is expected to perform better as it has CPC at position j for phrase i. no limit for average CPC. The second solution is relevant to an advertiser that has a budget but would like to have some 3.1 Click maximization control on the average CPC upper limit. We propose two variants of click maximization in which the first variant takes a budget β as a constraint while the 3.2 Cost minimization other takes β and a maximum average CPC ϕ that an ad- Minimizing advertising cost can be one of business objec- vertiser is willing to spend. tives. We propose two variants: the first variant takes mini- mum required clicks χ as a constraint, while the other takes 3.1.1 ClickMax variant #1 χ and ϕ. The goal of these solutions is to have an insight Given the advertiser’s budget β for the phrases in her whether the cost-click curves for ClickMax and CostMin are portfolio, the problem is formulated as follows: identical. Our evaluations on ClickMax and CostMin are maximize: provided in Section 4. Xm X n xij ∗ λij (3) 3.2.1 CostMin variant #1 i=1 j=1 Given the minimum clicks χ the advertiser expects from subject to: the phrases she will bid, the objective function is as ex- m X X n pressed in Eq. 8. The constraint reflecting the inclusion of xij ∗ τij ≤ β, (4) χ is expressed in Eq. 9. The constraints defined in Eq. 5 i=1 j=1 and Eq. 6 are also included but their definitions are not n repeated. minimize X xij ≤ 1, (5) m X n j=1 X xij ∗ τij (8) ∀i, i = 1, ..., m, i=1 j=1 xij ∈ {0, 1} (6) subject to ∀i, i = 1, ..., m and ∀j, j = 1, ..., n m X n In Eq. 3 and Eq. 4, xij is either 0 or 1, reflecting at most X 1 position per phrase will be awarded. The above solution xij ∗ λij ≥ χ (9) maximizes the number of clicks with the CPC selected varies i=1 j=1 from one phrase to another. As the only constraint is β, the average CPC may be more than what the advertiser is will- 3.2.2 CostMin variant #2 ing to spend as the algorithm adaptively suggests the bid Given χ and ϕ, the objective function is as defined in according to the available budget. Therefore, another con- Eq. 8. The constraints defined in Eq. 7 and Eq. 9 are straint that should be included is the advertiser’s valuation used to reflect the inclusion of ϕ and χ, respectively. Other of maximum average CPC ϕ in which the actual average constraints defined in Eq. 5 and Eq. 6 are also used. For CPC should not exceed ϕ. simplicity, the definitions of Eq. 5 through Eq. 9 are not repeated. 3.1.2 ClickMax variant #2 An alternate solution requires an advertiser to specify two 4. PERFORMANCE EVALUATION parameters: β, and ϕ. The objective function defined in Eq. 3 and the constraints in Eq. 4, Eq. 5, and Eq. 6 are also 4.1 Experimental set-up used. For simplicity, the definitions of those equations (Eq.3- 6) are not repeated. The additional constraint reflecting the The objective of the simulation is to measure the per- inclusion of ϕ is expressed as follows: formance effectiveness of the proposed methods. In the simulation, some Yahoo! Search Marketing Advertiser Bid- Pm Pn Impression-Click [1] data are used to construct clickthrough xij ∗ τij i=1 Pm Pnj=1 ≤ϕ (7) and impression rates. The data set schema is as follows: i=1 j=1 xij ∗ λij • day: the day in which an advertiser places a bid, ex- With the assumption that within the same phrase, there pressed as an integer (e.g. 10, indicating the bid hap- are different display slots with different CPCs, the alternate pened on the 10th day) solution is able to select the combination of phrases to op- • anonymized account id : the anonymized account of an timize the objective function. Comparing the two solutions, advertiser, expressed in alphanumerics.
• rank : the ranking position of the phrase(s) bid by the phrase must be ≤ ϕ. The method then attempts to maxi- advertiser mize the number of clicks for each of the phrases. In Section • anonymized keyphrase: the phrase(s) bid by the ad- 3.2.1 and 3.2.2, the objective is to minimize costs. We use vertiser, expressed in alphanumerics. greedy methods as our baseline to measure the performance of our methods. The greedy solution corresponding to Cost- • avg bid : the average bid for the bid phrase(s). Min variant #1 is defined as Greedy method variant #3 as • impressions: the number of times the phrase(s) appear follows: for p number of phrases, the greedy method vari- at the search engine (Yahoo!) result page ant #3 divides the minimum required clicks χ with p as the • clicks: the number of times the advertiser’s landing minimum number of clicks to be generated by each phrase. page associated with the bid phrase(s) have been clicked It then attempts to minimize the overall costs. In Section on 3.2.2 the advertiser’s valuation of maximum average CPC ϕ is required. The greedy solution corresponding to CostMin 4.2 Evaluation methodology variant #2 is defined as Greedy method variant #4 as fol- We selected one of the longest bidding entries associated lows: for p number of phrases, the minimum required clicks with an advertiser. Due to the anonymity of the phrases χ is divided with p as the minimum number of clicks for per entry, we assumed that a group of phrases with similar each phrase. The max CPC for each phrase must be ≤ ϕ. ids for all phrases within the group as a unique phrase. In The method then attempts to minimize the overall costs. total, there were six unique phrases but we considered only five because entries in one of the unique phrases were too 4.4 Results few. We analyzed two-week bidding data that the advertiser had participated in. These data were used to estimate the 4.4.1 Maximizing clicks CTRs for different positions of each unique phrase. In our Using our proposed methods, our results show increases observation, a phrase ranked beyond position 10th did not in the number of clicks as they are able to redistribute the generate clicks. The CPC was not included in the data set, overall budget β on different phrases based on the statis- so we estimated the price for each position. The price esti- tics of the different phrases. In practice, this is particularly mated was later used for this simulation only. We observed true because different phrases are more clickable among Web that the distribution of clicks was sparse. So, we used the users. maximum CTR that we calculated in each phrase as a refer- Figure 2(a) shows that ClickMax variant #1 outperforms ence to estimate the CTRs for the remaining positions. As Greedy method variant #1 in terms of the number of clicks a guideline, the frequencies selected for each position were when β is increased by 5000 on each run. In our observation, computed based on Yahoo!/Overture sponsored search ad- the clicks generated by ClickMax variant #1 for each β level vertisement results available in the Atlas report [11]. Since are consistently greater than the number of clicks generated the objective of the simulation was to maximize the overall by Greedy method variant #1. ClickMax variant #1 adap- clicks, we applied a multiplier to balance clicks distribution tively bids to maximize the objective function as long as the for each unique phrase. β constraint is met. There are cases whereby not all phrases The limitation of the study is that detailed data sets spec- have been selected to gain more clicks offered by different ifying real phrases, bid prices, clicks, revenues and profits phrases. are difficult to have access to. At this stage, optimizing In term of budget used, Figure 2(b), ClickMax variant ROI based on the data that we use in this sudy cannot be #1’s percentage of budget used is within the range of 80- done. In our evaluation, the CTRs and CPCs are not readily 100% on each run. The Greedy method variant #1, uses available. Nevertheless, the objective of the study is to eval- beween 40-65% of the budget on each run. uate our proposed methods. The data inaccuracies due to The increase in click percentage for ClickMax variant #1 the estimations have little impact on the performance of the over Greedy method variant #1 is shown in Figure 2(c). proposed methods. Currently, the study does not include The increase in click percentage is between 24 and 71% on keyword interaction as the assumption is that advertisers each run. With the inclusion of an additional parameter bid for specific phrases. Focused phrases target the ”long ϕ, the advertiser has the flexibility to set her valuation of tail” distribution and have less likelihood of overlapping be- maximum average CPC. tween keywords. Figure 3(a) shows the total clicks generated by ClickMax variant #2 and Greedy method variant #2. ClickMax vari- 4.3 Evaluation baseline ant #2 outperforms the Greedy method variant #2. Click- Recall in section 3.1.1 and section 3.1.2, the objective is to Max variant #2 is able to adaptively bid to earn itself higher maximize clicks. We use greedy methods as our baseline to display position. The only difference with the previous eval- measure the performance. The greedy solution correspond- uation is that with the addition of ϕ, the range of maximum ing to ClickMax variant #1 is defined as Greedy method average CPC is set by the advertiser. variant #1 as follows: for p number of phrases, the greedy Figure 3(b) shows the budget used by ClickMax variant method variant #1 divides the overall budget β with p as the #2 is between 90-100%. Figure 3(c) shows the increase budget for each phrase. It then attempts to maximize the in click percentage for ClickMax variant #2 over Greedy number of clicks for each of the phrases. In Section 3.1.2, method variant #2. The percentages in click increase for β the advertiser’s valuation of maximum average CPC ϕ is less than 4K are not shown as they are well beyond 300%. required. The greedy solution corresponding to ClickMax For β ≥ 4K, the click increase is between 30-85% on each variant #2 is defined as Greedy method #variant 2 as fol- run. lows: for p number of phrases, the overall budget β is divided with p as the budget for each phrase. The max CPC for each 4.4.2 Minimizing costs
(a) (b) (c) Figure 2: ClickMax variant #1 vs Greedy method variant #1 when β is the constraint. β is increased by 5000 on each run: (a) the effect on generated clicks, (b) the percentage of β used, and (c) the percentage increase in clicks of ClickMax variant #1 over Greedy method variant #1 (a) (b) (c) Figure 3: ClickMax variant #2 vs Greedy method variant #2 when β and ϕ are the constraints. β is increased by 1000 and ϕ is set to be 500 on each run: (a) the effect on generated clicks, (b) the percentage of β used, and (c) the percentage increase in clicks of ClickMax variant #2 over Greedy method variant #2 The evaluation on minimizing costs with required mini- lutions that meet the constraint (e.g. the minimum required mum clicks χ as a constraint shows that CostMin variant clicks), CostMin will always choose the solution whose cost #1 outperforms Greedy method variant #3 (Figure 4(a)). is the least. In fact, on each run, the click-cost point generated by our so- lution is optimal. Any point above the optimal curve meets 5. CONCLUSIONS the constraint criterion but is less cost effective. Points be- low the curve are ineffecient due to the required minimum From an advertiser point of view, a good estimation of the clicks are not met. total clicks that an ad can generate is essential. Although When repeated with additional constraint ϕ, similar pat- the ultimate goals is to optimize ROI, optimizing the total tern (Figure 4(b)) shown in the previous evaluation to mini- number of clicks is a step towards that direction. The ideal mize costs persists. CostMin variant #2 outperforms Greedy case is to generate as many conversions as possible out of the method variant #4. The only difference is that when χ is 60, generated clicks. Later, out of the conversions, some users no solution is found for CostMin variant #2 due to the the interact with the advertiser’s website to purchase items. average CPC becomes too high (> 500) so the evaluation In this study we presented a theoretical insight to opti- stops when χ is 55. mize clicks and to minimize advertising cost. We show that Another evaluation is to have an insight whether cost-click the formulations of these two problems are mathematically curves generated by ClickMax and CostMin are identical. equivalent. Our simulation shows that our approach (Click- The click-cost points in ClickMax variant #1 and CostMin Max variant #1) outperforms the greedy method in maxi- variant #1 are shown in Figure 5(a). mizing clicks. Our algorithm bids for a phrase to gain more The cost-click points in ClickMax variant #2 and Cost- overall clicks as long as the average CPC is within the adver- Min variant #2 are shown in Figure 5(b). Both curves are tiser’s budget. The greedy method, on the other hand, at- almost identical. The difference is that for the same num- tempts to maximize clicks for individual phrases rather than ber of clicks, the costs in CostMin (variant #1 and variant treating the phrases as a portfolio. Our solution resembles #2) are slightly lower compared to the costs in ClickMax the real problem situation in which different phrases have (variant #1 and variant #2). Since the objective function different statistics as some phrases are more popular and in CostMin is to minimize costs, when there are multiple so- clickable among Web users. When an advertiser’s valuation of maximum average CPC is included as a constraint, our
(a) (b) Figure 4: CostMin vs Greedy method: (a)the effects on costs on CostMin variant #1 and greedy method variant #3 when minimum χ is increased by 5 on each run, and (b)the effects on costs on CostMin variant #2 and greedy method variant #4 when minimum χ is increased by 5 and ϕ is set to 500 on each run (a) (b) Figure 5: ClickMax Vs CostMin: (a)the cost-click combinations using ClickMax variant #1 and CostMin variant #1 when β is the constraint. β is increased by 5000 on each run, and (b)the cost-click combinations using ClickMax variant #2 and CostMin variant #2 when β and ϕ are the constraints. β is increased by 1000 and ϕ is set to 500 on each run method (ClickMax variant #2) allows the advertiser to pro- http://research.yahoo.com/Academic Relations. vide maximum average CPC that the advertiser is willing to [2] Z. Abrams. Revenue maximization when bidders have spend per click. This flexibility offers the advertiser to set budgets. In ACM-SIAM Symposium on Discrete the risk level the advertiser is willing to take in anticipating Algorithms, pages 1074–1082, 2006. for gained clicks as a reward. [3] Z. Abrams, O. Mendelevitch, and J. Tomlin. Optimal In our current study, we compute and approximate CTRs delivery of sponsored search advertisements subject to and CPCs based on historical data. In the future, working budget constraints. In ACM Conference on Electronic with live data is desirable with CTRs and CPCs are regarded Commerce, pages 272–278, 2007. as random variables. Another direction is to study the risk [4] Z. Abrams and E. Vee. Personalized ad delivery when of each click. Schulley el al. in [24] describes bounce rate ads fatigue: An approximation algorithm. In WINE, as a fraction of clicks that do not end up with conversions pages 535–540, 2007. whereby users leave immediately after they have been redi- [5] G. Aggarwal, A. Goel, and R. Motwani. Truthful rected to the landing page. The combination of clickthrough auctions for pricing search keywords. In ACM and bounce rates can be used to measure which phrases are Conference on Electronic Commerce, pages 1–7, 2006. risky to bid. [6] K. Asdemir. Dynamics of bidding in search engine auctions: An analytical investigation. In 2nd 6. REFERENCES Workshop on Sponsored Search Auctions, in [1] Yahoo! search marketing advertiser conjunction with the ACM Conference on Electronic bid-impression-click data, version 1.0.
Commerce, 2006. Predicting bounce rates in sponsored search [7] A. Ashkan, C. L. A. Clarke, E. Agichtein, and Q. Guo. advertisements. In ACM Conference on Knowledge Classifying and characterizing query intent. In ECIR, Discovery and Data Mining, pages 1325–1333, 2009. pages 578–586, 2009. [25] B. K. Szymanski and J.-S. Lee. Impact of roi on [8] R. Balakrishnan and S. Kambhampati. Optimal ad bidding and revenue in sponsored search ranking for profit maximization. In 11th International advertisement auctions. In 2nd Workshop on Workshop on Web and Databases (WebDB), 2008. Sponsored Search Auctions, in conjunction with the [9] C. Borgs, J. T. Chayes, N. Immorlica, K. Jain, ACM Conference on Electronic Commerce, 2006. O. Etesami, and M. Mahdian. Dynamics of bid [26] H. R. Varian. Position auctions. International Journal optimization in online advertisement auctions. In of Industrial Organization, 25:1163–1178, 2007. WWW, pages 531–540, 2007. [27] Y. Zhou, D. Chakrabarty, and R. M. Lukose. Budget [10] C. Borgs, J. T. Chayes, N. Immorlica, M. Mahdian, constrained bidding in keyword auctions and online and A. Saberi. Multi-unit auctions with knapsack problems. In WINE, pages 566–576, 2008. budget-constrained bidders. In ACM Conference on [28] Y. Zhou and R. M. Lukose. Vindictive bidding in Electronic Commerce, pages 44–51, 2005. keyword auctions. In ICEC, pages 141–146, 2007. [11] N. Brooks. The atlas rank report: How advertisement engine rank impact traffics. Technical Report, Atlas Institute, July 2004. [12] N. Buchbinder, K. Jain, and J. Naor. Online primal-dual algorithms for maximizing ad-auctions revenue. In ESA, pages 253–264, 2007. [13] B. Edelman and M. Ostrovsky. Strategic bidder behavior in sponsored search auctions. Decision Support Systems, 43(1):192–198, 2007. [14] B. Edelman, M. Ostrovsky, and M. Schwarz. Internet advertising and the generalized second price auction: Selling billions of dollars worth of keywords. American Economic Review, 97(1):242–259, March 2007. [15] E. Even-Dar, V. S. Mirrokni, S. Muthukrishnan, Y. Mansour, and U. Nadav. Bid optimization for broad match ad auctions. In WWW, pages 231–240, 2009. [16] J. Feldman, S. Muthukrishnan, M. Pál, and C. Stein. Budget optimization in search-based advertising auctions. In ACM Conference on Electronic Commerce, pages 40–49, 2007. [17] T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. Accurately interpreting clickthrough data as implicit feedback. In ACM SIGIR, pages 154–161, 2005. [18] A. Mehta, A. Saberi, U. V. Vazirani, and V. V. Vazirani. Adwords and generalized on-line matching. In IEEE Symposium on Foundations of Computer Science, pages 264–273, 2005. [19] A. Nakamura and N. Abe. Improvements to the linear programming based scheduling of web advertisements. Electronic Commerce Research, 5(1):75–98, 2005. [20] K. Newcomb. Search marketing shows strength in 2006. http://www.searchenginewatch.com, 2007. [21] F. Radlinski, A. Z. Broder, P. Ciccolo, E. Gabrilovich, V. Josifovski, and L. Riedel. Optimizing relevance and revenue in ad search: a query substitution approach. In SIGIR, pages 403–410, 2008. [22] M. Richardson, E. Dominowska, and R. Ragno. Predicting clicks: estimating the click-through rate for new ads. In WWW, pages 521–530, 2007. [23] P. Rusmevichientong and D. P. Williamson. An adaptive algorithm for selecting profitable keywords for search-based advertising services. In ACM Conference on Electronic Commerce, pages 260–269, 2006. [24] D. Sculley, R. Malkin, S. Basu, and R. J. Bayardo.
You can also read