This paper is concerned with the modeling of advertiser behaviors in sponsored search. Modeling advertiser behaviors can help search engines better serve advertisers, improve auction mechanism, and forecast future revenue. Previous works on this topic either unrealistically assume advertisers to be able to perceive the states of the sponsored search system and the private information of other advertisers or ignore the differences in advertisers' abilities to optimize their bid strategies. To tackle the problems, we propose viewing sponsored search auctions as partially observable multi-agent system with private information. Then, we employ a reinforcement learning behavior model to describe how each advertiser responds to this multi-agent system. The proposed model no longer assumes advertisers to have perfect information access, but instead assumes them to optimize their strategies only based on the partially observed states in the auctions. Furthermore, the model does not specify how the optimization is conducted, but instead uses parameters learned from data to describe different advertisers' abilities in obtaining the optimal strategies. Our experiments on real sponsored search data demonstrate that the proposed model outperforms previous models in predicting the bids and rank positions of the advertisers in the near future. In addition to the accurate prediction of these short-term behaviors, our study shows another nice property of the proposed model. That is, if all the advertisers behave according to the model, the multi-agent system of sponsored search will converge to a locally envy-free equilibrium, under certain conditions. This result establishes a connection between machine-learned behavior models and game-theoretic properties of the system. Copyright © 2016 John Wiley & Sons, Ltd.

References

1Edelman B, Ostrovsky M, Schwarz M. Internet Advertising and the Generalized Second-price Auction: Selling Billions of Dollars Worth of Keywords. American Economic Review. 2007; 97(1): 242–259.
10.1257/aer.97.1.242
Web of Science® Google Scholar
2Varian HR. Position auctions. International Journal of Industrial Organization. 2007; 25(6): 1163–1178.
10.1016/j.ijindorg.2006.10.002
Web of Science® Google Scholar
3Edelman B, Ostrovsky M. Strategic bidder behavior in sponsored search auctions. Decision Support Systems. 2007; 43(1): 192–198.
10.1016/j.dss.2006.08.008
Web of Science® Google Scholar
4Reddy SSS, Narahari Y. Bidding dynamics of rational advertisers in sponsored search auctions on the web. In ACODS'07: Proceedings of the International Conference on Advances in Control and Optimization of Dynamical Systems: Bangalore, India, 2007.
Google Scholar
5Cary M, Das A, Edelman B, Giotis I, Heimerl K, Karlin AR, Mathieu C, Schwarz M. Greedy bidding strategies for keyword auctions. In Proceedings of the 8th ACM Conference on Electronic Commerce, ACM: San Diego, CA, USA, 2007; 262–271.
Google Scholar
6Asdemir K. 2006. Bidding patterns in search engine auctions. In Second Workshop on Sponsored Search Auctions, ACM Electronic Commerce. ACM Press: Ann Arbor, MI.
Google Scholar
7Athey S, Nekipelovb D. A Structural Model of Sponsored Search Advertising Auctions. Working paper, Department of Economics, Harvard University, Boston, MA, 2009.
Google Scholar
8Pin F, Key P. Stochastic variability in sponsored search auctions: observations and models. In Proceedings of the 12th ACM Conference on Electronic Commerce: San Jose, CA, USA, 2011; 61–70.
Google Scholar
9Abramson B. Expected-outcome: a general model of static evaluation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1990; 12(2): 182–193.
10.1109/34.44404
Web of Science® Google Scholar
10Fudenberg D, Levine DK. The Theory of Learning in Games, vol. 2. The MIT press: Cambridge, MA, 1998.
Google Scholar
11Yao S, Mela CF. A dynamic model of sponsored search advertising. Marketing Science. 2011; 30(3): 447–468.
10.1287/mksc.1100.0626
CAS Web of Science® Google Scholar
12Lucier B, Paes Leme R. GSP auctions with correlated types. In Proceedings of the 12th ACM Conference on Electronic Commerce, ACM: San Jose CA, USA, 2011; 71–80.
Google Scholar
13Lagarias JC, Reeds JA, Wright MH, Wright PE. Convergence properties of the Nelder–Mead simplex method in low dimensions. SIAM Journal of Optimization. 1998; 9: 112–147.
10.1137/S1052623496303470
Web of Science® Google Scholar
14Varian HR. Online ad auctions. The American Economic Review. 2009; 99(2): 430–434.
10.1257/aer.99.2.430
Web of Science® Google Scholar
15Borkar VS. Stochastic Approximation: A Dynamical Systems Viewpoint. Cambridge University Press: Cambridge, UK, 2008.
10.1007/978-93-86279-38-5
Google Scholar

Citing Literature

Volume32, Issue3

Special Issue:Bayesian Statistics and Machine Learning in Business

May/June 2016

Pages 358-367

Reinforcement learning behaviors in sponsored search

Abstract

References

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Reinforcement learning behaviors in sponsored search

Abstract

References

Citing Literature

References

Related

Information