Work‎ > ‎Research‎ > ‎

Learning Click Model via Probit Bayesian Inference

posted Oct 23, 2011, 4:24 PM by Botao Hu   [ updated Dec 4, 2011, 9:46 PM ]
Experiment Conductor
Mar 2010 - May 2010
Coauthored with Yuchen Zhang, Dong Wang.
Supervised by Gang WangWeizhu Chen.
Microsoft Research Asia, Beijing
In Proceedings of CIKM 2010

Recent advances in click models have positioned them as an effective approach to the improvement of interpreting click data, and some typical works include UBM, DBN, CCM, etc. After formulating the knowledge of user search behavior into a set of model assumptions, each click model developed an inference method to estimate its parameters. The inference method plays a critical role in terms of accuracy in interpreting clicks, and we observe that different inference methods for a click model can lead to significant accuracy differences. In this paper, we propose a novel Bayesian inference approach for click models. This approach regards click model under a unified framework.

1. This approach can be widely applied to existing click models, and we demonstrate how to infer DBN, CCM and UBM through it. This novel inference method is based on the Bayesian framework which is more exible in characterizing the uncertainty in clicks and brings higher generalization abilities. As a result, it not only excels in the inference methods originally developed in click models, but also provides a valid comparison among different models;
2. In contrast to the previous click models, which are exclusively designed for the position-bias, this approach is capable of capturing more sophisticated information such as BM25 and PageRank score into click models. This makes these models interpret click-through data more accurately. Experimental results illustrate that the click models integrated with more information can achieve significantly better performance on click perplexity and search ranking;
3. Because of the incremental nature of the Bayesian learning, this approach is scalable to process large scale and constantly growing log data.

Botao Hu,
Oct 24, 2011, 8:09 PM