Counterfactual analysis for search results without personalization

Algorithm without personalization to estimate reward of new search algorithm without running AB tests. Key idea is to use previous randomness in the query to make the estimate. And limit the estimate to the top n slots

link