Title: P2: P0: causal inference A/B learing [2024-10-29 Tue]
Popular algorithms:
- Upper Confidence Bound (UCB) - deterministic, optimal
- Thompson Sampling - stochastic, optimal
- Epsilon Greedy - stochastic, approximate

Thompson Sampling and UCB have asymptotic regret lower #dailyreport #abtest #multiarmedbandit #mab