contextual 0.9.8.4
  • Reference
  • Articles
    • Demo: Basic Synthetic cMAB Policies
    • Demo: Offline cMAB LinUCB evaluation
    • Demo: MAB Replication Eckles & Kaptein (Bootstrap Thompson Sampling)
    • Demo: Basic Epsilon Greedy
    • Getting started: running simulations
    • Demo: MAB Policies Comparison
    • Demo: MovieLens 10M Dataset
    • Demo: Offline cMAB: CarsKit DePaul Movie Dataset
    • Offline evaluation: Replication of Li et al 2010
    • Demo: Bandits, Propensity Weighting & Simpson's Paradox in R
    • Demo: Replication Sutton & Barto, Reinforcement Learning: An Introduction, Chapter 2
    • Demo: Replication of John Myles White, Bandit Algorithms for Website Optimization
  • Changelog
  • FAQ

Articles

All vignettes

Demo: Basic Synthetic cMAB Policies
Demo: Offline cMAB LinUCB evaluation
Demo: MAB Replication Eckles & Kaptein (Bootstrap Thompson Sampling)
Demo: Basic Epsilon Greedy
Getting started: running simulations
Demo: MAB Policies Comparison
Demo: MovieLens 10M Dataset
Demo: Offline cMAB: CarsKit DePaul Movie Dataset
Development FAQ
Offline evaluation: Replication of Li et al 2010
Demo: Bandits, Propensity Weighting & Simpson's Paradox in R
Demo: Replication Sutton & Barto, Reinforcement Learning: An Introduction, Chapter 2
Demo: Replication of John Myles White, Bandit Algorithms for Website Optimization

Developed by Robin van Emden.

Site built with pkgdown 1.5.1.