multi-arm bandit

ADAPTIVE CDN NODE SELECTION IN DYNAMIC ICT SYSTEMS USING ONLINE CONTROLLED EXPERIMENTS AND CHANGE DETECTION MULTI ARMED BANDIT ALGORITHMS

Modern info‑communication technology (ICT) infrastructures such as content‑delivery networks (CDNs) must continuously tune low‑level parameters to deliver high performance under variable and non‑stationary network conditions. This paper investigates how online controlled experiments—including classical A/B tests and adaptive multi‑armed bandit (MAB) algorithms—can be used to optimise CDN node selection. We formalise the optimisation problem as minimising a network‑performance objective of average latency, one of key metrics used to measure network performance.