Publications and Preprints

  • Bicausal Optimal Transport for Markov Chains via Dynamic Programming.
    Vrettos Moulos,
    to appear in the IEEE International Symposium on Information Theory (ISIT), 2021
    [arXiv]

  • Finite-time Analysis of Round-Robin Kullback-Leibler Upper Confidence Bounds for Optimal Adaptive Allocation with Multiple Plays and Markovian Rewards.
    Vrettos Moulos,
    in proceedings of the 34th Conference on Neural Information Processing Systems (NeurIPS), 2020
    [arXiv]

  • A Hoeffding Inequality For Finite State Markov Chains and its Applications to Markovian Bandits.
    Vrettos Moulos,
    in proceedings of the IEEE International Symposium on Information Theory (ISIT), 2020
    [arXiv]

  • Optimal Best Markovian Arm Identification with Fixed Confidence.
    Vrettos Moulos,
    in proceedings of the 33rd Conference on Neural Information Processing Systems (NeurIPS), 2019
    [arXiv]

  • Optimal Chernoff and Hoeffding Bounds for Finite State Markov Chains.
    Vrettos Moulos and Venkat Anantharam,
    [arXiv]

  • Concentration and Sequential Decision Making in Markovian Environments.
    Vrettos Moulos,
    PhD dissertation, UC Berkeley, 2020
    [disseration]