Sequential interdiction with incomplete information and learning
Artículo
![Thumbnail](/themes/Mirage2/images/cubierta.jpg)
Open/ Download
Publication date
2019Metadata
Show full item record
Cómo citar
Borrero, Juan S.
Cómo citar
Sequential interdiction with incomplete information and learning
Author
Abstract
© 2019 INFORM. We present a framework for a class of sequential decision-making problems in the context of general interdiction problems, in which a leader and a follower repeatedly interact. At each period, the leader allocates resources to disrupt the performance of the follower (e.g., as in defender–attacker or network interdiction problems), who, in turn, minimizes some cost function over a set of activities that depends on the leader’s decision. Although the follower has complete knowledge of the follower’s problem, the leader has only partial information and needs to learn about the cost parameters, available resources, and the follower’s activities from the feedback generated by the follower’s actions. We measure policy performance in terms of its time-stability, defined as the number of periods it takes for the leader to match the actions of an oracle with complete information. In particular, we propose a class of greedy and robust policies and show that these policies are weak
Indexation
Artículo de publicación SCOPUS
Identifier
URI: https://repositorio.uchile.cl/handle/2250/171228
DOI: 10.1287/opre.2018.1773
ISSN: 15265463
0030364X
Quote Item
Operations Research, Volumen 67, Issue 1, 2019, Pages 72-89
Collections