Show simple item record
Author | dc.contributor.author | Borrero, Juan S. | |
Author | dc.contributor.author | Prokopyev, Oleg A. | |
Author | dc.contributor.author | Sauré, Denis | |
Admission date | dc.date.accessioned | 2019-10-11T17:30:01Z | |
Available date | dc.date.available | 2019-10-11T17:30:01Z | |
Publication date | dc.date.issued | 2019 | |
Cita de ítem | dc.identifier.citation | Operations Research, Volumen 67, Issue 1, 2019, Pages 72-89 | |
Identifier | dc.identifier.issn | 15265463 | |
Identifier | dc.identifier.issn | 0030364X | |
Identifier | dc.identifier.other | 10.1287/opre.2018.1773 | |
Identifier | dc.identifier.uri | https://repositorio.uchile.cl/handle/2250/171228 | |
Abstract | dc.description.abstract | © 2019 INFORM. We present a framework for a class of sequential decision-making problems in the context of general interdiction problems, in which a leader and a follower repeatedly interact. At each period, the leader allocates resources to disrupt the performance of the follower (e.g., as in defender–attacker or network interdiction problems), who, in turn, minimizes some cost function over a set of activities that depends on the leader’s decision. Although the follower has complete knowledge of the follower’s problem, the leader has only partial information and needs to learn about the cost parameters, available resources, and the follower’s activities from the feedback generated by the follower’s actions. We measure policy performance in terms of its time-stability, defined as the number of periods it takes for the leader to match the actions of an oracle with complete information. In particular, we propose a class of greedy and robust policies and show that these policies are weak | |
Lenguage | dc.language.iso | en | |
Publisher | dc.publisher | INFORMS Inst.for Operations Res.and the Management Sciences | |
Type of license | dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Chile | |
Link to License | dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/cl/ | |
Source | dc.source | Operations Research | |
Keywords | dc.subject | Attacker-defender | |
Keywords | dc.subject | Bilevel programming | |
Keywords | dc.subject | Incomplete information | |
Keywords | dc.subject | Interdiction | |
Keywords | dc.subject | Learning | |
Keywords | dc.subject | Online optimization | |
Keywords | dc.subject | Robust optimization | |
Título | dc.title | Sequential interdiction with incomplete information and learning | |
Document type | dc.type | Artículo de revista | |
Cataloguer | uchile.catalogador | SCOPUS | |
Indexation | uchile.index | Artículo de publicación SCOPUS | |
uchile.cosecha | uchile.cosecha | SI | |
Files in this item
- Name:
- item_85062060013.pdf
- Size:
- 1.972Kb
- Format:
- PDF
This item appears in the following Collection(s)
Show simple item record
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile