Sequential interdiction with incomplete information and learning

Borrero, Juan S.; Prokopyev, Oleg A.; Sauré, Denis

Author	dc.contributor.author	Borrero, Juan S.
Author	dc.contributor.author	Prokopyev, Oleg A.
Author	dc.contributor.author	Sauré, Denis
Admission date	dc.date.accessioned	2019-10-11T17:30:01Z
Available date	dc.date.available	2019-10-11T17:30:01Z
Publication date	dc.date.issued	2019
Cita de ítem	dc.identifier.citation	Operations Research, Volumen 67, Issue 1, 2019, Pages 72-89
Identifier	dc.identifier.issn	15265463
Identifier	dc.identifier.issn	0030364X
Identifier	dc.identifier.other	10.1287/opre.2018.1773
Identifier	dc.identifier.uri	https://repositorio.uchile.cl/handle/2250/171228
Abstract	dc.description.abstract	© 2019 INFORM. We present a framework for a class of sequential decision-making problems in the context of general interdiction problems, in which a leader and a follower repeatedly interact. At each period, the leader allocates resources to disrupt the performance of the follower (e.g., as in defender–attacker or network interdiction problems), who, in turn, minimizes some cost function over a set of activities that depends on the leader’s decision. Although the follower has complete knowledge of the follower’s problem, the leader has only partial information and needs to learn about the cost parameters, available resources, and the follower’s activities from the feedback generated by the follower’s actions. We measure policy performance in terms of its time-stability, defined as the number of periods it takes for the leader to match the actions of an oracle with complete information. In particular, we propose a class of greedy and robust policies and show that these policies are weak
Lenguage	dc.language.iso	en
Publisher	dc.publisher	INFORMS Inst.for Operations Res.and the Management Sciences
Type of license	dc.rights	Attribution-NonCommercial-NoDerivs 3.0 Chile
Link to License	dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/cl/
Source	dc.source	Operations Research
Keywords	dc.subject	Attacker-defender
Keywords	dc.subject	Bilevel programming
Keywords	dc.subject	Incomplete information
Keywords	dc.subject	Interdiction
Keywords	dc.subject	Learning
Keywords	dc.subject	Online optimization
Keywords	dc.subject	Robust optimization
Título	dc.title	Sequential interdiction with incomplete information and learning
Document type	dc.type	Artículo de revista
Cataloguer	uchile.catalogador	SCOPUS
Indexation	uchile.index	Artículo de publicación SCOPUS
uchile.cosecha	uchile.cosecha	SI

Files in this item

Name:: item_85062060013.pdf
Size:: 1.972Kb
Format:: PDF

This item appears in the following Collection(s)

Artículos de revistas
Artículos de revistas

Show simple item record

Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile