Knowledge Tree

thesis

Random Games

Defense date:

Jan. 1, 2008

Edit

Institution:

Paris 7

Disciplines:

Computer sciences

Authors:

Florian Horn

Directors:

Anca Muscholl

Wolfgang Thomas

Abstract EN:

In this thesis, we study games as a tool for the synthesis of controllers for reactive Systems. In this setting, a game is defined by an arena, representing the System and its evolution as a graph; and a winning condition, which models the specification that the controller must ensure In each state, the choice of the outgoing transition is done by either Eve (the controller), Adam (an hostile environment), or Random (uncontrollable evolution obeying a stochastic law). This process is repeated for an infinite number of times, leading to an infinite play whose winner depends on the winning condition. Our first object of study is the simplest case of Computing the optimal values of reachability games. We present a new effective approach, based on permutations of random states, to solve this classical problem. In terms of complexity, the resulting "permutation algorithm" is orthogonal to the classical, strategy-based algorithms: it is exponential in the number of random states, rather than in the number of controlled states. We also present an improvement heuristic for this algorithm, inspired by the "strategy improvement" algorithm. We turn next to a much more general class of problems, covering ail fie cases where the winning condition is prefix-independent. We prove the existence of optimal strategies for these games, and show that our permutation algorithm can be extended into a "meta-algorithm", changing any qualitative algorithm into a quantitative algorithm. Lastly, we study the complexity of optimal strategies for regular winning conditions represented as Muller games, and especially the amount of memory that can be saved by using randomised strategies. Using the Zielonka tree, we show tight bounds on the necessary and sufficient memory needed to define randomised optimal strategies for any single Muller condition.

Abstract FR:

Dans cette thèse, nous considérons les jeux comme un outil pour la synthèse de contrôleurs dans un système ouvert. Dans ce cadre, un jeu est défini par : une arène, sous la forme d'un graphe représentant les états et l'évolution du système; et une condition de victoire, représentant la spécification que le contrôleur doit garantir. Dans chaque état, la transition sortante est choisie soit par le contrôleur, soit par un environnement hostile, soit selon une loi stochastique. Ce processus est répété un nombre infini de fois, générant une partie infinie dont le vainqueur dépend de la condition de victoire. Nous étudions tout d'abord le cas fondamental des jeux d'accessibilité. Nous présentons une nouvelle approche pour en calculer les valeurs basée sur les permutations d'états aléatoires. En termes de complexité, cet "algorithme par permutations" est orthogonal aux algorithmes basés sur les stratégies : il est exponentiel dans le nombre d'états aléatoires, et non dans le nombre d'états contrôlés. Nous présentons également une heuristique d'amélioration pour cet algorithme, inspirée par l'algorithme d'amélioration de stratégies. Nous considérons ensuite la classe de problème très générale des jeux préfixe-indépendants. Nous prouvons l'existence de stratégie optimales pour ces jeux, et nous montrons que notre algorithme par permutations peut être étendu en un "méta-algorithme", transformant n'importe quel algorithme qualitatif en un algorithme quantitatif. Enfin, nous étudions les jeux de Muller, utilisant les "arbres de Zielonka" pour calculer la quantité de mémoire nécessaire et suffisante pour synthétiser une stratégie aléatoire "presque-sûrement" gagnante.