INGI seminar - Nicolas Cardozo - Generating adaptations from system executions using reinforcement learning options

June 14, 2023

12:50-13:50

Free

Louvain La Neuve

Shannon Room Maxwell Building a.105

During this seminar, Nicolas Cardozo (Uniandes) will talk about Generating adaptations from system executions using reinforcement learning options.
 
Sandwiches will be provided. Please fill in the form below before Wednesday 14th at 09:00 to reserve a sandwich:
 
 
The seminar can also be followed on Teams.

Abstract

Self-adaptive software systems continuously adapt in response to internal and external changes in their execution environment, captured as contexts. The COP paradigm posits a technique for the development of self-adaptive systems, capturing their main characteristics with specialized programming language constructs. COP adaptations are specified as independent modules composed in and out of the base system as contexts are activated and deactivated in response to sensed circumstances from the surrounding environment. However, the definition of adaptations, their contexts and associated specialized behavior, need to be specified at design time. In complex CPS this is intractable due to new unpredicted operating conditions. We propose Auto-COP, a new technique to enable generation of adaptations at run time. Auto-COP uses RL options to build action sequences, based on the previous instances of the system execution. Options are explored in interaction with the environment, and the most suitable options for each context are used to generate adaptations exploiting COP. To validate Auto-COP, we present two case studies exhibiting different system characteristics and application domains: a driving assistant and a robot delivery system. We present examples of Auto-COP code generated at run time, to illustrate the types of circumstances (contexts) requiring adaptation, and the corresponding generated adaptations for each context. We confirm that the generated adaptations exhibit correct system behavior measured by domain-specific performance metrics, while reducing the number of required execution/actuation steps by a factor of two showing that the adaptations are regularly selected by the running system as adaptive behavior is more appropriate than the execution of primitive actions.

Biography:

After having obtained his PhD at the UCLouvain in 2013, Nicolas Cardozo is now Associate Professor in Computer Science at the Universidad de los Andes (Colombia). He is working on adaptive systems from programming language perspective, working on development (programming language design), verification (partial, and incremental techniques), and application (smart environments, CPS, and IoT) of these systems.