Objectives

Develop a novel planning algorithm that efficiently achieves a new, previously unknown objective given the current environment model of the system. The planning algorithm should account for the fact that the environment model may change over time.
Develop a novel exploration strategy for RL that automatically and efficiently updates the environmental model, by selecting actions that explore parts of the environment that the system is not yet familiar with.
Develop a novel framework for task decomposition that automatically creates and evaluates tasks, discarding tasks that are not deemed useful. Each task has its own associated decision strategy.
Evaluate the novel planning and RL algorithms in two realistic scenarios: active network management for electrical distribution networks, and microgrid management. Apart from using these scenarios for evaluation, the project also aims at improving on the state-of-the-art in these two applications.