Skip to content

Commit db9f234

Browse files
committed
update chair description
1 parent db559d9 commit db9f234

1 file changed

Lines changed: 4 additions & 5 deletions

File tree

reload.md

Lines changed: 4 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,10 @@
11
# Reinforcement Learning on a Diet (RELOAD) – [ANITI](https://aniti.univ-toulouse.fr) research chair
22

3-
## Statement of purpose
4-
5-
TL;DR:
3+
## TL;DR
64
The RELOAD research chair aims to develop algorithms for **frugal** **life-long** reinforcement learning.
5+
RELOAD is a research chair of the [ANITI](https://aniti.univ-toulouse.fr) AI cluster.
6+
7+
## Statement of purpose
78

89
Deep reinforcement learning (RL) – learning optimal behaviors from interaction data, using deep neural networks – is often seen as one of the next frontiers in artificial intelligence. While current RL algorithms do not escape the relentless pursuit of larger models, bigger data and more computation demands, we posit real-world impacts of RL will also stem from algorithms that are relevant in the small data regime, on reasonable computing architectures. RL is at a crossroads where one wishes to retain the versatility and representational abilities of deep neural networks, while coping with limited data and resources. Under such real world limitations, understanding how to preserve algorithmic convergence properties, robustness to uncertainties, worst case guarantees, transferable features, or behavior explanation elements is an open field. Hence, we endeavor to put RL on a diet, in order to reach a better understanding of frugal, life-long RL: its theoretical foundations, the many ways one can compensate for limited data, the sound algorithms one can design, and the practical impacts it can have on the many real world applications where, intrinsically, data is costly and resources are limited, ranging from autonomous robotics to personalized medicine.
910

@@ -21,8 +22,6 @@ The endeavor to develop frugal (both in terms of computation and data) algorithm
2122
- language (and action) models fine-tuning.
2223
The chair does not aim to tackle all these problems, and the list itself is non-exhaustive. We are happy to expand this list and collaborate with partners depending on opportunities.
2324

24-
RELOAD is a research chair funded by the [ANITI](https://aniti.univ-toulouse.fr) AI cluster.
25-
2625
## Members
2726

2827
[Emmanuel Rachelson](https://erachelson.github.io/) (Chair holder, ISAE-SUPAERO professor)

0 commit comments

Comments
 (0)