Skip to content

Commit db559d9

Browse files
committed
update chair description
1 parent 8db9b05 commit db559d9

1 file changed

Lines changed: 4 additions & 4 deletions

File tree

reload.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# Reinforcement Learning on a Diet (RELOAD) – ANITI research chair
1+
# Reinforcement Learning on a Diet (RELOAD) – [ANITI](https://aniti.univ-toulouse.fr) research chair
22

33
## Statement of purpose
44

@@ -8,9 +8,9 @@ The RELOAD research chair aims to develop algorithms for **frugal** **life-long*
88
Deep reinforcement learning (RL) – learning optimal behaviors from interaction data, using deep neural networks – is often seen as one of the next frontiers in artificial intelligence. While current RL algorithms do not escape the relentless pursuit of larger models, bigger data and more computation demands, we posit real-world impacts of RL will also stem from algorithms that are relevant in the small data regime, on reasonable computing architectures. RL is at a crossroads where one wishes to retain the versatility and representational abilities of deep neural networks, while coping with limited data and resources. Under such real world limitations, understanding how to preserve algorithmic convergence properties, robustness to uncertainties, worst case guarantees, transferable features, or behavior explanation elements is an open field. Hence, we endeavor to put RL on a diet, in order to reach a better understanding of frugal, life-long RL: its theoretical foundations, the many ways one can compensate for limited data, the sound algorithms one can design, and the practical impacts it can have on the many real world applications where, intrinsically, data is costly and resources are limited, ranging from autonomous robotics to personalized medicine.
99

1010
The chair investigates topics in reinforcement learning, including (but not limited to):
11-
- reusable skill learning, including skills with generalization abilities, robustness guarantees, skill diversity, invariance properties,
12-
- inductive biases and priors in skill learning,
13-
- life-long and frugal (both in data and computation) skill adaptation, transfer between tasks.
11+
- **reusable skill learning**, including skills with generalization abilities, robustness guarantees, skill diversity, invariance properties,
12+
- **inductive biases and priors** in skill learning,
13+
- **life-long and frugal** (both in data and computation) skill adaptation, transfer between tasks.
1414

1515
The endeavor to develop frugal (both in terms of computation and data) algorithms for life-long RL stems from a number of inspiration problems, such as:
1616
- onboard policy adjustment (in satellites, robots, and more generally devices with limited hardware),

0 commit comments

Comments
 (0)