Quality diversity, Reinforcement learning
(Mouret, Clune 2015; Cully et al. 2015)
MAP-Elites are an example of QD algorithm. The behavior space is discretized in cells and during exploration, only the best “elite” for each cell is kept.
Individuals are added to the grid if they:
- fill an empty space
- are better than an existing elite
- Jean-Baptiste Mouret, Jeff Clune. . "Illuminating Search Spaces by Mapping Elites". arXiv. http://arxiv.org/abs/1504.04909.
- Antoine Cully, Jeff Clune, Danesh Tarapore, Jean-Baptiste Mouret. . "Robots That Can Adapt Like Animals". Nature 521 (7553). Nature Publishing Group:503–7. DOI.