https://github.com/tlemo/darwin

Evolutionary Algorithms Framework
https://github.com/tlemo/darwin
ai evolution evolutionary-algorithms genetic-algorithms genetic-programming ml neuroevolution
Last synced: about 2 months ago
JSON representation
Evolutionary Algorithms Framework
Host: GitHub
URL: https://github.com/tlemo/darwin
Owner: tlemo
License: apache-2.0
Created: 2018-11-11T19:46:24.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2022-02-20T01:38:36.000Z (over 3 years ago)
Last Synced: 2024-11-06T23:42:49.974Z (8 months ago)
Topics: ai, evolution, evolutionary-algorithms, genetic-algorithms, genetic-programming, ml, neuroevolution
Language: C++
Homepage:
Size: 15 MB
Stars: 108
Watchers: 11
Forks: 21
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project

README

        
![](docs/images/unicycle_demo.gif)

# Darwin Neuroevolution Framework

Darwin is a framework intended to make Neuroevolution experiments easy, quick and fun. It

provides building blocks, samples and tooling to avoid the repetitive (and potentially

complex) scaffolding required to research new ideas.

The current implementation is a combination of portable C++ (running on Linux,

Windows and macOS), augmented by a collection of Python [scripts](scripts/docs/scripts.md)

for post-processing recorded evolution traces.

Experimental [Python bindings][4] are underway.

[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/tlemo/darwin/master?urlpath=lab/tree/notebooks)

## What's New

Please see the [changelog](CHANGELOG.md) for the latest updates.

## Getting Started

> Read the setup instructions before cloning the repository

- [Setup Instructions](docs/setup.md)

- [Documentation][1]

- [Scripts](scripts/docs/scripts.md)

- [Python Bindings][4]

- [Contributing](CONTRIBUTING.md)

## Evolutionary Algorithms and Neuroevolution

[Evolutionary Algorithms][3] are a class of nature-inspired algorithms based on the idea

that a few basic mechanisms, loosely inspired by biological evolution (selection,

reproduction and mutation) can be the building blocks for efficient searches in complex

problem spaces. In particular, we can use Evolutionary Algorithms to train artificial

neural networks: [Neuroevolution][2].

Starting with a random initial _population_, we seek to evolve towards better solutions in

a iterative fashion: each iteration (_generation_) attempts to combine (_crossover_) the

most promising traits (_selection_) from the previous one, occasionally making random

tweaks (_mutation_):

![Evolve!](docs/images/evolution.svg)

At a high level, the generic structure of an evolutionary algorithm can be as simple as:

```python

    initialize_population

    while(not satisfied):

        for_each individual:

            evaluate_fitness

        next_generation:

            select_parents

            use crossover & mutation to generate children

```

This conceptual simplicity makes Evolutionary Algorithms attractive and easy to implement,

although creating interesting domain-specific fitness functions and supporting a

structured experimentation approach requires a lot of scaffolding: persisting experiment

variations and results, visualizations, reports, profiling, etc. This is where the Darwin

Framework comes in.

## Darwin Neuroevolution Framework Overview

At the highest level, the core concepts are the [Domain] and the [Population]. The former

describes what we're trying to solve, while the latter encapsulates the solution model(s)

together with the specific evolutionary algorithm(s) used to search for better solutions.

The [Domain] and [Population] interfaces intentionally decouple the two concepts: domains

don't know anything about the details of a particular population implementation, and the 

only thing a population knows about a domain is the number of inputs and outputs.

#### Domains

A [Domain] implementation defines the problem space: the "shape" of a solution (the number

of inputs & outputs) and how to assign a fitness value to a particular solution.

In our case, a solution instance is encoded as a [Genotype] and it's evaluated indirectly

through its phenotypic expression (the corresponding [Brain]).

For example, [Pong] is a domain implementation which simulates the classic 2-player 

arcade game. It defines 6 inputs + 2 outputs, and it calculates the fitness of every

genotype in the population based solely on the results of a tournament between the

population individuals themselves (so the evolved solutions don't incorporate any a priori

knowledge of what a good game play looks like)

#### Populations

A [Population] is simply a set of [Genotypes][Genotype], together with the ability to 

generate new generations (after a [Domain] implementation evaluates and assigns fitness 

values to all the individual genotypes in the population)

The [Genotype] is an encoding for a particular solution and the "recipe" to construct the

corresponding [Brain] (the "phenotype") with the number of inputs and outputs specified by

the domain selected in the active experiment.

#### Summary

Here's how all these pieces fit together:

![Key Interfaces](docs/images/darwin_overview.svg)

Using these interfaces, the general structure of the evolution driver code is illustrated

below (this evolution top loop, with a few additions, is provided by the Darwin

Framework so we don't have to re-implement it for every new experiment)

```cpp

population->createPrimordialGeneration(population_size);

while (domain->evaluatePopulation(population)) {

    population->rankGenotypes();

    population->createNextGeneration();

}

```

This is everything required to know in order to experiment with new problems (domains) or

implement new evolutionary algorithms (the populations). Everything else in the Darwin 

Framework exists to provide support for these concepts: persistance for storing experiment

results, UI, tracking and visualizing experiments, common building blocks, tools to

analyze the results, etc.

For additional information see the [full documentation][1].

## Darwin Studio

Darwin Studio is a visual integrated environment used to create, run and visualize

experiments:

![Darwin Studio](docs/images/darwin_studio.png)

Currently it's the main user-facing tool included in the Darwin Framework, although there

are plans to add additional options (for example a command line driver and/or Python

bindings). For post-processing experiment results there are Python

[scripts](scripts/docs/scripts.md) which can parse Darwin universe databases.

## Running Experiments & The Universe Database

Every instance of an experiment is persisted in a [Universe] database, which is

implemented as a single Sqlite file. The key data model concepts are:

- [Universe]: the persistent storage for a set of experiments.

- [Experiment]: loosely speaking, a [Domain] / [Population] pair.

- [Variation]: a specific set of configuration values for an experiment.

- [Trace]: the recording of a particular experiment variation run.

Normally, each [Domain] and [Population] implementation comes with a set of configuration

properties which can be edited before starting an experiment. For each set of values

there's a [Variation] associated with the [Experiment]. Every time an experiment variation

is started, a new [Trace] is created to record the history/results of the experiment.

The database schema models the structural relationships, while the actual configuration

values and results are stored as JSON strings (the fields highlighted in green):

![Darwin Data Model](docs/images/darwin_data_model.svg)

## Related projects

It's worth mentioning a few similar projects. Many of them are focused on RL rather than

EA (while some cover both and more) but they overlap in interesting ways with the Darwin 

Framework:

- [SharpNEAT](http://sharpneat.sourceforge.net)

- [Open AI Gym](https://gym.openai.com)

- [Pyevolve](http://pyevolve.sourceforge.net)

- DeepMind's [TRFL](https://deepmind.com/blog/trfl)

- [Reinforcement Learning Coach](https://nervanasystems.github.io/coach/index.html)

- [Dopamine](https://github.com/google/dopamine)

- [OpenNERO](https://github.com/nnrg/opennero/wiki)

- [ESTool](https://github.com/hardmaru/estool)

- [Neataptic](https://wagenaartje.github.io/neataptic)

- [DEAP](https://deap.readthedocs.io/en/master)

- [PyBrain](http://www.pybrain.org/pages/home)

- [Maja Machine Learning Framework](http://mmlf.sourceforge.net)

Darwin Framework was created with the following goals and design principles in mind:

- **First class support for Evolutionary Algorithms concepts** (population, generation,

    genotype/phenotype), without specializing on a particular EA flavor. This allows

    simple interfaces for implementing new algorithms (and domains) while accommodating

    a wide variety of algorithms (Neuroevolution, GP, GEP, ...)

- **Capable of running interesting experiments on easily available hardware**

    (no expensive GPU or data center required). There are plans to take advantage of both

    GPUs and distributed platforms in the future.

- **Complete package**: creating & running experiments, visualizing the progress and the final

     results, interacting with the solutions, and more. Currently Darwin Studio is the

     central part of the framework and it aims to offer a familiar integrated environment

     UI (_it may be worth noting that the Darwin Framework would score well against John

     R. Koza's wish list mentioned in the seminal Genetic Programming book_)

- **Structured approach to experimentation**: all the experiment runs are automatically

    recorded, including the experiment variation "lineage"

- **Cross-platform with minimal external dependencies**

---

This is not an officially supported Google product.

[1]: https://tlemo.github.io/darwin

[2]: https://en.wikipedia.org/wiki/Neuroevolution

[3]: https://en.wikipedia.org/wiki/Evolutionary_algorithm

[4]: bindings/python/docs/tutorial.md

[Genotype]: https://tlemo.github.io/darwin/classdarwin_1_1_genotype.html

[Brain]: https://tlemo.github.io/darwin/classdarwin_1_1_brain.html

[Population]: https://tlemo.github.io/darwin/classdarwin_1_1_population.html

[Domain]: https://tlemo.github.io/darwin/classdarwin_1_1_domain.html

[Pong]: https://tlemo.github.io/darwin/classpong_1_1_pong.html

[Universe]: https://tlemo.github.io/darwin/classdarwin_1_1_universe.html

[Experiment]: https://tlemo.github.io/darwin/structdarwin_1_1_db_experiment.html

[Variation]: https://tlemo.github.io/darwin/structdarwin_1_1_db_experiment_variation.html

[Trace]: https://tlemo.github.io/darwin/structdarwin_1_1_db_evolution_trace.html
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/tlemo/darwin

Awesome Lists containing this project

README