jharwell / sierra Goto Github PK

View Code? Open in Web Editor NEW

18.0 18.0 1.0 3.21 MB

Automation framework for the scientific method in AI research

License: MIT License

Emacs Lisp 0.18% Python 94.18% Shell 5.64%

sierra's People

Contributors

Stargazers

Watchers

sierra's Issues

feature/17-add-constant-swarm-density-criteria

Basically, increasing arena size while keeping area occupied by robots the
same.

Would take in/have a robot area parameter

feature/27-add-block-priority-batch-criteria

feature/36-add-per-exp-duration

Will be VERY helpful when trying to extrapolate scripted experiment overall
duration for MSI.

enh/44-get-avg-vel-simulation

Right now I hardcode an estimate into the footbot model. Given that the estimate
will undoubtedly change between simulations/swarm sizes, I should probably get
it from the simulation results (or even from the max speed which is specified in
the simulation input file).

feature/14-add-csv-column-statistics

For each column in an averaged .csv, graphs should be generated showing the
distribution of the averaged data:

std deviation over time
variance over time
Mean over time

feature/30-add-histogram-plots-for-distributions

feature/38-add-depth1-inter-exp-graphs

feature/30-switch-to-dictionary-for-graph-generation

feature/19-add-block-distributions

enh/50-add-starting-swarm-size-to-ranged-graph

For some comparison graphs, I need the swarm size to start at 1 (scalability),
for some, it needs to start at 2 (blocks collected). Adding a switch/argument to
control this is necessary.

feature/49-add-controller-comp-blocks-collected-graph

feature/35-add-scalability-measure-and-graphs

refactor/28-switch-to-pandas-for-averaging

The homegrown csv_class class is pretty good, but pandas is more robust,
cleaner, etc. When switching, we need to maintain the checks for same number
rows/colums in .csv files before averaging that currently exist.

This has become a bottleneck when running on MSI, as average 50-100 simulations
can take upwards of an hour with the current method, which takes upwards of an
hour. It is only a few seconds with ~4 or so simulation runs, which is totally
fine.

feature/13-add-criteria-for-static-caches

Cache footprint, max size, usage penalty

feature/21-joint-batch-criteria

This would enable me to generate a much larger set of experiments in a batch
from TWO criteria (say swarm size and # of blocks). Would really just be
generating the 2 lists of sets of changes, and then combining them such that the
resulting list of sets contains M*N changesets.

2018-12-18: Some progress has been made on this with the command line string
parsing + dynamic class generation, BUT I still can't use a batch
criteria of swarm_size + temporal_variance (for example).

2019-09-23: For now, limit this to at most 2 criteria, specified on the cmdline
separated by commas.

feature/23-add-stateful-generators

research/45-fix-CA-modeling

The current collision avoidance entry/exit/duration models, even for the super
simple stateless foraging in a square arena with random block distribution are
way off (entry) or incomplete/not implementation (exit, duration)

More accurate models for each of the 3 phases of CA should be developed so that
transition probabilities for MC models can be analytically estimated.

feature/39-add-depth1-generator

feature/20-add-sierra-root-dir

So that instead of specifying the directories for EACH and EVERY stage of the
pipeline, you can just specify the root directory and it will create everything
in there. You should still be able to specify specific directories if you want
to though.

feature/1-add-xml-parsing-and-parameter-replacement

sierra should be able to:

Parse a .argos input file
Given a path like "params.occupancy_grid.pheromone.rho" (maybe
something a little different dependeng what xml python parsing
looks like), it should be able to replace any value in the .argos
file with any other value.

docs/34-add-tutorials-for-experiments

How to add a new controller to generators
How to add a new variable
How to add a new intra-exp graph
How to add a new inter-exp graph
How run on MSI (this is sort of already there, but needs a revamp).
Update the readme to explain what the purpose of each directory in sierra
is.
Update readme to say that sierra does not delete any directories for you,
which can cause problems if you shrink size of exp/# simulations when it comes
to averaging and displaying graphs.

feature/33-add-inter-exp-histograms

I have linegraphs, but having histograms for SOME kinds of experiments
might be helpful (not for ones where I vary the swarm size, for example, but for
temporal variance they might provide insight).

docs/16-add-versioning

bugfix/31-cannot-generate-rnd40x40-inputs

bugfix/29-errorbars-make-all-lines-one-color

No idea why--some setting in matplob lib. I need to be able to pick a color for
the error bars, as otherwise they appear as the same color as the line they
correspond to, which makes it hard to see/make sense of.

feature/26-add-heterogeneous-block-stats

feature/2-post-processing-statistics

Given a set of csv file names, sierra should be able to read all them
in, and:

Average all columns together (except those given in an exception
list), and write the result to a final csv.
Compute the std deviation for each row in each column, and output
that in a second .csv.

feature/10-add-criteria-for-pheromone-experiments

Tied to fordyca issue.

docs/9-create-basic-readme

feature/7-add-batched-experiment-option

In addition to being able to run N copies of a simulation and average the
results, I also need to be able to compare results of experiments with respect
to variation in some parameter in the input file (i.e. # of robots).

My thought is to create a batched mode/batched pipeline that will:

Generate different "template" files from the original template
Feed the generated template files to the existing experimental pipeline,
which will generate the necessary simulation input files, run experiments,
and average the results.

I haven't tackled graph generation yet, so that shouldn't be considered in
this feature.

bugfix/41-swarm-size-batch-criteria-sizing-wrong

For a log 256 criteria, an experiment is run with 512 robots, which is obviously
wrong.

feature/42-add-controller-comparison-graphs

feature/14-add-criteria-block-carry-throttling

feature/8-add-stateless-intra-exp-graph-generation

Simple things that you want to graph within a single experiment from the
averaged data:

block acquisition over time
block transport over time
distance traveled over time

feature/18-add-n-blocks-criteria

refactor/40-update-fsm-stat-processing-for-movement

feature/46-add-emergence-measure

Should use the scalability measure as a template, as the process for gathering
the information it needs should be very similar.

feature/11-add-criteria-for-task-partitioning-experiments

Methods, always/never partition, as well as method parameters if applicable.

feature/48-add-multithreading-to-graph-generation

Mostly because I'm impatient.

feature/43-add-collision-trendlines-option-to-graphs

This is for validating my analytical model against experimental data. Should
include:

Predicted # robots entering CA each timestep, given swarm size, arena
geometry, and robot kinematic parameters
Predicted # robots exiting CA each timestep, given swarm size, arena
geometry, and robot kinematic parameters
Predicted CA duration, given swarm size arena geometry, and robot kinematic
parameters

feature/5-add-plot-generation

Given a final set of .csv files, for a set of N simulation runs and a
std deviation file, sierra should be able to generate a set of plots (probably with
matplotlib or similar) representing the simulation results.

feature/4-add-parameter-space-exploration-methods

The easiest method is factorial, where given a set of parameters and
ranges/values (probably in a JSON file), exhaustively test all
combinations. This will realistically only work for small sets of
parameters due to combinatorial explosion.

There are other ways of traversing the parameter space, and those
should be incorporated as well (i.e. hook into PyPet).

feature/3-create-task-processing-framework

The framework would need to be able to run N copies of a particular
simulation scenario on a single computer (serially), as well as on a cluster in a
distributed fashion (simultaneously).

For distributed execution, definitely want to hook into something like
cosmos, luigi, sciluigi, etc. that already exists--no need to reinvent
the wheel.

feature/25-add-block-pickup-drop-rates

feature/15-add-inter-exp-graph-generation

feature/24-add-block-throttling-variable

task/6-breakup-main-class

Too large/does too many things at the moment. Need to break into its
constituent pipeline stages.

jharwell / sierra Goto Github PK

sierra's People

Contributors

Stargazers

Watchers

sierra's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs