vicariousinc / pgmax Goto Github PK

View Code? Open in Web Editor NEW

63.0 63.0 9.0 2.21 MB

Loopy belief propagation for factor graphs on discrete variables, in JAX!

Home Page: https://pgmax.readthedocs.io

License: MIT License

Python 100.00%

jax python

pgmax's People

Contributors

Stargazers

Watchers

Forkers

stanniszhou shrinukushagra nishanthjkumar swaroopgj antoine-dedieu bailiping zhennan-wu zhanghua7099 pantonante

pgmax's Issues

Update heretic model example notebook to use new message manipulation interface

Support sum-product with different temperatures, in addition to max-product

Benchmark against pomegranate

Notes about pomegranate

Package Focus: easy to stack and sequence probabilistic models by considering them to just be an underlying probability distribution. Also supports parallelization and GPU computation.

Overlap: pomegranate implements loopy belief propagation on factor graphs. However, only MAP inference is implemented (so they only seem to support max product and not sum product)

Comparison we can make: Speed of running either sum-product or max-product, ease of specifying model (maybe in terms of number of lines required or something like that?)

Demonstrating our advantage: We should be able to specify models much more easily with PGMax than with pomegranate since pomegranate only lets you add one variable/factor at a time. Also, since PGMax's inference is JIT'ted end-to-end, I expect it to be more efficient than pomegranate's inference (even though pomegranate leverages GPU's)

Set up sphinx documentation generation and hosting

Make `FactorGraph` mutable to support interactive model building

Should implement interface for:

Add factors
Set evidence for variables
Initialize messages by setting messages for factors
Initialize messages by spreading beliefs from variables

Design and implement core inference functions

Migrate the best solution implemented in the contrib module, taking into account new message flows.

To quote @lazarox :

I’ve never used clipping so I don’t know about that one. Maybe I would just clip the unaries and hope that everything else remains contained.

Normalization should not make any mathematical difference in the algorithm, it’s there only for numerical robustness. You should be able to normalize and renormalize at will without changing the results. So just normalize as needed so that all operations are numerically stable.

For the rest, this is how I do it:
outgoing messages: Factor-to-variable messages
incoming messages: Variable-to-factor messages
messages: vector of log-max-marginals, including all possible assignments of the variable. I.e., for binary variables it’s a two dimensional vector.

Then:
1. You have a set of outgoing messages. For each message, the maximum value should be zero. Renormalize if not. These messages are the key quantity we are updating. Everything else are derived quantities.
2. Compute the beliefs by adding the messages at each variable (parallel op)
3. Compute the incoming messages by subtracting the outgoing messages from the beliefs (parallel op)
4. For each factor compute the new outgoing messages new_outgoing (parallel op)
5. Renormalize new_outgoing so that the max of each message is 0
6. Compute the message deltas: delta=new_outgoing-outgoing (parallel op)
7. If you now updated the outgoing messages of a single factor to the new ones, you’d get serial MP. Instead do outgoing += eta*delta  (parallel op). eta is the stepsize, i.e., 1-damping .
8. Renormalize outgoing so that the max of each message is 0

Verify RTD integration and do more setup once repo is public

Optimize and parallelize structure compiling function

Structure compiling takes 5.65s in https://github.com/NishanthJKumar/PGMax/blob/sanity_check/test_notebooks/sanity_check_optimize_mpbp_unpadded.py but 7.09s in https://github.com/NishanthJKumar/PGMax/blob/sanity_check/examples/sanity_check_example.py using the existing interface.

We should be able to optimize further, and parallelize the structure compiling process to significantly reduce the time taken.

Add support for differentiating through belief propagation

Add support for training with perturb-and-max-product

@antoine-dedieu

Check whether a key is a sequence before checking key length

@StannisZhou : Wanted to check the expected format for the segment

np.zeros(valid_configs_dict[edge[-1]], dtype=float)

As I got a

Traceback (most recent call last):
  File "rcn_example.py", line 153, in <module>
    np.zeros(valid_configs_dict[edge[-1]], dtype=float),  # This line causes issues
ValueError: maximum supported dimension for an ndarray is 32, found 65

Here's the shape of valid_configs_dict[edge[-1]]

In [2]: valid_configs_dict[edge[-1]].shape
Out[2]: (65, 2)

I tried

np.zeros_like(valid_configs_dict[edge[-1]], dtype=float)

But got

Traceback (most recent call last):
  File "rcn_example.py", line 154, in <module>
    np.zeros_like(valid_configs_dict[edge[-1]], dtype=float),
  File "/home/chekun/miniconda3/envs/pgmax/lib/python3.7/site-packages/pgmax/fg/graph.py", line 129, in add_factor
    self._variable_group, *new_args, **kwargs
  File "<string>", line 7, in __init__
  File "/home/chekun/miniconda3/envs/pgmax/lib/python3.7/site-packages/pgmax/fg/groups.py", line 380, in __post_init__
    self, "_keys_to_factors", MappingProxyType(self._get_keys_to_factors())
  File "/home/chekun/miniconda3/envs/pgmax/lib/python3.7/site-packages/pgmax/fg/groups.py", line 504, in _get_keys_to_factors
    for ii in range(len(self.connected_var_keys))
  File "/home/chekun/miniconda3/envs/pgmax/lib/python3.7/site-packages/pgmax/fg/groups.py", line 504, in <dictcomp>
    for ii in range(len(self.connected_var_keys))
  File "/home/chekun/miniconda3/envs/pgmax/lib/python3.7/site-packages/pgmax/fg/groups.py", line 174, in __getitem__
    if len(curr_key) < 2:
TypeError: object of type 'numpy.int32' has no len()

So I think I'm not parsing it right...

Originally posted by @lawchekun in #73 (comment)

Construct `FactorGraph`s using `FactorGroup`s

Change the interface of FactorGraphs to take as input a sequence of FactorGroups, instead of the current flat list of individual factors. Do various expansions (getting the flat list of variables and factors) inside the FactorGraph class.

Support structured factors/essential factor interface

Test sanity check example using new interface and inference modules, and put together the first unit test

The unit test should run fast. One option is to cache new results. Another option is to just make the model really small.

In the process, we should also:

Deprecate the current contrib module and create a new examples directory to hold everything.
Start figuring out what our user facing interface should look like.

Design new user-facing interface for specifying `FactorGraph`s

This interface should greatly simplify the specification of the sanity_check example to make it extremely straightforward and simple to do. It should also be extensible to cover a wide variety of different use-cases (such as facilitating #16 in the future).

Add RBM example

RBM is relevant as it is a standard model with two classes of variables (which are nicely supported here)

Maybe examples could be divided into two subfolders:

standard_models: Ising, RBM
complex_models: heretic, cut model, CMRF(?)

Enable support for non-uniform potential functions

Make `decode_map_states` output a mapping from variable keys to MAP states

Currently, the decode_map_states function outputs a mapping from Variables to integers corresponding to the variable's MAP state. However, the user doesn't really have access to Variables, they only use keys to index Variables through a VariableGroup, so it's rather unintuitive/cumbersome to output a mapping from Variable to int. Rather, it should be from keys to int.

Support message scheduling

Create a 'Discussions' tab on GitHub for the Repo

This seems to be a useful and cool tool that's easy to setup; should enable better discussions from within Vicarious and eventually beyond when the repo is open sourced!

Instructions here

Better handle evidence construction and messages initialization

Make get_evidence and get_init_msgs argument free, and make any necessary data for the two functions as attributes of children classes that inherit FactorGraph class.

Make everything immutable while maintaining the ability to do caching

Update naming/docs based on feedback from internal beta

From @lawchekun :

key_tuple in GenericVariableGroup -> variable_names
In general maybe move away from using keys and instead use names
Would probably be helpful if there's a short example showing that init_msgs can be used to setup/trigger the belief_propagation too in run_bp

Experiments on PMAP learning of RBM

Setup

Learning RBMs on MNIST digits from Sec. 5.5 of PMAP paper

Metrics

Sampling quality (Fig. 3(a) of PMAP paper)
Visualization of samples (Fig. 3(b) of PMAP paper)
Speed of inference

Concrete TODOs

Implement perturb-and-max-product in PGMax @StannisZhou
Implement perturb-and-map using PGMPy (candidate methods are MPLP and junction tree based BP) @NishanthJKumar
Implement perturb-and-map using pomegranate (using the predict method mentioned in tutorial which seems to be doing MAP inference) @NishanthJKumar
~~[ ] Integrate all perturb-and-map implementations and learn and sample from the models @StannisZhou~~

Resolve mypy error

pgmax/interface/datatypes.py:128: error: Argument 1 to "append" of "list" has incompatible type "Union[Variable, List[Variable]]"; expected "Variable"

Originally posted by @NishanthJKumar in #30 (comment)

Switch to new pre-commit CI once repo is public

Currently we are using the deprecated pre-commit github action, which seems to have issues (e.g. for caching).

The new pre-commit ci framework looks better, but doesn't work with private repos.

Switch to the new pre-commit ci after repo is public

Update `pyproject.toml` with relevant additional info (link to RTD website, maintainers, etc.) when package is ready for open-sourcing

Additional info sections are listed here

Implement factor graph construction from `VariableGroup`/`CompositeVariableGroup` and `FactorGroups`

In a future PR we should implement this construction from VariableGroup/CompositeVariableGroup and FactorGroups

Originally posted by @StannisZhou in #30 (comment)

Add customized class for pairwise factors; Default to have uniform potentials

Currently, users have to manually create an array of all possible configs and a uniform potential, but it would be nice to do this behind-the-scenes in some easy way. Maybe we can make it so that if either of these is None during init, then we assume all possible configs or uniform potential respectively and automatically create these

Create contribution guide as part of documentation

Consider making a 'PR Checklist' similar to JAX's

Get rid of contrib module once it is no longer useful

Experiments on Ising models

Setup

Ising model experiments from Sec. 5.2 of PMAP paper

Model

100 EBMs as described.

Metrics

Difference in estimated log partition function and GT log partition function (Fig. 1(a)[Left] of PMAP paper).
Speed of inference

Experiments

Max-product implemented in PGMax @StannisZhou
Methods from PGMPy (MPLP and junction tree based BP) @NishanthJKumar
Methods from pomegranate (see tutorial) @NishanthJKumar

Benchmark against PGMPy

Notes about PGMPy

Package Focus: easy-to-use from a user perspective, wide support for a variety of different types of PGM's, inference and learning algorithms

Overlap: Also supports belief propagation on discrete, undirected factor graphs

Comparison we can make: Speed of running either sum-product or max-product, ease of specifying model (maybe in terms of number of lines required or something like that?)

Demonstrating our advantage: PGMPy uses dicts and NumPy arrays for message passing, so we should be significantly more efficient. Also, for grid models, etc. we should be able to specify them much more easily with PGMax

Write blog post introducing repo for Vicarious blog when ready to open-source

Bug in pytree registration code for dataclass

See #19 (comment)

Implement MAP Inference on the Heretic Model as an example notebook

Model design and learned weights provided by @lazarox

Add format checking for log potentials array shape

Add checking and more informative error message to prevent wrong format for log potentials

Originally posted by @lawchekun in #73 (comment)

Make all variable groups and factor groups hashable

After this is done, make the get_all_vars method into a cached property!

Keep track of mapping from factors to starting indices in the flat message array

Once #45 is resolved, also keep track of mapping from factors to starting indices in the flat message array to provide finer control over things like message initialization. Concretely:

Keep a mapping from FactorGroups to starting indices in the flat message array, inside the refactored FactorGraph which takes a sequence of FactorGroups at initialization.
Within each FactorGroup, keep a mapping from individual factors (indexed using a tuple of involved variable indices) to starting indices in the flat message array.

Verify CodeCov integration once repo is public

Also add docs about how to test coverage locally before opening a PR so that new contributors are aware of this.

Add more text explanation to examples so they're more understandable to new users

Currently, our examples are not easy to understand for a new user. I think it'd be nice to:

Provide an explanation at the top of the file that gives some intuition for why this example exists/what the model is doing
Provide more of a walkthrough for each cell
Explain what the various outputs are showing

Add more unit tests in preparation for initial release

Can also incorporate some tools that analyze code coverage.

Should have decent coverage for the code base before the initial public release

Experiments on GMRF

Reproduce experiments on learning a GMRF on BO dataset.

Requires #17 and #68

Demonstrate how PGMax can be used as part of a larger NN, by implementing this experiment using Trax, Flax and Haiku.

Make `CompositeVariableGroup`s able to contain `CompositeVariableGroup`s

Put together RCN test example

Add HowTo example notebook

This would explain for instance:

how to create a factor graph
how to use add_factor (with its different use)
how to give evidence
how to parallelize over scenes
...

Currently this has to be derive from existing examples

Experiments on RCN

Reproduce RCN experiments from science paper.

Demonstrate inference with a learned model with 100 templates.

Implement forward and backward pass within a single PGM.

Set up publishing to PyPI

New factor graph interface that are aware of group structures in variables/factors

Currently the factor graph works with individual variables/factors. But in many cases the factor graphs are constructed with variable groups/factor groups.

The factor graphs can exploit such structures to make certain operations more efficient/convenient. Some examples include:

Each variable group can implement a customized flattening function for a given evidence array. This way we no longer need to always go through the vars_to_evidence dictionary in setting the evidence.
We can name and organize factors according to factor groups, and set messages, potentially for an entire factor group.

Suggestion: for journal submission add Pmap training

Pmap is as hard as inference so the pieces are here.
Maybe we could reproduce the RBM on MNIST digits experiments from Pmap paper.

This would become a package for "Learning and inference in GM"

vicariousinc / pgmax Goto Github PK

pgmax's People

Contributors

Stargazers

Watchers

Forkers

pgmax's Issues

Setup

Metrics

Concrete TODOs

Setup

Model

Metrics

Experiments

Recommend Projects

Recommend Topics

Recommend Org

Jobs