GithubHelp home page GithubHelp logo

masahi / dynamic-datalog Goto Github PK

View Code? Open in Web Editor NEW

This project forked from frankmcsherry/dynamic-datalog

0.0 1.0 0.0 82.12 MB

Engines, queries, and data for dynamic Datalog computation

Rust 100.00%

dynamic-datalog's Introduction

dynamic-datalog

Engines, queries, and data for dynamic Datalog computation


This repository maintains a list of dynamic Datalog engines, those that update query outputs in response to changes in the input fact sets.

The repository also maintains several paired Datalog queries and input data, meant to exercise dynamic Datalog execution engines in non-trivial ways. These can be found in the ./problems/ subdirectory. These problems may also be independently interesting for evaluating non-dynamic Datalog engines.

Engines

We also use the excellent Soufflé as a non-dynamic baseline.

Problems

Evaulations

For each problem we record reported times for various systems in various configurations, both to perform any query-specific compilation and then execution. These measurements are meant to be representative rather than definitive. All of the systems support multiple worker threads, and could be run in a variety of configurations on a variety of hardware platforms.

Soufflé can often benefit from join planning help; without this help it can take orders of magnitude longer than it could. Such help is currently provided for the CRDT and Doop benchmarks, and measurements for other queries could improve in the future (especially for those runs in which it did not finish in 1,000 seconds). The Soufflé measurements are all directly using the query.dl file from the decompressed input directory, either with the -c flag (for compilation) or without (for interpretation).

Unlike other measurements, the Differential Dataflow measurements are for hand-written code in a larger language, and can reflect implementation and optimizations not easily available within Datalog. The code for each problem is in the differential/ directory.

The CRDT benchmark

Engine Compilation Evaluation Cores Notes
Soufflé (interpreted) 0s 1000s+ (DNF) 1 Laptop
Soufflé (compiled) 10.15s 294.73s 1 Laptop
Differential Dataflow 166.26s 3.44s 1 Laptop
Declarative Dataflow 0s
Differential Datalog
IncA

The CRDT benchmark contains stratified negation, as well as several "maximization" idioms represented in Datalog using a quadratic number of facts, which can be more efficiently implemented as data-parallel reduce operations.

The DOOP benchmark

Engine Compilation Evaluation Cores Notes
Soufflé (interpreted) 0s 762.14s 1 Laptop
Soufflé (compiled) 93.43s 111.76s 1 Laptop
Differential Dataflow 237.43s 161.58s 1 Laptop
Declarative Dataflow 0s
Differential Datalog
IncA

The DOOP benchmark is just really quite large.

The GALEN benchmark

Engine Compilation Evaluation Cores Notes
Soufflé (interpreted) 0s 1000s+ (DNF) 1 Laptop
Soufflé (compiled) 9.31s 198.19s 1 Laptop
Differential Dataflow 111.59s 123.54s 1 Laptop
Declarative Dataflow 0s
Differential Datalog
IncA

The GALEN benchmark contains joins on highly skewed keys, for which correct or adaptive join orders are important. The most problematic rule (IR4) is presented in the least problematic order for binary joins.

dynamic-datalog's People

Contributors

frankmcsherry avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.