bloom-lang / bud Goto Github PK

View Code? Open in Web Editor NEW

852.0 852.0 61.0 6.74 MB

Prototype Bud runtime (Bloom Under Development)

Home Page: http://bloom-lang.net

License: Other

Ruby 100.00%

bud's People

Contributors

Stargazers

Watchers

bud's Issues

Make use of unified_ruby in rewriter

Before passing an sexp to Ruby2Ruby, you should first pass it through unified_ruby. This should hopefully avoid the need for kludges like SaneR2R.

Better implementation of stdin reader

Current send-data-via-TCP method is a massive kludge; would be much better to integrate the terminal's file descriptor into the EM event loop.

Similar comments probably apply to other IO-oriented collections.

Add concept of ordered channels?

The current channel semantics don't capture the behavior provided by several common types of channels (e.g., TCP sockets, typical Unix FDs, even typical asynchronous queues, etc.) -- if we send tuple t_1 at time 1 and t_2 at time 2, t_1 will be delivered before t_2. Choosing the delivery timestamp in a completely non-deterministic manner doesn't capture this; in practical programs, this means that the "correct" operation of the program depends on semantics not explicitly provided by the language spec (i.e., two writes to stdio in adjacent timesteps preserves the timestep order).

Zookeeper integration

Allow access to @ip, @port from bootstrap

Need to refactor interaction between bootstrap and EM.

BFS port/rewrite

visualizer/debugging enhancements

connect visualizer nodes to rules/lines of code
interactive visualization?

String interpolation doesn't work

stdio <~ pipe.map {|p| ["got message: #{p.msg}"]}

Produces the output: "got message:", but doesn't appear to interpolate the given string correctly.

configure for capistrano

Rails example

Discovery protocol integration

a la Bonjour

chat example + mixins

memories/guesses/apologies

Fix pingpong example

Uses deprecated syntax (explicit strata). Explicit specification of hostname/port is ugly, but harder to resolve.

Async request/response handling

via a library or sugar. Some related things:

atomic deferred/async
batch scheduling
storage: UNIX I/O, SQL
RESTful web service interation
HTML protocol bindings
interrupt-driven/push collections vs. read/pull

"terminal" collection type + multiple Buds per process

Right now, terminal automatically tries to read from stdin. This is annoying, because stdin might be used for other purposes. It also causes problems since a given Bud instance creates sub-instances to do stratification and similar tasks. Each instance spawns a thread to read from stdin, leading to contention.

Refactoring: move "meta" code out of Bud class

Should be in a separate module with clearly-defined private state and narrow interfaces.

Add test cases for "interface" functionality

There is cart-specific stuff, but nothing that gets run as part of "ts_bud".

REPL

Heartbeat library

Fix joins for TC-backed tables

Current exception:

NoMethodError: undefined method `each_value' for nil:NilClass
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:103:in `each_from'
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:102:in `each'
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:102:in `each_from'
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:112:in `each_storage'
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:656:in `send'
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:656:in `hash_join'
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:590:in `each'
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:585:in `each'
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:584:in `each'

key-value store

Should <+ and <- be evaluated "eagerly"?

Right now, <+ and <- derivations don't cause a new tick to occur. That is, suppose I have a program:

When X occurs, add a new fact to Y via <+
When Y contains 10 facts, fire the missiles

With current Bud, if there are exactly 10 X events (and no more), the missiles will never be fired: the 10th Y fact will be "pending", but won't be added to Y until the next tick, which might never occur.

Rewrite failure: empty "declare" block

Given a Bud program with a "declare def foo; end" block, Bud produces the output:

Running original (ZkTableTest) code: couldn't rewrite stratified ruby (Invalid top-level clause length 1: '[:nil]')

ordering features

queue/serializer
nonce/sequence generation
- nested time?
- Solution: assign_ordinals. Given a set and a total order for that set, return the input set annotated with the ordinal of each element of the set according to the ordering.
- How does this generalize to ordered tables?

Instance variables can't be accessed from bootstrap blocks

Can't reference stdio in bootstrap block

Test case failure: test_visualization

.......................E...Running original (VarBudDup) code: couldn't rewrite stratified ruby (Invalid op (x) in top-level block [:attrasgn, [:self], :x=, [:array, [:lit, 4]]]
)
.Running original (VarBud) code: couldn't rewrite stratified ruby (Invalid op (x) in top-level block [:attrasgn, [:self], :x=, [:array, [:lit, 4]]]
)
..
Finished in 33.719155 seconds.

Error:
test_visualization(TestMeta):
NoMethodError: undefined method cycle' for #<DepAnalysis:0x101791948> ./tc_meta.rb:92:intest_visualization'
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:106:in each_from' /Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:103:ineach_value'
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:103:in each_from' /Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:102:ineach'
/Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:102:in each_from' /Library/Ruby/Gems/1.8/gems/bud-0.0.1/lib/bud/collections.rb:98:ineach'
./tc_meta.rb:92:in `test_visualization'

30 tests, 138 assertions, 0 failures, 1 errors

syntax enhancements for events, lookups

think through:

periodics
message channels (possibly multiple in batch)
addressing through a networking library

Improve signal handling

We emit an ugly message on Ctrl+C.

Broadcast/Multicast protocols

scope and aliasing

currently if I want to use a module Foo, I say:

include Foo

and this brings into (global) scope the rules and relations defined by Foo. what if I want to use two Foos? what if I want to extend Foo by redeclaring one of its input interfaces and interposing additional logic? we really want something like:

import Foo as f1

this would kill both birds...

chord example

Persistent storage: on-disk state

Where should we store the on-disk state associated with persistent tables? Requirements:

If two concurrent Bud instances are running, they shouldn't try to write to the same on-disk state
- Solution: include port number in path to on-disk state
If the same Bud program is run twice, the persistent state from the previous instance should be accessible to the new instance
- What defines "a new instance of the same Bud program"? One approach would be to include the full path name of the Bud program in the path to the on-disk state; running /x/y/foo.rb on port 5555 twice in a row means that the second instance reuses the state written by the first instance

Mapreduce example

shopping carts

Delivery/Reliable delivery

refactor collection interface

Building a collection type based on TokyoCabinet is somewhat awkward, because the base BudCollection class provides/assumes that @storage is in in-memory hash containing the "normal" tuples in the table. It is inconvenient to maintain this invariant for TC; it would be better if BudCollection didn't make this assumption.

Other areas for improvement:

interface is large; it isn't clear what stuff an implementation needs to provide
perhaps move some functionality into a module/mixin?

safe_rewrite should actually do some validation

Currently, safe_rewrite doesn't validate things. Some validation that might be nice is "are there any variables in this code that are undefined?" and "are all of the tables referenced actually defined?"

Ideally, we'd like to catch stuff like this example (which illustrates the previous two points)

def Foo < Bud
declare
def rules
j # undefined variable
xyz <= [[1]] # undefined collection xyz
end
end

UDA for disorderly cart

we need to take the disorderly schema (session, item, cnt) and transform it to the array representation (session, array_of_items) as we describe in the cidr paper. array_of_items should contain each item cnt times. so we need either a binary UDA, or some flattening trick. the current unary UDA accum() does not account for cnt.

voting

code-review semi-naive

Fix deletion semantics

Currently we check for <- LHS matches only looking at the key columns; should look at the entire tuple.

convert from Class to Module

Support for ephemeral ports

Binding to a port explicitly is ugly.

leader election

lelann/chan
per Paxos Made Live

Bud API: support interaction with async bud

When Bud is running as a background thread, we need a safe way to interact with it (eg insert tuples, probe table contents) w/o race conditions. Easiest way to do that is to use a thread-safe queue.

Bud API: easier insert into scratch

There isn't an easy way to insert tuples into a scratch collection between calls to tick(): right now, scratchs are emptied before the next tick() begins.

bloom-lang / bud Goto Github PK

bud's People

Contributors

Stargazers

Watchers

Forkers

bud's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs