GithubHelp home page GithubHelp logo

ed-science / amazon-kinesis-replay Goto Github PK

View Code? Open in Web Editor NEW

This project forked from aws-samples/amazon-kinesis-replay

0.0 1.0 0.0 47 KB

A Java application that replays events that are stored in objects in Amazon S3 into a Amazon Kinesis stream as if they occurred in real time.

License: MIT No Attribution

Java 88.23% Jupyter Notebook 11.77%

amazon-kinesis-replay's Introduction

Amazon Kinesis Replay

A simple Java application that replays Json events that are stored in objects in Amazon S3 into a Amazon Kinesis stream. The application reads the timestamp attribute of the stored events and replays them as if they occurred in real time.

By default, the application will replay a historic data set of taxi trips that made in New York City that is derived from the public dataset available from the Registry of Open Data on AWS.

$ java -jar amazon-kinesis-replay-1.0.jar -streamName «Kinesis stream name» -streamRegion «AWS region»

To increase the number of events sent per second, you can accelerate the replay using the -speedup parameter.

The following command replays one hour of data within one second.

$ java -jar amazon-kinesis-replay-1.0.jar -streamName «Kinesis stream name» -streamRegion «AWS region» -speedup 3600

To aggregate multiple events in a Kinesis Data Streams record, you can use the -aggregate option. Aggregation allows you to increase the number of records sent per API call, which effectively increases producer throughput

$ java -jar amazon-kinesis-replay-1.0.jar -streamName «Kinesis stream name» -streamRegion «AWS region» -speedup 3600 -aggregate

To specify an alternative dataset you can use the -bucket and -prefix options as long as the events in the objects are stored in minified Json format, have a timestamp attribute and are ordered by this timestamp. The name of the timestamp attribute can be customized with the timestampAttributeName parameter.

$ java -jar amazon-kinesis-replay-1.0.jar -streamName «Kinesis stream name» -streamRegion «AWS region» -bucketName «S3 bucket name» -bucketRegion «S3 bucket region» -objectPrefix «S3 prefix of objects to read»

More options can be obtained through the -help parameter.

License Summary

This sample code is made available under a modified MIT license. See the LICENSE file.

amazon-kinesis-replay's People

Contributors

dependabot[bot] avatar jpeddicord avatar jrode avatar kamikazi avatar sthm avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.