Light

seo01 / tweetngrams Goto Github PK

View Code? Open in Web Editor NEW

2.0 2.0 2.0 120 KB

An ngram tweet generator

Python 100.00%

tweetngrams's Introduction

TweetNgrams ReadMe

Tweet ngrams generates new unique documents based on given documents as a model. It was designed to generate tweets (hence the name) but will work with any corpus.

Get the damn thing to do something

Download the repo
Go to the src directory

cd {downloads}/TweetNgrams/src

Set this directory as your python path

export PYTHONPATH=pwd

See the options

nlp/nlp_cli.py -h

Given a text file with one document per line (e.g. one tweet per line if generating tweets) as a model, generate a new unique document.

nlp/nlp_cli.py -g {filename}

The rest should be apparent from the help text.

WTF!?! Why is this useful

There are two main uses for generating documents: To generate spam or generate filler. The most useful example of generating spam is to manipulate people analysing twitter. The most useful example of generating filler is as an alternative to Lorum Ipsum modelled on more representative text.

Testing

Testing is done using nose and unittest. From the src folder execute nose:

nosetests

tweetngrams's People

Contributors

Stargazers

Watchers

Forkers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs