Light

shinji94 / text_twitter_minining Goto Github PK

View Code? Open in Web Editor NEW

0.0 0.0 0.0 11.57 MB

for apally data mining project

HTML 81.04% Python 18.96%

text_twitter_minining's Introduction

text_twitter_minining

for apally data mining project

I strongly recommend reading this chapter before this week's practical to give you further background in addition to the lecture material: http://www.nltk.org/book/ch06.html

The instruction of the code is not design in a 'click and run' format, so to use the code, user should open it in python and change the location of dataset (line 132 's = 'C:\Users\Hasee\Desktop\final project of adm\gb-celebs\gb-celebs') After which you can run the file.

The tokenizer is list in the line 60~64 you should use the function 'define_tokenizer(tokenizer = tokenise)' to use the tokenizer you want to use (line _77,several tokenizer is placed.)

And to use corpus from text , you should uncomment line 258

The code is design using a 10 cross validation with a 8-2 split, this is also what can be managed by user.line 237

If you want to use other classiers check the sklearn website for more. also proivided tokenizer not show in the report like standford or regtokenizer. you can explore it if you wish!!

text_twitter_minining's People

Contributors

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs