hhy5277 Goto Github PK
Name: haiyang
Type: User
Blog: https://git.io/fhsNr
Name: haiyang
Type: User
Blog: https://git.io/fhsNr
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
TensorFlow分布式MNIST手写字体识别实例
分布式新浪微博爬虫
1. 主要分为三个模块,一个爬虫抓取模块,一个是数据处理模块,一个是用户模块。 2. 爬虫抓取模块主要是从直播吧、新浪体育、网易体育上爬取有关足球的新闻和用户关于足球的评论,利用集群HADOOP抓取网页,分析得出URL集,提取特征URL 3. 网页linux脚本过滤得到原始网页,然后二次过滤得到文本,并使用分布式储存。 4. 处理模块主要是根据训练集规则一和规则二,得到分词器,然后对文本进行操作,得出训练结果。 5. 通过特征脚本得到训练结果的特征词分类,然后提取出球队模糊集和球星模糊集。 6. 过滤得到球队精确集和球星精确集,并存入MYSQL数据库。 7. 从数据库中提取球星和球队的信息进行图表分析,并动态显示WIKI信息,调入显示模块中和用户进行交换
分布式爬虫,redis缓存,mysql持久化,rpc实现分布式。可用docker部署
This is what I do with Pthon distributed crawler
Short, simple, direct scripts for creating ASCII graphical histograms in the terminal.
The Docker toolset to pack, ship, store, and deliver content
NodeSource Node.js Binary Distributions
🥑 Language focused docker images, minus the operating system.
Class materials for a distributed systems lecture series
Lightweight Markdown Documentation System
:elephant: Ditto is a scripting language implemented in C
A tool for exploring each layer in a docker image
本项目将《动手学深度学习》原书中的MXNet代码实现改为PyTorch实现。
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为TensorFlow 2.0实现,项目已得到李沐老师的同意
Dive into Machine Learning with Python Jupyter notebook and scikit-learn
全面的Webpack教程《深入浅出Webpack》电子书
深入Go并发编程研讨课
A copy of Mark Pilgrim’s “Dive Into HTML5” book, hosted by HTML5 Doctors. To help improve submit a pull request or add an issue. More info at http://html5doctor.com/dive-into-html5-doctor/
webpack 源码解析系列
build you own robot in one hour! (this is the entry version "green" NO Bluetooth, for latest updates please go here:
Machine Learning Tool Guides and Theory Notes
DIY a simple Vuex
django+es搭建的前后端分离,唐诗宋词搜索引擎。
The Web framework for perfectionists with deadlines.
Let AngularJS play well with Django
An example repository of combining Django Rest Framework with AngularJS
django blog demo
基于 Python3.5 和 Django 1.10 的 Django Blog 项目。
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.