GithubHelp home page GithubHelp logo

dogspider's Introduction

dogSpider

这是一个网站爬虫,用于练习。还在写着呢。。

Spider.py:各接口

dogSpider.py:主程序

config.py : 配置文件


1、列出网站分类目录,选择进行爬取(目前默认下载第一个板块)

2、根据网站目录,建立相应文件夹 (已完成)

3、根据标题建立相应文件夹并存储页面内的图片及下载的文件,对于合集根据内容分割再次建立文件夹。(合集未处理,目前能下载最后一个种子及所有图片)

4、多线程下载用户所需要下载(未写)

5、Mysql记录已经下载过的链接(未写)

6、更新模式/全部下载模式(未写)

7、BUG以及异常处理(不完整)

8、代码优化(最后)


Python 3X 以上版本运行

所需要的第三方库

urllib bs4


由于一些隐私原因。config.py 等配置文件未透漏 支付宝赞助:[email protected] (1元以上) 转账留言 您的QQ 我会将config 文件发到QQ邮箱

dogspider's People

Contributors

hitaian avatar

Stargazers

cousepig avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.