GithubHelp home page GithubHelp logo

NuGet Package? about dotnetspider HOT 7 CLOSED

dotnetcore avatar dotnetcore commented on May 12, 2024
NuGet Package?

from dotnetspider.

Comments (7)

ZhaoXiangXML avatar ZhaoXiangXML commented on May 12, 2024

I found a DotnetSpider2.Core and a DotnetSpider2.Extension on NuGet, which looks like this project

from dotnetspider.

zlzforever avatar zlzforever commented on May 12, 2024

NUGET上的DotnetSpider2.Core, DotnetSpider2.Extension就是这个项目的, 但是包比较老. 想用最新的可以配置我的私有源http://zlzforever.6655.la:40001/

from dotnetspider.

ZhaoXiangXML avatar ZhaoXiangXML commented on May 12, 2024

谢谢。

我用NuGet上的库暂时没有发现什么问题,除了有个Download拼错了。

另外我现在做的事情是下载网页上的图片,全部下载好之后和网页本身拼到一起。如果用现在的pipeline机制的话,怎样在图片全部下载完成后执行后面的事情呢?

from dotnetspider.

zlzforever avatar zlzforever commented on May 12, 2024

HttpClientDownloader集成了下载图片的功能,不确定NUGET上版本有。你只需要在Processor的解析中,把所有IMAGE的链接解析出来,通过page.AddTargetRequests最终会添加到队列中,downloader发现是文件是会下载到相对的对应目录中。比如 www.a.com/b/1.png, 则会下载到 /b/1.png中

from dotnetspider.

ZhaoXiangXML avatar ZhaoXiangXML commented on May 12, 2024

那有办法在同一网页上的所有图片完成后得到通知吗?

from dotnetspider.

zlzforever avatar zlzforever commented on May 12, 2024

没有。你这个需求不是一个常用的场景,不太好抽象。你可以自己实现一个Processor, 把图片的链接按URL归类, 在Pipeline中下载,就样可以保留到这一状态信息。

from dotnetspider.

ZhaoXiangXML avatar ZhaoXiangXML commented on May 12, 2024

我现在就是这么干的。。谢谢了

from dotnetspider.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.