These options would cause that all HTML images and CSS files would not be loaded by th

Add skipImages and skipCss to launchPuppeteer() about crawlee HOT 5 CLOSED

apify commented on May 18, 2024

Add skipImages and skipCss to launchPuppeteer()

from crawlee.

Comments (5)

mnmkng commented on May 18, 2024 1

This cannot be reliably done before we have a middleware system for request interception: #204 since it cannot be done on Chromium level (with a flag) and adding request interception prevents adding more request interceptions.

I'm just gonna drop a doc here for referencing it later perhaps:

@param {Boolean|Array} [blockResources=false]
 *   Uses the [`Apify.utils.puppeteer.blockResources()`](../api/puppeteer#puppeteer.blockResources)
 *   function to block downloads of resources such as images, videos or CSS. It accepts either
 *   a boolean `true`, which will enable the default blocking or a `string[]` listing
 *   the resources that should be blocked. See the
 *   [`Apify.utils.puppeteer.blockResources()`](../api/puppeteer#puppeteer.blockResources)
 *   for details.

from crawlee.

jancurn commented on May 18, 2024

This should go to the new class called PuppeteerEx - see other issue.

from crawlee.

jancurn commented on May 18, 2024

Now we have new Apify.utils.puppeteer namespace for this!

from crawlee.

jancurn commented on May 18, 2024

And now we even have Apify.utils.puppeteer.blockResources() that can be used. Perhaps we can only add option blockResourceTypes to launchPuppeteer and that's it.

from crawlee.

mnmkng commented on May 18, 2024

Since request interception disables cache, this might not bring the expected benefits. Closing for now.

from crawlee.

Add skipImages and skipCss to launchPuppeteer() about crawlee HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs