Image scrapper is a chrome extension that can be used to exctract image URLs out of any web page and download them in bulk.
- This can also be used to download files of other type (example : pdf)
- We can also choose to upload the images directly to google photos (takes up memory from google drive quota )
- Ability to pause downloads or download images in the deserved order.
- Save the list of URLs to chrome's local storage to download later.
- Import and export configurations.
- Study the web page you want to get images from , and do the configuration accordingly in settings page.see How to Configure for details.
- click the accept button in the notifications.
- Wait till the application fetches the URLs and click the download button. Content page should not be closed util all the images are fetched.
- url : Match pattern that matches to deserved URLs.
- path : address of the DOM elements in the document which have URLs of pictures you want to fetch
- next : address of the element that has link to the document containing next set of pictures. Leave it blank if there are no more pictures to be fetched.
Just like xpath , address should be like location path.The location path consists of steps.Each step is either in form X
or X=X
or X=X=X
id=x
--> get elements of id = xchildNodes=x
--> get xth childNodeclass=x
--> get elements of classname = xclass=x=y
--> get yth element of class = xtag=x
--> get elements of tagname = xtag=x=y
--> get yth element of tagname = x?=x
--> get attribute value of name = xnavigate
--> get document after navigating the address obtained
In javascript , class=gallery=0/tag=input=0/?=value
translates to
document.getElementsByclassName('gallery')[0].getElementsByTagName(‘input’)[0].value
Configuration that was shown in the demonstration is
{
"https://www.unsplash.com/search" : {
"path":"class=_1pn7R/tag=a=0/?=href/navigate/class=_2yFK- IEpfq=0/tag=img=0/?=src",
"next":"",
"url":"https://www.unsplash.com/search/photos/*"
}
}