GithubHelp home page GithubHelp logo

seco / web-crawler Goto Github PK

View Code? Open in Web Editor NEW

This project forked from faruktemur/web-crawler

0.0 1.0 0.0 165 KB

WebCrawler is simple xpath based crawler library for PHP developers.

PHP 100.00%

web-crawler's Introduction

What is WebCrawler ?

WebCrawler is simple xpath based crawler library for PHP developers.

How to use it ?

  • Install autoloader with composer.
	$ composer install
  • Create your own resource to be crawle.
	<?php
	namespace WebCrawler\Resource;

	class SampleResource extends AbstractResource 
								  implements ResourceInterface
	{
		public function getImageUrl()
		{
			$xpath = '//*[@class="slider"]/li[1]/a/img';
			return sprintf('%s%s', $this->getBaseUrl(), 
						   $this->getAttributeValue($xpath, 'src'));
		}

		public function getPrice()
		{
			$xpath = '//*[@id="ctl00_u14_ascUrunDetay_dtUrunD'. 
					 'etay_ctl00_lblSatisFiyat"]/text()[1]';
			return $this->getNodeValue($xpath);
		}

		public function getCurrency()
		{
			return 'TRL';
		}
	}
  • Use it!
	<?php
	require('vendor/autoload.php');

	use WebCrawler\Crawler;
	use WebCrawler\Resource\SampleResource;

	$crawler = new Crawler();
	$resource = new SampleResource(
		'http://www.vatanbilgisayar.com/' .
		'iphone-6-16-gb-akilli-telefon-silver.html#genel-bakis'
	);
	$crawler->crawleResource($resource);

	print $resource->getPrice() . PHP_EOL;
	print $resource->getImageUrl() . PHP_EOL;

Output:

	2.349
	http://www.vatanbilgisayar.com/UPLOAD/PRODUCT/APPLE/Thumb/v2-74694-4_medium.jpg

web-crawler's People

Contributors

ibrahimgunduz34 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.