Most page scrapers built in PHP can be tedious to use, while providing unintended results. These page scrapers iterate through the HTML for each independent "DOM Extraction", thus making it slow to use. Once the results are received, you still need to manipulate and sort the data yourself, which can be difficult without knowledge of JavaScript.
Aldo aims to make it almost effortless to fetch results from a remote website.
- HTTP Requests
- Element Manager
- Selectors for ID, class (TODO: and other types)
- Sorting
- Filtering (getting emails)
- Rebuild HTML
- Parent/children
- Set value of element, instead of creating a new array for value
- Handle HTML empty elements: input, br, etc
- Do not include comments in sequence
- Alias functions for certain attributes; href => link(), src => source(), value => val(), etc
- Support multiple classes in element
- Turn arrays into objects