rvest
rvest helps you scrape information from web pages. It currently provides two main features:
-
Select parts of a document using css selectors:
doc[sel("table td")]
-
Extract important components of html tags with
html_tag()
,html_text()
,html_attr()
andhtml_attrs()
. -
Parse tables into data frames with
html_table()
. -
Extract, modify and submit forms with
html_form()
,set_values()
andsubmit_form()
-
Navigate around a website as if you're in a browser with
html_session()
,jump_to()
,follow_link()
,back()
,forward()
,submit_format()
and so on.
Inspirations
- Python: Robobrowser