Data tidying and importing
Increasing amount of data is available on the web
These data are provided in an unstructured format: you can always copy & paste, but it’s time-consuming and prone to errors
Web scraping is the process of extracting this information automatically and transform it into a structured dataset
Two different scenarios:
|>
read_html(): Read HTML data from a url or character stringhtml_element(): Select a specified HTML elementhtml_elements(): Select specified HTML elementshtml_table(): Parse an HTML table into a data framehtml_text(): Extract text from an HTML elementhtml_name(): Extract the name of an HTML elementhtml_attr(): Extract all HTML element attributes by namehtml_attr(): Extract a single HTML element attribute by name





Through this process of selection and rejection, SelectorGadget facilitates discovering the appropriate CSS selector for your needs.
