URL (Web Scraping)
Web Scraping is used to bring data directly into Phrazor to perform analysis and generate insights.
To fetch data from the internet you can use a URL connector to web scrape a website.
- Go to Data / New Data Connections / Fetch Data from URL
- Enter a Source Name for the data being retrieved, a name for the dataset to be uploaded on Phrazor
data:image/s3,"s3://crabby-images/503d9/503d94a1ad9fbdf72cd2d72c2d74c995757c15fc" alt=""
- Click ADD URL Enter values of the variables required to scrape data from the source website
- Name - A unique name for the Table
- URL - Website URL
- xPath - Location with information (table) on the webpage / full XPath
- Header-Index - Default (0)
- Skip-Begin - Default (1) To skip the Header
- Skip-End - Default (0)
To locate the xPath value -
Right click on the webpage and click Inspect
In the window that opens up locate table-summary, right click on it and then click on Copy and Copy full xPath
data:image/s3,"s3://crabby-images/49879/49879f341e45c0791251a465beff8b0fe3045f71" alt=""
- Click Submit The table will be added.
- Click View tables
data:image/s3,"s3://crabby-images/86fee/86feedaba15fbc7973772deee3442df97b5c9cc2" alt=""
- In the Web Scraping screen click Confirm to submit the table
data:image/s3,"s3://crabby-images/38ce5/38ce54e1f92c26b201274dde6a76084fc0da76b1" alt=""
- The saved Data source appears under Datasets
data:image/s3,"s3://crabby-images/dda6b/dda6b08f52acb4b3a7f0ecc59c4c57dcf1e6d264" alt=""