Configure Scrape Data Window

Configures the scrape data definition.

This window allows you to define how data should be extracted from a webpage. It includes columns, a next button, and the total page count.

Columns:

Each column represents a specific data field you want to extract. For each column, you can specify a list of elements that help identify a pattern within the webpage. This pattern is then used to extract the data belonging to that column. You can add new columns using the "Add" button and update existing columns using the "Update" button.

Next Button Selector and Pages:

By default, only the current page is scraped. However, you can enable scraping on multiple pages by setting the next button selector and specifying the total page count. The next button selector allows the automation to navigate to the next page, and the pages value determines the total number of pages to be scraped.

User Interface

  • Add Button: Clicking on this button allows you to add a new column. You can select the element that represents the data for that column.

  • Update Button: This button allows you to update an existing column. It helps in modifying the pattern used to capture the data.

  • Delete Button: This button deletes a column.

  • Next Button Selector Button: Clicking on this button enables you to select the 'Next' button on the webpage, which facilitates navigation to the next page of data.

  • Pages: This field allows you to set the total page count for scraping.

Selecting an Element

When selecting an element, the Count displayed indicates the total number of rows identified for the column. Using the update button adds more rows to the column, allowing you to refine the pattern.

Count doesn't indicate the number of rows that will be scraped when the automation is executed.

Pressing 'Esc' button on keyboard will cancel Selection

To perform element selection, click on the "Add," "Update," or "Next Button Selector" buttons. This initiates the element selection process in the web browser. Please make sure that the Visualyze Web Automation extension is installed in your browser for this functionality to work.

By configuring the Scrape Data Definition window, you can effectively define and customize the data extraction process for web scraping tasks.

Last updated