Data Table Extraction (Pdf to Excel)

This robot extract multiple data tables from a Pdf file and write into the different sheets in an excel file.

Features:

  • Quicker and accurate way of extracting data tables from a PDF and writing it to an excel sheet

  • Excel sheets will be created according to the number of data tables in the pdf file.

  • Easy and faster method to extract and write all data tables of a pdf file to consecutive excel worksheets without any human interactions.

Work Flow Diagram:

Working:

  • Select File activity can be used to Select the input PDF file from which the data need to be extracted.

  • We have to use the Select file activity again to assign the file path for the output excel file.

  • Extract Table from PDF activity extract tables from a user specified PDF to the excel format. We get multiple of data tables if the pdf contains multiple number of tables.

  • Open Excel File activity create and open the excel file in which the data needs to be written.

  • Next, we can use For Each activity. It takes an item from the data table collection, and uses it to execute the rest of the steps until it encounters the loop end. When the execution reaches the loop end it will check if all items in the collection are processed, the next activity is executed.

  • We can use Counter activity together with the increment activity in order to get the count of iteration. Counter activity is used before the loop and the increment activity is used inside the loop.

  • The output of the counter activity can be converted to text by using convert to text activity

  • To write the data from the obtained data table from the output of the Extract Table from PDF activity into the excel sheet, write range activity can be used.

  • End loop activity will mark the end of the loop, if there are no more items to process in the collection.

  • Close Excel File activity closes the specified excel file in Open Excel File activity

INPUT:

The input of this robot is a Pdf file that contain data tables.

OUTPUT

The robot saves the output as an excel file and based on the number of data table, robot creates excel sheets and writes the data table in to it.

Last updated