Data Table Extraction (Pdf to Excel)

This robot extract multiple data tables from a Pdf file and write into the different sheets in an excel file.

Features:

  • Quicker and accurate way of extracting data tables from a PDF and writing it to an excel sheet

  • Excel sheets will be created according to the number of data tables in the pdf file.

  • Easy and faster method to extract and write all data tables of a pdf file to consecutive excel worksheets without any human interactions.

Work Flow Diagram:

Workflow

Working:

  • Select File activity can be used to Select the input PDF file from which the data need to be extracted.

Property of select File Activity
  • We have to use the Select file activity again to assign the file path for the output excel file.

Property Pane of Select File Activity
  • Extract Table from PDF activity extract tables from a user specified PDF to the excel format. We get multiple of data tables if the pdf contains multiple number of tables.

Property Pane for Extract table from pdf activity
  • Open Excel File activity create and open the excel file in which the data needs to be written.

property pane of open excel activity
  • Next, we can use For Each activity. It takes an item from the data table collection, and uses it to execute the rest of the steps until it encounters the loop end. When the execution reaches the loop end it will check if all items in the collection are processed, the next activity is executed.

Property pane of For Each activity
  • We can use Counter activity together with the increment activity in order to get the count of iteration. Counter activity is used before the loop and the increment activity is used inside the loop.

Property pane of create counter activity
Property pane of increment activity
  • The output of the counter activity can be converted to text by using convert to text activity

Property pane of Convert to text activity
Property pane of Get workbook sheet activity.
  • To write the data from the obtained data table from the output of the Extract Table from PDF activity into the excel sheet, write range activity can be used.

Property pane of write range activity
  • End loop activity will mark the end of the loop, if there are no more items to process in the collection.

Property pane of close loop Activity
  • Close Excel File activity closes the specified excel file in Open Excel File activity

Property pane of close excel file activity

INPUT:

The input of this robot is a Pdf file that contain data tables.

Input Pdf File

OUTPUT

The robot saves the output as an excel file and based on the number of data table, robot creates excel sheets and writes the data table in to it.

Output excel sheet
Output Excel sheet

Last updated