How can I extract table rows from a document?
Extracting tables from documents can be achieved by creating either a Table Rows or Line Items parsing rule. Watch the following screencast which gives a quick overview of how to create a PDF table extraction parsing rules. Further below you'll find a more detailed step-by-step guide.
1) Create a new parsing rule
Navigate to the Rules tab, click on Add Parsing Rule, and choose Table Data when prompted what type of data you want to extract
2) Define the table borders
The first step is to visually define where the table is located inside your document.
- Move the existing column separators so that they fit the column borders of your table
- Add as many column separators as needed with the + buttons to the left and to the right of the screen
- You can also define the area where the table is located by clicking and cropping a selection over the area on the page. A defined table with column separators is shown below:
Please note: It is not always necessary and sometimes not recommended to crop an area selection. If you are working with tables ranging over several pages it is usually a better idea to not crop an area selection in the first step. Instead, just set the column dividers and add table row filters in the next step (see below) to extract the table data you need.
3) Refine parsed results
The results of the table extraction will be visible after clicking on CONFIRM in the bottom right. The previous step of visually defining the table will give you a result which you likely want to refine further. You can for example choose the rows you would like to keep or filter out unwanted rows, format dates, remove columns, etc. Refining parsing results is done by chaining up multiple filters on the right side. A click on 'Add Table Filter' will reveal a menu with various options.
Below is a screenshot of data after the above selection area and column separators were defined which includes a filter to refine the data to only rows where Column #2 has a value.
Once you are happy with the results select Save Parsing Rule. Don't forget to then queue your file for parsing to view the final result.
For more complicated solutions when refining tables see the below links: