Run Extractor
Extracts data from a document
The Run Extractor activity takes an extractor definition file and a preprocessed document as its inputs. It performs document extraction in which the required data is extracted from the document according to the extractor definition. It outputs an ExtractionResult variable which contains the extracted data.
If the user provides a ClassificationResult output for the document from Classify Document activity and if the extractor definition is multi-class then the activity only extracts the data defined under the class predicted in ClassificationResult.
Input
Processed Document :
ProcessedDocument VariableRequiredThe preprocessed document from the preprocess document activity.Extractor Definition :
String ArgumentRequiredPath to the classifier definition file (*.extractor.vdai).Document AI Client :
DocAIClient VariableRequiredThe configuredDocument AI Client.Classification Result :
ClassificationResult VariableRequiredTheClassificationResultif the user has classified the document and the definition is multi-class.
Options
Split Pages :
RequiredIftrueand the document has multiple pages then the extraction is applied on each page independently. Available values are :TrueFalse
Output
Extraction Result :
ExtractionResult VariableRequiredThe result of the document extraction.
Last updated