Class PDFExtractor

    • Method Detail

      • extractInvoices

        public List<ExtractedPDF> extractInvoices​(List<InvoiceSplitterV1InvoicePageGroup> pageIndexes,
                                                  boolean strict)
                                           throws IOException
        Extract invoices from the given page indexes (from an invoice-splitter prediction).
        Parameters:
        pageIndexes - List of page indexes.
        strict - Whether the extraction should strictly follow the confidence scores or not.
        Returns:
        a list of extracted files.
        Throws:
        IOException - Throws if the file can't be accessed.