Package org.apache.pdfbox.text
package org.apache.pdfbox.text
-
ClassesClassDescriptionLEGACY text calculations which are known to be incorrect but are depended on by PDFTextStripper.This is an stream engine to extract the marked content of a pdf.This class will take a pdf document and strip out all of the text and ignore the formatting and such.internal marker class.wrapper of TextPosition that adds flags to track status as linestart and paragraph start positions.Internal class that maps strings to lists of
TextPosition
arrays.This will extract text from a specified region in the PDF.This represents a string and a position on the screen of those characters.This class is a comparator for TextPosition operators.