src.preprocessing.pdf_to_text¶
Extract text from PDFs.
Classes
|
Extract the text from a list of PDF files. |
- class src.preprocessing.pdf_to_text.PdfToText(files)[source]¶
Extract the text from a list of PDF files.
- Parameters:
files (
list
[Path
]) – The list of PDF files to extract the text from.
-
files:
list
[Path
]¶