pdf2docx.page.Pages module¶
Collection of Page instances.
- class pdf2docx.page.Pages.Pages(instances: list = None, parent=None)¶
Bases:
BaseCollectionA collection of
Page.- parse(fitz_doc, **settings)¶
Analyze document structure, e.g. page section, header, footer.
- Args:
fitz_doc (fitz.Document):
PyMuPDFDocument instance. settings (dict): Parsing parameters.