The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.

NAME

Renard::Block::Format::PDF::Document - document that represents a PDF file

VERSION

version 0.005

EXTENDS

CONSUMES

METHODS

get_rendered_page

  method get_rendered_page( (PageNumber) :$page_number )

See Renard::Incunabula::Document::Role::Renderable.

get_textual_page

  method get_textual_page( (PageNumber) $page_number ) :ReturnType(InstanceOf['String::Tagged'])

Returns a String::Tagged representation of the PDF textual data for a given page. The return value contains tags that indicate the extent of each level as defined by Renard::API::MuPDF::mutool::get_mutool_text_stext_xml:

  • page,

  • block,

  • line,

  • span, and

  • char

The values associated with these tags can be used to find the bounding box for the symbols on the page.

AUTHOR

Project Renard

COPYRIGHT AND LICENSE

This software is copyright (c) 2017 by Project Renard.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.