pl.beone.promena.transformer:page-extractor-pdfbox-root


License

License

Categories

Categories

PDFBox Business Logic Libraries ORM Data PDF
GroupId

GroupId

pl.beone.promena.transformer
ArtifactId

ArtifactId

page-extractor-pdfbox-root
Last Version

Last Version

1.0.1
Release Date

Release Date

Type

Type

pom
Description

Description

pl.beone.promena.transformer:page-extractor-pdfbox-root
pl.beone.promena.transformer:page-extractor-pdfbox-root
Project URL

Project URL

https://beone.pl
Project Organization

Project Organization

BeOne Sp. z o.o.
Source Code Management

Source Code Management

https://github.com/BeOne-PL/promena-transformer-page-extractor-pdfbox

Download page-extractor-pdfbox-root

How to add to project

<!-- https://jarcasting.com/artifacts/pl.beone.promena.transformer/page-extractor-pdfbox-root/ -->
<dependency>
    <groupId>pl.beone.promena.transformer</groupId>
    <artifactId>page-extractor-pdfbox-root</artifactId>
    <version>1.0.1</version>
    <type>pom</type>
</dependency>
// https://jarcasting.com/artifacts/pl.beone.promena.transformer/page-extractor-pdfbox-root/
implementation 'pl.beone.promena.transformer:page-extractor-pdfbox-root:1.0.1'
// https://jarcasting.com/artifacts/pl.beone.promena.transformer/page-extractor-pdfbox-root/
implementation ("pl.beone.promena.transformer:page-extractor-pdfbox-root:1.0.1")
'pl.beone.promena.transformer:page-extractor-pdfbox-root:pom:1.0.1'
<dependency org="pl.beone.promena.transformer" name="page-extractor-pdfbox-root" rev="1.0.1">
  <artifact name="page-extractor-pdfbox-root" type="pom" />
</dependency>
@Grapes(
@Grab(group='pl.beone.promena.transformer', module='page-extractor-pdfbox-root', version='1.0.1')
)
libraryDependencies += "pl.beone.promena.transformer" % "page-extractor-pdfbox-root" % "1.0.1"
[pl.beone.promena.transformer/page-extractor-pdfbox-root "1.0.1"]

Dependencies

There are no dependencies for this project. It is a standalone project that does not depend on any other jars.

Project Modules

  • application-model
  • implementation
  • example
  • configuration

Promena Transformer - page extractor - PDFBox

This transformer provides functionality to extract a range of pages from application/pdf documents using PDFBox 2.0.16.

Visit Promena#Transformer to understand the repository structure.

Transformation PdfBoxPageExtractorDsl, PdfBoxPageExtractorParametersDsl

The DataDescriptor has to contain at least one descriptor. If more than one descriptor is passed, the transformation will be performed on each of them separately.

Support PdfBoxPageExtractorSupport

Media type PdfBoxPageExtractorSupport.MediaTypeSupport

  • application/pdf; UTF-8 ➡️ application/pdf; UTF-8

Parameters PdfBoxPageExtractorSupport.ParametersSupport

  • pages, List<List<Int>>, optional - extracts each list of the pages (indexed from 1) of the lists to separate TransformedDataDescriptor
  • splitByBarcodeMetadata, Boolean, optional - extracts pages based on barcode-detector-metadata producing by barcode detector transformers. This parameter causes that the pages between subsequent barcodes are extracted to separate TransformedDataDescriptor with metadata for the given range of pages

Dependency

<dependency>
    <groupId>pl.beone.promena.transformer</groupId>
    <artifactId>page-extractor-pdfbox-configuration</artifactId>
    <version>1.0.1</version>
</dependency>

promena-docker-maven-plugin

<dependency>
    <groupId>pl.beone.promena.transformer</groupId>
    <artifactId>page-extractor-pdfbox</artifactId>
    <version>1.0.1</version>
</dependency>

Properties

transformer.pl.beone.promena.transformer.pageextractor.pdfbox.PdfBoxPageExtractorTransformer.priority=1
transformer.pl.beone.promena.transformer.pageextractor.pdfbox.PdfBoxPageExtractorTransformer.actors=1

transformer.pl.beone.promena.transformer.pageextractor.pdfbox.settings.memoryUsageSetting=org.apache.pdfbox.io.MemoryUsageSetting::setupMainMemoryOnly
transformer.pl.beone.promena.transformer.pageextractor.pdfbox.settings.fallbackMemoryUsageSetting=org.apache.pdfbox.io.MemoryUsageSetting::setupTempFileOnly

transformer.pl.beone.promena.transformer.pageextractor.pdfbox.default.parameters.split-by-barcode-metadata=true
transformer.pl.beone.promena.transformer.pageextractor.pdfbox.default.parameters.timeout=
pl.beone.promena.transformer

BeOne Sp. z o.o.

Versions

Version
1.0.1
1.0.0