openPR Logo
Press release

BFO adds text extraction to PDF Library

10-27-2005 04:17 PM CET | IT, New Media & Software

Press release from: BFO

London, England, 27 October 2005, - BFO (Big Faceless Organization), a global supplier of java reporting solutions, strengthens the acclaimed Big Faceless PDF Library with the addition of text and image extraction.

The 2.6.2 release adds the ability to extract text and bitmap images from PDF documents, as well as index the PDF using the Apache Lucene search engine. The library extracts and indexes text in Unicode from the form fields, annotations and document metadata as well as the document body, and at roughly 50 pages a second for large documents.

Speed and accuracy of text extraction coupled with the existing features of the PDF Library makes it a wise choice for developers involved in data mining, content management systems and form processing environments. As well as being beneficial in settings that require the ability to search or extract text from large numbers of PDF files.

Text and image extraction requires the Big Faceless PDF Library Extended Edition plus Viewer license, which can be downloaded from BFO’s website.

About BFO: BFO is a leading global provider of Java based reporting solutions founded in 1998. They produce a stable of robust Java components for the international B2B market. Such components include Report Generator, Graph and PDF Library. Report Generator comprises both Libraries and converts XML to PDF documents. Using JSP, ASP or similar technology, it is possible to create dynamic PDF reports as quickly and easily as HTML.

This release was published on openPR.

Permanent link to this press release:

Copy
Please set a link in the press area of your homepage to this press release on openPR. openPR disclaims liability for any content contained in this release.

You can edit or delete your press release BFO adds text extraction to PDF Library here

News-ID: 389 • Views:

More Releases from BFO

PDF Library introduces consolidated logging
London, England, 08 July 2009, - Big Faceless Organization (BFO), provider of high quality Java software components, have enhanced the PDF library by introducing consolidated logging. Version 2.11.6 of the Big Faceless PDF Library also includes a thread clean up. Previously log messages were written to System.err. If problems were encountered PDFs could throw a large number of messages, and although these could always be turned off, there was no way
Superior Text Extraction with the Big Faceless Java PDF Library and Viewer
London, England, 19 March 2007, - Big Faceless Organization (BFO), has released a new version of their market leading Java PDF Library. Version 2.7.8 provides significant improvements to clients using the PDF Viewer extension to build interactive viewers. CTO Mike Bremford, says “significant modifications have been made to the PageExtractor class. Text that was previously being extracted as individual letters or smaller groups are now being reassembled where possible into
BFO and Hallmark deliver interactive Christmas
London, England, 11 December 2006, - Big Faceless Organization (BFO), provider of high quality Java software components, are excited to add Venspro, Holland’s leading Marketing and PR Consultancy, to their enviable client base. Venspro purchased the Big Faceless PDF Library Extended Edition, the smartest Java class library for creating, editing, displaying and printing Acrobat PDF documents. The Extended Edition allows users to load and edit existing PDF documents as templates
Render Reports Rapidly with BFO
London, England, 21 September 2006, - Big Faceless Organization (BFO), provider of high quality Java software components, are delighted to announce the simultaneous release of updated editions of the Big Faceless Report Generator 1.1.32 and Big Faceless PDF Library 2.7.2. Version 1.1.32 of the Report Generator provides significant improvements to the speed of text rendering. Text rendering will now be between 5% and 10% faster! The Report Generator is built on

All 5 Releases


More Releases for PDF

Damaged PDF Viewing Solution: Corrupt PDF Viewer
Today PDFFixer.com released their new freeware program Corrupt PDF Viewer, which allows users to open and view the corrupt PDF content instantly, and repairs damaged PDFs by saving to new files or printing. Users may get the "file has been damaged" or the "this is a corrupt PDF" error messages while opening a PDF document with PDF reader programs, that means the PDF file is damaged and cannot be opened. Corrupt
All-About-PDF: Versatile PDF Toolkit for Windows
We would like to announce the available of All-About-PDF for Windows. With All-About-PDF, you can quickly: - Convert PDF documents to MS Word, MS Excel, MS PowerPoint, HTML and JPG Images - Set PDF documents to expire after a certain date - Merge multiple PDF documents into a single document - Split a single PDF document into multiple files based of page numbers or text search - Use Watch Folders to convert PDF documents to
BatchOutput PDF Now Supports macOS Mojave: PDF Automation Tool
Zevrix Solutions releases BatchOutput PDF 2.2.30, a compatibility update to company's PDF printing automation solution for Mac. The app prints PDF from watched hot folders. With BatchOutput PDF, users only need to drop PDFs into hot folders and the files will be printed automatically using the assigned output settings. The app saves users significant time and effort of printing PDF files manually. The new version introduces support for macOS 10.14
PDF to PPTX, PDF/A, TIFF Conversions With Enhanced PDF Text Replacement using .N …
What's New in this Release? Aspose team is pleased to announce the release of Aspose.Pdf for .NET 17.7.0. This release provides better inter file format conversion. In this release, It has specifically focused on the improvement of PDF to PDF/A conversion, PDF to TIFF, PDF to PPTX, Form fields flattening, Text replacement and much more. This release majorly contains fixes related to inter file format conversion as some of the customers
Latex to PDF Conversion, Enhanced PDF to DOC/DOCX & HTML to PDF Conversion in Ja …
What's New in this Release? Aspose team is pleased to announce the release of Aspose.Pdf for Java 17.1.0. This version includes a new feature to convert Latex file to PDF document along with all the enhancements and improvements introduced in its corresponding version of Aspose.Pdf for .NET. This version has introduced Latex to PDF conversion feature. Aspose team has included a LatexLoadOptions class to load Latex files in Aspose.Pdf DOM
PDF/A_2U Standard Support, PDF Size Optimization & Rendering XML Stream to PDF u …
What's New in this Release? Aspose team is pleased to announce the release of Aspose.Pdf for .NET 16.12.0. A new feature, the support of PDF/A_2U standard along with number of enhancements and improvements are included in this release. Some of the enhancements are improved PDF Optimization and support of XML stream conversion to PDF. It also contains number of fixes of bugs reported in previous versions by Aspose valued customers, that