このページは既にアーカイブ化され、その後の更新はありません。

この製品の販売は終了しました。

PDFlib TET PDF IFilter(英語版)

PDF文書からテキストとメタデータを抽出

PDFlib 社の製品
2003 年より日本国内にてComponentSourceで販売中。

バージョン: 5.5 更新日: Jan 12, 2024

i

Please note: PDFlib TET PDF IFilter was officially retired as of December 19th 2024. If you are interested in this product, consider PDFlib instead.

PDFlib TET PDF IFilter 5.3

Released: May 4, 2021

5.3 での更新項目

機能

  • Optimized PDF resource handling to improve performance for documents with excessive numbers of images, patterns or other resources.
  • Security and performance updates of all third-party components.
  • Harden processing of damaged and illegal PDF documents by testing the full Issue Tracker PDF corpus with tens of thousands of stressful PDF files.
  • Expanded platform and CPU support including macOS on ARM64 and Linux on ARM64.
  • Timeout can be specified to limit processing time for large or complex files.
  • Enhancements in all language bindings and updates for the latest language versions including .NET 5, PHP 8, Perl 5.32 and Ruby 3.0.
  • Support for native UTF-8, UTF-16 and UTF-32 Unicode strings in C++17 and C++20.
  • Implement detection of certain kinds of attacks using legal PDF constructs overly large data structures.
  • Improved TETML output for edge cases.
  • Improved word boundary, list and paragraph detection.
  • Support for Unicode 13.
  • Improved performance for Classic .NET binding.
  • Many minor bug fixes and improvements.
  • Updated CJK character collections and CMaps for PDF 2.0.