GroupDocs.Search for Java について

文書の検索とインデックス付け

GroupDocs.Search for Java enables users to perform text search functions. You can create and merge multiple indexes and use simple, boolean, regular expression (Regex), fuzzy and other types of queries to search through indexes. You can fetch information from files, documents, emails, and archives, as GroupDocs.Search for Java supports all popular file formats.

Supported file formats

Popular Office Formats

  • Portable: PDF
  • Word: DOC, DOCX, DOCM, DOT, DOTX, DOTM
  • Excel: XLS, XLSX, XLSM, XLT, XLTX, XLTM, XLSB, XLA, XLAM, CSV, TSV
  • PowerPoint: PPT, PPTX, POT, POTX, PPS, PPSX, PPTM, PPSM, POTM
  • OpenDocument: ODT, ODP, ODS, OTT, OTS
  • Text: TXT, RTF

Media Formats

  • Popular image formats: BMP, JP2, PNG, EMF, WMF, JPG, PSD
  • Multi-page images: GIF, WEBP, TIFF
  • Audio: MP3, WAV
  • Video: AVI, MOV, QT, FLV, ASF

Other

  • Email: PST, OST, MSG, EML, EMLX
  • Microsoft Visio: VSD, VSS
  • Web: XML, HTM, HTML, XHTML, MHT, MHTML
  • Others: TORRENT, ZIP, DCM, DJVU, EPUB, FB2

GroupDocs.Search for Java Features

  • Customizable Search Parameters - Refine searches using date ranges and case sensitivity filters.
  • Enhanced Spell Check - Search efficiently with spell check, wildcards, and by ignoring special characters.
  • Filtered Search Results - Apply filters to focus search results based on specific document types or criteria.
  • Import and Export Index Data - Easily import data for indexing or export results to files for further use.
  • Skip Unneeded Files - Optimize indexing by excluding specific files or words.
  • HTML and URL Processing - Extract HTML content to files and generate URLs for navigation through search results.
  • Fast Search in Large Indexes - Speed up search operations by dividing large indexes into manageable chunks.
  • Stream-Based Indexing - Index data directly from streams or data structures.
  • Handle Misspelled Queries - Detect misspellings and suggest alternative words for better search accuracy.
  • Comprehensive Archive Support - Index nested archives and retrieve detailed lists of files within ZIP files.
  • Space-Saving Indexing - Compact indexes to save disk space and process password-protected files.
  • Custom Synonym Support - Expand the synonym dictionary to enhance search accuracy with tailored options.