GroupDocs.Classification for .NET について

.NETアプリケーション開発用のテキストと文書の分類API

GroupDocs.Classification for .NET is a document and text classification API for C#, ASP.NET, VB.NET, J# or any other .NET based application. Developers can work with four different types of taxonomies to perform advanced classification, either by using IAB-2 for assigning standardized text categories to text, document taxonomy developed by Aspose for different document types or Sentiment (and Sentiment3) for sentiment analysis. The library analyses text, sentences, even words and supports classifying a variety of industry standard document formats including PDF, Microsoft Word, OpenDocument, RTF and text. Sentiment analysis (classification) supports English, Chinese, Spanish, and German languages with language auto-detection. GroupDocs.Classification for .NET uses its own document processing engine and does not require any external tools be installed on the system.

Supported file formats

Microsoft Office Formats

  • Word: DOC, DOCM, DOCX, DOT, DOTM, DOTX, RTF

OpenDocument & Other Formats

  • OpenOffice: ODT, OTT
  • Fixed Layout: PDF
  • Other: TXT

GroupDocs.Classification Features
Classify text and documents using advanced taxonomies and options.

  • Multiple Taxonomies - Supports IAB-2, Document, and Sentiment taxonomies for versatile classification.
  • Multi-Language Support - Perform sentiment classification in both English and Chinese.
  • Customizable Results - Specify the number of classification results to return.
  • Precision Control - Adjust precision/recall balance for Documents taxonomy classification.
  • Multiple File Formats - Compatible with various document formats including PDF, DOC, DOCX, RTF, and TXT.
  • Easy Integration - Seamlessly integrate with any .NET application, including ASP.NET and Windows apps.