This plugin provides file utility pipelets:
- GateTextExtractionPipelet - parse source text and extract metadata annotations
Overview
| Name |
GATE Pipelet |
| Vendor |
brox IT-Solutions GmbH |
| Authors |
|
| Homepage |
http://www.brox.de |
| Issue Management |
http://support.eccenca.com |
| Continuous Integration |
n/a |
| Categories |
Pipelet |
| Most Recent Version (see older versions) |
Version 0.5.0 |
| Availability (see older versions) |
|
| State |
|
| Support |
|
| License |
GNU LIBRARY GENERAL PUBLIC LICENSE Version 2, June 1991
Pipelet: Freeware eccenca Component License |
| Price |
Free |
| Release Docs |
|
| Java API Docs |
n/a |
| Download Source |
|
| Download JAR |
|
It's a text-extraction pipelet which can extract the annotations from the text based on the rules configured using the GATE (General Architecture for Text Engineering).
Example of using GateTextExtractionPipelet is covered in How to use GateTextExtractionPipelet
How to configure it.
Configuration properties
- Annotation - a list of annotations that should be extracted by GATE
- Source - a list of sources from Record (attributes or attachments)
- SourceType - a list of source types (ATTRIBUTE or ATTACHMENT), if only one specified then its type for all Sources
- Target - a list of target Attributes for result of GATE processing, the quantity should be the same as Source
- CopySource - boolean parameter that specify should source text be copied also in target attribute
- Plugin - the name of GATE plug-in used (subfolder of org.eccenca.smila.pipelets.textextraction.gate/plugins in the configuration); Now GATE shipped with one plug-in "eccenca"
If you experience any problems with this pipelet, please go to the
support system
Frequently Asked Questions about this extension.