Skip to content
This repository was archived by the owner on Feb 22, 2024. It is now read-only.

fusepool/fusepool-sma

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fusepool-SMA

This project is an implementation of a dictionary-matching algorithm as an enhancer engine.

###What is Fusepool SMA?

Fusepool SMA is a pure Java based Apache Stanbol enhancement engine, it is based on the Aho-Corasick string matching algorithm. It was created as an alternative solution for entity extraction, because it is much more flexible than any model-based NER solution. The Aho-Corasic string matching algorithm is a dictionary-matching algorithm that locates elements of a finite set of words and expressions which is called the dictionary, within the input text.

###Stanbol Enhancement Engine

The outcome of this project is an OSGi bundle (enhancement engine) for Apache Stanbol that takes plain, unstructured text as input and gives RDF triples as output that contains the extracted entities and additional information on the enhancement.

The enhancer engine can contains multiple SMA instances, each instance is a separate module, and therefore each module has its own configuration page inside Stanbol Configuration Manager. It also means that different SMA instances can be part of different enhancer chains.

Currenty Fusepool-SMA has one enhancer instance that uses the disease ontology from http://disease-ontology.org/. The name of this instance is "smaDisease". This same name must be used when configuring a chain in order to use it.

###Install

To clone the repository to your local machine

git clone https://github.com/fusepool/fusepool-sma.git

To compile the bundle run

mvn install

To deploy the engine to a stanbol instance running on localhost port 8080 run

mvn org.apache.sling:maven-sling-plugin:install

About

This project is an implementation of a dictionary-matching algorithm as an enhancer engine.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages