Dice Group Gerbil Versions Save

GERBIL - General Entity annotatoR Benchmark

v1.2.2

8 years ago

Features

  • Implemented a better dataset handling (#66).
  • Implemented the checking of entities (#98).
    • Implemented a cache for the entity checking that stores checks for 1 month to increase the performance.
  • Implemented the GSInKB measures
  • Added a domain based sameAs retrieval manager that can call different sameAs retrievers based on the domain of a URI.
    • Added a sameAs retriever that serves as a bridge between Wikipedia and DBpedia.
    • Added certain domains to the properties file for which the HTTP-based same as retriever is used preventing the system from performing unnecessary HTTP requests.
    • Added a SameAsRetriever handling the URI encoding problem.
  • Added a sameAs retriever decorator that implements a crawling strategy instead of asking only once for the first given URI.
  • Refactored the evaluation by implementing a classification of the markings that is done before the main evaluation starts. The confidence scores are considered more directly inside the evaluation at the stage of the F-measure calculation. Thus, the matching counting is only repeated for the single documents with the different confidence values present inside these documents instead of repeating it several times for the complete corpus.
  • Added the FREME NER annotator configuration (#109).
  • Added the FRED annotator configuration (#112).
  • Added a client for the AIDA web service (#107).

Bug Fixes

  • Fixed a bug inside the DBpedia Spotlight client.
  • Fixed a bug that disabled the drawing of the SVG diagrams in Chrome-based browsers.
  • Fixed a bug in the configuration page (#100).
  • Updated the start.sh script.
  • Fixed the problem of unmerged MSNBC entities (#103).
  • Fixed a bug in the time measuring that was caused by the fact, that a single annotator instance could have been blocked by a Semaphore while the time measurement already started (#108).
  • Fixed tooltips in the front end that haven't been shown correctly.
  • Added the gerbil.nif.transfer library version 1.2.1 that fixes communication problems with the FREME NER annotator.

Known Problems

  • Because of the retrieving of sameAs relations as well as the checking of all entity URIs, the evaluation takes more time than in older versions.

gerbil.nif.transfer-v1.2.1

8 years ago

This is the fourth release of the gerbil.nif.transfer library. This simple lib translates a document from java into NIF for being transfered to a webservice or client and retranslates it into a java object. It is part of the GERBIL project and can be used by others to implement a NIF based web service for their own annotators.

  • Added clone methods
  • Implemented a common behaviour regarding the setting of Set<String> attributes
  • All Span implementations that are translated into NIF nodes additionally get the type nif:Phrase

V1.2.1

8 years ago

Changes from 1.2.0 to 1.2.1

  • Added a filter to the D2KB and Entity Typing experiment tasks that are removing all markings that do not (strongly) match the positions of the mentions that have been given in the request (refs #98).

Changes from 1.1.4 to 1.2.0

Features

  • Contains a new benchmark core that is based on URIs instead of Wikipedia IDs (refs #51 and #74). Thus, the evaluation is knowledge base agnostic (refs #88).
  • Implements a better URI matching. Added the concept of well-known knowledge bases to ensure the identification of emerging entities (solves #94).
  • The evaluation is more flexible and open to new metrics (refs #5).
  • Added Task 1 as well as Task 2 of the OKE Challenge as new experiment types.
  • Added Entity Typing (#48) as experiment type and a hierarchical f-measure for its evaluation.
  • Added the new concept of sub tasks. Thus, it can be seen whether the linking or the recognition of an annotator caused the most problems.
  • The single responses of the annotators can be stored in a NIF file for later debugging.
  • Offers backwards compatibility to the BAT-framework
  • Gives a better way to activate and deactivate single annotators and datasets that is based on property files instead of coded java classes (refs #22).
  • Added run time measurement (refs #63).

Bug Fixes

  • Implemented the observing of every single request sent to an annotator. Thus, requests that are not answered in a given time can be canceled (refs #78 and #95).
  • The cheating described in #69 is not possible anymore.

gerbil.nif.transfer-v1.2.0

8 years ago

This is the third release of the gerbil.nif.transfer library. This simple lib translates a document from java into NIF for being transfered to a webservice or client and retranslates it into a java object. It is part of the GERBIL project and can be used by others to implement a NIF based web service for their own annotators.

  • Added scored spans

OKE2015

8 years ago

This is an adapted version of GERBIL that has been used for the OKE Challenge 2015.

Features:

  • Contains a new benchmark core that is not based on the BAT-Framework but on URIs instead of Wikipedia IDs (refs #51). Thus, benchmarking the linkage to other knowledge bases than Wikipedia as well as the identification of emerging entities is now supported.
  • The evaluation is more flexible and open to new metrics (refs #5).
  • Added the concept of well-known knowledge bases to ensure the identification of emerging entities.
  • Added Task 1 as well as Task 2 of the OKE Challenge as new experiment types.
  • Added Entity Typing as experiment type and a hierarchical f-measure for its evaluation.
  • Added the new concept of sub tasks. Thus, it can be seen whether the linking or the recognition of an annotator caused the most problems.
  • Used version 1.1.0 of the gerbil.nif.transfer library and its more detailed datatype hierarchy.
  • Compared to the last version of GERBIL, this adapted version has not all features of the "normal" GERBIL since they are not needed for the OKE Challenge 2015 and their adaption to the new benchmarking core is still work in progress.

Missing Features:

  • C2KB, Rc2KB, Sc2KB and Sa2KB are missing in this release
  • Some adapters (annotators as well as datasets) might be missing

gerbil.nif.transfer-v1.1.0

8 years ago

This is the second release of the gerbil.nif.transfer library. This simple lib translates a document from java into NIF, transfers it to a webservice or client and retranslates it into a java object. It is part of the GERBIL project and can be used by others to implement a NIF based web service for their own annotators.

  • Created a structure of classes making the library more modular and, thus, more reusable
  • Created a clearer hierarchy of markings (see the wiki for details)
  • Extended the Meaning interface to be able to have more than one single URI
  • Added a type inference supporting different ways to express the type of a named entity
  • Extended the IO part of the library

v1.1.3

9 years ago

Features:

  • Added the calculation of correlations of dataset features and annotator performance.
  • Added the new version of Babelfy (pre-release version 1.0). This version does not need a local instance of BabelNet.
  • Added the usage of a small worker library that eases the observing of single tasks.

Bug Fixes:

  • Prepared a better handling of the time measurement. But it is not used by the system, yet. (refs issue #63)
  • Added a new URL for the DEXTER web service
  • Removed the usage of the BAT-Framework's internal cache that caused large problems in combination with GERBIL's structure.

v1.1.2

9 years ago

Bug Fixes:

  • fixed an error in the start script
  • fixed a bug that caused a wrong alignment from Microposts2014 to Gerbil structures (fixes issue #81)
  • update jersey-* from 1.9 to 1.17.1 (fixes issue #83)

v1.1.1

9 years ago

Features:

  • Added a mechanism to the config page that checks the given URL of a NIF based web service before adding it to the list of annotators. (solves issue #80)

Bug Fixes:

  • Synchronized the requests send to the Babelfy service because the client is not thread safe. (related to issue #78)

v1.1.0

9 years ago

Features:

  • Added a better error handling to the DataIDGenerator. Now this class won't throw a NullpointerException if an experiment type or a matching is missing. refs #73
  • Added the mapping for the google analytics file. refs #72
  • Enhanced the performance problem of the database adapter if it has to load the lates experiments.
  • Added a way to use always the same color for a single adapter. Moved the spider diagram java script files into the js folder.
  • Ordered experiment types, annotators and datasets alphabetically.
  • Added the version of the program as property to the properties file.
    • Since the version is set automatically by maven, the property files have been moved into an own folder.Thus, the filtering done by maven will not influence other resource files. refs issue #56

Bug Fixes:

  • Added new URL of gerbil_data.zip
    •  Added the URL of the gerbil_data zip file pointing to version 1.0.0 of GERBIL on GitHub
  • Fixed macrof1 description in json-ld format
  • Added the missing C2W experiment type to the GERBIL vocabulary.
  • Changed the mapping classes.
    • They are adding a suffix to uploaded datasets and NIF based web services that have been added using the configuration screen.
  • Added the missing parent pom for jena.
  • Added the missing jena-iri jar to the local repo.
  • Added a clean shutdown of the database.
  • Added the missing close method to the SimpleLoggingDAO.
  • Added the creation of parent folder for cache files if they are missing.
    • Added a JUnit test for the SingletonWikipediaApi class. refs issue #70
  • Added the logo and icon to the project (instead loading it from another URL).