Dice Group Gerbil Versions Save

GERBIL - General Entity annotatoR Benchmark

qa-v0.2.4

3 years ago
  • Bug Fixes
    • Dependency issues
    • Exception was thrown when no answer was sent (Fixed)
    • Default Language is English, was null before
    • Fixed: QA Matching problems with language tag
    • URL Validator had some problems, fixed by using URI instead (which is more correct).
  • Enhancements
    • #250, #344 - added option to upload dataset and answer file
    • HTTPS instead of HTTP for maven repository
    • More convenient addition of a QA System (Removed the need of adding system in NLIWODBasedSystem)
  • Clean Code
    • Added additional F-Measure test
  • Additions
    • #234 Added LCQUAD Dataset
    • QALD 9
    • QALD Generator CLI
    • dbpedia entity V1 added
  • Removals

v1.2.8

3 years ago
  • Several bug fixes
    • Dependency Issues
    • fixed correlation figure bug
    • Relaxed Redirection handling
    • Several Annotators using the Wrong IP
      • Rewritten WAT and TagMe2, now working again (#207)
    • Updated MSBNC Dataset
    • #348
  • Enhancements
    • Added DB Unit Tests
    • Derczynski dataset renamed
    • Maven using HTTPS instead of HTTP
  • Clean Code
    • Added Test Coverage to Codacy and Travis
    • Codacy Clean Code fixes
  • Removals
    • DoSeR is not existing anymore thus removed

v1.2.7

4 years ago

Features

  • Added the handling of changes that were done applied to the document by the benchmarked system (see #205)
  • Improved database structure to increase performance (#260)
  • Added OpenTapioca annotator (#316)

Bug Fixes

  • Fixed dataset properties
  • Fixed the usage of a wrong matching type for RE
  • Fixed experiment type hierarchy
  • Fixed FOX URL
  • Updated AGDISTIS service URL and renamed it to AGDISTIS/MAG(#257)
  • Added the handling of empty datasets (#295)

v1.2.6

6 years ago

Features

  • Added Tasks
    • Relation Extraction (RE): Extract the Relation to known Entities
    • OKE2018Task4: RE + A2KB
  • Added Datasets
    • OKE 2018 trainings sets

Bug Fixes

  • OKE 2017 configuration added
  • DBpedia Spotlight URL fixed to new URL
  • Available Microposts datasets may have empty lines with only IDs which caused Errors, this is fixed by ignoring those lines. (#206)

Known Problems

qa-v0.2.0

7 years ago

Features

  • Added tooltips in front-end
  • Added Lucene Indexed based Same As Rretrieval to improve runtime of experiments
  • QA Answers can now be logged into csv files for debug comparison (use org.aksw.gerbil.qa.matching.printAnswers=true in gerbil.properties)
  • Added multilingual support with language code (e.g. fr for French) WARNING: To behold your previous results, you have to uncomment the following line once: database-schema

Systems

Added 2 pre implemented systems

  • Qanary

  • OKBQA

  • File Based Annotators can now be in QALD or ExtendedJson format

Datasets

  • Added QALD 7 training datasets
  • Fixed bug with QALD 6 datasets not loading
  • File Based Datasets can now be in QALD or ExtendedJson format

v1.2.5

7 years ago

Features

  • Added annotators
    • DoSeR (#152)
    • NERFGUN (#160)
    • PBOH (#153)
    • xLisa (#113)
  • Added datasets
    • Derczynski (#45)
    • ERD14 and GERDAQ (#158)
    • Microposts 2015 and 2016 (#82, #132)
    • Ritter (#43)
    • Senseval 2 and 3 (#17)
    • UMBC (#44)
    • WSDM 2012 (#46)
  • Introduced the RT2KB experiment type that comprises recognition and typing of entities
  • Introduced index based sameAs relation retrieval and entity checking for KBs that do not change very often (e.g., DBpedia). Downloading the indexes is optional and GERBIL can run without them (but has the same performance drawbacks as the last versions).
  • A warning should be shown in the GUI if the server is busy at the moment.
  • Implemented checks for certain datasets and annotators. If dataset files are missing (because of licenses) or API keys of annotators, they are not available in the front end.

Bug Fixes

  • Fixed the problem that the NIFWSTestingController was not able to check whether a NIF-based web service is available for ETyping experiments. (#151)
  • Fixed encoding problems by setting the encoding to UTF-8 where needed instead of relying on the default encoding (#165)

Known Problems

  • Because of the retrieving of sameAs relations as well as the checking of all entity URIs, the evaluation takes more time than in older versions. With the newly introduced indexes this times is reduced by one half.

gerbil.nif.transfer-v1.2.3

7 years ago
  • Eased the access to internal writers and parsers of the different document (list) and NIF writer and parser classes.
  • Fixed the parsing and writing of TypedSpanImpl instances. (#150)

v1.2.4

7 years ago

Features

  • Added the Microposts 2013 dataset (#41).
  • Added the Derczynski dataset (#45).
  • Added the UMBC dataset (#44).
  • Added the xLisa annotator (#113).

Bug Fixes

  • Implemented the HTTPS connection to d4science.org for the new TagMe 2 instance.
  • Added a simple retry strategy if an org.apache.http.NoHttpResponseException is thrown (#121).
  • Removed the Wikiminer since this service does not exist anymore.

Known Problems

  • Because of the retrieving of sameAs relations as well as the checking of all entity URIs, the evaluation takes more time than in older versions.

qa-v0.1.0

7 years ago

This release is an extension of GERBIL to benchmark Question Answering systems. This is the first release of GERBIL for this area and the first comprehensive online benchmark platform for QA.

Features

v1.2.3

7 years ago

Features

  • Decreased the time needed to load the most recent results (#77)

Bug Fixes

  • Fixed the problem that uploaded NIF-file-based datasets did not have been initialized using the EntityChecker and SameAsRetriever mechanism.
  • Fixed the AIDA/CoNLL Dataset adapter. (#118)
  • Fixed a bug that prevented the correct loading of a dataset if more than one annotator should be tested.
  • Fixed the RDF representation of experiment results.

Known Problems

  • Because of the retrieving of sameAs relations as well as the checking of all entity URIs, the evaluation takes more time than in older versions.