LemmatizedAncientGreekXML Versions Save

v1.2.5

6 years ago

Changes:

  • Lemmas for prepositions, particles, and a few clear mistakes concerning article lemmas have been corrected. This has increased dramatically the number of the lemmas available in the corpus: 21493806 lemmas against 25522507 tokens.

v1.2.4

6 years ago

Changes:

  • The codepoint "’" is used as apostrophe or as quotation mark. Some known issues stemming from (wrong) Betacode conversion makes tokenization for this codepoint hard to handle. This version corrects some tokenization errors to move quotation mark "’" into a separate token

v1.2.3

6 years ago

Changes:

  • "’" position for elision has been corrected, i.e., put in the same element of the elided word (This error was due to the fact that the apostrophe has been encoded with different codepoints)

v1.2.2

6 years ago

Changes:

  • In tlg0018.tlg010.opp-grc1.xml and tlg0018.tlg015.opp-grc1.xml the erroneous ’Kv at the beginning of the first sentence has been corrected into Ἐν
  • In tlg0018.tlg019.opp-grc1.xml the erroneous ’Η at the beginning of the first sentence has been corrected into Ἡ
  • "’" position has been corrected, i.e., put at the end of a sentence
  • Duplicate l1 and l2 are deleted

v1.2.1

6 years ago

Improved documentation in README.md lemma redundancy deleted

v1.2.0

6 years ago

v1.0.1

6 years ago

An xml version of the Morpheus database, which has been used to retrieve lemmas, has been added.

v1.0.0

6 years ago

This release contains the data as they have been automatically produced, without attempting manual corrections.