ShallowLearn Versions Save

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

0.0.5

7 years ago
  • Online learning and better pre-training in GensimFastTex:
    • Hashing trick for building the vocabulary, similar to the original fastText approach
    • It is possible to pre-fit word embeddings from a dataset with word2vec
    • True online earning with partial_fit, the vocabulary is incrementally updated
  • New version of fastText.py: 0.8.2
  • New version of Gensim: 0.13.4
  • Fixed predict_proba output format

0.0.4

7 years ago
  • Faster prediction for multiple labels with one predict call
  • Better persistence with save and load methods
  • Fixed parameter names convention

0.0.3

7 years ago
  • FastText classifier based on version 0.8.0 of https://github.com/salestock/fastText.py
  • GensimFastText has now:
    • negative sampling
    • softmax as alternative output function
    • almost complete LabeledWord2Vec as subclass of Gensim's Word2Vec
  • More tests

0.0.2

7 years ago

this is broken, valid commit for this release is 263bfb1

  • Cython code for fastText in Gensim
  • Script for benchmarks

0.0.1

7 years ago
  • GensimFTClassifier can train and predict