Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
This contains addition of new feature - get_similar_sentences - with which you can augment and multiply your data in supported languages
New Features:
get_sentence_encoding
- supported for all languages in iNLTKget_sentence_similarity
- supported for all languages in iNLTK.New Model:
from inltk.inltk import reset_models
>> reset_models('pa')
>> setup('pa')
Added Urdu support to iNLTK - thanks to @anuragshas contributions Added Windows 10 support - thanks to @ibrahiminfinite contributions
Added get_embedding_vectors function to allow users to get embedding vectors for their words/sentences/documents
Added tamil support