Llm Embed Jina Save

Embedding models from Jina AI

Project README

llm-embed-jina

Embedding models from Jina AI

Background

Jina AI Launches World's First Open-Source 8K Text Embedding, Rivaling OpenAI introduces these models.

See also Embeddings: What they are and why they matter for background on embeddings and an explanation of the LLM embeddings tool.

Here's my blog post about how I built this plugin.

Installation

Install this plugin in the same environment as LLM.

llm install llm-embed-jina

Usage

This plugin adds support for three new embedding models:

jina-embeddings-v2-small-en: 33 million parameters.
jina-embeddings-v2-base-en: 137 million parameters.
jina-embeddings-v2-large-en: 435 million parameters - not yet released, but it will work once it has been released.

The models will be downloaded the first time you try to use them.

See the LLM documentation for everything you can do.

To get started embedding a single string, run the following:

llm embed -m jina-embeddings-v2-small-en -c 'Hello world'

This will output a JSON array of 512 floating point numbers to your terminal.

To calculate and store embeddings for every README in the current directory (try this somewhere with a node_modules directory to get lots of READMEs) run this:

llm embed-multi jina-readmes \
    -m jina-embeddings-v2-small-en \
    --files . '**/README.md' --store

Then you can run searches against them like this:

llm similar jina-readmes -c 'utility functions'

Add | jq to pipe it through jq for pretty-printed output, or | jq .id to just see the matching filenames.

Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

cd llm-embed-jina
python3 -m venv venv
source venv/bin/activate

Now install the dependencies and test dependencies:

llm install -e '.[test]'

To run the tests:

pytest

Open Source Agenda is not affiliated with "Llm Embed Jina" Project. README Source: simonw/llm-embed-jina

Stars

Open Issues

Last Commit

3 months ago

Repository

simonw/llm-embed-jina

License

Apache-2.0

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/llm-embed-jina"><img src="https://www.opensourceagenda.com/projects/llm-embed-jina/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022