Pytorch Metric Learning Versions Save

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

v2.5.0

1 month ago

Improvements

Thanks @mkmenta !

v2.4.1

5 months ago

This is identical to v2.4.0, but includes the LICENSE file which was missing from v2.4.0.

v2.4.0

5 months ago

Features

Added DynamicSoftMarginLoss. See PR #659. Thanks @domenicoMuscill0!
Added RankedListLoss. See PR #659. Thanks @domenicoMuscill0!

Bug fixes

Fixed issue where PNPLoss would return NaN when a batch sample had no corresponding positive. See PR #660. Thanks @Puzer and @interestingzhuo!

Tests

Fixed the test for HistogramLoss to work with PyTorch 2.1. Thanks @GaetanLepage!

v2.3.0

9 months ago

Features

Added HistogramLoss. See pull request #651. Thanks @domenicoMuscill0!

v2.2.0

10 months ago

Features

Added ManifoldLoss. See pull request #635. Thanks @domenicoMuscill0!
Added P2SGradLoss. See pull request #635. Thanks @domenicoMuscill0!
Added the symmetric flag to SelfSupervisedLoss. If True, then the embeddings in both embeddings and ref_emb are used as anchors. If False, then only the embeddings in embeddings are used as anchors. The previous behavior was equivalent to symmetric=False. Now the default is symmetric=True, because this is usually what is done in self supervised papers (e.g. SimCLR).

v2.1.2

11 months ago

Bug Fixes

Fixed bug where set_stats was not being called in TripletMarginMiner (#628)
Made HierarchicalSampler extend torch.utils.data.Sampler instead of torch.utils.data.BatchSampler (#613)
Made samplers documentation clearer (#615). Thanks @rheum !

v2.1.1

1 year ago

Bug Fixes

Fixes bug where BaseDistance.initial_avg_query_norm was not actually being set (#620)

v2.1.0

1 year ago

Features

New loss function: PNPLoss. Thanks @interestingzhuo!

v2.0.1

1 year ago

Bug Fixes

Fixed #591. Thanks @HSinger04!

v2.0.0

1 year ago

New features

SelfSupervisedLoss

You don't have to create labels for self-supervised learning anymore:

from pytorch_metric_learning.losses import SelfSupervisedLoss
loss_func = SelfSupervisedLoss(TripletMarginLoss())
embeddings = model(data)
augmented = model(augmented_data)
loss = loss_func(embeddings, augmented)

Thanks @cwkeam!

API changes

AccuracyCalculator.get_accuracy

The order and naming of arguments has changed.

Before:

get_accuracy(
    query, 
    reference, 
    query_labels, 
    reference_labels, 
    embeddings_come_from_same_source=False
)

Now:

get_accuracy(
    query, 
    query_labels,
    reference=None,
    reference_labels=None 
    ref_includes_query=False
)

The benefits of this change are:

if query is reference, then you only need to pass in query, query_labels
ref_includes_query is shorter and clearer in meaning than embeddings_come_from_same_source

Some example usage of the new format:

# Accuracy of a query set, where the query set is also the reference set:
get_accuracy(query, query_labels)

# Accuracy of a query set with a separate reference set:
get_accuracy(query, query_labels, ref, ref_labels)

# Accuracy of a query set with a reference set that includes the query set:
get_accuracy(query, query_labels, ref, ref_labels, ref_includes_query=True)

`BaseMiner` instead of `BaseTupleMiner`

Miners must extend BaseMiner because BaseTupleMiner no longer exists

CrossBatchMemory's `enqueue_idx` is now `enqueue_mask`

Before, enqueue_idx specified the indices of embeddings that should be added to the memory bank.

Now, enqueue_mask[i] should be True if embeddings[i] should be added to the memory bank.

The benefit of this change is that it fixed an issue in distributed training.

Here's an example of the new usage:

# enqueue the second half of a batch
enqueue_mask = torch.zeros(batch_size).bool()
enqueue_mask[batch_size/2:] = True

VICRegLoss requires keyword argument

Before:

loss_fn = VICRegLoss()
loss_fn(emb, ref_emb)

Now:

loss_fn = VICRegLoss()
loss_fn(emb, ref_emb=ref_emb)

The reason is that VICRegLoss now uses the forward method of BaseMetricLossFunction, to allow for possible generalizations in the future without causing more breaking changes.

BaseTrainer `mining_funcs` and `dataset` have swapped order

This is to allow mining_funcs to be optional.

Before if you didn't want to use miners:

MetricLossOnly(
    models,
    optimizers,
    batch_size,
    loss_funcs,
    mining_funcs = {},
    dataset = dataset,
)

Now:

MetricLossOnly(
    models,
    optimizers,
    batch_size,
    loss_funcs,
    dataset,
)

Deletions

The following classes/functions were removed

losses.CentroidTripletLoss (it contained a bug that I don't have time to figure out)
miners.BaseTupleMiner (use miners.BaseMiner instead)
miners.BaseSubsetBatchMiner (rarely used)
miners.MaximumLossMiner (rarely used)
trainers.UnsupervisedEmbeddingsUsingAugmentations (rarely used)
utils.common_functions.Identity (use torch.nn.Identity instead)

Other minor changes

VICRegLoss should now work with DistributedLossWrapper (https://github.com/KevinMusgrave/pytorch-metric-learning/issues/535)
Dynamic recordable attribute names were removed (https://github.com/KevinMusgrave/pytorch-metric-learning/issues/436)
AccuracyCalculator now returns NaN instead of 0 when none of the query labels appear in the reference set (https://github.com/KevinMusgrave/pytorch-metric-learning/issues/397)

Pytorch Metric Learning Versions Save

v2.5.0

Improvements

v2.4.1

v2.4.0

Features

Bug fixes

Tests

v2.3.0

Features

v2.2.0

Features

v2.1.2

Bug Fixes

v2.1.1

Bug Fixes

v2.1.0

Features

v2.0.1

Bug Fixes

v2.0.0

New features

SelfSupervisedLoss

API changes

AccuracyCalculator.get_accuracy

Before:

Now:

BaseMiner instead of BaseTupleMiner

CrossBatchMemory's enqueue_idx is now enqueue_mask

VICRegLoss requires keyword argument

BaseTrainer mining_funcs and dataset have swapped order

Deletions

Other minor changes

`BaseMiner` instead of `BaseTupleMiner`

CrossBatchMemory's `enqueue_idx` is now `enqueue_mask`

BaseTrainer `mining_funcs` and `dataset` have swapped order