Trak Versions Save

A fast, effective data attribution method for neural networks in PyTorch

v0.3.2

4 months ago

Bug fixes and enhancements:

  • fix excessive CPU memory load from logging
  • remove unnecessary dependencies between featurizing & scoring methods
  • fix abstraction violation in projectors
  • bring back an example of iterative (as opposed to functional) gradient computer
  • add a ridge regularization option for the computation of the XTX inverse term in the TRAK estimator.

v0.3.1

6 months ago
  • community extensions in trak/contrib (see CONTRIBUTING.md)
  • updates to docs, tests, and README

v0.3.0

6 months ago

0.3.0 by @kristian-georgiev and @AlaaKhaddaj in https://github.com/MadryLab/trak/pull/50

  • Added support for large models and datasets (ChunkedCudaProjector, (much) faster scoring by removing I/O bottlenecks)
  • Allow taking gradients with respect to a selected set of parameter groups (e.g., only wrt last layer)
  • black codestyle
  • bug fixes

v0.2.2

7 months ago

What's Changed

  • 0.2.2 by @kristian-georgiev in https://github.com/MadryLab/trak/pull/49

  • more tests

  • better formatting

  • minor bug fixes:

    • controllable random seed for projector

    • fix bug with dtype and device of gradients

    • fix bug with init_projector when device is CPU


Co-authored-by: Sung Min Park spark@mslurm Co-authored-by: Alaa Khaddaj [email protected]

Full Changelog: https://github.com/MadryLab/trak/compare/v0.2.1...v0.2.2

v0.2.1

1 year ago

What's Changed

  • bug fixes

  • updated docs

Full Changelog: https://github.com/MadryLab/trak/compare/v0.2.0...v0.2.1

v0.2.0

1 year ago

Some (mild) backward incompatibilities introduced. In particular, exp_name is now a required argument when scoring.

What's Changed

  • handle pre-emption for featurizing

  • support scoring & featurizing data shards in parallel

  • reduce memory footprint by ~1.5x

  • migrate to torch.func

  • bump torch dep requirement to 2.0.0 because of torch.func

  • python >=3.8 for pytorch 2.0

  • project and store in float16 by default

  • tie experiment name to scoring targets; simplify saver; add logging

  • save scores as mmap

  • normalization factor for numerical stability

  • clean up quickstart

  • no-op projector

  • pass in an instance of a class for tasks, rather than init inside of gradientcomputer

  • bug fixes


New Contributors

@AlaaKhaddaj made their first contribution in #38


Full Changelog: https://github.com/MadryLab/trak/compare/v0.1.3...v0.2.0

v0.1.3

1 year ago

What's Changed

  • 0.1.3 by @kristian-georgiev in https://github.com/MadryLab/trak/pull/32

    • allow skipping model IDs in finalize scores

    • allow subclassing of saver and score_computer directly from traker args

    • default to BasicProjector if CudaProjector projeciton step errors out

    • add another type of error that sometime occurs when fast_jl has issues

    • update quickstart notebook

    • Add link to colab with pre-computed trak scores to readme

    • add dropbox links to quickstart nb

    • update training code in quickstart tutorial

    • bump version

Full Changelog: https://github.com/MadryLab/trak/compare/v0.1.2...v0.1.3

v0.1.2

1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/MadryLab/trak/compare/v0.1.1...v0.1.2

v0.1.1

1 year ago

trak v0.1.1

What's Changed

Full Changelog: https://github.com/MadryLab/trak/compare/v0.1.0...v0.1.1

v0.1.0

1 year ago

trak v0.1.0