VELOC Versions Save

Very-Low Overhead Checkpointing System

veloc-1.7

9 months ago

Feature highlight:

  • Alternative configuration using environmental variables
  • Asynchronous POSIX caching (can be used to hide the overhead of I/O to external storage using node-local storage)
  • Serialization and C++ interface improvements
  • Performance and scalability improvements

1.6

2 years ago

Feature highlight:

  • Support for custom storage modules (enables non-POSIX external storage)
  • Preliminary support for DAOS
  • Better support for the threaded mode (VELOC backend runs as an application thread instead of a separate process)
  • Performance improvements

veloc-1.6

2 years ago

Feature highlight:

  • Support for custom storage modules (enables non-POSIX external storage)
  • Preliminary support for DAOS
  • Better support for the threaded mode (VELOC backend runs as an application thread instead of a separate process)
  • Performance improvements

1.5

2 years ago

Feature highlight:

  • C++ and Python interface
  • Serialization support for C++ (see example here: test/serial_test.cpp)
  • Threaded mode (VELOC backend runs as an application thread instead of a separate process)
  • Performance improvements

veloc-1.5

2 years ago

Feature highlight:

  • C++ and Python interface
  • Serialization support for C++ (see example here: test/serial_test.cpp)
  • Threaded mode (VELOC backend runs as an application thread instead of a separate process)
  • Performance improvements

veloc-1.4

3 years ago

Highlights:

  • async mode automatically starts VELOC backend if not already running, which brings two advantages:
    • can generate the VELOC config file dynamically at runtime before calling VELOC_Init
    • no need to do extra scripting to start the VELOC backend on each node
  • VELOC backend starts as a daemon and logs by default into /dev/shm/veloc-backend-$HOSTNAME-$UID.log

veloc-1.3

3 years ago

Highlights and API changes:

  • Many fixes and performance improvements, upgrade highly recommended
  • Alternative communication protocols between clients and active backend (UNIX sockets, Thallium)
  • Fully asynchronous MD5 checksumming (as a separate module in the checkpointing pipeline)
  • VELOC_Finalize(bool): flag toggles wait for checkpoints to flush (previously it remove checkpoints on finish)

Please join our mailing list to discuss and provide feedback: https://lists.mcs.anl.gov/mailman/listinfo/veloc-users

veloc-1.2

4 years ago

Highlights and API changes:

  • non-collective checkpointing does not need MPI
  • file-based mode can restart with custom file names
  • Python bindings

Please join our mailing list to discuss and provide feedback: https://lists.mcs.anl.gov/mailman/listinfo/veloc-users

veloc-1.1

5 years ago

Several new features and API changes:

  • non-collective checkpoint-restart
  • selective recovery in memory mode
  • lots of improvements

Please join our mailing list to discuss and provide feedback: https://lists.mcs.anl.gov/mailman/listinfo/veloc-users

veloc-1.0

5 years ago

First public release. Please join our mailing list to discuss and provide feedback: https://lists.mcs.anl.gov/mailman/listinfo/veloc-users