Dennwc Cas Save

Content Addressible Storage

Project README

Content Addressable Storage

This project implements a simple and pragmatic approach to Content Addressable Storage (CAS). It was heavily influenced by Perkeep (aka Camlistore) and Git.

For more details, see concepts and comparison with other systems.

Status

The project is stable, and further work is ongoing on designing CAS2 - more flexible and performant version. This project will receive bug fixed and maintenance work. New features will likely end up in CAS2.

Check the Quick start guide for a list of basic commands.

Goals

Simplicity: the core specification should be trivial to implement.
Interop: CAS should play nicely with existing tools and technologies, either content-addressable or not.
Easy to use: CAS should be a single command away, similar to git init.

Use cases

Immutable and versioned archives: CAS supports files with multiple TBs of data, folders with millions of files and can index and use remote data without storing it locally.
Data processing pipelines: CAS caching capabilities allows to use it for incremental data pipelines.
Git for large files: CAS stores files with an assumption that they can be multiple TBs and is optimized for this use case, while still supporting tags and branches, like Git.

Features and the roadmap

Implemented:

Fast file hashing
- SHA-256, other can be used
- Stores results in file attributes (cache)
Support for large archives
- Large contiguous files (> TB)
- Large multipart files (> TB)
- Large directories (> millions of files)
- Zero-copy file fetch (BTRFS)
Integrations
- Can index and sync web content
- HTTP(S) caching (as a Go library)
Remote storage
- Self-hosted HTTP CAS server (read-only)
- Google Cloud Storage
Usability
- Mutable objects (pins)
- Local storage in Git fashion
Data pipelines
- Extendable
- Caches results
- Incremental

Planned (for CAS2):

Support for large multipart files (> TB)
- Support multilevel parts
- Support blob splitters (rolling checksum, new line, etc)
Remote storage
- AWS, etc
- Self-hosted HTTP CAS server (read-write)
Integration with Git
- Zero-copy fetch from Git (either remote or local)
- LFS integration
Integration with Docker
- Zero-copy fetch of an image from Docker
- Unpack FS images to CAS
- Use containers in pipelines
Integration with BitTorrent:
- Store torrent files
- Download torrent data directly to CAS
- To consider: expose CAS as a peer
Integration with other CAS systems:
- Perkeep
- Upspin
- IPFS
Windows and OSX support
Better support for pipelines

Open Source Agenda is not affiliated with "Dennwc Cas" Project. README Source: dennwc/cas

Stars

Open Issues

Last Commit

1 month ago

Repository

dennwc/cas

License

Apache-2.0

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/dennwc-cas"><img src="https://www.opensourceagenda.com/projects/dennwc-cas/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022