Sybil Versions Save

columnar storage + NoSQL OLAP engine | https://logv.org

0.5.1

6 years ago

This is a minor release. snorkel 0.5.1 is released, releasing sybil 0.5.1 as well.

Miscellany

  • [idiomatic go] updating code style
  • [query] interstitial stage of querying can choose which metric to sort by before pruning, was using "count" as default before
  • [fake data] add fake data generation through https://github.com/plaitpy/plaitpy
  • [cli] improve output for insufficent/invalid arguments (#33)

v0.5.0

6 years ago

Features

  • add loghist support
  • add per block query cache support
  • add loglogbeta usage for count distinct
  • add combine step to middle of aggregation, allowing for freeing memory to OS earlier and better high cardinality column handling
  • add initial distributed query support (list tables, get table info and run query are supported)
  • add a node aggregator for combining results from distributed queries

Bugfixes

  • lower total number of opened files when decoding large tables
  • support quoted CSV better (using encoding/csv now)
  • correct treatment of eq and neq int filters when examining block extents

v0.2.0

6 years ago

Fixes & Features

  • auto compact during ingestion
  • add export to TSV option
  • support custom field and filter delimiters
  • don't double print samples in ingestion log during samples queries
  • support for ingesting directly from file (for use with fluentd) instead of stdin

0.1.0

6 years ago

New Features & Behavior:

  • add support for gzip compressed files in DB
  • CLI is quiet by default
  • experimentally embedding luajit mapreduce
  • static linking of sybil with cgo
  • add bounce rate to sessionization code

0.0.8

7 years ago

New Features:

  • None

Bugfixes:

  • Fixing broken tests & data races
  • Fixing cache lock recovery
  • Fixing group by issue with time series

0.0.7

7 years ago

New Features:

  • Recycle records memory (and avoid usage of GC heap)
  • Extreme outlier detection and removal (more than 10,000 STD DEVs away)
  • Deal with high volume datasets by caching capped block info for datasets with large block counts (caches 64 blocks per file)

Bugfixes:

  • Correctness: Filter samples in row store when using filters and samples query
  • Threadsafety: Grab lock for accessing block list when loading row store records into row store block

v0.0.6

7 years ago

Small fixes for compiling on multiple architectures

v0.0.5

7 years ago

update for building with go 1.6

  • fix deleting block from table blocklist during aggregation race condition
  • read from the ingestion log when getting samples

v0.0.4

7 years ago

small bugfix release

  • small speed improvements in unpacking columns
  • fix GROUP BY with missing column values
  • don't align time filters automatically unless we are doing a time query

v0.0.3

7 years ago

New Features:

  • support for per table memory limits
  • better printing for console based time series

Bugfixes:

  • remove excess docs from repo
  • don't create new dataset dirs willy nilly during query or digest, only during ingestion
  • don't read ingestion log unless specified on command line