Ceresdb Versions Save

HoraeDB is a high-performance, distributed, cloud native time-series database.

v2.0.0

3 weeks ago

Upgrade from 1.x.x to 2.0.0

The transition from CeresDB to Apache HoraeDB introduces several breaking changes. To facilitate upgrading from older versions to v2.0.0, specific alterations are necessary.

Upgrade Steps

Setup required envs

export HORAEDB_DEFAULT_CATALOG=ceresdb

Update config

Etcd's root should be configured both in horaedb and horaemeta

For horaedb

[cluster_deployment.etcd_client]
server_addrs = ['127.0.0.1:2379']
root_path = "/rootPath"

For horaemeta

storage-root-path = "/rootPath"

Upgrade horaemeta

Horaedb will throw following errors, which is expected

2024-01-23 14:37:57.726 ERRO [src/cluster/src/cluster_impl.rs:136] Send heartbeat to meta failed, err:Failed to send heartbeat, cluster:defaultCluster, err:status: Unimplemented, message: "unknown service meta_service.MetaRpcService", details: [], metadata: MetadataMap { headers: {"content-type": "application/grpc"} }

Upgrade horaedb

After all server upgraded, the cluster should be ready for read/write, and old data could be queried like before.

What's Changed

Breaking Changes

refactor!: refactor shard version logic by @ZuLiangWang in https://github.com/apache/incubator-horaedb/pull/1286

Features

feat: support re-acquire shard lock in a fast way by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1251
feat: support alter partition table by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1244
feat: support access etcd with tls by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1254
feat: support schema validate in remote write by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1256
feat: avoid flush when drop table by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1257
feat: opentsdb api support gzip body by @tanruixiang in https://github.com/apache/incubator-horaedb/pull/1261
feat: infer timestamp constraint for single-timestamp column by @Dennis40816 in https://github.com/apache/incubator-horaedb/pull/1266
feat: primary keys support sample by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1243
feat: cache space total memory by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1278
feat: skip record column values for level0 sst by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1282
feat: support write wal logs in columnar format by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1179
feat: support stack size of read threads configurable by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1305
feat: impl DoNothing wal by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1311
feat: slow log include remote query by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1316
feat: use string for request id by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1349
feat: support metrics for number of bytes fetched from object storage by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1363
feat: avoid building dictionary for massive unique column values by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1365
feat: utilize the column cardinality for deciding whether to do dict by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1372
feat: avoid pulling unnecessary columns when querying append mode table by @Rachelint in https://github.com/apache/incubator-horaedb/pull/1307
feat: dist sql analyze by @baojinri in https://github.com/apache/incubator-horaedb/pull/1260
feat: impl priority runtime for read by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1303
feat: upgrade horaedbproto by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1408
feat: block rules support query by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1420
feat: try load page indexes by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1425
feat: support setting meta_addr&etcd_addrs by env by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1427
feat: add table status check by @ZuLiangWang in https://github.com/apache/incubator-horaedb/pull/1418
feat: support docker-compose and update README by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1429
feat: impl layered memtable to reduce duplicated encode during scan by @Rachelint in https://github.com/apache/incubator-horaedb/pull/1271
feat: update disk cache in another thread to avoid blocking normal query process by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1431
feat: update pgwire to 0.19 by @sunng87 in https://github.com/apache/incubator-horaedb/pull/1436
feat: filter out MySQL federated components' emitted statements by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1439
feat: add system_stats lib to collect system stats by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1442
feat(horaectl): initial commit by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1450
feat: support collect statistics about the engine by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1451
feat: persist sst meta size by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1440
feat: add sst level config for benchmark by @zealchen in https://github.com/apache/incubator-horaedb/pull/1482
feat: add exponential backoff when retry by @zealchen in https://github.com/apache/incubator-horaedb/pull/1486

Refactor

refactor: move wal structs and traits to wal crate by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1263
refactor: improve error readability by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1265
refactor: move wal crate to under src folder by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1270
refactor: use notifier::RequestNotifiers instead of dedup_requests::RequestNotifiers by @baojinri in https://github.com/apache/incubator-horaedb/pull/1249
refactor: conditionally compile wal impls by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1272
refactor: remove unused min/max timestamp in the RowGroup by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1297
refactor: avoid duplicate codes by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1371
refactor: avoid returning metrics in non-analyze sql by @baojinri in https://github.com/apache/incubator-horaedb/pull/1410
refactor: move sub crates to the src directory by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1443
refactor: adjust cpu's stats by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1457
refactor: refactor compaction process for remote compaction by @Rachelint in https://github.com/apache/incubator-horaedb/pull/1476

Fixed

fix: dist query dedup by @Rachelint in https://github.com/apache/incubator-horaedb/pull/1269
fix: log third party crates by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1289
fix: ensure primary key order by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1292
fix: use flag in preflush to indicate whether reorder is required by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1298
fix: alter partition table tag column by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1304
fix: increase wait duration for flush by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1315
fix: add license to workspace members by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1317
fix: ensure channel size non zero by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1345
fix: fix create table result by @ZuLiangWang in https://github.com/apache/incubator-horaedb/pull/1354
Revert "fix: fix create table result" by @ZuLiangWang in https://github.com/apache/incubator-horaedb/pull/1355
fix: fix test create table result by @ZuLiangWang in https://github.com/apache/incubator-horaedb/pull/1357
fix: no write stall by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1388
fix: collect metrics for get_ranges by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1364
fix: ignore collecting fetched bytes stats when sst file is read only once by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1369
fix: publich nightly image by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1396
fix: missing and verbose logs by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1398
fix: fix broken link by @caicancai in https://github.com/apache/incubator-horaedb/pull/1399
fix: the broken link about the issue status by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1402
fix: skip wal encoding when data wal is disabled by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1401
fix: disable percentile for distributed tables by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1406
fix: compatible for old table options by @Rachelint in https://github.com/apache/incubator-horaedb/pull/1432
fix: get_ranges is not spawned in io-runtime by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1426
fix: table name is normalized when find timestamp column by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1446
fix: changes required for migrate dev to main by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1455
fix: missing filter index over the primary keys by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1456
fix: random failure of test_collect_system_stats by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1459
fix(ci): refactor ci trigger conditions by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1474

Docs

chore(docs): rename CeresDB to HoraeDB by @caicancai in https://github.com/apache/incubator-horaedb/pull/1337
chore/docs: remove broken link by @caicancai in https://github.com/apache/incubator-horaedb/pull/1341
docs: update CONTRIBUTING.md by @suyanhanx in https://github.com/apache/incubator-horaedb/pull/1382
doc: fix broken link by @caicancai in https://github.com/apache/incubator-horaedb/pull/1358
chore/doc: rename ceresdb to horaedb by @caicancai in https://github.com/apache/incubator-horaedb/pull/1332
doc: add MySQL-Client in README by @jackwener in https://github.com/apache/incubator-horaedb/pull/1331
doc: fix link in illegal markdown format by @jackwener in https://github.com/apache/incubator-horaedb/pull/1334
style: normalize comments/doc in rustfmt by @jackwener in https://github.com/apache/incubator-horaedb/pull/1335
docs: add sudo for install commands by @caicancai in https://github.com/apache/incubator-horaedb/pull/1347
docs: sync GH activities to commits only by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1385
chore(docs): fix invalid repo links by @SYaoJun in https://github.com/apache/incubator-horaedb/pull/1452
chore(docs): fix invalid repo links by @Apricity001 in https://github.com/apache/incubator-horaedb/pull/1472

Chore

chore(deps): bump golang.org/x/net from 0.5.0 to 0.17.0 in /integration_tests/sdk/go by @dependabot in https://github.com/apache/incubator-horaedb/pull/1258
chore: delete the configuration related to github cache by @tanruixiang in https://github.com/apache/incubator-horaedb/pull/1259
chore: remove backtrace of blocked table by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1267
ci: setup golang in CI by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1275
chore: remove default features in analytic_engine by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1277
chore(deps): bump google.golang.org/grpc from 1.53.0 to 1.56.3 in /integration_tests/sdk/go by @dependabot in https://github.com/apache/incubator-horaedb/pull/1280
test: simplify ceresmeta-server installation by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1287
chore: enable blank issue by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1290
chore: add metrics to inspect write path by @Rachelint in https://github.com/apache/incubator-horaedb/pull/1264
chore: refactor build_meta.sh in integration-test by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1306
chore: rename ceresdb to horaedb by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1310
edit: add schema id, schema name, catalog name in TableData by @dust1 in https://github.com/apache/incubator-horaedb/pull/1294
chore: ignore seq check for DoNothing wal by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1314
chore: remove community by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1318
chore: try to clear ceresdb stuff by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1320
chore: change copyright owner by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1321
ci: stop release docker image before we finish the rename and transfer by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1323
chore: bump deps by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1325
chore: rename ceresmeta to horaemeta by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1327
chore: rename binary to horaedb-server and more by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1330
chore(license): rename license-header.txt's CeresDB to HoraeDB by @caicancai in https://github.com/apache/incubator-horaedb/pull/1336
chore: replace ceresdb with horaedb by @jackwener in https://github.com/apache/incubator-horaedb/pull/1338
chore: more rename to horaedb by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1340
chore: update create table integration test result by @ZuLiangWang in https://github.com/apache/incubator-horaedb/pull/1344
chore: disable frequently failed tests by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1352
test: add integration test for alter table options by @caicancai in https://github.com/apache/incubator-horaedb/pull/1346
chore: ignore flush failure when flush by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1362
chore: disable timeout for http api by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1367
chore: disable block for http api by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1368
config: add .asf.yaml by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1377
ci: remove missing Required status by @tisonkun in https://github.com/apache/incubator-horaedb/pull/1383
chore: git repo link type fix by @fengmk2 in https://github.com/apache/incubator-horaedb/pull/1378
chore: apply ASF license header by @tanruixiang in https://github.com/apache/incubator-horaedb/pull/1384
chore: add dev mail list and rename ceresdb to horaedb by @tanruixiang in https://github.com/apache/incubator-horaedb/pull/1375
chore: more rename to horaedb by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1387
chore: add push-nightly-image in workflow by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1389
chore: update README by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1390
chore: refactor for better readability by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1400
chore: add error log for remote server by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1407
chore: update website url by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1404
chore: upload horaedb logo by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1409
chore: add slack link by @tanruixiang in https://github.com/apache/incubator-horaedb/pull/1395
chore: update logo by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1414
chore: update horaedb logo by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1415
chore: rename ceresformat to logformat by @ZuLiangWang in https://github.com/apache/incubator-horaedb/pull/1417
chore: fix logo link in readme by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1416
chore: update github pages by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1421
chore: more rename to horaedb by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1419
chore: fix error message by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1412
chore: remove github pages in asf.yaml by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1428
chore: skip wal seq check when wal is disabled by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1430
chore: enable merge on github by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1435
chore: merge change sets on the dev branch by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1423
chore: fix issue status of README-CN.md by @ShiKaiWi in https://github.com/apache/incubator-horaedb/pull/1437
chore(deps): bump h2 from 0.3.17 to 0.3.24 by @dependabot in https://github.com/apache/incubator-horaedb/pull/1448
chore(deps): bump shlex from 1.1.0 to 1.3.0 by @dependabot in https://github.com/apache/incubator-horaedb/pull/1458
chore: update create tables result by @ZuLiangWang in https://github.com/apache/incubator-horaedb/pull/1454
chore: merge HoraeMeta code into HoreaDB repository by @ZuLiangWang in https://github.com/apache/incubator-horaedb/pull/1460
chore(deps): bump google.golang.org/grpc from 1.47.0 to 1.56.3 in /horaemeta by @dependabot in https://github.com/apache/incubator-horaedb/pull/1464
chore(deps): bump golang.org/x/net from 0.16.0 to 0.17.0 in /horaemeta by @dependabot in https://github.com/apache/incubator-horaedb/pull/1465
chore(deps): bump golang.org/x/crypto from 0.14.0 to 0.17.0 in /horaemeta by @dependabot in https://github.com/apache/incubator-horaedb/pull/1462
chore: rename ci's prefix name by @tanruixiang in https://github.com/apache/incubator-horaedb/pull/1467
chore: fix github issue template by @ZuLiangWang in https://github.com/apache/incubator-horaedb/pull/1470
chore(horaemeta&horaectl): refactor clusters/diagnose response body by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1475
chore: free disk for ci by @jiacai2050 in https://github.com/apache/incubator-horaedb/pull/1484
deps: bump datafusion by @tanruixiang in https://github.com/apache/incubator-horaedb/pull/1445
horaectl: remove go implementation of horaectl by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1490
chore: update version to 2.0.0, prepare for releasing v2.0.0 by @chunshao90 in https://github.com/apache/incubator-horaedb/pull/1487

New Contributors

@Dennis40816 made their first contribution in https://github.com/apache/incubator-horaedb/pull/1266
@caicancai made their first contribution in https://github.com/apache/incubator-horaedb/pull/1332
@jackwener made their first contribution in https://github.com/apache/incubator-horaedb/pull/1331
@suyanhanx made their first contribution in https://github.com/apache/incubator-horaedb/pull/1382
@fengmk2 made their first contribution in https://github.com/apache/incubator-horaedb/pull/1378
@sunng87 made their first contribution in https://github.com/apache/incubator-horaedb/pull/1436
@SYaoJun made their first contribution in https://github.com/apache/incubator-horaedb/pull/1452
@Apricity001 made their first contribution in https://github.com/apache/incubator-horaedb/pull/1472

Full Changelog: https://github.com/apache/incubator-horaedb/compare/v1.2.7...v2.0.0

v1.2.7

7 months ago

Major Features

Partition Table

Support random partition rule #1193
Avoid memory allocation during partition write requests #1208
Fix wrong text of show create table for partition table #1214
Improved partitioned table tests powered by tsbs #1195

Performance

Teach ceresdb to run the whole dist query process #1204
Support aggr push down in distributed query #1232
Store real time range in sst #1225
Use real time range to filter memtable #1233
Rewrite not in expr to in #1236
Dedup requests in proxy #1125
Support dedup execute physical plan #1237

Bug Fix

Fix deadlock when dedup stream read #1199
Fix panic when read data out of range by disk cache #1206
Fix lock contention on acquiring the arena stats #1207
Fix panic if dedupped query fails #1229

What's Changed

feat: improved partitioned table tests powered by tsbs by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1195
chore(deps): bump webpki from 0.22.0 to 0.22.1 by @dependabot in https://github.com/CeresDB/ceresdb/pull/1198
feat: support random partition rule by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1193
feat: limit multiple threads fetch the same block simultaneous by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1190
fix: deadlock when dedup stream read by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1199
chore: add meta stable check into integration test by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1202
feat: refactor Resolver in dist sql query by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1186
refactor: stream read metric by @baojinri in https://github.com/CeresDB/ceresdb/pull/1203
fix: panic when read data out of range by disk cache by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1206
fix: lock contention on acquiring the arena stats by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1207
fix: use ExecutionGuard to ensure notifiers released when futures got cancelled by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1200
chore: modify enable_others default true for happy debugging by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1209
fix: skip update shard status when create/remove table by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1210
fix: add table status to cancel background jobs by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1212
refactor: use encoded_size as memory usage by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1213
chore(deps): bump bcder from 0.7.2 to 0.7.3 by @dependabot in https://github.com/CeresDB/ceresdb/pull/1216
fix: wrong text of show create table for partition table by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1214
refactor: avoid memory allocation during partition write requests by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1208
fix: ignore error when open partition table failed by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1220
chore: http route directly from meta by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1221
build(deps): upgrade rust-rocksdb by @tisonkun in https://github.com/CeresDB/ceresdb/pull/1223
fix: loop all sub tables to get table info by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1224
refactor: stop reschedule when pending task larger than max ongoing by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1219
fix: disk cache deduped get_ranges by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1218
refactor: first step to move all packages under src folder by @tisonkun in https://github.com/CeresDB/ceresdb/pull/1226
feat: store real time range in sst by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1225
fix: set and fetch environment variables error by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1227
feat: dedup requests in proxy by @baojinri in https://github.com/CeresDB/ceresdb/pull/1125
test: add integration test for query plan by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1228
feat: teach ceresdb to run the whole dist query process by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1204
fix: panic if dedupped query fails by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1229
feat: support aggr push down in distributed query by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1232
feat: use real time range to filter memtable by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1233
fix: when drop table first check its existing by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1234
feat: support dedup execute physical plan by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1237
fix: throw error when create a table with a different table id by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1238
chore(deps): bump webpki from 0.22.1 to 0.22.2 by @dependabot in https://github.com/CeresDB/ceresdb/pull/1239
chore: upgrade obkv table client by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/1240
feat: rewrite not in expr to in by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1236
feat: improve query path observability by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1235
chore: add wechat group qrcode by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/1245
fix: bug about logging nothing by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1250
chore: fix the missing example.toml in README-CN by @zuston in https://github.com/CeresDB/ceresdb/pull/1253

New Contributors

@tisonkun made their first contribution in https://github.com/CeresDB/ceresdb/pull/1223
@zuston made their first contribution in https://github.com/CeresDB/ceresdb/pull/1253

Full Changelog: https://github.com/CeresDB/ceresdb/compare/v1.2.6...v1.2.7

v1.2.6

8 months ago

Major Features

Query

Optimize datafusion plan, remove unnecessary node #1150
Optimize disk cache, avoid panic when cache file is corrupted #1130
Support PostgreSQL protocol #1138

Remote engine

Optimize remote server's protocol, reduce payload overhead when write batch is small #1146

WAL

Open wal parallelly #1129
Introduce columnar encoding

What's Changed

chore: remove the codes about the reverse reading by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1116
fix: meta service change to meta_runtime by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1121
feat: add obkv operation metrics by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1122
chore: check response header when query failed in proxy by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/1119
feat: add metrics for prom route query by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1123
chore: add metric for manifest recover by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1124
refactor: use BinaryExpr to present regex filter by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1128
feat: make obkv wal opening more parallelly by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1129
fix: upgrade obkv cilent to fix panic in mysql crate by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/1131
feat: hotspot record remote engine requests by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1127
chore: bump datafusion by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1133
chore: add apache license checker by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1134
refactor: refactor query engine by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1137
chore: don't trigger image build via tag push by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1139
feat: support PostgreSQL wire protocol by @holicc in https://github.com/CeresDB/ceresdb/pull/1138
chore: bump obkv by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1141
fix: record remote engine requests by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1140
fix: avoid panic when the file is corrupted in disk cache by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1130
chore: bump to 1.2.6-alpha by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1142
chore: add apache license template by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1143
chore: update pre-commit config by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1145
feat: remove filter plan node in pipeline by @dust1 in https://github.com/CeresDB/ceresdb/pull/1126
refactor: add datafusion default optimizer rules by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1147
refactor: use new protocol for remote engine service write by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1146
refactor: add support_pushdown in table trait by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1150
refactor: improve the partition compute by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1151
refactor: add request id in context by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1153
feat: teach ceresdb to generate UnresolvedPartitionedScan for partitioned table by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1148
fix: deadline check by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1159
test: add integration test for distinct by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1155
chore: define QueryEngine and wrap all things into it by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1160
fix: add order by for integration test case by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1162
feat: support columar encoding for datums by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1158
refactor: pass the TaskContext when execute physical plan rather than holding it by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1163
chore: add metrics for write logs in wal by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1166
chore(deps): bump rustls-webpki from 0.100.1 to 0.100.2 by @dependabot in https://github.com/CeresDB/ceresdb/pull/1168
feat: implement compression encoding for columar bytes values by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1165
feat: implement columnar encoding for integer by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1169
feat: implement boolean columnar encoding by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1170
feat: separate metadata from parquet's kv_metadata by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1120
feat: make float as number encoding by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1171
chore: modified Acknowledgements by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1174
feat: teach ceresdb to convert the inexecutable partitioned scan to executable(resolving process) by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1161
chore: add fields metrics by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1176
chore: remove hybrid related logic by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1172
feat: implement a basic columnar memory table by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/1164
feat: teach ceresdb to encode/decode datafusion physical plan by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1177
chore: upgrade nightly rust to 1.72 by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1184
refactor: make enable_partition_table_access a config by @baojinri in https://github.com/CeresDB/ceresdb/pull/1182
chore: modified README-CN's Acknowledgements by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1183
fix: recover memtable misuse ColumnarMemTable by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/1187
fix: not consume all the datums if some of them is empty by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1181
chore: add metrics for meta data cache hit rate by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1188
chore: bump to 1.2.6 by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1192

New Contributors

@holicc made their first contribution in https://github.com/CeresDB/ceresdb/pull/1138

Full Changelog: https://github.com/CeresDB/ceresdb/compare/v1.2.5...v1.2.6

v1.2.5

9 months ago

Major Features

Support OceanBase as object store backend(stable now!)
Compaction
- support compact same table concurrently https://github.com/CeresDB/ceresdb/pull/1101
- support pick sst by sequence order, this is required to avoid data corruption for overwrite table https://github.com/CeresDB/ceresdb/pull/1041
Improve the stability of CeresDB
- ensure shard is opened once https://github.com/CeresDB/ceresdb/pull/1080
- add shard status when heartbeat to meta https://github.com/CeresDB/ceresdb/pull/1082
Enhancement on query and write
- avoid write queue full block https://github.com/CeresDB/ceresdb/pull/1065
- avoid prefetching all sst streams at once https://github.com/CeresDB/ceresdb/pull/1069
- improve performance of thetasketch distinct https://github.com/CeresDB/ceresdb/pull/1102
- query requests dedup https://github.com/CeresDB/ceresdb/pull/1100
- string type support dictionary, this will reduce memory consumption by 30% in our experiments https://github.com/CeresDB/ceresdb/pull/993, https://github.com/CeresDB/ceresdb/pull/1049, https://github.com/CeresDB/ceresdb/pull/1068
Improve the performance of recovery
- make obkv wal opening more parallelly https://github.com/CeresDB/ceresdb/pull/1129

What's Changed

feat: expose more rocksdb options by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/1033
feat: add metrics for memtable by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1036
feat: support opentsdb put api by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/1037
test: add integration test for opentsdb put api by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/1043
feat: sst-metadata support sort by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1042
feat: use dictionary type to store column by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/993
fix: compaction support pick by max_seq by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1041
feat: sql support dictionary column by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1049
chore(deps): bump google.golang.org/grpc from 1.47.0 to 1.53.0 in /integration_tests/sdk/go by @dependabot in https://github.com/CeresDB/ceresdb/pull/1052
refactor: add http write metrics by @baojinri in https://github.com/CeresDB/ceresdb/pull/1045
feat: memtable bitmap for null by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1046
fix: persist sst id by @baojinri in https://github.com/CeresDB/ceresdb/pull/1009
fix: fix kafka wal logs deletion and recovery logic by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1048
feat: split operations of Cluster and Shard, and serialize operations of Shard by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1056
feat: add record batch mem stats by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1058
fix: fix fully flushed region open in kakfa wal by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1061
fix: directly return when found no table datas to replay by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1062
feat: warning when query's timestamp is exceeding table's ttl by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1054
refactor: use varint to encode string/bytes length by @baojinri in https://github.com/CeresDB/ceresdb/pull/1060
feat: support query opened shards info by @baojinri in https://github.com/CeresDB/ceresdb/pull/1070
fix: make is_dictionary optional by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1074
fix: add missing dict_id by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1068
feat: support cancellation safe future by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1071
chore: bump version by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1079
feat: use manifest updates stats in TableData to trigger snapshot by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1076
chore: add testcases for time range predicate by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1078
fix: avoid write queue full block by @baojinri in https://github.com/CeresDB/ceresdb/pull/1065
refactor: separate common_util to multiple components by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1077
refactor: remove type encode by @baojinri in https://github.com/CeresDB/ceresdb/pull/1073
fix: ensure shard is opened once by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1080
feat: avoid prefetching all sst streams at once by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1069
fix: separate some modules from common_types by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1084
feat: add shard status when heartbeat to meta by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1082
fix: quote column name in prom query filter by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1083
fix: update pprof version by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1088
refactor: add grpc query nums metrics by @baojinri in https://github.com/CeresDB/ceresdb/pull/1090
chore: bump datafusion version by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1086
refactor: pprof in toml use workspace by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1092
fix: out of range error in module ObjectStore base on OBKV by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/1089
feat: refactor stats method in wal by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1098
fix: wrong encoding when write schema is different by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1103
feat: Improve performance of thetasketch distinct by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1102
feat: support compact table concurrently by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1101
fix: ensure shard lock is release in corner case by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1104
chore: bump pprof version by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1107
chore: add actual scan duration stats into chain iterator by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1109
fix: disable read parquet page index by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1110
chore: remove unused codes about the page index by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1111
fix: logic of keep scheduling compaction by @baojinri in https://github.com/CeresDB/ceresdb/pull/1113
chore: remove space_and_table from the table_impl by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1114
feat: query requests dedup by @baojinri in https://github.com/CeresDB/ceresdb/pull/1100
fix: obkv wal open by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1117

Full Changelog: https://github.com/CeresDB/ceresdb/compare/v1.2.4...v1.2.5

v1.2.4

10 months ago

Major Features

Support shard based recovery to improve performance #976
Improve the stability of Kafka based wal
- Some enhancements on Kafka client #980 #1005
- Refactor wal deletion algorithm #1064
Improve performance of query and write
- Introduce parquet page filter to accelerate query #664
- Optimize bloom filter building process to accelerate flush #967 #975
Support more object store backends
- Support S3 as object store backend #969
- Support OceanBase as object store backend(unstable) #887 #970 #971
Other new features
- Support hex literal in sql #1030
- Improve sst-metadata tool for better debugging #1019

What's Changed

chore: add rationale part in pr template by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/957
chore: remove duplicated metrics by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/956
refactor: replace *join_all with forloop in flush by @baojinri in https://github.com/CeresDB/ceresdb/pull/947
fix: capacity should equal to total / part_num by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/960
chore: add log for stream read of remote engine service by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/961
feat: ignore row group filter when certain column type by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/958
fix: missing metrics when using chain iterator by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/964
refactor: open shard impl of TableEngine by @Rachelint in https://github.com/CeresDB/ceresdb/pull/954
feat: make remote client configurable by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/948
feat: use OceanBase as object store by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/887
feat: add parquet page filter by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/664
refactor: add grpc write failed counter metric by @baojinri in https://github.com/CeresDB/ceresdb/pull/963
feat: expose s3 object store setting by @baojinri in https://github.com/CeresDB/ceresdb/pull/969
fix: use uuid as upload_id in obkv object store by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/970
refactor: optimize sst filter build to consume less CPU by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/967
chore: bump xor8 by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/972
feat: use get_batch to implement get_range by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/971
refactor: optimize sst iterator and filter build to consume less CPU by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/975
refactor: disk cache use partition lock by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/974
refactor: add generic support to generate hasher by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/977
test: add test for hashers by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/979
feat: allow setting multiple kafka boost brokers by @Rachelint in https://github.com/CeresDB/ceresdb/pull/980
refactor: add partition num as param in partition lock's init_fn by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/981
chore: deny dbg-macro in clippy by @Rachelint in https://github.com/CeresDB/ceresdb/pull/983
fix: schema mismatch during write by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/966
refactor: function args of object store by @baojinri in https://github.com/CeresDB/ceresdb/pull/978
chore: reduce kafka logs by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/986
chore: install git in Dockerfile by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/992
refactor: profiling by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/989
feat: support region based wal replay by @Rachelint in https://github.com/CeresDB/ceresdb/pull/976
fix: datumkind size by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/994
chore: add integration test about recovery by @Rachelint in https://github.com/CeresDB/ceresdb/pull/996
refactor: add grpc handler metrics by @baojinri in https://github.com/CeresDB/ceresdb/pull/988
refactor: avoid grpc forwarding twice by @baojinri in https://github.com/CeresDB/ceresdb/pull/991
fix: ensure files can only be picked once by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/995
fix: arrow meta data is lost when decode custom meta data by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1004
fix: avoid any updates after table is closed by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/998
fix: add page index for metadata by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/1000
feat: use instead forked rskafka to support limited retry by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1005
chore: add logs and metric to recovery by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1007
chore: fix logs and style by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1011
chore: bump ceresdbproto by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1021
fix: avoid crash due to empty sql by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1024
feat: add more details about the sst in sst-metadata tool by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/1019
revert: "fix: add page index for metadata (#1000)" by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1026
fix: test_suggest_duration_and_ranges() occasional fail. by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/1028
chore: bump rust client by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1025
feat: add page indexes for metadata by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/1027
feat: support hex literal by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1030
Revert "fix: avoid any updates after table is closed (#998)" by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1034
fix: avoid flush too many small sst file by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/1003
feat: expose more rocksdb options by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/1033
feat: add metrics for memtable by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1036
feat: support opentsdb put api by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/1037
test: add integration test for opentsdb put api by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/1043
feat: sst-metadata support sort by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1042
feat: use dictionary type to store column by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/993
fix: compaction support pick by max_seq by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1041
feat: sql support dictionary column by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1049
chore(deps): bump google.golang.org/grpc from 1.47.0 to 1.53.0 in /integration_tests/sdk/go by @dependabot in https://github.com/CeresDB/ceresdb/pull/1052
refactor: add http write metrics by @baojinri in https://github.com/CeresDB/ceresdb/pull/1045
feat: memtable bitmap for null by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/1046
fix: persist sst id by @baojinri in https://github.com/CeresDB/ceresdb/pull/1009
fix: fix kafka wal logs deletion and recovery logic by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1048
feat: split operations of Cluster and Shard, and serialize operations of Shard by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1056
feat: add record batch mem stats by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/1058
fix: fix fully flushed region open in kakfa wal by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1061
fix: directly return when found no table datas to replay by @Rachelint in https://github.com/CeresDB/ceresdb/pull/1062
feat: warning when query's timestamp is exceeding table's ttl by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/1054
refactor: use varint to encode string/bytes length by @baojinri in https://github.com/CeresDB/ceresdb/pull/1060

Full Changelog: https://github.com/CeresDB/ceresdb/compare/v1.2.2...v1.2.4

v1.2.2

11 months ago

Major Features

Enhancement on proxy module:
- #875 Support influxdb api with proxy
- #886 #896 Refactor sql query with proxy
- #932 Support querying partitioned sub-table by http query
- #878 Support forwarding of Prometheus remote write request
Improvement of write performance:
- #879 Support merge small write requests
- #918 Skip building column schema when columns already exists
Enhancement on debugging tools:
- #909 Support tokio console for debugging
- #927 Add sst-metadata tool to query sst metadata

Bug Fix

Cluster
- #908 Fix the problem that shard cannot be closed normally
- #941 Fix the problem that the shard cannot be opened normally due to deadlock
Compaction
- #910 Limit input sst size when compact for old bucket
- #915 Ensure pick at least 2 files for compaction
Proxy
- #911 Fix auto create table without CeresMeta
PromQL
- #901 Fix reserve column case when build plan

What's Changed

chore: add integration test for dropping partition table by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/870
chore: remove udeps in CI by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/877
feat: impl influxdb api with proxy by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/875
feat: place table datas into manifest, update them together by @Rachelint in https://github.com/CeresDB/ceresdb/pull/863
fix: remove explanation in PR tmpl by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/883
chore: bump the version to 1.2.0 and update the config example by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/880
fix: remove unused write_worker module by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/884
fix: snappy decode error for prom remote write by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/882
feat: impl forwarding prom write by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/878
feat: add metrics for http requests by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/876
chore: add aliyun docker hub by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/881
feat: impl mysql query with proxy by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/886
fix: new multiple write requests when tag names are different by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/888
chore: remove garbage create table logs by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/890
feat: support merge small write requests by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/879
refactor: proxy write and route by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/889
chore: reflush memory tables after flush failed by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/891
chore: bump datafusion by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/894
refactor: proxy sql read by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/896
fix: auto create table by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/895
chore: adjust log level by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/899
feat: support for disabling router cache by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/903
chore: remove parse table name in integration-test by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/905
fix: reserve column case when build plan by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/901
feat: simplify the PR tmpl by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/885
feat: support tokio console for debugging by @Rachelint in https://github.com/CeresDB/ceresdb/pull/909
feat: add shard related methods to table engine by @Rachelint in https://github.com/CeresDB/ceresdb/pull/897
fix: limit input sst size when compact for old bucket by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/910
fix: auto_create_table without ceresmeta by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/911
fix: remove shard from cluster topology after shard closed by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/908
chore: remove useless code by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/906
refactor: use binary bits to determine the number of partitions by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/919
refactor: adopt parquet arrow async writer by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/922
fix: ensure pick at least 2 files for compaction by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/915
feat: Implement multipart upload of object store by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/913
refactor: find new columns to improve write performance by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/918
chore: replace DefaultHash with AHasher by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/928
fix: Optimize write time cost when periodic full compaction by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/907
fix: http query support querying partitioned sub-table by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/932
chore: add hint about ahash usage by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/931
feat: add sst-metadata tool to query sst metadata by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/927
chore: add cargo clippy --fix to Makefile by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/937
chore: bump obkv client version by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/938
refactor: use partition lock for memory cache by @tanruixiang in https://github.com/CeresDB/ceresdb/pull/936
fix: deadlock when stop keepalive bg task by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/941
fix: too frequent flush by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/943
fix: write cancel when flush pending write queue by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/940
chore: check compact_files is empty by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/949
feat: upgrade vergen and remove dep libgit2 by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/951
chore: bump the version to 1.2.2 by @ZuLiangWang in https://github.com/CeresDB/ceresdb/pull/952
fix: the problem of inconsistent error message returned when auto_create_table fails by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/942

New Contributors

@tanruixiang made their first contribution in https://github.com/CeresDB/ceresdb/pull/919

Full Changelog: https://github.com/CeresDB/ceresdb/compare/v1.2.0...v1.2.2

v1.2.0

1 year ago

Upgrade Guide

NOTE: this guide is only used for upgrading CeresDB v1.1.0 to CeresDB v1.2.0, ignore it if you want to deploy a brand new CeresDB cluster with v1.2.0.

In v1.2.0, some incompatible changes are contained, so it's important to upgrade carefully:

First, stop all the instances of CeresDB and CeresMeta;
Upgrade the CeresMeta first by referring to the Upgrade Guide of CeresMeta;
When upgrade the CeresDB, the config should be updated:

Change the config section [analytic.compaction_config] to [analytic.compaction] if you use it;
Add the config section about the [cluster_deployment.etcd_client] if your CeresDB cluster is in WithMeta mode:

[cluster_deployment.etcd_client]
server_addrs = ['127.0.0.1:2379']
root_path = "/rootPath"

NOTE: the root_path must be /rootPath if upgrade from v1.1.0. 4. After updating CeresDB config, start the CeresDB server;

Major Features

Enhancement on InfluxQL support:
- Support query with aggregators;
- #854 optimize influxql planner to load all tables on demand instead of loading them when initializing the planner;
- Replace influxdb_iox with CeresDB/influxql to remove unnecessary dependencies introduced by influxdb_iox;
Enhancement on proxy module:
- Implement the proxy as a separate module;
- Support forward table requests in proxy;
- Support read and write on partition table in proxy;
- Recover the metadata of partition table from CeresMeta instead of CeresDB in proxy;
Improvement of write performance:
- #822 solves the problem that compaction schedule triggered by flush procedure may block the write procedure;
- #814 is a big change set, and replaces the write queue with the lock on table level for less write contentions;
- #843 adjusts the flush strategy to avoid frequent write stall;
- #861 brings the level 1 to SSTs, and currently the SST of the level 0, which is generated by flushing, won't contain complex indexes, e.g. xor-filter, leading to faster flushing and less write stall;
Enhancement on observability:
- #774 introduces the hotspot recorder that can be used to find out the top tables with the highest write/read throughput in a specific time window;
- #827 #831 provides more metrics for all the stages of writing procedure, which can be used to troubleshoot write performance problems, and the grafana dashboard config has been already updated.
- #817 introduces the CPU profiler, and the flamegraph of CPU can be generated easily just by an HTTP request to CeresDB server;
Support the new mechanism of failover and load balancing, more details can refer to the [Release Note v1.2.0] of CeresMeta:
- #706 #853 implements the distributed locks for shard based on ETCD, and opening and closing of shards is only allowed with the shard lock held, and after that, data corruption caused by multiple shard leaders will be avoided completely;
- Support automatic failover of CeresDB nodes, that is to say, the service recovery can be handled automatically without any manual intervention;
- Support automatic load balance based on consistent hashing, which can ensure that shards are evenly distributed on each node of the cluster when the number of the cluster nodes increases or decreases;

Thanks

Heartfelt thanks for @zouxiang1993's effort in helping troubleshooting write performance issues.

What's Changed

fix: simplify the logs in query path (#770) by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/776
fix: remove FixedSizeArena by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/772
chore(deps): bump time from 0.1.44 to 0.3.15 by @dependabot in https://github.com/CeresDB/ceresdb/pull/761
feat: add default schema config by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/782
fix: remove body limit for influxql request by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/783
feat: add integration tests for influxql request by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/784
feat: add java integration tests by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/786
chore(deps): bump log4j-core from 2.8.2 to 2.17.1 in /integration_tests/sdk/java by @dependabot in https://github.com/CeresDB/ceresdb/pull/789
chore(deps): bump junit from 4.12 to 4.13.1 in /integration_tests/sdk/java by @dependabot in https://github.com/CeresDB/ceresdb/pull/788
fix: timestamp column should not be auto added by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/787
chore: route use read_runtime by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/794
feat: influxql support show measurements by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/795
chore: bump version to 1.1.0 by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/797
feat: impl getTableInfo in remoteEngine service by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/793
feat: add rust sdk test by @Rachelint in https://github.com/CeresDB/ceresdb/pull/791
fix: avoid error when disk cache miss by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/790
feat: impl get_table_info in remote_engine_client by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/798
fix: avoid send empty record batch to client by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/796
chore: remove useless cluster_version by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/804
refactor: make tsbs more configurable by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/805
fix: avoid break when drop wal table failed by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/806
feat: implement route interface in http protocol by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/803
refactor: bump datafusion, add influxql aggregator support by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/778
fix: add router when build request context for mysql by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/809
feat: hotspot recorder by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/774
feat: introduce TableOperator to encasulate operation of tables by @Rachelint in https://github.com/CeresDB/ceresdb/pull/808
feat: expose rocksdb background jobs option by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/812
feat: integration test support env filter by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/811
chore: bump datafusion by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/810
feat: convert nanoseconds to milliseconds automatically by @dust1 in https://github.com/CeresDB/ceresdb/pull/780
feat: add cpu profiler by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/817
feat: upgrade rust-rocksdb by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/821
feat: avoid blocking the write procedure because of compaction schedule by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/822
feat: query partition table with proxy in grpc service by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/802
feat: influxql support fill syntax by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/824
feat: install dev dependencies in make file by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/815
chore: remove unused dependency by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/823
feat: replace bg runtime with default and compact runtime by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/826
chore: add commit id of nightly docker image by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/829
chore: add write batch metrics by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/827
feat: http query with proxy by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/807
feat: add metrics for write procedure by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/831
feat: impl prom query with proxy by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/833
feat: support write partition table in grpc service by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/828
fix: improve remote write performance by using separate runtime by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/837
chore: update ob client version by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/835
chore: remove unnecessary deps by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/838
chore(deps): bump h2 from 0.3.16 to 0.3.17 by @dependabot in https://github.com/CeresDB/ceresdb/pull/841
feat: tsbs support more write options by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/839
feat: support write batch in remote engine by @Rachelint in https://github.com/CeresDB/ceresdb/pull/840
feat: serialize table operations by lock rather than queue by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/814
feat: avoid frequent write stall by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/843
fix: wrong default write batch size for run_tsbs by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/845
chore: clean forward configs by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/847
feat: refactor manifest to get snapshot in memory by @Rachelint in https://github.com/CeresDB/ceresdb/pull/825
chore: rename module sql to query_frontend by @Rachelint in https://github.com/CeresDB/ceresdb/pull/849
feat: forward request in grpc write by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/844
chore: bump obkv client version by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/850
feat: support domain name as the ceresdb node addr by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/852
refactor: implement the distributed lock of shard by @ZuLiangWang in https://github.com/CeresDB/ceresdb/pull/706
feat: compaction support different level by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/848
fix: avoid panic when convert prom result by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/851
feat: only collecting all tables on demand in influxql planner by @Rachelint in https://github.com/CeresDB/ceresdb/pull/854
refactor: shard lock module by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/853
feat: support prom remote query forward by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/855
feat: support querying partition table in prom query and http query by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/857
fix: build filter when needed by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/861
feat: rename the compaction_config to compaction and adjust interval by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/862
refactor: implement prom remote query by convert to datafusion plan directly by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/860
refactor: remove runtime from request context by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/859
test: add prometheus integration tests by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/864
chore: proxy as a separate module by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/865
fix: fix write partition table by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/869
chore: add some commands in Makefile by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/866
fix: fix evict logic in remote client by @Rachelint in https://github.com/CeresDB/ceresdb/pull/872
fix: drop partition table by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/871
chore!: rename table name in table_kv based wal by @Rachelint in https://github.com/CeresDB/ceresdb/pull/868
Revert "chore!: rename table name in table_kv based wal" by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/873

Full Changelog: https://github.com/CeresDB/ceresdb/compare/v1.1.0...v1.2.0

v1.1.0

1 year ago

Major features

Initial support for InfluxQL, usage can be found here.
Introduce proxy module:
- Support auto update schema when a new column occurs in the write request;
- Support forward stream sql query now, usage can be found here.
Optimize SST write/read process with less memory consumption.
explain analyze [SQL] statement is able to show the details of scan procedure.

What's Changed

chore: push nightly image to ghcr.io by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/684
fix: nightly docker image name by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/687
refactor: move grpc create table to sql crates by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/689
chore: image name must be lowercase by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/691
feat: add go sdk tests by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/686
refactor: replace tokio lock with std lock in some sync scenarios by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/694
docs: change the description of qr-code by @archerny in https://github.com/CeresDB/ceresdb/pull/697
fix: Parse string without specify time zone to local time stamp by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/692
ci: upgrade the Rust version used in CI by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/700
refactor: remove table flush policy by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/704
fix: avoid file purge when they are used in queries by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/699
feat: support rewrite basic raw query in influxql by @Rachelint in https://github.com/CeresDB/ceresdb/pull/683
refactor: remove cluster version by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/669
chore: update pull_request_template.md by @hehex9 in https://github.com/CeresDB/ceresdb/pull/708
chore: modify code coverage trigger conditions by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/709
chore: add concrete tag to log when write failed by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/707
feat: support the simplest influxql raw query by @Rachelint in https://github.com/CeresDB/ceresdb/pull/710
feat: replace native-tls with rustls by @dust1 in https://github.com/CeresDB/ceresdb/pull/701
feat: support auto create table config by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/713
feat: support integration tests for influxql by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/719
refactor: remove custom oss impl by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/720
docs: replace qr-code by @archerny in https://github.com/CeresDB/ceresdb/pull/725
feat: add influxdb write by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/723
fix: Panicked when OceanBase table client is initialing by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/728
chore: update issue template by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/731
feat: new crate trace_metric for collecting metrics in read procedure by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/714
fix: div zero when compaction by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/734
fix: failure to open a single table does not interrupt the shard's opening process by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/722
fix: remove compaction retry when memory limit by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/739
feat: http debug api for config by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/733
feat: reuse logical planner in influxdb_iox by @Rachelint in https://github.com/CeresDB/ceresdb/pull/730
fix: start http server after table recovery finished by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/741
feat: return error while encountering unsupport from in influxql by @Rachelint in https://github.com/CeresDB/ceresdb/pull/745
feat: don't allow create table which failed when open by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/743
feat: remove replace table level metrics with aggregate metrics by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/740
feat: introduce proxy module by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/732
feat: implement route cache by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/748
feat: block all query requests by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/751
chore: modify workflows by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/744
feat: build sst in stream way by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/747
feat: auto add column by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/749
fix: move table engine proxy to table_engine crate by @Rachelint in https://github.com/CeresDB/ceresdb/pull/755
chore: upgrade influxql-logical-planner version and modify CI setting by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/753
feat: support split write request to batches for small wal logs by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/754
chore: use influxdb line protocol in crates by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/757
feat: configurable record batches in flight by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/759
fix: rename as_bytes to as_byte in ReadableSize (#428) by @zouxiang1993 in https://github.com/CeresDB/ceresdb/pull/767
feat: convert the influxql result using influxdb format by @Rachelint in https://github.com/CeresDB/ceresdb/pull/758
chore: fix insert-license pre-commit by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/756
chore: replace unfold with async-stream by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/763
feat: make influxql query interface compatible with influxdb1.8 by @Rachelint in https://github.com/CeresDB/ceresdb/pull/773

New Contributors

@hehex9 made their first contribution in https://github.com/CeresDB/ceresdb/pull/708

Full Changelog: https://github.com/CeresDB/ceresdb/compare/v1.0.0...v1.1.0

v1.0.0

1 year ago

Major features

Support prometheus remote storage protocol
Query performance improvement and resource control
- Replace bloom filter with XOR8 filter
- Add timeout for query
- Add route cache in remote engine client in table partition
- Support locate partition for in sql expression in table partition
Internal refactor
- Refactor sst module for better extensibility
- Refactor manifest module
Refactor grpc storage service
Add intergration test for cluster mode
Bug fix
- Fix sql identifier case-sensibility
- Correct the order of sync meta snapshot and clean logs in wal on kafka
- Update flushed_sequence_num after compaction
- Fix varbinary type error

What's Changed

Revert "refactor: separate object store from parquet sst async reader… by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/536
chore: add version in PartitionInfo by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/537
feat: support drop partition table by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/532
docs: wrong schema in the static routing example config by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/543
chore(deps): bump tokio from 1.22.0 to 1.24.1 by @dependabot in https://github.com/CeresDB/ceresdb/pull/546
chore(deps): bump lz4-sys from 1.9.3 to 1.9.4 by @dependabot in https://github.com/CeresDB/ceresdb/pull/545
docs: update example cluster config by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/548
avoid routing for non-existent table by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/551
fix: Revert "feat: support bloom filter in hybrid format (#479)" by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/549
feat: add open_table_on_shard and close_table_on_shard in meta_event_service by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/542
feat: disable do_snapshot when recover table data by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/550
feat: use arrow-ipc to communicate between remote server and client by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/552
feat: add timeout for query by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/539
Supplement ceresdb docs by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/555
support schema header for http/grpc service by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/544
feat: add cached router for remote engine client by @Rachelint in https://github.com/CeresDB/ceresdb/pull/547
chore: add some metrics for PartitionTable and grpc service by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/560
fix: truncated error message by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/564
chore: change the prefix of sub partition table name to "__" by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/566
feat: pass partition info to ceresmeta when create table by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/568
chore: add code of conduct by @archerny in https://github.com/CeresDB/ceresdb/pull/575
chore: add llvm cov codecov by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/574
feat: support querying partition tables through select in by @ZuLiangWang in https://github.com/CeresDB/ceresdb/pull/530
feat: add permission check for sub tables in table partition by @Rachelint in https://github.com/CeresDB/ceresdb/pull/541
feat: improve the release profile and add release-slim profile by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/580
feat: add some logs for reading sst by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/581
feat: merge breaking changes to the main branch by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/583
chore: update the roadmap by @archerny in https://github.com/CeresDB/ceresdb/pull/584
chore(deps): bump git2 from 0.14.2 to 0.16.1 by @dependabot in https://github.com/CeresDB/ceresdb/pull/588
chore(deps): bump bumpalo from 3.10.0 to 3.12.0 by @dependabot in https://github.com/CeresDB/ceresdb/pull/586
chore: add codecov.yml to ignore some dirs that don't need code coverage by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/591
feat: implement prom remote storage api by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/578
feat restrict partition num by @Rachelint in https://github.com/CeresDB/ceresdb/pull/598
chore: remove docs by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/599
chore(deps): bump warp from 0.3.2 to 0.3.3 by @dependabot in https://github.com/CeresDB/ceresdb/pull/602
fix: re-enable SingleDistinctToGroupBy by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/594
chore: fix missing deps in CI by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/603
chore: make nightly image name readable by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/604
fix: remove unnecessary type conversion in MysqlWorker by @Huachao in https://github.com/CeresDB/ceresdb/pull/606
chore: remove lto by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/609
feat: print error in datanode by @Rachelint in https://github.com/CeresDB/ceresdb/pull/613
chore(deps): bump tokio from 1.24.1 to 1.25.0 by @dependabot in https://github.com/CeresDB/ceresdb/pull/610
feat: prevent from pushing down filter of non primary key or ts key by @Rachelint in https://github.com/CeresDB/ceresdb/pull/611
test: add testcase for timestamp not in primary key by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/619
feat: adapt new protocol by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/585
feat: make bloom filter optional on column level by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/615
primary keys defaults to tsid, timestamp by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/621
chore: add logs by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/614
feat: Upgrade to Datafusion 17 by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/601
fix: avoid some unwrap with result by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/624
feat: store manifest to oss by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/607
chore: rename meta module to manifest in analytic crate by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/625
feat: add BoxError trait to simplify the way to create generic error by @Rachelint in https://github.com/CeresDB/ceresdb/pull/627
test: add test for issue 302 by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/618
chore: remove useless code by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/630
chore: modify wal config by @Rachelint in https://github.com/CeresDB/ceresdb/pull/629
chore: run ci in parallel by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/632
feat: add cluster mode system table by @ZuLiangWang in https://github.com/CeresDB/ceresdb/pull/464
feat: use ConfigOptions to transfer custom settings by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/639
docs: udpate readme-cn by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/600
feat: replace bloom with xor8 filter by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/631
chore: refacor ceresdb config by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/640
feat: support request context by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/636
feat: allow mapping between sequence and kafka offset incomplete by @Rachelint in https://github.com/CeresDB/ceresdb/pull/642
fix: correct the order of sync meta snapshot and clean logs in wal on kafka by @Rachelint in https://github.com/CeresDB/ceresdb/pull/643
chore: move proto to ceresdbproto repo by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/644
feat: map shard id to region id in manifest by @Rachelint in https://github.com/CeresDB/ceresdb/pull/645
fix: sql identifier default to case-sensitive by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/641
fix: update flushed_sequence_num after compaction by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/651
refactor: adjust configuration by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/649
feat: replace column index with id when persisting schema in manifest by @Rachelint in https://github.com/CeresDB/ceresdb/pull/652
refactor: reuse client in manifest wal and normal wal by @Rachelint in https://github.com/CeresDB/ceresdb/pull/656
fix: fix varbinary type error by @ZuLiangWang in https://github.com/CeresDB/ceresdb/pull/653
refactor: use enum instead of version for encoding format in pb by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/655
chore: fix some clippy errors & add clippy.toml by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/658
refactor: remove encoder method from WalManager by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/661
chore: add integration test for cluster mode. by @ZuLiangWang in https://github.com/CeresDB/ceresdb/pull/646
feat: Support DataTypes of Date and Time by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/657
chore: normalize http api by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/659
fix: xor filter build by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/666
chore: fix CI upload log by @ZuLiangWang in https://github.com/CeresDB/ceresdb/pull/667
feat: remove avro encoder by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/660
feat: remove status in readme by @MachaelLee in https://github.com/CeresDB/ceresdb/pull/671
chore: add rustc version by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/672
test: add partition_table test in integration_test by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/668
chore: bump client by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/673
chore: rename some files and configuration by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/674
chore: update log level by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/677
fix: can't recognize aliyun oss NotFoundError when load manifest by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/678
fix: remove auto create table option by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/679
chore: update readme by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/680
chore: release 1.0.0 by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/676

New Contributors

@dependabot made their first contribution in https://github.com/CeresDB/ceresdb/pull/546

Full Changelog: https://github.com/CeresDB/ceresdb/compare/v1.0.0-alpha02...v1.0.0

v1.0.0-alpha02

1 year ago

Major features

Table Partition
- Support key partition(MySQL-like syntax).
- Implement PartitionTable to support query and write for partitioned tables.
Improve query performance
- Add disk based object store cache. (unstable)
- Implement parallel get_byte_ranges for ObjectStoreReader.
- Scan row groups in one sst file parallelly.
- Support bloom filter in hybrid format.
- Support MergeIterator to pull data concurrently.
Support auto query forwarding for grpc.
Normalize the case of SQL and make clear that all SQL cases are sensitive.
Chore
- Migrate current harness tests to sqlness. by @dust1
- Support memory usage limit on background compaction.
- Make the bloom filter optional in sst meta.
Bug fix
- Fix wrong primary key when define tsid and timestamp key as primary key.
- Fix wrong path in the result from StoreWithPrefix.
- Fix lru-weighted-cache` memory leak.
- Fix some bugs in background compaction.
- Fix wrong profile output os path for heap profiling.

What's Changed

feat: convert table name to lowercase when not quoted by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/444
fix: object_store config by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/443
feat: add disk based object store cache by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/420
feat: add dynamic setting log level by @ZuLiangWang in https://github.com/CeresDB/ceresdb/pull/445
feat: add disk cache value crc by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/447
refactor: make http max body size configurable and modify default value by @Rachelint in https://github.com/CeresDB/ceresdb/pull/451
chore: use default value for StorageOptions by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/455
chore: modify some config in workflows by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/457
fix: wrong primary key when define tsid and timestamp key as primary key by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/453
feat: impl parallel get_byte_ranges for ObjectStoreReader by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/450
feat: convert table name to quoted style by @Rachelint in https://github.com/CeresDB/ceresdb/pull/454
refactor: remove enable_tsid_primary_key by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/459
chore: bump oss sdk by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/458
feat: make rocksdb wal compatible with other distributed implementation by @Rachelint in https://github.com/CeresDB/ceresdb/pull/460
fix: kill ceresdb-server if run harness failed by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/461
feat: support prefix for object store by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/463
fix: wrong path in the result from StoreWithPrefix by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/465
feat: make initializing of buffered streams in MergeIterator concurrent by @Rachelint in https://github.com/CeresDB/ceresdb/pull/466
chore: add metrics oss by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/478
chore: bump oss sdk by @ZuLiangWang in https://github.com/CeresDB/ceresdb/pull/475
feat: support memory usage limit on background compaction by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/476
fix: lru-weighted-cache mem leak by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/480
chore: support Cargo.toml format check by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/481
feat: support bloom filter in hybrid format by @Rachelint in https://github.com/CeresDB/ceresdb/pull/479
fix: compact table in background scheduler by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/482
fix: wrong profile output os path for heap profiling by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/484
feat: add max_input_sstable_size by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/483
feat: scan row groups in one sst file parallelly by @Rachelint in https://github.com/CeresDB/ceresdb/pull/474
feat: migrate current harness tests to sqlness by @dust1 in https://github.com/CeresDB/ceresdb/pull/473
feat: make the bloom filter optional in sst meta by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/488
feat: support store with readonly cache by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/490
feat: support ignore bloomfilter when compaction by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/491
fix: remove EncodingWriter by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/493
feat: support create partition table by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/485
feat: support parsing partition table creating by @Rachelint in https://github.com/CeresDB/ceresdb/pull/487
feat: support query limit by rule by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/494
chore: adjust usage of sst type by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/499
feat: introduce ObjectStorePicker to replace the two object stores by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/496
feat: define remote table engine trait by @Rachelint in https://github.com/CeresDB/ceresdb/pull/502
feat: define partition rule trait by @Rachelint in https://github.com/CeresDB/ceresdb/pull/501
refactor: separate object store from parquet sst async reader by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/503
refactor: recovery in standalone mode by @Rachelint in https://github.com/CeresDB/ceresdb/pull/414
feat: support parse key partition by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/506
chore: define remote_engine grpc service by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/505
feat: impl key partition rule by @Rachelint in https://github.com/CeresDB/ceresdb/pull/507
feat: auto forward query for grpc by @ShiKaiWi in https://github.com/CeresDB/ceresdb/pull/511
feat: impl remote_engine grpc service by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/508
feat: Impl remote sdk by @Rachelint in https://github.com/CeresDB/ceresdb/pull/509
feat: upgrade sqlness to latest version by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/517
feat: update ceresdb proto & adapter to partition info by @ZuLiangWang in https://github.com/CeresDB/ceresdb/pull/515
fix: fix conversion by avro by @Rachelint in https://github.com/CeresDB/ceresdb/pull/519
fix: fix convert record batch by @ZuLiangWang in https://github.com/CeresDB/ceresdb/pull/521
feat: impl PartitionTable by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/512
fix: fix partition table affected_rows by @chunshao90 in https://github.com/CeresDB/ceresdb/pull/522
chore: add more context for query log by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/523
chore: bump sqlness to 0.1.1 by @jiacai2050 in https://github.com/CeresDB/ceresdb/pull/524
fix: fix integration test's bad case by @Rachelint in https://github.com/CeresDB/ceresdb/pull/527

Full Changelog: https://github.com/CeresDB/ceresdb/compare/v1.0.0-alpha01...v1.0.0-alpha02