https://github.com/tersesystems/blacklite

"Fast as internal ring buffer" Logback/Log4J2 appender using SQLite with zstandard dictionary compression and rollover.
https://github.com/tersesystems/blacklite

log4j2 logback logging slf4j sqlite zstandard

Last synced: 7 months ago
JSON representation

"Fast as internal ring buffer" Logback/Log4J2 appender using SQLite with zstandard dictionary compression and rollover.

Host: GitHub
URL: https://github.com/tersesystems/blacklite
Owner: tersesystems
License: apache-2.0
Created: 2020-11-14T22:50:56.000Z (over 4 years ago)
Default Branch: main
Last Pushed: 2022-08-19T16:44:47.000Z (almost 3 years ago)
Last Synced: 2023-10-20T23:22:26.977Z (over 1 year ago)
Topics: log4j2, logback, logging, slf4j, sqlite, zstandard
Language: Java
Homepage: https://tersesystems.com/blog/2020/11/26/queryable-logging-with-blacklite/
Size: 1.15 MB
Stars: 60
Watchers: 4
Forks: 3
Open Issues: 4
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Blacklite

[![Maven central](https://img.shields.io/badge/mavencentral-com.tersesystems.blacklite%3Ablacklite--logback-blue.svg)](https://search.maven.org/#search%7Cgav%7C1%7Cg%3A%22com.tersesystems.blacklite%22%20AND%20a%3A%22blacklite-logback%22)
[![License Apache-2.0](https://img.shields.io/badge/license-Apache--2.0-blue.svg)](https://www.tldrlegal.com/l/apache2)

[![CI](https://github.com/tersesystems/blacklite/actions/workflows/gradle.yml/badge.svg)](https://github.com/tersesystems/blacklite/actions/workflows/gradle.yml)

Blacklite is an [SQLite](https://www.sqlite.org/index.html) appender that is intended for cases where you want buffer of logging data, and also want the option of querying logs from different processes with a built in query language.

So why use Blacklite? Logback does come with a built-in [circular buffer appender](https://github.com/qos-ch/logback/blob/master/logback-core/src/main/java/ch/qos/logback/core/read/CyclicBufferAppender.java), but there is no way to dump it on command. The terse-logback [ring buffer](https://tersesystems.github.io/terse-logback/1.0.0/guide/ringbuffer/) classes will dump a ring buffer, but flush the entire buffer to logs. This works in a single-user or client scenario as described in [Using Ring Buffer Logging to Help Find Bugs](http://www.exampler.com/writing/ring-buffer.pdf), but does not work in a server-side environment, where there may be many concurrent operations and a much greater volume of logs which would make "dumping" in appropriate.

Blacklite provides this functionality by using a queue **roughly equivalent to an in-memory ring buffer**, and then writing to a database configured for write throughput by using [memory mapping](https://www.sqlite.org/mmap.html) and [write ahead logging](https://sqlite.org/wal.html). Using SQLite, the buffer can be queried instead of dumped, and a much broader array of ecosystem of tools can be used with SQLite than with an in-memory ring buffer.

Practically speaking, with some decent hardware you can budget around 800 debugging statements per 1 ms request -- see [benchmarks](BENCHMARKS.md). Using [conditional logging](https://github.com/tersesystems/echopraxia#why-conditions), you can turn on debug logging in production and get a complete picture of what a single request is doing. See [echopraxia-examples](https://github.com/tersesystems/echopraxia-examples) and [terse-logback-showcase](https://github.com/tersesystems/terse-logback-showcase) for a live demonstration.

Blog post [here](https://tersesystems.com/blog/2020/11/26/queryable-logging-with-blacklite/).

## Core Features

Blacklite supports both [Logback](http://logback.qos.ch/) and [Log4J 2](https://logging.apache.org/log4j/2.x/).

Blacklite writes to a single table with the following structure:

```sql
CREATE TABLE IF NOT EXISTS entries (
epoch_secs LONG, // number of seconds since epoch
nanos INTEGER, // nanoseconds in the second
level INTEGER, // numeric level of logging
content BLOB // raw bytes from logging framework encoder / layout
);
```

The `content` column contains the log entry itself, as bytes. The only other columns are longs and integers. There
are no indexes or autoincrement field. Logs stored in Blacklite are the same size as raw files. In addition, using
SQLite file means [total compatibility](https://sqlite.org/locrsf.html) and support over all platforms.

The appender incorporates a queue that is bound by default to a maximum capacity of 1,048,576 entries: you can add a [budget filter](https://tersesystems.github.io/terse-logback/1.0.0/guide/budget/) to impose a limit on the number of entries logged in a duration. The queue must be bounded because if the filesystem fails completely e.g. there is no space left on the device, the queue cannot be left to fill indefinitely and must error out at some point.

## Archiving and Compression

In addition, there are a number of features that Blacklite has above and beyond raw append speed:

* Built-in archiving and rollover based on number of rows.
* Automatic ZStandard dictionary training and compression for 4x disk space savings in archives.
* `blacklite-core` module allows direct entry writing with no logging framework needed.
* Database reader to search logs from command line by "natural language" date ranges.

Blacklite also provides a codec for [zstandard](https://facebook.github.io/zstd/), using
the [zstd-jni](https://github.com/luben/zstd-jni) library. which is extremely fast and can be tweaked to be competitive
with LZ4 using "negative" compression levels like "-4". This codec is provided with the archiver so that older records can be automatically compressed.

In addition, the archiver also includes a [dictionary compression](https://facebook.github.io/zstd/#small-data) option.
If a dictionary is found, then the archiver will write the compressed content to the archive file. If no dictionary is
found, the archiver will feed a dictionary using the incoming log entries, then switch over to dictionary compression
once the dictionary has been trained.

Using a dictionary provides both speed and size improvements. An entry that is typically 185 bytes with JSON can shrink
down to as few as 32 bytes. This adds up extremely quickly when you start working with larger log files.

This is all very abstract, so here's a real life example using 2,001,000 log entries with the logstash logback encoder
writing out JSON.

For the unencoded content:

```
❱ ls -lh blacklite.json
-rw-rw-r-- 1 wsargent wsargent 431M Oct 18 14:14 blacklite.json
```

Compare with the encoded SQLite database using dictionary compression:

```
❱ ls -lh archive.db
-rw-rw-r-- 1 wsargent wsargent 177M Oct 18 14:14 archive.db
```

But still have the same number of records:

```
❱ sqlite3 archive.db "select count(*) from entries"
2001000
❱ wc blacklite.json
2001000 6002000 451212069 blacklite.json
```

## Reading

Providing data in SQLite format means you can leverage tools built using SQLite. I typically connect [DB Browser](https://sqlitebrowser.org/) as the default application for `*.db` files in IntelliJ IDEA so double clicking will bring up GUI.

### Editor / IDE Plugins

* [sqlite VS Code Plugin](https://marketplace.visualstudio.com/items?itemName=alexcvzz.vscode-sqlite)
* [Database Navigator for IntelliJ IDEA](https://plugins.jetbrains.com/plugin/1800-database-navigator)

### GUI Tools

* [SQLite Browser](https://sqlitebrowser.org/)
* [SQLite Speed](https://sqlitespeed.com/)

### Command Line Tools

* [blacklite-reader](https://github.com/tersesystems/blacklite/tree/main/blacklite-reader/)
* [sqlite-utils](https://sqlite-utils.readthedocs.io/en/stable/): Read and process SQLite files from command line

### Web Applications

* [Datasette](https://docs.datasette.io/en/stable/): Exposing SQLite files as web applications
* [Observable HQ](https://observablehq.com/@mbostock/sqlite): Using SQLite data in visualization notebooks

### Scripts

There are scripts available for manipulating SQLite in REPL environments and processing through small programs in JSON.

See the [jbang scripts](scripts/jbang/README.md) and the [Python scripts](scripts/python/README.md) for more detail.

Also you can work with sqlite [directly](SQLITE.md).

## Installation

### Gradle

Add the following resolver:

```
repositories {
mavenCentral()
}
```

And then add the libraries and codecs that you want.

For logback:

```
implementation 'com.tersesystems.blacklite:blacklite-logback:'
implementation 'com.tersesystems.blacklite:blacklite-codec-zstd:'
```

or for log4j:

```
implementation 'com.tersesystems.blacklite:blacklite-log4j2:'
implementation 'com.tersesystems.blacklite:blacklite-log4j2-codec-zstd:'
```

### Maven

For logback:

```xml

com.tersesystems.blacklite
blacklite-logback
$latestVersion

com.tersesystems.blacklite
blacklite-codec-zstd
$latestVersion

```

or log4j:

```xml

com.tersesystems.blacklite
blacklite-log4j
$latestVersion

com.tersesystems.blacklite
blacklite-log4j2-codec-zstd
$latestVersion

```

### SBT

SBT installation is fairly straightforward.

```sbt
libraryDependencies += "com.tersesystems.blacklite" % "blacklite-logback" % ""
libraryDependencies += "com.tersesystems.blacklite" % "blacklite-codec-zstd" % ""
```

Or log4j:

```sbt
//libraryDependencies += "com.tersesystems.blacklite" % "blacklite-log4j" % ""
//libraryDependencies += "com.tersesystems.blacklite" % "blacklite-log4j2-codec-zstd" % ""
```

## Configuration

### Logback

The logback appender uses [JCTools](https://jctools.github.io/JCTools/) internally as an asynchronous queue. This means you don't need to use an `AsyncAppender` or `LoggingEventAsyncDisruptorAppender` on top.

You should always use a `shutdownHook` to allow Logback to drain the queue before exiting.

The appender consists of a `file` property, and an `encoder` which encodes the bytes written to the `content` field in an entry.

The `batchInsertSize` property determines the number of entries to batch before writing to the database. This is a highwater mark that only applies when the number of inserts has gone over a certain point without idling -- this situation only usually applies when using an archiver which will take over the connection for the duration. When archiving, new entries will buffer in the queue, and then be drained and inserted in batches. Under normal circumstances, when the thread is idle, it will `executeBatch/commit` any outstanding inserts, meaning you will see database entries immediately.

If not defined, the default archiver is the `DeletingArchiver` set to `10000` rows.

```xml

1000

${db.dir}/live.db

10000000

```

#### Deleting Archiver

The deleting archiver will delete the oldest entries in the database when the highwater mark is reached.

Note that the database file size may be notably larger than the number of rows after deletion, because SQLite will reuse pages after deletion. You can run `VACUUM` at regular intervals to recover space.

The maximum number of rows in the table is set using the `archiveAfterRows` property. There is no facility for unbounded growth, but you can set this number to `Long.MaxValue` which is 2⁶³-1.

```xml

10000

```

#### Rolling Archiver

The rolling archiver can be a bit complicated, but it works much the same way that rolling file appenders do.

The archiver has a `archiveAfterRows` property that is the maximum number of rows in the live database. When there are more rows, then archiving takes place.

The rolling archiver will keep older log entries by moving them into other sqlite databases. When the maximum number of rows is reached, the oldest rows will be moved into the archive specified by the `file` property. A codec compression can be
applied when rows are moved into the archive to save on disk space.

The archive file will be rolled over when the triggering policy is matched. In the case of the `RowBasedTriggeringPolicy`,
this is the maximum number of rows in the archive database -- after that, the archive database will be renamed according to
the rolling strategy and another archive file will be created.

```xml

/tmp/blacklite/archive.db
10000

500000

logs/archive.%d{yyyyMMdd'T'hhmm,utc}.db
20

```

##### Codec

The rolling archiver can take a codec that compresses the content of the bytes produced by the encoder. This can be very effective.

```xml

```

If using dictionary compression, it's `ZStdDictCodec` and the dictionary must be defined in a repository.

There are two repositories for dictionaries: `ZstdDictFileRepository` which points directly to a zstandard
dictionary on the filesystem, and `SqliteRepository` which keeps dictionaries in an sqlite database.

Blacklite will automatically train a dictionary from the incoming content if it does not exist. You can
tweak the dictionary parameters, but the defaults work fine.

```xml

9

logs/dictionary

```

You can also specify a SQLite database containing dictionaries, using the zstandard dictionary ids as a lookup. This lets you use multiple dictionaries.

```xml

logs/dictionary.db

```

Be aware that if you use a zstandard dictionary, you must have it available to read the logs. If you lose it, the logs will be unreadable!

##### Triggering Policy

There is one triggering policy, using the maximum number of rows in the archive.

```xml

500000

```

##### Rolling Strategies

Fixed Window Rolling Strategy will set up a number of SQLite archive databases, using `%i` to indicate the index.

```xml

logs/archive.%i.db
1
10

```

Time Based Rolling Strategy uses a date system, which will roll over renaming the file to the given date.

```xml

logs/archive.%d{yyyyMMdd'T'hhmm,utc}.db
20
10M
true

```

### Log4J 2

The Log4J 2 is similar to the Logback appender:

```xml

3
102400000
10485760

500000

```

It is broadly similar to the Logback system, with the same settings. .

#### NoOpArchiver

The no-op archiver does nothing:

```xml

```

#### DeletingArchiver

The deleting archiver will delete all rows greater than the `archiveAfterRows` property:

```xml

100

```

#### RollingArchiver

The rolling archiver is as follows:

```xml

```

##### Fixed Window Rolling Strategy

The fixed window rolling strategy is as follows:

```xml

```

There is no time based rolling strategy for Log4J2 at this time: I don't understand how to extract the functionality and make it available.

## Benchmarks

See [BENCHMARKS.md](BENCHMARKS.md)

Logging takes between 25 and 60 ns to enter the in-memory queue, depending on the queue size. The appender will happily accept bursts of logging to the queue, and will drain from queue and insert into the database in batches.

On the backend, the SQLite consumer is single threaded, and can sustain ~2 us/op of small entries using batched commits with an SQLite instance mounted on a `tmpfs` filesystem. For comparison, using Logback with a file appender with `immediateFlush=false` is between [636 and 850 ns/op](https://github.com/wsargent/slf4j-benchmark) but lacks the row-based truncation, querying, indexing, and backup that come with SQLite.

All of this is of course subject to your encoding, your logging framework, and your specific hardware.

## Setting up tmpfs

In cases where you want to use Blacklite as a persistent ring buffer, using a `tmpfs` filesystem as a backing store is a great way to avoid fsync. This is a tactic used by [Alluxio](https://github.com/Alluxio/alluxio/blob/master/core/common/src/main/java/alluxio/worker/block/meta/StorageTier.java#L141), for example.

The easiest thing to do is to set up `/var/log` as [tmpfs](
https://forums.gentoo.org/viewtopic-t-371889-start-0-postdays-0-postorder-asc-highlight-tmpfs.html?sid=13bc57e79de631391821d1869615eb45) and go from there.

Using a `tmpfs` filesystem does not require that you constrain your logs to the amount of memory you have, but it does mean that the logs will be removed when the server shuts down. To get around this, you can [run some scripts on shutdown](https://web.archive.org/web/20200809170437/https://debian-administration.org/article/661/A_transient_/var/log) to transfer the log files.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/tersesystems/blacklite

Awesome Lists containing this project

README