Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/lukaszkorecki/mokujin

Structured logging for Clojure. Thin layer on top of clojure.tools.logging with MDC support
https://github.com/lukaszkorecki/mokujin

clojure clojure-library log4j2 logback logging logging-library

Last synced: 2 months ago
JSON representation

Structured logging for Clojure. Thin layer on top of clojure.tools.logging with MDC support

Awesome Lists containing this project

README

        

# Mokujin

> Mokujin (木人 Wooden person?) is a character in the Tekken series of fighting games. It is a spiritually sensitive animated training dummy, and serves as a guardian of good against supernatural evil threats.

[![Clojars Project](https://img.shields.io/clojars/v/org.clojars.lukaszkorecki/mokujin.svg)](https://clojars.org/org.clojars.lukaszkorecki/mokujin)

> [!WARNING]
> While Mokujin has been used in live, production applications - it's still under *active* development,
> consider the API stable-ish, with possible small changes here and there. See [the roadmap](#todo) section.

## Just enough (structured) logging

### Rationalle

`clojure.tools.logging` is Good Enough :tm: solution for majority (if not all) your logging needs. It's fast, works with any
logging backend supported on the JVM and is incredibly simple to use.

The only area where it falls short is producing structured logs.

In the past I'd use [`logfmt`-like](https://brandur.org/logfmt) formatting style when using `infof` (and friends) to produce logs.
Then I'd configure the log ingester to parse log lines using this format. That's fine for simple cases, but as soon as
exceptions (and stack traces) got thrown into the mix, I fell into the deep rabbit hole ofmulti-line log
parsers in Fluentd and vector.dev.

There's a way out of this though - [Mapped Diagnostic Context (MDC)](https://logback.qos.ch/manual/mdc.html) - as
way of adding **structured** information to your logs. This way you get both structured data that can be used to
dervive metrics, alerts etc in your log collector and human readable logs for debugging pruposes.

Mokujin emerged after years of working with different logging solutions in Clojure (and other languages), log aggregation
systems and monitoring platforms. It strikes a balance between familiar API, and leveraging existing
JVM logging ecosystem and all of its good (and less good) parts.

### How does it work?

Mokujin wraps `clojure.tools.logging` and inject SLF4J's MDC into logging events, if provided. It tries to maintain a balance between new
features and the API of a widely used library. Mokujin wraps standard logging macros (`info`, `warn` etc) and adds
support for the context map argument, as the first argument.

Keep in mind that `infof` (and friends) variants are present, but do not support passing the MDC (more on that later).

#### Performance

While effort was made to keep things as efficient as possible, there is some impact
introduced by manipulating the MDC object. Typical "raw" call to `clojure.tools.logging/info` is measured in nano-seconds.
Same call to `mokujin.log/info` will have nearly the same performance characteristics. Only when introducing several levels of MDC
processing you can expect a small slow down but still maintain sub-microsecond performance. This is absolutely fine for
your typical usage - most of applications running out there do a lot of I/O, where processing times are measured in milliseconds
or seconds event, so any overhead introduced by logging is negligble.

You can run the benchmark via `clj -M:benchmark`. Latest results (as of April 9th, 2024):

```
#'mokujin.log-bench/mokujin-log : 121.528074 ns
#'mokujin.log-bench/mokujin-log+context : 614.801815 ns
#'mokujin.log-bench/tools-logging-log : 127.343391 ns
#'mokujin.log-bench/tools-logging-log+context : 383.410334 ns
```

> [!INFO]
> Benchmarks are tricky, and always should be taken with a grain of salt.
> My sole focus with these is to ensure that Mokujin keeps up with `tools.logging` as far as performance is concerned.
> There's many more variables that need to be taken into account when measuring performance impact of logs in your application.

### Show me the codes!

Rather than logging something like this (you probably shouldn't include emails in your logs, but that's a different topic):

```clojure
(require '[clojure.tools.logging :as log])

(defn do-signup [req]
(let [user (create-user .... )]
(if (:id user)
(log/infof "signup user_id=%s email=%s" (:id user) (:email user))
(log/errorf "signup failed email=%s" (:email (:body req))))))

```

You are (probably) be better off doing something like this, and structure your logs to avoid your log collector parsing them using regex patterns:

```clojure
(defn create-user [{:keys [email]}]
{:id "foo" :email email})

(require '[mokujin.log :as log])

(defn do-sign-up [req]
(log/with-context {:flow "signup" :email (:email (:body req))}
(let [user (create-user req)]
(if (:id user)
(log/info "success" {:id (:id user)})
(log/error "failed")))))

(do-sign-up {:body {:email "[email protected]"}})
```

It would produce the following:

```
2023-11-03 17:53:25,850 [app.server] [user] [INFO] success flow=signup, [email protected], id=foo
```

```json
{
"timestamp": "2023-11-03T17:53:25.850957Z",
"message": "success",
"logger_name": "user",
"thread_name": "app.server",
"level": "INFO",
"flow": "signup",
"email": "[email protected]",
"id": "foo"
}

```

> [!NOTE]
> Output depends on the logging backend and appender configuration

From here, you can add/write a Ring middleware to inject request data to the logging context and make all your logging statements simpler.

In my own projects, I'm using Mokujin with Logback and Logback's Logstash appender for producing JSON logs that get sent to a JSON-aware log collector.
Very simple middlewares for Ring, Hato and the background processing system ensure that all logs are enriched by contextual data (uri's, request/response time etc).

## Logging backend support

Technically, anything that supports SLF4j and MDC should work with some configuration. See `examples` directory and
Getting started section - Logback and log4j2 examples are provided.

## API & Usage

The API is close enough that Mokujin is *almost* a drop-in replacement for `c.t.logging`, **however** to force good practices,
logging functions that support format strings e.g. `log/infof` or `log/errorf` **do not suport the context map**.

That's because in 99% of the cases where I'd use `log/infof` what I wanted to do was `(log/info "message" context)` instead.
In cases where you really really want to use formatted strings and the context, this Works Just Fine :tm: :

```clojure
(log/with-context {:some :ctx}
(log/infof "thing %s happened to %s" thing-a thing-b))
```

### The context

The context is a `Map[String]String` internally. To make it easier to work with standard log ingestion infrastructure,
all keywords present in the keys and values are converted to `"snake_case"` strings. Other value types are stringified.
This frees you up from figuring out what can and cannot be serialized by the appender that your logging backend is using.
If you need to change the layout of the produced log line, you can and should defer it to the appender configuration instead.

> [!WARNING]
> Nested maps or other "rich" data types are not allowed - this is something that might change in the future.

### Full API

#### Logging statements

```clojure
(log/info [msg] [msg ctx])
(log/warn [msg] [msg ctx])
(log/debug [msg] [msg ctx])
(log/error [msg] [exc msg] [exc msg ctx])
(log/infof [fmt & fmt-args])
(log/warnf [fmt & fmt-args])
(log/debugf [fmt & fmt-args])
(log/errorf [fmt & fmt-args])
(log/with-context [ctx & body])
```

Mokujin preserves caller context, and ensures that the right information (namespace, line numbers) is injected into the log statement, under the `logger` field.

> [!WARNING]
> `log/error` doesn't support the context/MDC argument for 1- and 2-arity variants, again - you're better off wrapping the form in `log/with-context` (see below) and using that as extra information attached to all logs
> carried over with the exception - that's how I enrich exceptions tracked by Sentry's LogBack appender.
> Additionally, figuring out the types of passed in arguments gets complicated as we'd have to distinguish between the context map, exception, message string and rest of the args
> and that introduces overhead that we don't want.
> This is the only area of Mokujin's user-facing API that *might* change, maybe.

#### Context macro

You can use `with-context` macro, to set context for a form, request handler body etc.
Contexts can be nested.

All map keys and values will be stringified using `name` + `str` combo.

```clojure
(log/with-context ctx body)
```

Example:

``` clojure
(defn handle-notification [user notification]
(log/with-context {:user-id (:id user) :notification-type (:type notification)}
(let [{:keys [success? response-code] result} (do-something-with-notification user notification)]
(log/with-context {:response-code response-code}
(if success?
(log/info "success")
(log/error "problem with notification processing"))))))
```

As expected, context are merged - including nested `with-context` forms and contexts added to individual logging statements.

# Getting started

To get started, add Mokujin to your `deps.edn` or `project.cj`

[![Clojars Project](https://img.shields.io/clojars/v/org.clojars.lukaszkorecki/mokujin.svg)](https://clojars.org/org.clojars.lukaszkorecki/mokujin)

But wait, **you're not done yet**! You need to include a logging backend which suports MDC. For Logback, this would be:

```clojure
{io.github.lukaszkorecki/mokujin {:mvn/version "...."}
org.slf4j/jcl-over-slf4j {:mvn/version "2.0.9"}
ch.qos.logback/logback-classic {:mvn/version "1.4.11"
:exclusions [org.slf4j/slf4j-api]}
;; for JSON output, you can use:
net.logstash.logback/logstash-logback-encoder {:mvn/version "7.4"}}
```

and creating a configuration file, stored in the class path (usually `resources/logback.xml`).


Sample Logback configuration

Here's my Logback configuration that I use in all production projects, with logs shipped to something that understands JSON/structured logging (Loki, Better Stack's Logs etc)
Production config, stored in `resources/logback.xml`:

```xml





timestamp

[ignore]
[ignore]


email









```

Development/test config, stored in `dev-resources/logback-test.xml`:

```xml



%date [%thread] [%logger] [%level] %msg %mdc%n




```

## TODO

- [x] Maven release?
- [x] Improve performance of adding/removing keys from the MDC - see Cambium's or Aviso-logging source for a good approach
- [x] Finalize `log/error` API - it works, but there are cases where its usage can be confusing
- [ ] Provide a way to customize how context data is transformed