Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/samber/slog-parquet

🚨 slog: Parquet handler + Object Storage
https://github.com/samber/slog-parquet

apache attribute broker error format go golang handler log log-level logger middleware parquet slog structured-logging

Last synced: 2 months ago
JSON representation

🚨 slog: Parquet handler + Object Storage

Awesome Lists containing this project

README

        

# slog: Parquet handler

[![tag](https://img.shields.io/github/tag/samber/slog-parquet.svg)](https://github.com/samber/slog-parquet/releases)
![Go Version](https://img.shields.io/badge/Go-%3E%3D%201.21-%23007d9c)
[![GoDoc](https://godoc.org/github.com/samber/slog-parquet?status.svg)](https://pkg.go.dev/github.com/samber/slog-parquet)
![Build Status](https://github.com/samber/slog-parquet/actions/workflows/test.yml/badge.svg)
[![Go report](https://goreportcard.com/badge/github.com/samber/slog-parquet)](https://goreportcard.com/report/github.com/samber/slog-parquet)
[![Coverage](https://img.shields.io/codecov/c/github/samber/slog-parquet)](https://codecov.io/gh/samber/slog-parquet)
[![Contributors](https://img.shields.io/github/contributors/samber/slog-parquet)](https://github.com/samber/slog-parquet/graphs/contributors)
[![License](https://img.shields.io/github/license/samber/slog-parquet)](./LICENSE)

A [parquet](https://www.elastic.co/parquet/) Handler for [slog](https://pkg.go.dev/log/slog) Go library.




Sponsored by:




Quickwit


Cloud-native search engine for observability - An OSS alternative to Splunk, Elasticsearch, Loki, and Tempo.




**See also:**

- [slog-multi](https://github.com/samber/slog-multi): `slog.Handler` chaining, fanout, routing, failover, load balancing...
- [slog-formatter](https://github.com/samber/slog-formatter): `slog` attribute formatting
- [slog-sampling](https://github.com/samber/slog-sampling): `slog` sampling policy

**HTTP middlewares:**

- [slog-gin](https://github.com/samber/slog-gin): Gin middleware for `slog` logger
- [slog-echo](https://github.com/samber/slog-echo): Echo middleware for `slog` logger
- [slog-fiber](https://github.com/samber/slog-fiber): Fiber middleware for `slog` logger
- [slog-chi](https://github.com/samber/slog-chi): Chi middleware for `slog` logger
- [slog-http](https://github.com/samber/slog-http): `net/http` middleware for `slog` logger

**Loggers:**

- [slog-zap](https://github.com/samber/slog-zap): A `slog` handler for `Zap`
- [slog-zerolog](https://github.com/samber/slog-zerolog): A `slog` handler for `Zerolog`
- [slog-logrus](https://github.com/samber/slog-logrus): A `slog` handler for `Logrus`

**Log sinks:**

- [slog-datadog](https://github.com/samber/slog-datadog): A `slog` handler for `Datadog`
- [slog-betterstack](https://github.com/samber/slog-betterstack): A `slog` handler for `Betterstack`
- [slog-rollbar](https://github.com/samber/slog-rollbar): A `slog` handler for `Rollbar`
- [slog-loki](https://github.com/samber/slog-loki): A `slog` handler for `Loki`
- [slog-sentry](https://github.com/samber/slog-sentry): A `slog` handler for `Sentry`
- [slog-syslog](https://github.com/samber/slog-syslog): A `slog` handler for `Syslog`
- [slog-logstash](https://github.com/samber/slog-logstash): A `slog` handler for `Logstash`
- [slog-fluentd](https://github.com/samber/slog-fluentd): A `slog` handler for `Fluentd`
- [slog-graylog](https://github.com/samber/slog-graylog): A `slog` handler for `Graylog`
- [slog-quickwit](https://github.com/samber/slog-quickwit): A `slog` handler for `Quickwit`
- [slog-slack](https://github.com/samber/slog-slack): A `slog` handler for `Slack`
- [slog-telegram](https://github.com/samber/slog-telegram): A `slog` handler for `Telegram`
- [slog-mattermost](https://github.com/samber/slog-mattermost): A `slog` handler for `Mattermost`
- [slog-microsoft-teams](https://github.com/samber/slog-microsoft-teams): A `slog` handler for `Microsoft Teams`
- [slog-webhook](https://github.com/samber/slog-webhook): A `slog` handler for `Webhook`
- [slog-kafka](https://github.com/samber/slog-kafka): A `slog` handler for `Kafka`
- [slog-nats](https://github.com/samber/slog-nats): A `slog` handler for `NATS`
- [slog-parquet](https://github.com/samber/slog-parquet): A `slog` handler for `Parquet` + `Object Storage`
- [slog-channel](https://github.com/samber/slog-channel): A `slog` handler for Go channels

## 🚀 Install

```sh
go get github.com/samber/slog-parquet/v2
```

**Compatibility**: go >= 1.21

No breaking changes will be made to exported APIs before v3.0.0.

## 💡 Usage

GoDoc: [https://pkg.go.dev/github.com/samber/slog-parquet/v2](https://pkg.go.dev/github.com/samber/slog-parquet/v2)

### Handler options

```go
type Option struct {
// log level (default: debug)
Level slog.Leveler

// parquet rows buffer
Buffer slogparquet.ParquetBuffer

// optional: customize json payload builder
Converter Converter
// optional: fetch attributes from context
AttrFromContext []func(ctx context.Context) []slog.Attr

// optional: see slog.HandlerOptions
AddSource bool
ReplaceAttr func(groups []string, a slog.Attr) slog.Attr
}
```

Other global parameters:

```go
slogparquet.SourceKey = "source"
slogparquet.ErrorKeys = []string{"error", "err"}
```

### Parquet buffer

```go
func NewParquetBuffer(bucket objstore.Bucket, prefix string, maxRecords int, maxInterval time.Duration) slogparquet.ParquetBuffer
```

Attributes will be injected in log payload.

### Object storage

See [github.com/thanos-io/objstore](github.com/thanos-io/objstore).

### Example

```go
import (
"log/slog"

slogparquet "github.com/samber/slog-parquet/v2"
"github.com/thanos-io/objstore/providers/s3"
)

func main() {
bucket, _ := s3.NewBucketWithConfig(
slogparquet.NewLogger(),
s3.Config{
Endpoint: os.Getenv("AWS_S3_ENDPOINT"),
Region: os.Getenv("AWS_S3_REGION"),
Bucket: os.Getenv("AWS_S3_BUCKET"),
AccessKey: os.Getenv("AWS_ACCESS_KEY"),
SecretKey: os.Getenv("AWS_SECRET_KEY"),
PartSize: 16 * 1024 * 1024, // 16MB
},
"logger",
)

buffer := slogparquet.NewParquetBuffer(bucket, "api/logs", 10*1024*1024)

logger := slog.New(slogparquet.Option{Level: slog.LevelDebug, Buffer: buffer}.NewParquetHandler())
logger = logger.
With("environment", "dev").
With("release", "v1.0.0")

// log error
logger.
With("category", "sql").
With("query.statement", "SELECT COUNT(*) FROM users;").
With("query.duration", 1*time.Second).
With("error", fmt.Errorf("could not count users")).
Error("caramba!")

// log user signup
logger.
With(
slog.Group("user",
slog.String("id", "user-123"),
slog.Time("created_at", time.Now()),
),
).
Info("user registration")

buffer.Flush(true)
bucket.Close()
}
```

Output:

```bash
$ parquet meta ~/Downloads/00_17_08.d4d9f.parquet

File path: /Users/samber/Downloads/00_17_08.d4d9f.parquet
Created by: github.com/samber/slog-parquet version (devel)(build )
Properties: (none)
Schema:
message log {
required int64 time (TIMESTAMP(NANOS,true));
required binary log_level (STRING);
required binary message (STRING);
required binary attributes;
required binary source (STRING);
}

Row group 0: count: 2 279.00 B records start: 51 total(compressed): 558 B total(uncompressed):644 B
--------------------------------------------------------------------------------
type encodings count avg size nulls min / max
time INT64 F _ 2 22.50 B "2023-08-19T00:17:08.14408..." / "2023-08-19T00:17:08.14420..."
log_level BINARY F 2 26.50 B "ERROR" / "INFO"
message BINARY F 2 35.00 B "caramba!" / "user registration"
attributes BINARY F 2 155.50 B "0x7B2263617465676F7279223..." / "0x7B22656E7669726F6E6D656..."
source BINARY F 2 39.50 B "samber/slog-parquet" / "samber/slog-parquet"
```

### Tracing

Import the samber/slog-otel library.

```go
import (
slogparquet "github.com/samber/slog-parquet"
slogotel "github.com/samber/slog-otel"
"go.opentelemetry.io/otel/sdk/trace"
)

func main() {
tp := trace.NewTracerProvider(
trace.WithSampler(trace.AlwaysSample()),
)
tracer := tp.Tracer("hello/world")

ctx, span := tracer.Start(context.Background(), "foo")
defer span.End()

span.AddEvent("bar")

logger := slog.New(
slogparquet.Option{
// ...
AttrFromContext: []func(ctx context.Context) []slog.Attr{
slogotel.ExtractOtelAttrFromContext([]string{"tracing"}, "trace_id", "span_id"),
},
}.NewParquetHandler(),
)

logger.ErrorContext(ctx, "a message")
}
```

## 🤝 Contributing

- Ping me on twitter [@samuelberthe](https://twitter.com/samuelberthe) (DMs, mentions, whatever :))
- Fork the [project](https://github.com/samber/slog-parquet)
- Fix [open issues](https://github.com/samber/slog-parquet/issues) or request new features

Don't hesitate ;)

```bash
# Install some dev dependencies
make tools

# Run tests
make test
# or
make watch-test
```

## 👤 Contributors

![Contributors](https://contrib.rocks/image?repo=samber/slog-parquet)

## 💫 Show your support

Give a ⭐️ if this project helped you!

[![GitHub Sponsors](https://img.shields.io/github/sponsors/samber?style=for-the-badge)](https://github.com/sponsors/samber)

## 📝 License

Copyright © 2023 [Samuel Berthe](https://github.com/samber).

This project is [MIT](./LICENSE) licensed.