https://github.com/gnana997/1billion-row-challenge

Just building this terminal application to have multiple solutions with different approaches to solve the 1 Billion Row Challenge. The challenge is to read a file with 1 billion rows of city name and it's reported temperature and print the cities with their min/avg/max temperature.
https://github.com/gnana997/1billion-row-challenge

1brc 1brc-go cobra-cli golang terminal-app

Last synced: 2 months ago
JSON representation

Host: GitHub
URL: https://github.com/gnana997/1billion-row-challenge
Owner: gnana997
Created: 2024-07-20T22:01:59.000Z (12 months ago)
Default Branch: main
Last Pushed: 2024-07-23T20:45:04.000Z (12 months ago)
Last Synced: 2024-12-26T23:26:43.039Z (7 months ago)
Topics: 1brc, 1brc-go, cobra-cli, golang, terminal-app
Language: Go
Homepage:
Size: 18.6 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: Readme.md

Awesome Lists containing this project

README

## Terminal App for 1 Billion Row Challenge

# Description

# Features

Create Command: Quickly create files with a specified name and content. This command is optimized to handle large inputs and can be used to generate files with a substantial amount of data.

- Takes Around 4mins to create a file with 1 billion rows of data.

```bash
./out/1brc create --file=largefile.txt --rows=1000000000
```

Simple Process Command: Read and display the content of a specified file. This command is just to process the file in sequential and in very inefficient way

- Takes Around 2mins15secs to process a file with 1 billion rows of data.

```bash
./out/1brc simple-process --file=largefile.txt
```

Use MMap Command: Read and display the content of a specified file using memory mapping. This command is to process the file using memory mapping and is more efficient than the simple process command.

- Takes Around 2mins10secs to process a file with 1 billion rows of data.

```bash
./out/1brc use-basic-mmap --file=largefile.txt
```

Use Parallel Mmap Command: Read and display the content of a specified file using memory mapping and processing the data parallely.

- This command is to process the file using memory mapping and break into smaller chunks to process them parallely using go routines and is more efficient than the basic mmap process command.

- Takes Around 36secs to process a file with 1 billion rows of data.

```bash
./out/1brc use-parallel-mmap --file=largefile.txt
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/gnana997/1billion-row-challenge

Awesome Lists containing this project

README