https://github.com/codeterrayt/streamguard

StreamGuard is a high-performance data management script using Kafka and MongoDB to efficiently handle and process real-time data streams. Ideal for scenarios like live GPS tracking, it features real-time data processing, reduced database load, and bulk data insertion.
https://github.com/codeterrayt/streamguard

api-integration bulk-insert-query-optimization data-management data-streaming event-driven-architecture high-velocity-data kafka microservices mongodb nodejs nodejs-kafkajs performance-optimization real-time-data-processing scalability system-design system-design-project zookeeper

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/codeterrayt/streamguard
Owner: codeterrayt
License: mit
Created: 2024-09-13T16:23:53.000Z (10 months ago)
Default Branch: main
Last Pushed: 2024-09-14T10:11:06.000Z (10 months ago)
Last Synced: 2025-03-23T12:15:23.047Z (3 months ago)
Topics: api-integration, bulk-insert-query-optimization, data-management, data-streaming, event-driven-architecture, high-velocity-data, kafka, microservices, mongodb, nodejs, nodejs-kafkajs, performance-optimization, real-time-data-processing, scalability, system-design, system-design-project, zookeeper
Language: JavaScript
Homepage:
Size: 24.4 KB
Stars: 3
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# StreamGuard

**StreamGuard** is a powerful data management NodeJS Application designed to efficiently handle high-velocity data streams and reduce the load on MongoDB. It leverages Kafka to streamline the processing of real-time data, making it ideal for scenarios such as live GPS tracking and other applications requiring rapid data handling.

## Features

- **Real-Time Data Processing**: Handles fast-moving data streams efficiently.
- **Reduced MongoDB Overload**: Uses Kafka to alleviate pressure on MongoDB.
- **Bulk Insertion**: Performs bulk data operations to optimize performance.

## Installation

Clone the repository and install the dependencies:

```bash
git clone https://github.com/codeterrayt/StreamGuard.git
cd StreamGuard
npm install
```

## Running the Application

1. **Start the Required Services**:

- MongoDB:
```bash
docker run -p 27017:27017 mongo
```
- Zookeeper:
```bash
docker run -p 2181:2181 zookeeper
```
- Kafka:
```bash
docker run -p 9092:9092 -e KAFKA_ZOOKEEPER_CONNECT=:2181 -e KAFKA_ADVERTISED_LISTENERS=PLAINTEXT://:9092 -e KAFKA_OFFSETS_TOPIC_REPLICATION_FACTOR=1 confluentinc/cp-kafka
```

2. **Run the Server and Consumer Applications**:

```bash
node server.js
node consumer-app.js
```

## Environment Variables

- **`MAX_DATA_LENGTH_BUFFER`**: Defines the maximum number of data entries to buffer before performing a bulk insertion. Set this to `10` to trigger bulk operations after accumulating 10 data entries.

Example configuration in `.env` file:

```env
MAX_DATA_LENGTH_BUFFER=100
```

## Usage

- **Server**: Manages data flow and interacts with Kafka.
- **Consumer**: Processes incoming data and performs bulk insertions based on the configured buffer length.

## Testing

To test the data ingestion and processing, you can use the provided test script:

1. **Run the Test Script**:

```bash
node test/test.js
```

This script sends 3 requests per second with incrementing latitude and longitude values to simulate data streaming.

## Contributing

Feel free to contribute by submitting issues or pull requests. For any questions or feedback, open an issue on the [GitHub repository](https://github.com/codeterrayt/StreamGuard).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/codeterrayt/streamguard

Awesome Lists containing this project

README