https://github.com/desininja/airbnb-stream-data-ingestion
EventBridge Pipe for streaming data ingestion
https://github.com/desininja/airbnb-stream-data-ingestion
aws eventbridge eventbridge-pipes lambda
Last synced: 2 months ago
JSON representation
EventBridge Pipe for streaming data ingestion
- Host: GitHub
- URL: https://github.com/desininja/airbnb-stream-data-ingestion
- Owner: desininja
- Created: 2024-03-15T20:27:24.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-11-13T20:35:43.000Z (6 months ago)
- Last Synced: 2025-01-02T21:17:03.639Z (4 months ago)
- Topics: aws, eventbridge, eventbridge-pipes, lambda
- Language: Python
- Homepage:
- Size: 429 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
---
# Airbnb Stream Data Ingestion
This repository implements a real-time data ingestion pipeline for Airbnb data using AWS services. The project leverages Lambda functions and other AWS technologies to produce, enrich, and consume data in a streaming architecture, providing scalability and efficient data processing.
## Architecture Overview
The pipeline consists of three key Lambda functions:
1. **Producer Lambda Function**:
- Responsible for generating and sending streaming data to a designated AWS service like Kinesis.2. **Enrichment Lambda Function**:
- Processes incoming data to add additional information, perform transformations, or apply filtering.3. **Consumer Lambda Function**:
- Consumes enriched data, storing it in a database or forwarding it for further analysis or visualization.
## Technologies Used
- **AWS Lambda**: For serverless data processing.
- **Amazon S3**: For data storage.
- **Amazon CloudWatch**: For monitoring and logging.## Getting Started
### Prerequisites
- **AWS CLI** configured
- Basic knowledge of AWS Lambda, Kinesis, or similar streaming service### Setup
1. Clone the repository:
```bash
git clone https://github.com/desininja/AirBnb-Stream-Data-Ingestion.git
cd AirBnb-Stream-Data-Ingestion
```2. Deploy the Lambda functions by following the deployment script or manual setup documentation provided in the repo.
3. Configure the necessary AWS services, such as IAM roles, Kinesis, and DynamoDB, as outlined in the setup guide.
## Usage
1. Trigger the **Producer Lambda** to initiate data streaming.
2. Observe the **Enrichment Lambda** processing the data in real-time.
3. Check the **Consumer Lambda** output to see the final processed data.## Contributing
Contributions are welcome! Please submit a pull request or open an issue to suggest improvements or report bugs.
---