An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with bigdataanalytics

A curated list of projects in awesome lists tagged with bigdataanalytics .

https://github.com/freeipcc/freedatascrm

工商数据,电话获客,智能客户关系管理,数据驱动营销,自动化销售线索,B2B营销,客户洞察分析,精准营销!

ai bigdata bigdataanalytics data scrm

Last synced: 08 Feb 2026

https://github.com/divithraju/divith-raju-openmetadata

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

automation bigdata bigdataanalytics data data-structures dataengineering datascience hacktoberfest2022 metadata metadata-extraction

Last synced: 20 Feb 2025

https://github.com/bobergot/large-scale-data-processing-design-patterns

Explore essential MapReduce design patterns for big data processing! This repository includes practical implementations of patterns from the "MapReduce Design Patterns" book, complete with examples across summarization, filtering, organization, joins, and more.

bigdata bigdataanalytics cloudcomputing dataengineering dataprocessing datascience designpatterns distributedcomputing hadoop java mapreduce

Last synced: 16 Mar 2025

https://github.com/rushikeshshinde14/decoding-hotel-reservation-patterns-data-analysis

The aim of the project is to gain insights into customers' booking behaviors, preferences, and decision-making processes when reserving hotel accommodations.The data analysis process involves cleaning and organizing the hotel reservation data and generating visualizations and reports to present the findings.

bigdataanalytics dataanalysis datainsights datavisualization datavisualization-project msword oracle pandas python sql

Last synced: 16 Mar 2025

https://github.com/withrvr/bda_mini_project

ipl data analysis

bda bigdataanalytics

Last synced: 29 Aug 2025

https://github.com/juanparias29/bigdataprocessing

Repositorio con proyectos y laboratorios de procesamiento de datos utilizando Databricks, Apache Spark y Python. Incluye conceptos clave de Big Data, almacenamiento, procesamiento, análisis y aprendizaje automático.

apache-spark bigdata bigdataanalytics bigdatainfrastructure data-science database nosql-database python sql

Last synced: 10 Jul 2025

https://github.com/npdao1992/bigdataanalysis_counterfeitcreditcards-kafka_sparkstreaming

Analyze fraudulent credit card transactions using Kafka, Spark Streaming, and Random Forest Classifier algorithms in PySpark

bigdata bigdataanalytics kafka python spark-streaming

Last synced: 03 Mar 2025

https://github.com/jotstolu/retail-orders-end-to-end-data-engineering-project-using-azure-databricks

This project demonstrates the development of a scalable end-to-end data pipeline for processing and reporting retail order data using Azure Cloud services, Delta Lake architecture, and Databricks.

azure-devops azurecloud azuredatabricks azuredatafactory bigdataanalytics databricks databricks-notebooks delta-lake medallion-architecture pyspark pyspark-notebook starschema unitycatalog

Last synced: 23 Oct 2025