An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with mrjob

A curated list of projects in awesome lists tagged with mrjob .

https://github.com/groda/big_data

Tutorials on Big Data essentials: Hadoop, MapReduce, Spark. Explore a variety of tutorials and demonstrations on Big Data technologies, primarily in the form of Jupyter notebooks. Most notebooks are self-contained and live—ready to run with a click.

apache-sedona apache-spark big-data bigdata bigtop docker gutenberg-ebooks hadoop hadoop-cluster hadoop-hdfs hadoop-mapreduce jupyter-notebook mapreduce mapreduce-bash mrjob pyspark spark spark-sql testdfsio

Last synced: 06 Apr 2025

https://github.com/jehiah/gomrjob

gomrjob - a Go Framework for Hadoop Map Reduce Jobs

dataproc go hadoop mapreduce mrjob

Last synced: 17 Mar 2025

https://github.com/hearthsim/articles

Analysis of Hearthstone replays

emr hearthstone mrjob replays

Last synced: 15 Apr 2025

https://github.com/burhanahmed1/big-data-analytics

Practice tasks in Python programming language using Hadoop, MRJob, PySpark for Big Data Analytics.

apache-spark hadoop hadoop-mapreduce jupyter-notebook mrjob pyspark python spark spark-sql sparksql

Last synced: 22 Feb 2025

https://github.com/aromoh/basic-sentiment-analysis-mrjob-twitter-

Project developed to make an sentiment analysis using dictionary implemented with MrJob applying a map-reduce model. It can be executed locally or in HDFS enviroments (such as Hadoop or AWS)

aws-ec2 hadoop hdfs-enviroments map-reduce mrjob sentiment-analysis twiiter

Last synced: 29 Mar 2025

https://github.com/e-panourgia/big-data

Big Data Management Systems course assignments

analytics azure bigdata data hadoop json latex mrjob neo4j python redis stream

Last synced: 09 Apr 2025

https://github.com/heracliteanflux/exercises-scala

Exercises in the Scala programming language with an emphasis on big data programming and applications in Apache Hadoop and Apache Spark.

apache-hadoop apache-maven apache-spark distributed-computing distributed-file-system distributed-systems hadoop map-reduce mrjob scala spark

Last synced: 12 Mar 2025