{"id":18924183,"url":"https://github.com/jleetutorial/python-spark-streaming","last_synced_at":"2025-05-07T07:19:13.289Z","repository":{"id":49358933,"uuid":"109314382","full_name":"jleetutorial/python-spark-streaming","owner":"jleetutorial","description":null,"archived":false,"fork":false,"pushed_at":"2018-04-04T00:37:09.000Z","size":148850,"stargazers_count":150,"open_issues_count":2,"forks_count":165,"subscribers_count":11,"default_branch":"master","last_synced_at":"2025-05-07T07:18:56.364Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/jleetutorial.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2017-11-02T20:17:18.000Z","updated_at":"2025-04-22T13:13:33.000Z","dependencies_parsed_at":"2022-09-24T00:52:08.434Z","dependency_job_id":null,"html_url":"https://github.com/jleetutorial/python-spark-streaming","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jleetutorial%2Fpython-spark-streaming","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jleetutorial%2Fpython-spark-streaming/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jleetutorial%2Fpython-spark-streaming/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jleetutorial%2Fpython-spark-streaming/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/jleetutorial","download_url":"https://codeload.github.com/jleetutorial/python-spark-streaming/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252831271,"owners_count":21810784,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-08T11:06:03.588Z","updated_at":"2025-05-07T07:19:13.250Z","avatar_url":"https://github.com/jleetutorial.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Python Spark Streaming\n\n### Overview\n\nProject source code for James Lee's Aparch Spark with Python (Pyspark) course.\n\n### Description\n\nTools like spark are incredibly useful for processing data that is continuously appended. The python bindings for Pyspark not only allow you to do that, but also allow you to combine spark streaming with other Python tools for Data Science and Machine learning. This course goes through some of the basics of using Apache Spark, as well as more advanced concepts like accumulators, combining Pyspark with Apache Kafka, using Pyspark with AWS tools like Kinesis, streaming data from sources like Twitter, and how to get the most out of the Structured Streaming paradigm in the recently-released Spark 2.3.0.\n\nThis course is a one-stop-shop for all your pyspark streaming education needs.\n\n\n### What's in this Repo?\n\nIn this repo are the notebooks, data files, exercise files, and everything else you need to learn how to use the streaming capabilities of Pyspark.\n\n### More content like this\n\nCheck out the full list of DevOps and Big Data courses that James and Tao teach [here](https://www.level-up.one/courses/)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjleetutorial%2Fpython-spark-streaming","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjleetutorial%2Fpython-spark-streaming","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjleetutorial%2Fpython-spark-streaming/lists"}