{"id":14982386,"url":"https://github.com/learningjournal/spark-streaming-in-python","last_synced_at":"2025-09-04T03:42:41.120Z","repository":{"id":39714304,"uuid":"281429307","full_name":"LearningJournal/Spark-Streaming-In-Python","owner":"LearningJournal","description":"Apache Spark 3 - Structured Streaming Course Material","archived":false,"fork":false,"pushed_at":"2023-08-19T11:44:44.000Z","size":20329,"stargazers_count":121,"open_issues_count":0,"forks_count":159,"subscribers_count":7,"default_branch":"master","last_synced_at":"2025-04-10T01:11:59.109Z","etag":null,"topics":["apache-spark","big-data","bigdata","data-lake","pyspark","python","spark-sql","spark-streaming"],"latest_commit_sha":null,"homepage":"https://www.learningjournal.guru","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/LearningJournal.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2020-07-21T15:04:21.000Z","updated_at":"2024-11-28T14:34:20.000Z","dependencies_parsed_at":"2022-07-13T07:20:29.105Z","dependency_job_id":"8120811d-393b-40c0-ae62-ec873604135d","html_url":"https://github.com/LearningJournal/Spark-Streaming-In-Python","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LearningJournal%2FSpark-Streaming-In-Python","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LearningJournal%2FSpark-Streaming-In-Python/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LearningJournal%2FSpark-Streaming-In-Python/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LearningJournal%2FSpark-Streaming-In-Python/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/LearningJournal","download_url":"https://codeload.github.com/LearningJournal/Spark-Streaming-In-Python/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248137888,"owners_count":21053775,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["apache-spark","big-data","bigdata","data-lake","pyspark","python","spark-sql","spark-streaming"],"created_at":"2024-09-24T14:05:19.447Z","updated_at":"2025-04-10T01:12:06.371Z","avatar_url":"https://github.com/LearningJournal.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Apache Spark 3 - Real-time Stream Processing using Python\nThis is the central repository for all the materials related to \u003cem\u003eApache Spark 3 -Real-time Stream Processing using Python\u003c/em\u003e \u003cbr\u003eCourse by Prashant Pandey.\n\u003cbr\u003e You can get the full course at \u003ca href=\"https://www.udemy.com/course/draft/3184584/?referralCode=77E18B4F800479A263D5\"\u003e \n  Apache Spark Course @ Udemy.\n\u003c/a\u003e\n\n\u003cdiv\u003e\n\u003ca href=\"https://www.udemy.com/course/spark-streaming-using-python/?referralCode=127B048D9EBD2D1278DC\"\u003e\n\u003cimg src=\"https://www.learningjournal.guru/_resources/img/jpg-5x/spark-beginners-course.jpg\" alt=\"Apache Spark 3 - Real-time Stream Processing using Python\" width=\"300\" align=\"left\"\u003e \n\u003c/a\u003e\n\n\u003ch2\u003e Description \u003c/h2\u003e\n\u003cp align=\"justify\"\u003e\n  I am creating \u003cem\u003eApache Spark 3 - Real-time Stream Processing using Python \u003c/em\u003ecourse to help you understand the Stream Processing using Apache Spark and apply that knowledge to build stream processing solutions. This course is example-driven and follows a working session like approach. We will be taking a live coding approach and explain all the needed concepts along the way.\n\u003c/p\u003e\n\n\u003ch3\u003eWho should take this Course?\u003c/h3\u003e\n\u003cp align=\"justify\"\u003e\nI designed this course for software engineers willing to develop a Stream Processing pipeline and application using the Apache Spark. I am also creating this course for data architects and data engineers who are responsible for designing and building the organization’s data-centric infrastructure. Another group of people is the managers and architects who do not directly work with Spark implementation. Still, they work with the people who implement Apache Spark at the ground level.\n\u003c/p\u003e\n\n\u003ch3\u003eSpark and source code version\u003c/h3\u003e\n\u003cp align=\"justify\"\u003e\nThis Course is using the Apache Spark 3.x. I have tested all the source code and examples used in this Course on Apache Spark 3.0.0 open-source distribution.\n\u003c/p\u003e\n\n\u003c/div\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flearningjournal%2Fspark-streaming-in-python","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flearningjournal%2Fspark-streaming-in-python","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flearningjournal%2Fspark-streaming-in-python/lists"}