{"id":26509629,"url":"https://github.com/viniciusvdias/pdm","last_synced_at":"2026-04-13T22:02:19.902Z","repository":{"id":283387651,"uuid":"925321128","full_name":"viniciusvdias/pdm","owner":"viniciusvdias","description":"DCC/UFLA course \"Big-Data: Massive Data Processing\"","archived":false,"fork":false,"pushed_at":"2026-03-11T19:00:16.000Z","size":729124,"stargazers_count":3,"open_issues_count":0,"forks_count":12,"subscribers_count":1,"default_branch":"main","last_synced_at":"2026-03-12T00:32:44.866Z","etag":null,"topics":["bigdata","computer-science","education","kafka","neo4j","spark","systems","ufla"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/viniciusvdias.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2025-01-31T16:50:50.000Z","updated_at":"2026-03-11T19:00:35.000Z","dependencies_parsed_at":"2026-03-11T21:02:39.623Z","dependency_job_id":null,"html_url":"https://github.com/viniciusvdias/pdm","commit_stats":null,"previous_names":["viniciusvdias/pdm"],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/viniciusvdias/pdm","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viniciusvdias%2Fpdm","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viniciusvdias%2Fpdm/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viniciusvdias%2Fpdm/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viniciusvdias%2Fpdm/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/viniciusvdias","download_url":"https://codeload.github.com/viniciusvdias/pdm/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viniciusvdias%2Fpdm/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31772642,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-13T20:17:16.280Z","status":"ssl_error","status_checked_at":"2026-04-13T20:17:08.216Z","response_time":93,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bigdata","computer-science","education","kafka","neo4j","spark","systems","ufla"],"created_at":"2025-03-21T01:19:54.585Z","updated_at":"2026-04-13T22:02:19.888Z","avatar_url":"https://github.com/viniciusvdias.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Big-Data: Massive Data Processing\n\n## Getting started\n\n1. This course's tools assume a Linux/UNIX system. In case you do not have\naccess yet, [install it in your computer natively](https://ubuntu.com/tutorials/install-ubuntu-desktop#1-overview), or install on Windows via [Windows Linux Subsystem (WSL)](https://learn.microsoft.com/pt-br/windows/wsl/)\n2. Clone this repository:\n\n```\ngit clone https://github.com/viniciusvdias/pdm\n```\n\n3. Change directory to the repository and build necessary tools (this may take\n   a while, but must be done once):\n\n```\ncd pdm\nmake\n```\n\n## Getting to know this repository\n\n- `docs/`: Misc documentation and mini-tutorials, links, study references, etc.\n- `hostdir/`: Mapped as a volume in docker containers for persisting of your work (notebooks, processed files, etc.)\n- `jupytercli/`: Client for interacting with Docker containers via [Jupyter](https://jupyter.org/)\n- `neo4j/`: Graph system used in this course\n- `spark/`: General-purpose system used in this course\n- `vmaccess/`: Used to access this course's VMs\n- (Assignment) `exercises/`: hands-on exercises, day-to-day individual assignments,\n- (Assignment) `seminars/`: topics in Big-Data, group assignment\n- (Assignment) `finalproject/`: final project, group assignment\n\n## Additional course material\n\n- [INTRO - Introduction](https://docs.google.com/presentation/d/18VEGdulAowiQJ9OXPXUxge2fQg38etU7NN3YSMVvmC0/edit?usp=sharing)\n- [SYS - Fundamentals of Parallel/Distributed Systems](https://docs.google.com/presentation/d/11KmZmuRXqUfmWVTNKh2wqtjtwFwfAsf0RqWaTh6Gwnk/edit?usp=sharing)\n- [DATA - Fundamentals of Big-Data Systems](https://docs.google.com/presentation/d/1etproR5qdOgRkG-aY1EKkErUmd1Nm7fMKGSjy3zD6Qk/edit?usp=sharing)\n- [BATCH - Batch processing 1](https://docs.google.com/presentation/d/17iuE9aKG_NRJ1ui_YVQ5IV3yIyxG1OTvY-iQfOdJy6E/edit?usp=sharing)\n- [BATCH - Batch processing 2](https://docs.google.com/presentation/d/1jv5srrBMUQhbORXOA9bhhfFG3goHJQiR0Anxrn9Hrnw/edit?usp=sharing)\n- [STRUCT - Structured Data processing 1](https://docs.google.com/presentation/d/1oLYEyYSp-gUPbP5Kx2affD-SpdzPYrKdLGXH32ezSnQ/edit?usp=sharing)\n- [STRUCT - Structured Data processing 2](https://docs.google.com/presentation/d/1LYTM0Nk91MLxUdZZca5VUkBLXjvlJ-70ZyhTSR6ix0A/edit?usp=sharing)\n- [STREAM - Stream processing 1](https://docs.google.com/presentation/d/1Rl6a1lHvzS3kI8c1umg9pDwXj5NgbOcmOh_om3ZDnH0/edit?usp=sharing)\n- [STREAM - Stream processing 2](https://docs.google.com/presentation/d/1aZlHeMqU8l7AmBQl58yJWqr2Nv-0aD6vIJ-r2tXfeSs/edit?usp=sharing)\n- [GRAPH - Graph processing 1](https://docs.google.com/presentation/d/16q9uV-SLYzERZsdAxeJfKMhhgQLFyUkkl4xeapCBM0c/edit?usp=sharing)\n- [GRAPH - Graph processing 2](https://docs.google.com/presentation/d/1BodgQ8EtKJd4UWqIONqz0OTeSqNSkdb7SQwTD6cdtCY/edit?usp=sharing)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fviniciusvdias%2Fpdm","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fviniciusvdias%2Fpdm","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fviniciusvdias%2Fpdm/lists"}