{"id":18675658,"url":"https://github.com/miferreiro/cdap-map-reduce","last_synced_at":"2025-07-25T09:04:11.232Z","repository":{"id":102727750,"uuid":"274185536","full_name":"miferreiro/CDAP-Map-Reduce","owner":"miferreiro","description":"Map/Reduce exercises for the subject of \"Computación Distribuída e de Altas Prestacións\" in the Master Degree of Computer Engineering of the University of Vigo in 2020","archived":false,"fork":false,"pushed_at":"2020-06-22T16:19:27.000Z","size":33183,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2024-12-27T20:36:21.282Z","etag":null,"topics":["map-reduce","python"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/miferreiro.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-06-22T16:12:56.000Z","updated_at":"2020-06-22T16:21:41.000Z","dependencies_parsed_at":null,"dependency_job_id":"5cca2232-99f6-406a-8913-343f5edd7080","html_url":"https://github.com/miferreiro/CDAP-Map-Reduce","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/miferreiro%2FCDAP-Map-Reduce","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/miferreiro%2FCDAP-Map-Reduce/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/miferreiro%2FCDAP-Map-Reduce/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/miferreiro%2FCDAP-Map-Reduce/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/miferreiro","download_url":"https://codeload.github.com/miferreiro/CDAP-Map-Reduce/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239520244,"owners_count":19652659,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["map-reduce","python"],"created_at":"2024-11-07T09:25:49.184Z","updated_at":"2025-02-18T17:43:05.131Z","avatar_url":"https://github.com/miferreiro.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Starting on Map/reduce\n\nThese three exercises were made in the subject of \"Computación Distribuída e de Altas Prestacións\" in the Master Degree of Computer Engineering of the University of Vigo in 2020\n\n### Exercise 1\n\nThis exercise is composed of a series of files containing audience data on topics broadcast on radio stations:\n- The join_cad?.txt files consist of a list of music tracks and, for each track, the radio station where it was broadcast.\n- The join_num?.txt files also contain playlists and, for each track, the number of listeners it has had.\n\nThe objective of this section is to implement a map/reduce task that provides an answer to the following question:\n\n*What has been the total number of listeners (in all radio stations) to the topics that have been broadcast by RNE1?*\n\nNOTE 1: the mapper for this task is simple. Once implemented, its operation can be checked in the terminal:\n\n`$ cat join_*.txt | ./join_mapper.py | sort`\n\nNOTE 2: the reducer will be a little more complex, but we must not lose sight of the fact that at its entry the data will be ordered alphabetically.\n\n### Exercise 2\n\nIn order to do this exercise, the file containing information on the sales made in a chain of department stores in January 2012 is used as a starting point. Each line of the purchases.txt file contains the following fields: date, time, city, section, amount, means of payment.\n\nWe ask that you implement map/reduce programs that will allow you to answer the following questions:\n- What is the most widely used payment method for the purchase of computers?\n- For each means of payment, which section makes the most sales?\n\nA small pdf document should be attached briefly justifying the decision taken on the content of the \u003ckey,value\u003e fields and briefly explaining the implementation and results.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmiferreiro%2Fcdap-map-reduce","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmiferreiro%2Fcdap-map-reduce","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmiferreiro%2Fcdap-map-reduce/lists"}