{"id":15056849,"url":"https://github.com/gares95/data-modeling_apache-cassandra","last_synced_at":"2026-02-12T01:38:55.661Z","repository":{"id":218120523,"uuid":"304933210","full_name":"Gares95/Data-Modeling_Apache-Cassandra","owner":"Gares95","description":"Create an Apache Cassandra database. Project based on Udacity's template. ","archived":false,"fork":false,"pushed_at":"2020-10-17T18:43:25.000Z","size":340,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-03-14T07:45:59.408Z","etag":null,"topics":["apache-cassandra","udacity","udacity-data-engineer-nanodegree"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Gares95.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2020-10-17T17:32:30.000Z","updated_at":"2020-10-17T18:43:27.000Z","dependencies_parsed_at":"2024-01-19T21:31:16.331Z","dependency_job_id":"1a8797e3-e6e3-42a9-836e-55662de53a4e","html_url":"https://github.com/Gares95/Data-Modeling_Apache-Cassandra","commit_stats":null,"previous_names":["gares95/data-modeling_apache-cassandra"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Gares95%2FData-Modeling_Apache-Cassandra","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Gares95%2FData-Modeling_Apache-Cassandra/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Gares95%2FData-Modeling_Apache-Cassandra/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Gares95%2FData-Modeling_Apache-Cassandra/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Gares95","download_url":"https://codeload.github.com/Gares95/Data-Modeling_Apache-Cassandra/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243544664,"owners_count":20308168,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["apache-cassandra","udacity","udacity-data-engineer-nanodegree"],"created_at":"2024-09-24T21:57:08.559Z","updated_at":"2026-02-12T01:38:55.634Z","avatar_url":"https://github.com/Gares95.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Data Modeling with Apache Cassandra\n***\nThis repository simulates the creation of an Apache Cassandra database for a music streaming startup whose data currently resides in a directory of CSV files and are looking for an easy way to query the data and to analyze it. \n\nThis project will include the files to create and define tables for a Apache Cassandra database and will serve as an example on how to make a connection to a Cassandra instance in your local machine and create a cluster to model data creating tables in order to be able to run queries of the data and process it and analyze it. \n\n# Data Files\n***\n### Event_data\nThe data that we are going to use is store in \u003cem\u003eevent_data\u003c/em\u003e directory which contains csv files with the next structure:\n\n![alt text](https://raw.githubusercontent.com/Gares95/Data-Modeling_Apache-Cassandra/master/images/image_event_datafile_new.jpg)\n\nThe directory in this repository will only contain an example file.\n\n# Python files\n***\n## Project_1B_Project_GuillermoGarcia.ipynb\n\nThis Jupyter notebook contains the steps to read the CSV files and create the cluster using Apache Cassandra to process the data and create tables in which we will load this data and we will visualize it using some queries. The notebook includes descriptive commentary and explanatory text indicating the different queries and statements and it also includes the lines of code to finally drop the tables and close the session and cluster connection.\n\n\n### Credits\n***\nUdacity provided the template and the guidelines to start this project.\nThe completion of this was made by Guillermo Garcia and the review of the program and the verification that the project followed the proper procedures was also made by my mentor from udacity.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgares95%2Fdata-modeling_apache-cassandra","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgares95%2Fdata-modeling_apache-cassandra","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgares95%2Fdata-modeling_apache-cassandra/lists"}