{"id":23679970,"url":"https://github.com/luciarevaliente/shell_script_data_cleaning","last_synced_at":"2026-02-04T12:36:44.607Z","repository":{"id":262308369,"uuid":"548829282","full_name":"luciarevaliente/Shell_script_data_cleaning","owner":"luciarevaliente","description":"This project focuses on cleaning and processing datasets using Shell scripts. It is part of the Fundamentals of Informatics course (2022-23) and involves handling movie and show data to create cleaned and filtered datasets for further analysis.","archived":false,"fork":false,"pushed_at":"2024-12-08T20:07:03.000Z","size":4003,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-05-21T18:33:48.947Z","etag":null,"topics":["data","data-cleaning","shell-script"],"latest_commit_sha":null,"homepage":"","language":"Shell","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/luciarevaliente.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2022-10-10T08:46:16.000Z","updated_at":"2024-12-08T20:09:14.000Z","dependencies_parsed_at":"2025-05-21T18:32:30.681Z","dependency_job_id":"415c61f4-8007-4b34-b735-ccc2543c0542","html_url":"https://github.com/luciarevaliente/Shell_script_data_cleaning","commit_stats":null,"previous_names":["luciarevaliente/fon_info_practica1"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/luciarevaliente/Shell_script_data_cleaning","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luciarevaliente%2FShell_script_data_cleaning","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luciarevaliente%2FShell_script_data_cleaning/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luciarevaliente%2FShell_script_data_cleaning/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luciarevaliente%2FShell_script_data_cleaning/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/luciarevaliente","download_url":"https://codeload.github.com/luciarevaliente/Shell_script_data_cleaning/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/luciarevaliente%2FShell_script_data_cleaning/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29084406,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-04T03:31:03.593Z","status":"ssl_error","status_checked_at":"2026-02-04T03:29:50.742Z","response_time":62,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data","data-cleaning","shell-script"],"created_at":"2024-12-29T17:57:00.373Z","updated_at":"2026-02-04T12:36:44.602Z","avatar_url":"https://github.com/luciarevaliente.png","language":"Shell","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Fundamentals of Informatics: Cleaning a Dataset\nThis repository contains the first practice of the Fundamentals of Informatics course (2022-23), which involves cleaning a dataset.\n\n## Project Description\nThe goal of this practice is to learn how to handle and clean datasets using Shell scripts. Several CSV files with movie and show data have been provided, and scripts have been created to filter and clean this data, generating final files that are more manageable and useful for further analysis.\n\n## Repository Contents\n- **Movies.csv**: Original file with movie data.\n- **Movies_columna12.csv** to **Movies_columna16.csv**: Files with specific columns extracted from the original dataset.\n- **Movies_f.csv** and **Movies_net.csv**: Files with filtered and cleaned movie data.\n- **Shows.csv**: Original file with show data.\n- **Shows_columna12.csv** to **Shows_columna15.csv**: Files with specific columns extracted from the original dataset.\n- **Shows_f.csv** and **Shows_net.csv**: Files with filtered and cleaned show data.\n- **practica1.sh**: Script used for data cleaning and processing.\n- **prova.txt** and **prova_script_pas4**: Test files used during the development of the practice.\n- **titles.cvs**: File with titles of movies and shows.\n\n## Instructions\n1. **Clone the repository**:\n    ```bash\n    git clone https://github.com/luciarevaliente/fon_info_practica1.git\n    cd fon_info_practica1\n    ```\n\n2. **Run the cleaning script**:\n    ```bash\n    ./practica1.sh\n    ```\n\n## Contributions\nThis project is part of an academic course and does not accept external contributions.\n\n## License\n\nThis project does not have a specific license and is for educational purposes only.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fluciarevaliente%2Fshell_script_data_cleaning","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fluciarevaliente%2Fshell_script_data_cleaning","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fluciarevaliente%2Fshell_script_data_cleaning/lists"}