{"id":28277862,"url":"https://github.com/cyblx/clustering","last_synced_at":"2025-07-18T08:36:18.823Z","repository":{"id":260464530,"uuid":"874912545","full_name":"CybLX/Clustering","owner":"CybLX","description":"This project explores clustering techniques and supervised learning applied to World Cup team performance analysis. The methodologies include K-Means, DBSCAN, K-Nearest Neighbors, Gaussian Mixture Models (GMM), and Agglomerative Clustering.","archived":false,"fork":false,"pushed_at":"2024-10-18T18:02:33.000Z","size":1681,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-05-28T07:58:06.300Z","etag":null,"topics":["clustering","data-analysis","dbscan","gmm","kmeans","supervised-learning","unsupervised-learning","world-cup"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/CybLX.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-10-18T17:34:09.000Z","updated_at":"2025-04-05T19:57:24.000Z","dependencies_parsed_at":"2024-10-31T13:41:35.366Z","dependency_job_id":"38fae4e4-3b66-44f8-a346-a4fc93a7e7a5","html_url":"https://github.com/CybLX/Clustering","commit_stats":null,"previous_names":["cyblx/clustering"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/CybLX/Clustering","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CybLX%2FClustering","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CybLX%2FClustering/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CybLX%2FClustering/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CybLX%2FClustering/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/CybLX","download_url":"https://codeload.github.com/CybLX/Clustering/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CybLX%2FClustering/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":265728947,"owners_count":23818733,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["clustering","data-analysis","dbscan","gmm","kmeans","supervised-learning","unsupervised-learning","world-cup"],"created_at":"2025-05-21T07:13:45.785Z","updated_at":"2025-07-18T08:36:18.814Z","avatar_url":"https://github.com/CybLX.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Clustering Techniques and Supervised Learning\n\n## Overview\nThis project explores various clustering techniques and supervised learning applied to the analysis of team performance in the World Cup. The methodologies covered include K-Means, DBSCAN, K-Nearest Neighbors, Gaussian Mixture Models (GMM), and Agglomerative Clustering.\n\n## Dataset Features\nThe dataset used in this project contains information such as:\n\n- **Position**: Team's ranking position\n- **Team**: Name of the team\n- **Games Played**: Total number of games played\n- **Win**: Total number of wins\n- **Draw**: Total number of draws\n- **Loss**: Total number of losses\n- **Goals For**: Total goals scored by the team\n- **Goals Against**: Total goals conceded by the team\n- **Goal Difference**: Difference between goals scored and conceded\n- **Points**: Total points accumulated\n- **Year**: Year of the competition\n\n## Project Goals\nThe main objective of this project is to apply clustering techniques to gain a better understanding of the data structure and the relationships among the variables. We aim to identify groups of similar teams, effectively segment the data, and evaluate the performance of machine learning algorithms in different scenarios, with an emphasis on teaching unsupervised learning techniques.\n\n## Tools Used\n- Python\n- Jupyter Notebook\n- Libraries: Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn, among others.\n\n## How to Use\n\n1. Clone the repository to your local machine:\n   ```bash\n   git clone https://github.com/cyblx/clustering.git\n   ```\n\n2. Install the required libraries:\n   ```bash\n   pip install -r requirements.txt\n   ```\n\n3. Open Jupyter Notebook and run the analysis:\n   ```bash\n   jupyter notebook\n   ```\n\n4. Follow the instructions within the notebook to explore the dataset and view the analysis results.\n\n## For More Information\nFor more information, codes, tutorials, and exciting projects, visit the links below:\n\n- Email: alves_lucasoliveira@usp.br\n- GitHub: [cyblx](https://github.com/cyblx)\n- LinkedIn: [Cyblx](https://www.linkedin.com/in/cyblx)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcyblx%2Fclustering","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcyblx%2Fclustering","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcyblx%2Fclustering/lists"}