{"id":34501471,"url":"https://github.com/basemax/qt-dataset-explorer","last_synced_at":"2026-04-21T12:03:37.120Z","repository":{"id":329920766,"uuid":"1120765661","full_name":"BaseMax/qt-dataset-explorer","owner":"BaseMax","description":"Interactive desktop GUI application for exploring datasets and performing statistical analysis. Built using Python, PyQt5, and scientific libraries (pandas, matplotlib, seaborn, scipy). Interactive desktop app for exploring datasets and statistics visually. Built using Python, PyQt5, and scientific libraries.","archived":false,"fork":false,"pushed_at":"2025-12-22T09:47:40.000Z","size":16,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-12-23T20:55:58.608Z","etag":null,"topics":["dataset","dataset-checker","dataset-explorer","datasets","py","py3","pyqt","pyqt5","python","python3","qt","qt5"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/BaseMax.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2025-12-21T22:33:32.000Z","updated_at":"2025-12-22T09:47:44.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/BaseMax/qt-dataset-explorer","commit_stats":null,"previous_names":["basemax/qt-dataset-explorer"],"tags_count":null,"template":false,"template_full_name":null,"purl":"pkg:github/BaseMax/qt-dataset-explorer","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BaseMax%2Fqt-dataset-explorer","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BaseMax%2Fqt-dataset-explorer/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BaseMax%2Fqt-dataset-explorer/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BaseMax%2Fqt-dataset-explorer/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/BaseMax","download_url":"https://codeload.github.com/BaseMax/qt-dataset-explorer/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BaseMax%2Fqt-dataset-explorer/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":27992996,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-12-24T02:00:07.193Z","response_time":83,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["dataset","dataset-checker","dataset-explorer","datasets","py","py3","pyqt","pyqt5","python","python3","qt","qt5"],"created_at":"2025-12-24T02:02:05.664Z","updated_at":"2025-12-24T02:02:22.743Z","avatar_url":"https://github.com/BaseMax.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Qt Dataset Explorer\n\nInteractive desktop GUI application for exploring datasets and performing statistical analysis. Built using Python, PyQt5, and scientific libraries (pandas, matplotlib, seaborn, scipy).\n\n![Qt Dataset Explorer](https://private-user-images.githubusercontent.com/2658040/529159004-83791923-3698-4493-b335-8aa38485a3e9.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3NjYzOTcxMzUsIm5iZiI6MTc2NjM5NjgzNSwicGF0aCI6Ii8yNjU4MDQwLzUyOTE1OTAwNC04Mzc5MTkyMy0zNjk4LTQ0OTMtYjMzNS04YWEzODQ4NWEzZTkucG5nP1gtQW16LUFsZ29yaXRobT1BV1M0LUhNQUMtU0hBMjU2JlgtQW16LUNyZWRlbnRpYWw9QUtJQVZDT0RZTFNBNTNQUUs0WkElMkYyMDI1MTIyMiUyRnVzLWVhc3QtMSUyRnMzJTJGYXdzNF9yZXF1ZXN0JlgtQW16LURhdGU9MjAyNTEyMjJUMDk0NzE1WiZYLUFtei1FeHBpcmVzPTMwMCZYLUFtei1TaWduYXR1cmU9OGVjZmU5ZTIwY2ZhYWJiZGNiMDg0ZjY5MDdkNzMwNjMyOWZjNDY1MDg5ZjQ0MWJlNjE2NjkzN2IyZDVmNGVmNyZYLUFtei1TaWduZWRIZWFkZXJzPWhvc3QifQ.UvhhhGFTm6KlzB6_o-hjzqsX4VNjdRjgh8PBN2pxzA4)\n\n![Qt Dataset Explorer](https://private-user-images.githubusercontent.com/2658040/529159084-567764a6-96be-462c-bdd0-95f322cbbcb4.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3NjYzOTcxMzUsIm5iZiI6MTc2NjM5NjgzNSwicGF0aCI6Ii8yNjU4MDQwLzUyOTE1OTA4NC01Njc3NjRhNi05NmJlLTQ2MmMtYmRkMC05NWYzMjJjYmJjYjQucG5nP1gtQW16LUFsZ29yaXRobT1BV1M0LUhNQUMtU0hBMjU2JlgtQW16LUNyZWRlbnRpYWw9QUtJQVZDT0RZTFNBNTNQUUs0WkElMkYyMDI1MTIyMiUyRnVzLWVhc3QtMSUyRnMzJTJGYXdzNF9yZXF1ZXN0JlgtQW16LURhdGU9MjAyNTEyMjJUMDk0NzE1WiZYLUFtei1FeHBpcmVzPTMwMCZYLUFtei1TaWduYXR1cmU9MmFjNjRiYmUxOTU1Yjk4ODBkNzc1ZWE4NDQ3MjkzZjRiMjljZGQ3M2M1MjNhYzQ0NzJlNGI5NTZiNjU1NGJmNiZYLUFtei1TaWduZWRIZWFkZXJzPWhvc3QifQ.FMpY_MgvoIA43E6J8TTS5vVuSwv5BEAYTZ1BOG-xZi4)\n\n![Qt Dataset Explorer](https://private-user-images.githubusercontent.com/2658040/529159092-59228d2d-f9a1-4ea8-8902-8bd1db47365d.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3NjYzOTcxMzUsIm5iZiI6MTc2NjM5NjgzNSwicGF0aCI6Ii8yNjU4MDQwLzUyOTE1OTA5Mi01OTIyOGQyZC1mOWExLTRlYTgtODkwMi04YmQxZGI0NzM2NWQucG5nP1gtQW16LUFsZ29yaXRobT1BV1M0LUhNQUMtU0hBMjU2JlgtQW16LUNyZWRlbnRpYWw9QUtJQVZDT0RZTFNBNTNQUUs0WkElMkYyMDI1MTIyMiUyRnVzLWVhc3QtMSUyRnMzJTJGYXdzNF9yZXF1ZXN0JlgtQW16LURhdGU9MjAyNTEyMjJUMDk0NzE1WiZYLUFtei1FeHBpcmVzPTMwMCZYLUFtei1TaWduYXR1cmU9NmFjYWY0ZDdkMTM0NzA4NGY2Yzc5MDI5MDk4NWMxZTFmOTNkMDg4NzI3NTIzN2RhYTBhM2U1YjdhNGQ3NzY2MCZYLUFtei1TaWduZWRIZWFkZXJzPWhvc3QifQ.utP8psG3eZ5sBL0x-9Stnp9ieaChcUJkrowgpfRv33M)\n\n## Features\n\n- **Dataset Loading**: Support for CSV and Excel files\n- **Dataset Preview**: Interactive table view with up to 1000 rows displayed\n- **Data Filtering**: Apply filters using various operators (==, !=, \u003e, \u003c, \u003e=, \u003c=, contains, startswith, endswith)\n- **Descriptive Statistics**: Comprehensive statistical analysis including:\n  - Mean, median, standard deviation, min, max\n  - Variance, skewness, kurtosis\n  - Missing value counts\n  - Categorical value distributions\n- **Visualization**: Multiple plot types:\n  - Histograms\n  - Box plots\n  - Scatter plots\n  - Correlation heatmaps\n  - Bar plots\n- **Hypothesis Testing**: Statistical tests including:\n  - Independent T-Test\n  - Paired T-Test\n  - Chi-Square Test\n  - ANOVA\n  - Correlation Test (Pearson \u0026 Spearman)\n- **Export Tools**:\n  - Export filtered data to CSV/Excel\n  - Export statistics to text file\n  - Save plots as PNG/PDF\n\n## Installation\n\n1. Clone the repository:\n```bash\ngit clone https://github.com/BaseMax/qt-dataset-explorer.git\ncd qt-dataset-explorer\n```\n\n2. Install required dependencies:\n```bash\npip install -r requirements.txt\n```\n\n## Usage\n\nRun the application:\n```bash\npython dataset_explorer.py\n```\n\n### Quick Start Guide\n\n1. **Load a Dataset**:\n   - Click \"Load Dataset\" button\n   - Select a CSV or Excel file\n   - The dataset will be loaded and displayed in the Preview tab\n\n2. **Filter Data**:\n   - Go to the \"Preview \u0026 Filter\" tab\n   - Select a column from the dropdown\n   - Choose an operator (==, !=, \u003e, \u003c, contains, etc.)\n   - Enter a value to filter by\n   - Click \"Apply Filter\"\n   - Click \"Reset Filter\" to show all data again\n\n3. **View Statistics**:\n   - Go to the \"Statistics\" tab\n   - Statistics are automatically calculated when you load data\n   - Click \"Refresh Statistics\" to update after filtering\n\n4. **Generate Plots**:\n   - Go to the \"Plots\" tab\n   - Select a plot type (Histogram, Box Plot, Scatter Plot, etc.)\n   - Choose columns for X and Y axes (if applicable)\n   - Click \"Generate Plot\"\n   - Click \"Save Plot\" to export as PNG or PDF\n\n5. **Run Hypothesis Tests**:\n   - Go to the \"Hypothesis Testing\" tab\n   - Select a test type\n   - Choose columns to test\n   - Click \"Run Test\"\n   - View detailed results including p-values and conclusions\n\n6. **Export Data**:\n   - Click \"Export Data\" to save filtered dataset\n   - Click \"Export Statistics\" to save statistical summary\n\n## Sample Data\n\nA sample dataset (`sample_data.csv`) is included for testing the application. It contains employee information with columns: Name, Age, Gender, Salary, Department, and Experience.\n\n## Requirements\n\n- Python 3.7+\n- PyQt5 5.15.10\n- pandas 2.0.3\n- matplotlib 3.7.2\n- seaborn 0.12.2\n- scipy 1.11.2\n- numpy 1.24.3\n- openpyxl 3.1.2\n\n## License\n\nSee LICENSE file for details.\n\n## Author\n\n- GitHub: [@BaseMax](https://github.com/BaseMax)\n\n## Contributing\n\nContributions are welcome! Please feel free to submit a Pull Request.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbasemax%2Fqt-dataset-explorer","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fbasemax%2Fqt-dataset-explorer","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbasemax%2Fqt-dataset-explorer/lists"}