{"id":13600684,"url":"https://github.com/dexplo/dexplo","last_synced_at":"2025-07-29T10:42:31.909Z","repository":{"id":62567940,"uuid":"118955784","full_name":"dexplo/dexplo","owner":"dexplo","description":"Data exploration library with a pandas-like API","archived":false,"fork":false,"pushed_at":"2020-06-16T12:57:14.000Z","size":1069,"stargazers_count":74,"open_issues_count":0,"forks_count":7,"subscribers_count":7,"default_branch":"master","last_synced_at":"2025-07-04T15:12:47.477Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"bsd-3-clause","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dexplo.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-01-25T19:08:59.000Z","updated_at":"2024-01-04T16:20:17.000Z","dependencies_parsed_at":"2022-11-03T16:30:43.423Z","dependency_job_id":null,"html_url":"https://github.com/dexplo/dexplo","commit_stats":null,"previous_names":[],"tags_count":9,"template":false,"template_full_name":null,"purl":"pkg:github/dexplo/dexplo","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dexplo%2Fdexplo","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dexplo%2Fdexplo/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dexplo%2Fdexplo/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dexplo%2Fdexplo/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dexplo","download_url":"https://codeload.github.com/dexplo/dexplo/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dexplo%2Fdexplo/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":267673275,"owners_count":24125707,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-07-29T02:00:12.549Z","response_time":2574,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-08-01T18:00:46.566Z","updated_at":"2025-07-29T10:42:31.856Z","avatar_url":"https://github.com/dexplo.png","language":"Python","readme":"# dexplo\n\n[![](https://img.shields.io/pypi/v/dexplo)](https://pypi.org/project/dexplo)\n[![Build Status](https://travis-ci.org/dexplo/dexplo.svg?branch=master)](https://travis-ci.org/dexplo/dexplo)\n[![PyPI - License](https://img.shields.io/pypi/l/dexplo)](LICENSE)\n\nA data analysis library comparable to pandas\n\n# Installation\nYou must have cython installed. Run `python setup.py build_ext --use-cython -i`\n\n# Main Goals\n\n* A minimal set of features \n* Be as explicit as possible\n* There should be one-- and preferably only one --obvious way to do it.\n\n### Data Structures\n* Only DataFrames\n* No Series\n\n### Only Scalar Data Types\nAll data types allow nulls\n- [x] bool - always 8 bits\n- [x] int\n- [x] float\n- [x] str - stored as a categorical\n- [x] datetime\n- [x] timedelta\n\n### Column Labels\n* No hierarchical index\n* Column names must be strings\n* Column names must be unique\n\n### Row Labels\n* No row labels for now\n* Only a number display on the output\n\n### Subset Selection\n* Only one way to select data - `[ ]`\n* Subset selection will be explicit and necessitate both rows and columns\n* Rows will be selected only by integer location\n* Columns will be selected by either label or integer location. Since columns must be strings, this will not be amibguous\n* Slice notation is also OK\n\n### Development\n* Must use type hints\n* Must use 3.6+ - fstrings\n* numpy\n\n### Advantages over pandas\n* Easier to write idiomatically\n* String processing will be much faster\n* Nulls allowed in each data type\n* Nearly all operations will be faster\n\n## API\n\n#### Attributes\n\n- [x] size\n- [x] shape\n- [x] values\n- [x] dtypes\n\n#### Methods\n\n**Stats**\n\n- [x] abs\n- [x] all\n- [x] any\n- [x] argmax\n- [x] argmin\n- [x] clip\n- [x] corr\n- [x] count\n- [x] cov\n- [x] cummax\n- [x] cummin\n- [x] cumprod\n- [x] cumsum\n- [x] describe\n- [x] max\n- [x] min\n- [x] median\n- [x] mean\n- [x] mode\n- [x] nlargest\n- [x] nsmallest\n- [x] prod\n- [x] quantile\n- [x] rank\n- [x] round\n- [x] std\n- [x] streak\n- [x] sum\n- [x] var\n- [x] unique\n- [x] nunique\n- [x] value_counts\n\n**Selection**\n\n- [x] drop\n- [x] head\n- [x] isin\n- [x] rename\n- [x] sample\n- [x] select_dtypes\n- [x] tail\n- [x] where\n\n**Missing Data**\n\n- [x] isna\n- [x] dropna\n- [x] fillna\n- [ ] interpolate\n\n**Other**\n\n- [x] append\n- [x] astype\n- [x] factorize\n- [x] groupby\n- [x] iterrows\n- [ ] join\n- [x] melt\n- [x] pivot\n- [x] replace\n- [x] rolling\n- [x] sort_values\n- [x] to_csv\n\n**Other (after 0.1 release)**\n- [ ] cut\n- [ ] plot\n- [ ] profile\n\n**Functions**\n\n- [x] read_csv\n- [ ] read_sql\n- [ ] concat\n\n**Group By** - specifically with `groupby` method\n\n- [x] agg\n- [x] all\n- [x] apply\n- [x] any\n- [x] corr\n- [x] count\n- [x] cov\n- [x] cumcount\n- [x] cummax\n- [x] cummin\n- [x] cumsum\n- [x] cumprod\n- [x] head\n- [x] first\n- [ ] fillna\n- [x] filter\n- [x] last\n- [x] max\n- [x] median\n- [x] min\n- [x] ngroups\n- [x] nunique\n- [x] prod\n- [ ] quantile\n- [ ] rank\n- [ ] rolling\n- [x] size\n- [x] sum\n- [x] tail\n- [x] var\n\n**str** - `df.str.\u003cmethod\u003e`\n\n- [x] capitalize\n- [x] cat\n- [x] center\n- [x] contains      \n- [x] count         \n- [x] endswith      \n- [x] find         \n- [x] findall\n- [x] get           \n- [x] get_dummies\n- [x] isalnum\n- [x] isalpha\n- [x] isdecimal\n- [x] isdigit\n- [x] islower\n- [x] isnumeric\n- [x] isspace\n- [x] istitle\n- [x] isupper\n- [x] join\n- [x] len\n- [x] ljust\n- [x] lower         \n- [x] lstrip\n- [x] partition\n- [x] repeat\n- [x] replace\n- [x] rfind\n- [x] rjust\n- [x] rpartition\n- [x] rsplit\n- [x] rstrip\n- [x] slice\n- [x] slice_replace\n- [x] split\n- [x] startswith\n- [x] strip\n- [x] swapcase\n- [x] title\n- [x] translate\n- [x] upper         \n- [x] wrap\n- [x] zfill\n\n**dt** - `df.dt.\u003cmethod\u003e`\n\n- [x] ceil\n- [x] day\n- [x] day_of_week\n- [x] day_of_year\n- [x] days_in_month\n- [x] floor\n- [ ] freq\n- [x] hour\n- [x] is_leap_year\n- [x] is_month_end\n- [x] is_month_start\n- [x] is_quarter_end\n- [x] is_quarter_start\n- [x] is_year_end\n- [x] is_year_start\n- [x] microsecond\n- [x] millisecond\n- [x] minute\n- [x] month\n- [x] nanosecond\n- [x] quarter\n- [x] round\n- [x] second\n- [x] strftime\n- [x] to_pydatetime\n- [x] to_pytime\n- [ ] tz\n- [ ] tz_convert\n- [ ] tz_localize\n- [x] weekday_name\n- [x] week_of_year\n- [x] year\n\n**td** - `df.td.\u003cmethod\u003e`\n\n- [ ] ceil\n- [ ] components\n- [x] days\n- [ ] floor\n- [ ] freq\n- [x] microseconds\n- [x] milliseconds\n- [x] nanoseconds\n- [ ] round\n- [x] seconds\n- [ ] to_pytimedelta","funding_links":[],"categories":["Libraries"],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdexplo%2Fdexplo","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdexplo%2Fdexplo","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdexplo%2Fdexplo/lists"}