{"id":18672248,"url":"https://github.com/toucantoco/fastexcel","last_synced_at":"2026-01-20T13:00:59.093Z","repository":{"id":60926278,"uuid":"546585573","full_name":"ToucanToco/fastexcel","owner":"ToucanToco","description":"A fast excel reader for Rust and Python","archived":false,"fork":false,"pushed_at":"2026-01-13T10:56:44.000Z","size":50320,"stargazers_count":207,"open_issues_count":25,"forks_count":18,"subscribers_count":10,"default_branch":"main","last_synced_at":"2026-01-13T13:33:17.968Z","etag":null,"topics":["arrow","pandas","polars","python","rust"],"latest_commit_sha":null,"homepage":"http://fastexcel.toucantoco.dev/","language":"Rust","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ToucanToco.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2022-10-06T10:08:35.000Z","updated_at":"2026-01-13T10:30:48.000Z","dependencies_parsed_at":"2025-11-27T09:10:46.760Z","dependency_job_id":null,"html_url":"https://github.com/ToucanToco/fastexcel","commit_stats":{"total_commits":79,"total_committers":6,"mean_commits":"13.166666666666666","dds":0.5569620253164558,"last_synced_commit":"6b29b24340457b488f5f317d52e94fa21326659c"},"previous_names":[],"tags_count":38,"template":false,"template_full_name":null,"purl":"pkg:github/ToucanToco/fastexcel","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ToucanToco%2Ffastexcel","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ToucanToco%2Ffastexcel/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ToucanToco%2Ffastexcel/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ToucanToco%2Ffastexcel/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ToucanToco","download_url":"https://codeload.github.com/ToucanToco/fastexcel/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ToucanToco%2Ffastexcel/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28603402,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-20T12:01:53.233Z","status":"ssl_error","status_checked_at":"2026-01-20T12:01:46.545Z","response_time":117,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["arrow","pandas","polars","python","rust"],"created_at":"2024-11-07T09:10:10.345Z","updated_at":"2026-01-20T13:00:59.080Z","avatar_url":"https://github.com/ToucanToco.png","language":"Rust","funding_links":[],"categories":[],"sub_categories":[],"readme":"# `fastexcel`\n\nA fast excel file reader for Python and Rust.\n\nDocs:\n * [Python](https://fastexcel.toucantoco.dev/).\n * [Rust](https://docs.rs/fastexcel).\n\n## Stability\n\nThe Python library is considered production-ready. The API is mostly stable, and we avoid breaking changes as much as\npossible. v1.0.0 will be released once the [milestone](https://github.com/ToucanToco/fastexcel/milestone/2) is reached.\n\n\u003e ⚠️ The free-threaded build is still considered experimental\n\nThe Rust crate is still experimental, and breaking changes are to be expected.\n\n## Installation\n\n```bash\n# Lightweight installation (no PyArrow dependency)\npip install fastexcel\n\n# With Polars support only (no PyArrow needed)\npip install fastexcel[polars]\n\n# With Pandas support (includes PyArrow)\npip install fastexcel[pandas]\n\n# With PyArrow support\npip install fastexcel[pyarrow]\n\n# With all integrations\npip install fastexcel[pandas,polars]\n```\n\n## Quick Start\n\n### Modern usage (recommended)\n\nFastExcel supports the [Arrow PyCapsule Interface](https://arrow.apache.org/docs/format/CDataInterface/PyCapsuleInterface.html) for zero-copy data exchange with libraries like Polars, without requiring pyarrow as a dependency.\nUse fastexcel with any Arrow-compatible library without requiring pyarrow.\n\n```python\nimport fastexcel\n\n# Load an Excel file\nreader = fastexcel.read_excel(\"data.xlsx\")\nsheet = reader.load_sheet(0)  # Load first sheet\n\n# Use with Polars (zero-copy, no pyarrow needed)\nimport polars as pl\ndf = pl.DataFrame(sheet)  # Direct PyCapsule interface\nprint(df)\n\n# Or use the to_polars() method (also via PyCapsule)\ndf = sheet.to_polars()\nprint(df)\n\n# Or access the raw Arrow data via PyCapsule interface\nschema = sheet.__arrow_c_schema__()\narray_data = sheet.__arrow_c_array__()\n```\n\n### Traditional usage (with pandas/pyarrow)\n\n```python\nimport fastexcel\n\nreader = fastexcel.read_excel(\"data.xlsx\")\nsheet = reader.load_sheet(0)\n\n# Convert to pandas (requires `pandas` extra)\ndf = sheet.to_pandas()\n\n# Or get pyarrow RecordBatch directly\nrecord_batch = sheet.to_arrow()\n```\n\n### Working with tables\n\n```python\nreader = fastexcel.read_excel(\"data.xlsx\")\n\n# List available tables\ntables = reader.table_names()\nprint(f\"Available tables: {tables}\")\n\n# Load a specific table\ntable = reader.load_table(\"MyTable\")\ndf = pl.DataFrame(table)  # Zero-copy via PyCapsule, no pyarrow needed\n```\n\n## Key Features\n\n- **Zero-copy data exchange** via [Arrow PyCapsule Interface](https://arrow.apache.org/docs/format/CDataInterface/PyCapsuleInterface.html)\n- **Flexible dependencies** - use with Polars (no PyArrow needed) or Pandas (includes PyArrow)\n- **Seamless Polars integration** - `pl.DataFrame(sheet)` and `sheet.to_polars()` work without PyArrow via PyCapsule interface\n- **High performance** - written in Rust with [calamine](https://github.com/tafia/calamine) and [Apache Arrow](https://arrow.apache.org/)\n- **Memory efficient** - lazy loading and optional eager evaluation\n- **Type safety** - automatic type inference with manual override options\n\n## Contributing \u0026 Development\n\n### Prerequisites\n\nYou'll need:\n1. **[Rust](https://rustup.rs/)** - Rust stable or nightly\n2. **[uv](https://docs.astral.sh/uv/getting-started/installation/)** - Fast Python package manager (will install Python 3.10+ automatically)\n3. **[git](https://git-scm.com/)** - For version control\n4. **[make](https://www.gnu.org/software/make/)** - For running development commands\n\n**Python Version Management:**\nuv handles Python installation automatically. To use a specific Python version:\n```bash\nuv python install 3.13  # Install Python 3.13\nuv python pin 3.13      # Pin project to Python 3.13\n```\n\n### Quick Start\n\n```bash\n# Clone the repository (or from your fork)\ngit clone https://github.com/ToucanToco/fastexcel.git\ncd fastexcel\n\n# First-time setup: install dependencies, build debug version, and setup pre-commit hooks\nmake setup-dev\n```\n\nVerify your installation by running:\n\n```bash\nmake\n```\n\nThis runs a full development cycle: formatting, building, linting, and testing\n\n### Development Commands\n\nRun `make help` to see all available commands, or use these common ones:\n\n```bash\nmake all          # full dev cycle: format, build, lint, test\nmake install      # install with debug build (daily development)\nmake install-prod # install with release build (benchmarking)\nmake test         # to run the tests\nmake lint         # to run the linter\nmake format       # to format python and rust code\nmake doc-serve    # to serve the documentation locally\n```\n\n### Useful Resources\n\n* [`python/fastexcel/_fastexcel.pyi`](./python/fastexcel/_fastexcel.pyi) - Python API types\n* [`python/tests/`](./python/tests) - Comprehensive usage examples\n\n## Benchmarking\n\nFor benchmarking, use `make benchmarks` which automatically builds an optimised wheel.\nThis is required for profiling, as dev mode builds are much slower.\n\n### Speed benchmarks\n```bash\nmake benchmarks\n```\n\n### Memory profiling\n```bash\nmprof run -T 0.01 python python/tests/benchmarks/memory.py python/tests/benchmarks/fixtures/plain_data.xls\n```\n\n## Creating a release\n\n1. Create a PR containing a commit that only updates the version in `Cargo.toml`.\n2. Once it is approved, squash and merge it into main.\n3. Tag the squashed commit, and push it.\n4. The `release` GitHub action will take care of the rest.\n\n## Dev tips\n\n* Use `cargo check` to verify that your rust code compiles, no need to go through `maturin` every time\n* `cargo clippy` = 💖\n* Careful with arrow constructors, they tend to allocate a lot\n* [`mprof`](https://github.com/pythonprofilers/memory_profiler) and `time` go a long way for perf checks,\n  no need to go fancy right from the start\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftoucantoco%2Ffastexcel","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftoucantoco%2Ffastexcel","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftoucantoco%2Ffastexcel/lists"}