https://github.com/hsm207/pandas_row_iteration
Why itertuples() is faster than iterrows()
https://github.com/hsm207/pandas_row_iteration
blog blogging datascience pandas profiling
Last synced: 7 months ago
JSON representation
Why itertuples() is faster than iterrows()
- Host: GitHub
- URL: https://github.com/hsm207/pandas_row_iteration
- Owner: hsm207
- Created: 2019-10-19T10:57:09.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2019-10-19T19:43:27.000Z (almost 6 years ago)
- Last Synced: 2025-01-14T13:36:18.373Z (9 months ago)
- Topics: blog, blogging, datascience, pandas, profiling
- Language: Jupyter Notebook
- Size: 29.3 KB
- Stars: 1
- Watchers: 4
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Introduction
This repo contains code to accompany my blog post titled [Why Pandas itertuples() Is Faster Than iterrows() and How To Make It Even Faster](https://medium.com/@_init_/why-pandas-itertuples-is-faster-than-iterrows-and-how-to-make-it-even-faster-bc50c0edd30d)
# Usage
There are only two notebooks, namely:
* [pandas_row_iteration_analysis](./pandas_row_iteration_analysis.ipynb): This contains all the timings and profiles for the various methods described in the blog post.
* [itertuple_stats](itertuple_stats.ipynb): This contains a more detailed analysis of the profile for the `itertuples()` solution.