Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dilshod/xlsx2csv
Convert xslx to csv, it is fast, and works for huge xlsx files
https://github.com/dilshod/xlsx2csv
Last synced: 5 days ago
JSON representation
Convert xslx to csv, it is fast, and works for huge xlsx files
- Host: GitHub
- URL: https://github.com/dilshod/xlsx2csv
- Owner: dilshod
- License: mit
- Created: 2010-03-20T15:29:38.000Z (almost 15 years ago)
- Default Branch: master
- Last Pushed: 2025-01-06T07:04:50.000Z (13 days ago)
- Last Synced: 2025-01-07T18:44:59.966Z (12 days ago)
- Language: Python
- Homepage:
- Size: 570 KB
- Stars: 1,687
- Watchers: 46
- Forks: 307
- Open Issues: 132
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG
- License: LICENSE.txt
Awesome Lists containing this project
- awesome-data-wrangling - xlsx2csv
- awesome-python-again -
README
# xlsx2csv
> xlsx to csv converter (http://github.com/dilshod/xlsx2csv)
Converts xlsx files to csv format.
Handles large XLSX files. Fast and easy to use.## Tested(supported) python versions:
- 2.4
- 2.7
- 3.4 to 3.13## Installation:
```sh
sudo easy_install xlsx2csv
```
or```sh
pip install xlsx2csv
```Also, works standalone with only the *xlsx2csv.py* script
**Usage:**
```
xlsx2csv.py [-h] [-v] [-a] [-c OUTPUTENCODING] [-s SHEETID]
[-n SHEETNAME] [-d DELIMITER] [-l LINETERMINATOR]
[-f DATEFORMAT] [--floatformat FLOATFORMAT]
[-i] [-e] [-p SHEETDELIMITER]
[--hyperlinks]
[-I INCLUDE_SHEET_PATTERN [INCLUDE_SHEET_PATTERN ...]]
[-E EXCLUDE_SHEET_PATTERN [EXCLUDE_SHEET_PATTERN ...]] [-m]
xlsxfile [outfile]
```
**positional arguments:**
```
xlsxfile xlsx file path, use '-' to read from STDIN
outfile output csv file path, or directory if -s 0 is specified
```
**optional arguments:**
```
-h, --help show this help message and exit
-v, --version show program's version number and exit
-a, --all export all sheets
-c OUTPUTENCODING, --outputencoding OUTPUTENCODING
encoding of output csv ** Python 3 only ** (default: utf-8)
-s SHEETID, --sheet SHEETID
sheet number to convert, 0 for all
-n SHEETNAME, --sheetname SHEETNAME
sheet name to convert
-d DELIMITER, --delimiter DELIMITER
delimiter - columns delimiter in csv, 'tab' or 'x09'
for a tab (default: comma ',')
-l LINETERMINATOR, --lineterminator LINETERMINATOR
line terminator - lines terminator in csv, '\n' '\r\n'
or '\r' (default: os.linesep)
-f DATEFORMAT, --dateformat DATEFORMAT
override date/time format (ex. %Y/%m/%d)
--floatformat FLOATFORMAT
override float format (ex. %.15f)
-i, --ignoreempty skip empty lines
-e, --escape Escape \r\n\t characters
-p SHEETDELIMITER, --sheetdelimiter SHEETDELIMITER
sheet delimiter used to separate sheets, pass '' if
you do not need delimiter, or 'x07' or '\\f' for form
feed (default: '--------')
-q QUOTING, --quoting QUOTING
field quoting, 'none' 'minimal' 'nonnumeric' or 'all' (default: 'minimal')
--hyperlinks, --hyperlinks
include hyperlinks
-I INCLUDE_SHEET_PATTERN [INCLUDE_SHEET_PATTERN ...], --include_sheet_pattern INCLUDE_SHEET_PATTERN [INCLUDE_SHEET_PATTERN ...]
only include sheets named matching given pattern, only
effects when -a option is enabled.
-E EXCLUDE_SHEET_PATTERN [EXCLUDE_SHEET_PATTERN ...], --exclude_sheet_pattern EXCLUDE_SHEET_PATTERN [EXCLUDE_SHEET_PATTERN ...]
exclude sheets named matching given pattern, only
effects when -a option is enabled.
-m, --merge-cells merge cells
```Usage with folder containing multiple `xlxs` files:
```
python xlsx2csv.py /path/to/input/dir /path/to/output/dir
```
will output each file in the input dir converted to `.csv` in the output dir. If omitting the output dir it will output the converted files in the input dirUsage from within Python:
```
from xlsx2csv import Xlsx2csv
Xlsx2csv("myfile.xlsx", outputencoding="utf-8").convert("myfile.csv")
```Expat SAX parser used for xml parsing.
See alternatives:
Perl:
https://metacpan.org/dist/Spreadsheet-Read/view/scripts/xlsx2csv
Bash:
http://kirk.webfinish.com/?p=91Python:
http://github.com/staale/python-xlsx
http://github.com/leegao/pyXLSXRuby:
http://roo.rubyforge.org/Java:
http://poi.apache.org/## Meta
Dilshod Temirkhdojaev – [email protected]
Distributed under the MIT LICENSE. See ``LICENSE`` for more information.
[https://github.com/dilshod](https://github.com/dilshod)