https://github.com/Rogdham/bigxml
Parse big xml files and streams with ease
https://github.com/Rogdham/bigxml
big library python xml xml-parser xml-parsing
Last synced: 3 months ago
JSON representation
Parse big xml files and streams with ease
- Host: GitHub
- URL: https://github.com/Rogdham/bigxml
- Owner: Rogdham
- License: mit
- Created: 2020-08-23T13:21:05.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2024-10-10T18:41:27.000Z (8 months ago)
- Last Synced: 2025-03-05T08:45:35.481Z (4 months ago)
- Topics: big, library, python, xml, xml-parser, xml-parsing
- Language: Python
- Homepage: https://bigxml.rogdham.net/
- Size: 5.58 MB
- Stars: 27
- Watchers: 2
- Forks: 3
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# BigXML
Parse big xml files and streams with ease
[](https://github.com/rogdham/bigxml/actions?query=branch:master)
[](https://pypi.org/project/bigxml/)
[](https://github.com/rogdham/bigxml/search?q=fail+under&type=Code)
[](https://mypy.readthedocs.io/)
[](https://github.com/Rogdham/bigxml/blob/master/LICENSE.txt)---
[📖 Documentation](https://bigxml.rogdham.net/)   |   [📃 Changelog](./CHANGELOG.md)
---
Parsing big XML files in Python is hard. On one hand, regular XML libraries load the
whole file into memory, which will crash the process if the file is too big. Other
solutions such as `iterparse` do read the file as they parse it, but they are complex to
use if you don't want to run out of memory.This is where the _BigXML_ library shines:
- Works with XML files of any size
- No need to do memory management yourself
- Pythonic API
- Any stream can easily be parsed, not just files
- Secure from usual attacks against XML parsers