https://github.com/welfare-state-analytics/pyparlaclarin
Python package for processing Parla-Clarin XML files.
https://github.com/welfare-state-analytics/pyparlaclarin
lxml parla-clarin tei-xml xml
Last synced: 5 months ago
JSON representation
Python package for processing Parla-Clarin XML files.
- Host: GitHub
- URL: https://github.com/welfare-state-analytics/pyparlaclarin
- Owner: welfare-state-analytics
- Created: 2021-07-27T12:05:58.000Z (almost 5 years ago)
- Default Branch: main
- Last Pushed: 2024-04-15T13:07:07.000Z (about 2 years ago)
- Last Synced: 2026-01-24T07:30:02.076Z (5 months ago)
- Topics: lxml, parla-clarin, tei-xml, xml
- Language: Python
- Homepage: https://pypi.org/project/pyparlaclarin/
- Size: 728 KB
- Stars: 0
- Watchers: 2
- Forks: 1
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Pyparlaclarin
This module includes functionality for reading, creating, and modifying Parla-Clarin XML files.
For instance, you can loop over all paragraphs in a Parla-Clarin file with a simple function:
```python
from pyparlaclarin.read import paragraph_iterator
for paragraph in paragraph_iterator(root):
print(paragraph)
```
or get all speeches by a speaker
```python
from pyparlaclarin.read import speeches_with_name
for speech in speeches_with_name(root, name="barack_obama_1961"):
print(speech)
```
Further documentation is available on [GitHub pages](https://welfare-state-analytics.github.io/pyparlaclarin/pyparlaclarin/).