https://github.com/divvun/corpustools
Tools to manage and convert GiellaLT corpus files
https://github.com/divvun/corpustools
corpus linguistics xml
Last synced: 12 months ago
JSON representation
Tools to manage and convert GiellaLT corpus files
- Host: GitHub
- URL: https://github.com/divvun/corpustools
- Owner: divvun
- License: gpl-3.0
- Created: 2021-10-02T17:00:03.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2025-05-09T11:41:47.000Z (about 1 year ago)
- Last Synced: 2025-05-09T12:35:34.663Z (about 1 year ago)
- Topics: corpus, linguistics, xml
- Language: Python
- Homepage: https://giellalt.github.io/CorpusTools/
- Size: 23.5 MB
- Stars: 3
- Watchers: 30
- Forks: 0
- Open Issues: 6
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGES.txt
- License: LICENSE.txt
Awesome Lists containing this project
README
# Corpus Tools
Tools to manipulate a giellalt corpus in different ways.
## Install and update from Apertium nightly
These tools are a part of
[Apertium nightly packages](https://wiki.apertium.org/wiki/Installation/Developers).
For Mac users, running the `install-nightly.sh` suffices, Linux users will have
to run ` install `, as well (search for corpustools in
the package system).
To update the tools on Mac, run `install-nightly.sh`. On Linux, update packages
using the package manager.
## Install and update using pipx
- [Install pipx](https://pypa.github.io/pipx/installation/)
- Run `pipx install --force git+https://github.com/divvun/CorpusTools.git`
[API and manual pages](https://divvun.github.io/CorpusTools/)