https://github.com/ajenhl/tacl
Tool for performing basic text analysis on the CBETA corpus
https://github.com/ajenhl/tacl
cbeta n-gram python
Last synced: 6 months ago
JSON representation
Tool for performing basic text analysis on the CBETA corpus
- Host: GitHub
- URL: https://github.com/ajenhl/tacl
- Owner: ajenhl
- License: gpl-3.0
- Created: 2012-09-26T19:21:33.000Z (over 13 years ago)
- Default Branch: master
- Last Pushed: 2023-09-06T01:12:10.000Z (almost 3 years ago)
- Last Synced: 2024-08-08T20:56:41.707Z (almost 2 years ago)
- Topics: cbeta, n-gram, python
- Language: Python
- Size: 1.29 MB
- Stars: 29
- Watchers: 9
- Forks: 8
- Open Issues: 11
-
Metadata Files:
- Readme: README.rst
- Changelog: CHANGES
Awesome Lists containing this project
README
TACL
====
tacl is a tool for performing basic text analysis on the texts
available from the `Chinese Buddhist Electronic Text Association`_
(CBETA). It is largely generic, however, and can operate with minor
modifications on other corpora.
The code is developed at https://github.com/ajenhl/tacl/ and the
documentation is available at http://tacl.readthedocs.io/en/latest/.
Installation
------------
Using `Python 3`_ (minimum version 3.5), either run ``pip install
tacl`` or download the code manually and run ``python setup.py
install``. The dependencies are installed automatically when tacl is
installed with ``pip``. Note however that on Windows (and perhaps Mac
OS X) it is very likely that the dependencies that have non-Python
components will not build due to a missing compiler. In such a case,
follow the instructions at
https://github.com/ajenhl/tacl/wiki/Installation
Usage
-----
Run ``tacl -h`` for a listing of available subcommands, and ``tacl
-h`` for help on a specific subcommand.
.. _Chinese Buddhist Electronic Text Association: http://www.cbeta.org/
.. _Python 3: http://www.python.org/
.. _SQLite: http://www.sqlite.org/
.. _biopython: http://biopython.org/
.. _lxml: http://lxml.de/
.. _pandas: http://pandas.pydata.org/