Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/graehl/carmel
finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests
https://github.com/graehl/carmel
Last synced: 3 months ago
JSON representation
finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests
- Host: GitHub
- URL: https://github.com/graehl/carmel
- Owner: graehl
- License: other
- Created: 2010-06-25T14:15:37.000Z (over 14 years ago)
- Default Branch: master
- Last Pushed: 2022-10-14T20:45:04.000Z (over 2 years ago)
- Last Synced: 2024-08-02T06:19:07.224Z (6 months ago)
- Language: C++
- Homepage:
- Size: 43.1 MB
- Stars: 41
- Watchers: 7
- Forks: 13
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
## Carmel finite-state toolkit - J. Graehl
What is this thing? See http://www.isi.edu/licensed-sw/carmel/carmel-tutorial2.pdf
(carmel includes EM and gibbs-sampled (pseudo-Bayesian) training)
(see `carmel/LICENSE` - free for research/non-commercial)
(see `carmel/README` and `carmel/carmel-tutorial`).
## Building from source
Recommended: cmake
```
cmake -B build . -G "Unix Makefiles" && cmake --build build
# recommend -G Ninja instead```
from Makefile (some LDFLAGS CFLAGS customization may be needed to find libs)
# mac
brew install [email protected] # other version are fine too
cd carmel; make -j 4 carmel BOOST_SUFFIX=-mt
# BOOST_SUFFIX= depends on how your boost libraries are installed - ls /usr/lib/libboost*.so# centos 7
sudo yum install boost169-devel
cd carmel; make -j 4 carmel BOOST_INCLUDEDIR=/usr/include/boost169 LDFLAGS+=" -L/usr/lib64/boost169" BOOST_SUFFIX:=```
(prerequisites: GNU Make (3.8) C++17 capable compiler (GCC 8, or
visual studio 2017 will do) and [Boost](http://boost.org), which you
probably already have on your linux system; for Mac, you can get them
from [Homebrew](http://brew.sh/). Native windows builds should work;
you can also use cygwin or mingw.### `make` options
If your system doesn't support static linking, `make NOSTATIC=1`
If you're trying to modify or troubleshoot the build, take a look at
`graehl/shared/graehl.mk` as well as `carmel/Makefile`; you shouldn't need to
manually run `make depend`.## Subdirectories
* `carmel`: finite state transducer toolkit with EM and gibbs-sampled
(pseudo-Bayesian) training* `forest-em`: derivation forests EM and gibbs (dirichlet prior bayesian) training
* `graehl/shared`: utility C++/Make libraries used by carmel and forest-em
* `gextract`: some python bayesian syntax MT rule inference
* `sblm`: some simple pcfg (e.g. penn treebank parses, but preferably binarized)
* `clm`: some class-based LM feature? I forget.
* `cipher`: some word-class discovery and unsupervised decoding of simple
probabilistic substitution cipher (uses carmel, but look to the tutorial in
carmel/ first)* `util`: misc shell/perl scripts