https://github.com/elinorbgr/ai-project

Last synced: 8 months ago
JSON representation

Host: GitHub
URL: https://github.com/elinorbgr/ai-project
Owner: elinorbgr
Created: 2014-10-15T08:09:26.000Z (over 11 years ago)
Default Branch: master
Last Pushed: 2014-11-02T16:47:55.000Z (over 11 years ago)
Last Synced: 2025-09-04T15:49:50.842Z (10 months ago)
Language: Python
Size: 504 KB
Stars: 0
Watchers: 4
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

          Proverb Generator

=================

Depends on

----------

- the python library nltk : http://www.nltk.org/

- the C++ library lttoolbox : http://wiki.apertium.org/wiki/Lttoolbox

Building

--------

Execute the ``build.sh`` script to build the ``libltpy.so`` library, which will be used to interface our python program with lttoolbox.

It may require some adjustments to the location of your lttoolbox installation.

Using

-----

All functions are defined in the ``main.py`` file. To use them, start a python shell in this folder and import them with ``from main import *``.

### Building the background graph

We cannot provide the background graph as it is quite heavy (the fie is around 100MB). To build it, we provide two functions :

```

add_file_to_background_graph(graph_file, input_file)

```

This function inputs the whole ``input_file`` into the graph stored in ``graph_file``. If the graph file doesn't exist, it is created.

```

add_words_to_background_graph(graph_file, words)

```

This function add a list of words to the background graph in ``graph_file``. The word list is expected to be in the same format as the output of ``nltk.word_tokenize()``. If the graph file doesn't exist, it is created.

In order to build the same graph as the one we used, run :

```

add_file_to_background_graph("proverbs.bgraph", "proverbsList.txt")

import nltk

add_words_to_background_graph("proverbs.bgraph", nltk.corpus.brown.words())

add_words_to_background_graph("proverbs.bgraph", nltk.corpus.gutenberg.words())

```

### Using the generator

Once the background graph is created, you can make a generator object using it :

```

g = ProverbGenerator("proverbs.bgraph")

```

This will take some time to create it : the graph is normalized during the loading.

Then, you can generate proverbs by giving the generator an input word :

```

g.generate("darkness")

```

It will output the proverb used as a gramatical basis, as well as the generated proverb.

Note : if the input word is not in the graph, or is linked to too few words, the generated proverb will likely be the same as the one used as a basis.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/elinorbgr/ai-project

Awesome Lists containing this project

README