https://github.com/time4tea/glossary

Last synced: over 1 year ago
JSON representation

Host: GitHub
URL: https://github.com/time4tea/glossary
Owner: time4tea
License: other
Created: 2014-11-17T23:33:59.000Z (over 11 years ago)
Default Branch: master
Last Pushed: 2017-06-22T21:07:30.000Z (almost 9 years ago)
Last Synced: 2025-01-10T03:58:28.699Z (over 1 year ago)
Language: JavaScript
Size: 32.2 KB
Stars: 1
Watchers: 2
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          
[![Build Status](https://travis-ci.org/time4tea/glossary.svg?branch=master)](https://travis-ci.org/time4tea/glossary)

# Find items in a glossary in some text

Example

-------

Your glossary contains the items:

* computer

* person

* electricity company

You would like to mark up the text:

Is it a computer or a person at the other end. Phoning the electricity company you would never know

```javascript

var glossary = new Glossary();

glossary.add("computer");

glossary.add("person");

glossary.add("electricity company");

var text = "Is it a computer or a person at the other end. Phoning the electricity company you would never know";

var result = glossary.prepare().gloss(text);

var glossarised = "";

result.accept({

  gloss: function (text) {

    glossarised += "[" + text + "]"

  },

  text: function (text) {

    glossarised += text;

  }

});

console.log(glossarised);

```

Gives the output

```

Is it a [computer] or a [person] at the other end. Phoning the [electricity company] you would never know

```

## So what, that's a tiny file! - not very exciting

The implementation, based on the Aho-Corasick text searching algorithm, should support reasonably large glossaries,

and run in a short amount of time for large texts.

See examples/microsoft.js for a more realistic example (on my laptop):

```

Loading 2451 definitions 

Text length 268,647

Parse in 320 ms

Found entries 1797, Text nodes 1709

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/time4tea/glossary

Awesome Lists containing this project

README