https://github.com/groonga/wikipedia-search
Wikipedia search system on Groonga families.
https://github.com/groonga/wikipedia-search
Last synced: about 1 year ago
JSON representation
Wikipedia search system on Groonga families.
- Host: GitHub
- URL: https://github.com/groonga/wikipedia-search
- Owner: groonga
- License: cc0-1.0
- Created: 2014-04-03T06:51:20.000Z (about 12 years ago)
- Default Branch: master
- Last Pushed: 2023-05-18T00:49:13.000Z (about 3 years ago)
- Last Synced: 2025-04-10T01:08:21.842Z (about 1 year ago)
- Language: Shell
- Homepage:
- Size: 105 KB
- Stars: 2
- Watchers: 12
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# Wikipedia Search for Groonga family
It is a sample search system for Groonga family such as Groonga,
Droonga, Mroonga and PGroonga. It searches Wikipedia data.
You can use the search system for the following proposes:
* Understanding how to implement Droonga system
* Running benchmark for PGroonga versus pg\_bigm
## Usage
### For understanding how to implement Droonga system
TODO
### For running benchmark for PGroonga versus pg\_bigm
You need to set up clean CentOS environment. Run the following command
lines to run benchmark for PGroonga versus pg\_bigm:
```text
% sudo yum install -y git
% git clone https://github.com/groonga/wikipedia-search.git
% cd wikipedia-search
% benchmark/centos6/pgroonga.sh |& tee benchmark.log
```
The benchmark script does the followings:
* Installs PostgreSQL.
* Installs PGroonga.
* Installs pg\_bigm.
* Downloads Japanese Wikipedia data.
* Loads the Japanese Wikipedia data into PostgreSQL. (measured)
* Creates index by PGroonga. (measured)
* Creates index by pg\_bigm. (measured)
* Search by some queries by PGroonga. (measured)
* Search by some queries by pg\_bigm. (measured)
## About chef cookbooks
If you use Ubuntu 12.04LTS, you need to install depending library via apt:
% sudo apt-get install libgecode-dev
Then, install "berks":
% cd chef
% bundle install --path vendor/bundle
% bundle exec berks
## License
CC0 1.0 Universal. See LICENSE.txt for details.