https://github.com/georgevasiliadis/greparl
Grep the Greek Parliament
https://github.com/georgevasiliadis/greparl
flask greek-parliament information-retrieval python search-engine web-app
Last synced: about 2 months ago
JSON representation
Grep the Greek Parliament
- Host: GitHub
- URL: https://github.com/georgevasiliadis/greparl
- Owner: GeorgeVasiliadis
- License: mit
- Created: 2021-12-20T00:14:27.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2022-02-04T09:30:51.000Z (over 4 years ago)
- Last Synced: 2025-03-13T08:55:39.720Z (over 1 year ago)
- Topics: flask, greek-parliament, information-retrieval, python, search-engine, web-app
- Language: Python
- Homepage: https://pypi.org/project/GreParl/
- Size: 828 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# GreParl
A simple web-app-based Search Engine built on top of the Greek Parliament Proceedings

## Name
grep + Greek Parliament
## Install - GreParl Package
1. Run (ideally in a fresh venv) `pip install greparl`
1. Download the required data files and, if needed, decompress them in the desired directory (see below)
## Install - Data File Dependencies
The required data files that **are not shipped** along with the package include the Search Engine's indices, the parliament proceedings' file and some other tasty stuff.
- The raw proceedings can be downloaded here: [speeches.csv](https://aristotleuniversity-my.sharepoint.com/personal/papadopo_office365_auth_gr/_layouts/15/onedrive.aspx?id=%2Fpersonal%2Fpapadopo%5Foffice365%5Fauth%5Fgr%2FDocuments%2FGreek%5FParliament%5FProceedings%5F1989%5F2020%2Ezip&parent=%2Fpersonal%2Fpapadopo%5Foffice365%5Fauth%5Fgr%2FDocuments). The extracted file should be renamed to `speeches.csv`
- The Search Engine's core can be downloaded here: [information-retrieval.tar.gz](https://aristotleuniversity-my.sharepoint.com/:u:/g/personal/grammenot_office365_auth_gr/EQEWl2-R7n9EnREy6VUDKesBFRaFfdMKeLiVIBblMMYKcA?e=wxaswN)
> Those files should be decompressed in the same directory from which the user will run the GreParl.
Alternatively, all required files (apart from `speeches.csv`) can be auto-generated.
## Run
1. Activate venv
1. Run `greparl` or `python -m greparl` and wait for signs of life..
1. The default browser should open up automatically, but if not, browse to "http://127.0.0.1:5000/" manually
---
## Features
### Search

You can either search for a specific speech, or preview a random one (totally original..).
### Results

You can preview the speeches in Results page. No pagination is available at time.
### Deep Search

You can perform a deeper search which will return speeches that are _similar_ not _identical_ to query.
### Speech View

You can read a specific speech and/or its metadata. Also, in Speech page, a shortcut is provided for highlighting the current speech.
## Highlights

You can find the most important keywords of a specific speech or set of speeches.

Speech sets can be grouped by parties or parliament members and can be limited using date ranges.

## Similarities

You can compare parliament members to find out who tend to speak about the same topics the most.
## Προβλέψεις

You can predict the party that is likely to have said an arbitrary phrase of choice.

## Important Notes
The author of this package is **not** the creator of Search Engine's core. All credits should go to [Theodoros Grammenos' work](https://github.com/teogramm/ir-proj-priv). This project is just a graphical wrapper, trying to make life easier :D
Also, note that this projects ships a modified version of alup's [greek_stemmer](https://github.com/alup/python_greek_stemmer), which is originally distributed under MIT License.