Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/johnsnook/yii2-parsel
Parses a boolean user query and adds the terms and logic to a provided Query object.
https://github.com/johnsnook/yii2-parsel
Last synced: about 2 months ago
JSON representation
Parses a boolean user query and adds the terms and logic to a provided Query object.
- Host: GitHub
- URL: https://github.com/johnsnook/yii2-parsel
- Owner: johnsnook
- License: other
- Created: 2018-07-19T18:15:36.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2018-08-26T01:16:14.000Z (over 6 years ago)
- Last Synced: 2024-11-15T23:29:47.163Z (about 2 months ago)
- Language: PHP
- Size: 737 KB
- Stars: 3
- Watchers: 2
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# Yii2 Parsel
Allows developers to provide a boolean search query interface, similar to Google or Sphinx search or other full-text search (FTS) engines.
Turns a user query like '`georgia -(atlanta or decatur)`' into '`georgia AND NOT (atlanta or decatur)`' which is then turn into the follow SQL:
```sql
SELECT
"ip", /* ip address */
"visits", /* how many requests they've made */
"city",
"region"
FROM
/* A table similar to apaches access log. See my extension yii2-ipFilter */
"visitor"
WHERE
(
("visitor"."ip" ILIKE '%georgia%')
OR ("visitor"."city" ILIKE '%georgia%')
OR ("visitor"."region" ILIKE '%georgia%')
)
AND ( /** marvel as we efortlessly generate a subquery */
"ip" NOT IN (
SELECT
"ip"
FROM
"visitor"
WHERE
(
("visitor"."ip" ILIKE '%atlanta%')
OR ("visitor"."city" ILIKE '%atlanta%')
OR ("visitor"."region" ILIKE '%atlanta%')
)
OR (
("visitor"."ip" ILIKE '%decatur%')
OR ("visitor"."city" ILIKE '%decatur%')
OR ("visitor"."region" ILIKE '%decatur%')
)
)
)
```
Example results:| Ip | Visits | City | Region |
| -------------- | ------ | ------------- | ------- |
| 107.77.232.216 | 16 | | Georgia |
| 107.77.235.199 | 3 | | Georgia |
| 174.218.142.27 | 1 | Lawrenceville | Georgia |
| 107.77.233.225 | 18 | | Georgia |
| 205.201.132.14 | 42 | Woodstock | Georgia |
| 192.3.160.15 | 4 | Douglas | Georgia |## Installation
The preferred way to install this extension is through [composer](http://getcomposer.org/download/).
Either run
```
php composer.phar require --prefer-dist johnsnook/yii2-parsel "*"
```or add
```
"johnsnook/yii2-parsel": "*"
```to the require section of your `composer.json` file.
## Usage
> *"Look, I didn't know I could speak Parseltongue! What else don't I know about myself? Look. Maybe you can do something, even something horrible and not know you did it."*
Once the extension is installed, simply use it in your code by :
```php
$userQuery = 'good AND plenty -licorice';
$parsel = new ParselQuery([
'userQuery' => $this->userQuery,
'dbQuery' => Script::find()
]);
$parsel->dbQuery->all();
```## Tokens/behavior:
Fields to be search must be either text, varchar or char currently. Future versions may expand to number, dates and maybe even JSON. All search terms, except where specified bye the full match operator are wrapped in your databases wildcard of choice. Searching for "smart" is equivalent to the SQL expression `'%smart%'`. Search is case insensitive as long as your database's `LIKE` operator is. PostgreSQL will use `ILIKE`.
#### Conjunctives:
'AND' is the default behavior. "smart pretty" is the same as "smart AND pretty."
'OR' allows more results in your query: "smart OR pretty."
#### Operators:
| Operator | Type | Description |
| -------- | ------------------ | ------------------------------------------------------------ |
| - | Negation | The user query "smart pretty -judgmental" parses to "smart AND pretty AND NOT judgmental" |
| () | Sub-query | Allows grouping of terms . The user query "-crazy (smart AND pretty)" parses to "NOT crazy AND (smart AND pretty)" |
| * | Wildcard | Fuzzy matches. "butt\*" matches butt, buttery, buttered etc. |
| _ | Character wildcard | Matches one character. "boo\_" matches boot, book, bool, boon, etc. |
| = | Full match | Entire fields must be equal to the term. "=georgia" only matches where one or more fields is exactly equal to the search term. The search term will NOT be bracketed with %, but wildcards can still be used. |
| "" | Double quotes | Phrase. '"Super fun"' searches for the full phrase, space include. Wild cards, negation and exact match operators all work within the phrase. |
| '' | Single quotes | Phrase, no wildcards. The term will not be evaluated for * or _, but will be wrapped in wildcards. If a % or _ is in the term, it will be escaped. 'P%on*' becomes '%P\%on\*%'. |
| : | Field | Specify the field to search. 'name:jo*' will search the name field for 'jo\*.' If no field name matches, all fields will be searched for 'name:jo\*' |#### Examples
See files in /examples. If it's still up, you might also be able to play with an example [here](https://snooky.biz/parsel)
#### Additional Reading
**PostgreSQL**
[Faster PostgreSQL Searches with Trigrams](http://blog.scoutapp.com/articles/2016/07/12/how-to-make-text-searches-in-postgresql-faster-with-trigram-similarity)
[Optimizing databases for fuzzy searching](https://stackoverflow.com/a/13452528)
**MySQL**
[Performance analysis of MySQL's FULLTEXT indexes and LIKE queries for full text search](https://makandracards.com/makandra/12813-performance-analysis-of-mysql-s-fulltext-indexes-and-like-queries-for-full-text-search)
## Acknowledgements
This project was built by heavily modifying the excellent "[Search Query Parser](https://github.com/pimcore/search-query-parser)" project. I de-abstracted the token structure and modified the parser class to better fit my needs. Their licsence file should be found at the root of this project.
Both projects are made possible by the amazing and lightning quick [lexer library](https://github.com/nikic/Phlexy) by Nikita Popov of Berlin. It's work reading his [article on the subject](http://nikic.github.io/2011/10/23/Improving-lexing-performance-in-PHP.html).