Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/webstylestory/figolia

⚡️ Keep your Algolia search indexes in sync with your Firebase datasets!
https://github.com/webstylestory/figolia

Last synced: about 2 months ago
JSON representation

⚡️ Keep your Algolia search indexes in sync with your Firebase datasets!

Awesome Lists containing this project

README

        

[![Version npm](https://img.shields.io/npm/v/figolia.svg?style=flat-square)](https://www.npmjs.com/package/figolia) [![Build Status](https://img.shields.io/travis/webstylestory/figolia.svg?style=flat-square)](https://travis-ci.org/webstylestory/figolia) [![Coverage Status](https://img.shields.io/coveralls/webstylestory/figolia.svg?style=flat-square)](https://coveralls.io/github/webstylestory/figolia) [![Code climate](https://img.shields.io/codeclimate/github/kabisaict/flow.svg?style=flat-square)](https://codeclimate.com/github/webstylestory/figolia) [![Dependency Status](https://img.shields.io/david/webstylestory/figolia.svg?style=flat-square)](https://david-dm.org/webstylestory/figolia) [![Made by french tech badge](https://img.shields.io/badge/made%20by-french%20%E2%9D%A4%20tech-orange.svg?style=flat-square)](http://webstylestory.com)

# Figolia

⚡️ Keep your Algolia search indexes in sync with your Firebase datasets!

When using [Firebase](http://firebase.com) as a web or mobile backend, [Algolia](http://algolia.com) search-as-a-service is a really nice addition, considering the poor search and querying capabilities of Firebase. Figolia is the perfect companion to make their synchronization happen in no time.

*(Codebase inspired by Scott Smith's work in this [blog post](http://scottksmith.com/blog/2014/12/09/algolia-real-time-search-with-firebase/))*

**2017 note: as of this year, Google released Firebase Functions, that you can use as triggers to update Algolia indices from your Firebase database. The functions are still in Beta and might get a bit pricey to use if you have a lot of mutations in your indexed search data. I am still using `figolia` for the moment (in production) and keep maintaining this package.**

## Features

* Sync multiple Firebase references
* All CRUD Firebase operations reflected in Algolia
* Restarts from last indexing timestamp the next time it's launched
* Throttle indexing to limit Algolia API calls
* Make middle-of-word (*infix*) search possible

---

* [Install](#install)
* [Usage](#usage)
* [Configuration](#configuration)
* [Firebase configuration](#firebase-configuration)
* [Partial word matching (*infix*) search](#partial-word-matching-infix-search)
* [Reindexing, incremental indexing](#reindexing-incremental-indexing)
* [Release Notes](#release-notes)
* [Known Issues](#known-issues)
* [Developers](#developers)
* [Logging & Debugging](#logging--debugging)
* [testing](#testing)
* [contribute](#contribute)
* [License](#license)

## Install

$ [sudo] npm install -g figolia

Or, clone the github repository :

$ git clone https://github.com/webstylestory/figolia.git
$ cd figolia
$ npm install

## Usage

$ figolia --help

Usage: figolia [options]

Options:

-h, --help output usage information
-V, --version output the version number
-c, --config [path] Specify configuration (default ~/.figolia.conf.js)
-l, --live-index Keep server running to live index Firebase operations (otherwise exit after indexing)
-r, --reset Force index reset (clear & full reindex)
-t, --timestamp-field [name] Object field name containing last modification timestamp (default 'updatedAt')
-d, --throttle-delay [n] Minimum throttle delay between Algolia API calls (in seconds, default 10)
Note: between each throttle delay, a maximum of
{ 3 * number of datasets } API calls can be made (add, update & delete)

*Note: if downloaded from github, try using `npm link` first, or, `./bin/figolia`*

**Important:** for the moment, there is no way to run the server without a config file, at least to provide the schema you wish to index. See the [configuration](#configuration).

## Configuration

Copy the `defaults.conf.js` and modify it according to your needs, before running the server.
(type `figolia --help` to see the default configuration file location)

var CONFIG = {
// Firebase credentials
firebase: {
// Firebase project name, as seen in your web config:
// `databaseUrl: 'https://.firebaseio.com'`
instance: '',
// The bellow file can be downloaded from the Firebase Console in the
// last tabs of the settings of your project. NEVER SHARE THAT FILE.
// Note: you can also import `path` and use `__dirname` to refer to this file directory
// eg. serviceAccountFile: path.join(__dirname, 'serviceAccountFile.json'),
serviceAccountFile: '/path/to/serviceAccountFile.json',
// Where to store server metadata
path: 'algolia',
// Firebase token will be generated with this uid (to write above path)
uid: 'algolia'
},
// Algolia credentials
algolia: {
// Algolia application ID
applicationId: '',
// *Admin* API Key
apiKey: ''
},
// Fully reindex all datasets (ERASE PREVIOUS INDEX DATA)
reset: false,
// Stay running and live-index all firebase operations.
liveIndex: false,
// Minimum throttle delay between Algolia API calls (in seconds)
// Note: between each throttle delay, a maximum of
// [3 * dataset number] calls can be made (add, update & delete)
throttleDelay: 10,
// Optional, this field will be checked against last
// run date to see if reindexing is necessary.
// Field type must be UNIX timestamp (example Javascript Date.now()).
// WARNING: Without this field being corectly configured,
// everything is re-indexed at each rerun.
timestampField: 'updatedAt',
// Firebase datasets to index in Algolia (examples)
schema: {
todoLists: {
// Firebase path
path: 'app/todo',
// Algolia index (must exist already)
index: 'dev_todo_lists',
// Optional, name of ID field (otherwise,
// the Firebase object key will be used)
key: 'id',
// Optional, dataset-specific update time field
// (default is use global setting above)
timestampField: 'createdAt',
// Optional, list of fields to index
// (otherwise, every field will be indexed)
includeFields: [
'name',
'updatedAt'
],
// Optional, list of fields to exclude from index
// Note: if both are specified, `excludeFields`
// is applied *after* `includeFields`
// Nested properties can be accessed using dot notation
excludeFields: [
'passwdHash',
'nested.property'
],
// Optional, list of fields for which you need N-Gram tokens
// ex. "username: 'hermione'", will also create an additional
// field "usernameNGrams: ['ermione', 'rmione', 'mione', 'ione']"
// for each word (down to 4 chars), making *infix* search possible
// in addition to default prefix search available in Algolia out of the box
// letting users search with keyword "mione"
// Note: this can be storage-consumming for long fields, use with
// caution ! (preferably on fields with enforced size)
// Nested properties can be accessed using dot notation
ngrams: ['username', 'profile.fullName']
},
todoItems: {
// Second example dataset to index, minimal config
path: 'app/todoItems',
index: 'dev_todo_items'
}
}
};

#### Update from v0.3.x to v0.4.x

`firebase` package is now deprecated on the server, hence the update to use `firebase-admin`. You have to update your figolia configuration to remove the `firebase.secret` entry, and replace it with a `firebase.serviceAccountFile` pointing to your firebase key json file. It can be downloaded from the Firebase Console in the last tabs of the settings of your project. NEVER SHARE THAT FILE.

Basic steps (for a global install):

1. `npm i -g figolia@^0.4.x`
2. Download the service account json file from your Firebase console
3. In your figolia configuration file, replace `secret` by `serviceAccountFile` which value is the path to the file you just downloaded
4. If you store the service account file in a project with versioned source code, do not commit it (for example by adding a line in you `.gitignore` file)
5. Restart your `figolia` services, and you're done!
6. Report any issues ;-)

### Firebase configuration

In order for `figolia` to work properly, it must store
the last known indexing date in firebase. You can specify the path where you
want this information stored in the config ([see above](#configuration)).
Default is to use the path named `figolia` at the root of your Firebase reference.

### Partial word matching (*infix* search)

[Out of the box, Algolia only suports *prefix* search by design.](https://www.algolia.com/doc/faq/toubleshooting/how-can-i-make-queries-within-the-middle-of-a-word) However, this can be mitigated by generating *N-Grams* of the words
up to 4 characters (otherwise relevance falls too much). Figolia does this for you!

Example: activating the `ngrams` on `username` field will generate, for the
value `hermione`, the following tokens: `['ermione', 'rmione', 'mione', 'ione']`.

As a consequence, users can find Hermione with 'mione' keyword, which would sadly
return no results otherwise.

### Reindexing, incremental indexing

This daemon supports a simple mode where every indexed object in Algolia is dropped at runtime,
and then re-indexed from the current Firebase connection. **Any previously indexed data will be lost.**

This is not reccomended, as it can lead to many useless operations, and bandwith waste.

I really suggest to specify `path` and `uid` in `firebase` config field, and give
write access to that uid in the Firebase Rules. Also mandatory, the last update
time of each items must be set in the schema definition (for example,
in a `updatedAt` field).

This is in your app, if you did not implement such feature
to track the last update time of each of your objects, you'll have to do so
before using this tool efficiently. You can also chose to leave this field
unchanged for certain minor operations that does not need reindexation.

To allow `algolia` user to write in your Firebase `algolia` path,
add the following in your Firebase instance security rules:

// Let figolia daemon keep track of what is in sync
"algolia": {
".read": "auth.uid == 'algolia'",
".write": "auth.uid == 'algolia'",
}
// Optional, to avoid Firebase warnings when running the tests
"tests": {
"testData": {
".indexOn": "updatedAt"
}
}

### Production setup

For production setup, I strongly encourage the use of a good process manager
like [PM2](https://github.com/Unitech/pm2) or [foreverjs](https://github.com/foreverjs/forever).

To make them work with figolia, which is developped with ES2015 and deployed without pre-compilation,
you need to specify the full path of the executable :

$ which figolia
/usr/local/bin/figolia
$ pm2 start /usr/local/bin/figolia

## Release notes

* 0.4.6 - Display version when starting
* 0.4.5 - Small fixes and compliance with third party tools like TravisCI
* 0.4.0 - Update deps, firebase to firebase-admin, auth by secret to serviceAccount file
* 0.3.5 - Optimize ngrams storage by grouping multiple fields and deduplicating
* 0.3.4 - throw is key does not exists. excludedFields can be nested prop
* 0.3.3 - Object key can be nested prop: `'prop.id`'
* 0.3.2 - NGrams can be nested prop: `'prop.nested'`
* 0.3.1 - Fix - do not clear index when relaunching
* 0.3.0 - Add NGrams generation for middle-of-word (*infix*) search
* 0.2.8 - Change default timestamp field name to 'updatedAt', bug fixes
* 0.2.4 - Fix babel ignore option and config loading
* 0.2.0 - Add throttle option to limit API calls
* 0.1.5 - Add ignore/only in babel-register options as .babelrc switches ignored
* 0.1.1 - Fix commandline issue with missing npm package
* 0.1.0 - Initial release

## Known issues

* [#15](#15) When figolia server stops running, your Firebase can continue to
change. Next time you run figolia, it will pickup the changes (well, if you
have the `timestampField` option correctly set up), however, it will not see
if items have been deleted. **Workaround is to fully reindex the datasets.**
The fix (TODO) will be to list all object ID and remove the extraneous ones.

## Developers

### Logging & debugging

By default, basic info is output in the console. Should you need more debug information, you can use the following command line:

DEBUG=figolia* figolia

...or go full throttle including Babel, Algolia and Firebase debug info :

DEBUG=* figolia

### Testing

Because Firebase and Algolia accounts are needed for this app, you have to provide
all the necessary credentials as environment variables while running `npm test`:

FIREBASE_INSTANCE=CHANGE_ME FIREBASE_ACCOUNT=CHANGE_ME ALGOLIA_APP_ID=CHANGE_ME \
ALGOLIA_API_KEY=CHANGE_ME DEBUG=quiet npm test

Note 1: The tests are pretty slow sometimes, because they wait for all Algolia write
and indexing operations to finish in order to validate the results. You can speed
up your testing of a specific file by appending its name to the command line:

FIREBASE_INSTANCE=CHANGE_ME FIREBASE_ACCOUNT=CHANGE_ME ALGOLIA_APP_ID=CHANGE_ME \
ALGOLIA_API_KEY=CHANGE_ME DEBUG=quiet npm test ./test/testfile.spec.js

*Note 2: although the server can work with a read-only access to Firebase, the tests
cannot, because they have to write fixture data, hence the need for a full Firebase
configuration, including `serviceAccountFile` and `uid` ([see configuration](#firebase-configuration))*

### Contribute

PRs are more than welcome! Your PR should not break current usage
and pass all tests. Even better if you write the tests for the added code, and
even better if the new features are documented in this README ;-)

I will have a look at anything you will have the time to propose.

## License

MIT © 2016 Aurélien Chivot

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.