Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/model-zoo/shift-ctrl-f
🔎 Search the information available on a webpage using natural language instead of an exact string match.
https://github.com/model-zoo/shift-ctrl-f
bert chrome-extension natural-language question-answering semantic-search tensorflow tensorflowjs unpacked-extension
Last synced: 3 months ago
JSON representation
🔎 Search the information available on a webpage using natural language instead of an exact string match.
- Host: GitHub
- URL: https://github.com/model-zoo/shift-ctrl-f
- Owner: model-zoo
- License: apache-2.0
- Archived: true
- Created: 2020-08-12T21:07:40.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2020-09-01T16:24:04.000Z (over 4 years ago)
- Last Synced: 2024-09-21T17:42:27.349Z (3 months ago)
- Topics: bert, chrome-extension, natural-language, question-answering, semantic-search, tensorflow, tensorflowjs, unpacked-extension
- Language: JavaScript
- Homepage: https://modelzoo.dev
- Size: 87.8 MB
- Stars: 1,104
- Watchers: 18
- Forks: 46
- Open Issues: 6
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# Shift-Ctrl-F: Semantic Search for the Browser
![Shift-Ctrl-F](./assets/chrome-store-banner.png)
[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
[![Chrome Web Store Link](https://developer.chrome.com/webstore/images/ChromeWebStore_BadgeWBorder_v2_340x96.png)](https://chrome.google.com/webstore/detail/shift-ctrl-f-semantic-sea/alnafbdjkoiljlomnbcknehahbjoiped)
Search the information available on a webpage using
natural language instead of an exact string match. Uses
[MobileBERT](https://arxiv.org/abs/2004.02984)
fine-tuned on
[SQuAD](https://rajpurkar.github.io/SQuAD-explorer/)
via [TensorFlowJS](https://www.tensorflow.org/js) to
search for answers and mark relevant elements on the web page.![Shift-Ctrl-F Demo](./assets/demo.gif)
**This extension is an experiment.** Deep learning models like BERT are powerful
but may return unpredictable and/or biased results that are tough to interpret.
Please apply best judgement when analyzing search results.### Why?
Ctrl-F uses exact string-matching to find information within a webpage. String
match is inherently a proxy heuristic for the true content -- in most cases it
works very well, but in some cases it can be a bad proxy.In our example above we search
[https://stripe.com/docs/testing](https://stripe.com/docs/testing), aiming to
understand the **difference between test mode and live mode**. With string
matching, you might search through some relevant phrases `"live mode"`, `"test
mode"`, and/or `"difference"` and scan through results. With semantic search, you
can directly phrase your question `"What is the difference between live mode
and test mode?"`. We see that the model returns a relevant result, even though
the page does not contain the term "`difference`".### How It Works
Every time a user executes a search:
1. The content script collects all `
`, `
- `, and `
- ` elements on the
page and extracts text from each.
2. The background script executes the question-answering model on every
element, using the query as the question and the element's text as the context.
3. If a match is returned by the model, it is highlighted within the page along
with the confidence score returned by the model.
### Architecture
There are three main components that interact via [Message
Passing](https://developer.chrome.com/extensions/messaging) to orchestrate the
extension:
1. Popup (`popup.js`): React application that renders the search bar, controls
searching and iterating through the results.
2. Content Script (`content.js`): Runs in the context of the current tab,
responsible for reading from and manipulating the DOM.
3. Background (`background.js`): Background script that loads and executes the
TensorFlowJS model on question-context pairs.
`src/js/message_types.js` contains the messages used to interact between these
three components.
### Development
Make sure you have these dependencies installed.
1) [Node](https://nodejs.org/en/download/)
2) [Yarn](https://classic.yarnpkg.com/en/docs/install)
3) [Prettier](https://prettier.io/docs/en/install.html)
Then run:
```
make develop
```
The unpacked extension will be placed inside of `build/`. See [Google Chrome
Extension developer
documentation](https://developer.chrome.com/extensions/getstarted) to load the
unpacked extension into your Chrome browser in development mode.
### Publishing
```
make build
```
A zipped extension file ready for upload will be placed inside of `dist/`.