Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/transformerdb/transformerdb
TransformerDB
https://github.com/transformerdb/transformerdb
Last synced: 11 days ago
JSON representation
TransformerDB
- Host: GitHub
- URL: https://github.com/transformerdb/transformerdb
- Owner: transformerdb
- Created: 2021-04-18T17:29:26.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2021-04-22T14:11:34.000Z (over 3 years ago)
- Last Synced: 2024-08-01T16:50:04.962Z (3 months ago)
- Homepage:
- Size: 25.8 MB
- Stars: 19
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# TransformerDB
TransformerDB is a neural network that ingests _all_ your organization's data and allows you to query it using natural language.
It's the punch line in a [keynote presentation](https://icde2021.gr/keynotes/) by Jimmy Lin at [ICDE 2021](https://icde2021.gr/):
**The Attack of the Muppets: Data Management in the Era of Pretrained Transformers**
Natural language processing and data management have historically existed harmoniously, like tigers and sharks, masters of their domains—unstructured text and structured data, respectively.
While there have long been go-betweens, for example, work on relation extraction, knowledge graphs, etc., the recent advent of massively-pretrained transformer models such as BERT and related models (collectively, "muppets") threatens this balance.
Examples of the attack of the muppets include demonstration that pretrained models already know much of what's in knowledge graphs such as Wikidata, pretrained table models, and surprising progress on text-to-SQL parsing.
With models such as GPT-3 grabbing all the headlines, it's not a ridiculous proposition (any longer) to claim that "muppets are all you need".
In this talk, I will explore the veracity and implications of this claim, and what it might mean for NLP and data management research moving forward.[**Slides**](Lin-ICDE2021-keynote.pdf) from the presentation.