https://github.com/drdub/tellandshow
Tell-and-Show is a project for open recommendations that uses the AGPLv3 license to protect *data* and to consider said data as the source for machine learning models.
https://github.com/drdub/tellandshow
machine-learning opendata recommender-engine recommender-system
Last synced: 2 months ago
JSON representation
Tell-and-Show is a project for open recommendations that uses the AGPLv3 license to protect *data* and to consider said data as the source for machine learning models.
- Host: GitHub
- URL: https://github.com/drdub/tellandshow
- Owner: DrDub
- License: agpl-3.0
- Created: 2022-05-14T05:06:38.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2022-05-21T15:34:12.000Z (about 3 years ago)
- Last Synced: 2025-02-10T11:11:20.333Z (4 months ago)
- Topics: machine-learning, opendata, recommender-engine, recommender-system
- Language: Shell
- Homepage:
- Size: 1.94 MB
- Stars: 3
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Tell-and-Show: Community Content-sharing Without Idols Nor Masters
With so many software systems with machine learning these days, the data used to train the models is as important, if not more important, as the source code. While the data comes from the community, it is kept private by the entities that collect it, giving them a competitive advantage impossible to match by the Free Software movement. What can we do about it? Tell-and-Show is an experiment using Free Software licenses for data collection.
Tell-and-Show is a project for open recommendations. By sharing preference data to the community, contributors can help build a preference metric which can then be used to provide private recommendations in the browser.
What is special about Tell-and-Show is the use of the AGPLv3 license to protect **data** and to consider said data as the source for machine learning models. The license thus means that anybody interacting with the model should have access to the original data from which the model is derived. Moreover, if the data was enriched with additional data, the extra data should also be released by virtue of the "virality" of the GPL.
## Current Stage
The `process` folder contains the initial preprocess from a Wikimedia Commons dump to key data for annotation.
The `site` folder contains the current website and the active learning annotation tool.
See the [website](http://tellandshow.org/about_en.html) for further information.
## Roadmap
In the [website](http://tellandshow.org/roadmap_en.html).