Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/datacommonsorg/llm-tools
https://github.com/datacommonsorg/llm-tools
Last synced: 3 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/datacommonsorg/llm-tools
- Owner: datacommonsorg
- License: apache-2.0
- Created: 2024-08-21T06:19:04.000Z (5 months ago)
- Default Branch: main
- Last Pushed: 2024-09-16T17:36:38.000Z (4 months ago)
- Last Synced: 2024-09-16T21:46:13.083Z (4 months ago)
- Language: Jupyter Notebook
- Size: 117 KB
- Stars: 23
- Watchers: 1
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
- awesome_ai_agents - Llm-Tools - This repo contains client library code for accessing DataGemma, an open model that helps address the challenges of hallucination by grounding LLMs in the vast, real-world statistical data of Google's Data Commons (Building / Tools)
- awesome_ai_agents - Llm-Tools - This repo contains client library code for accessing DataGemma, an open model that helps address the challenges of hallucination by grounding LLMs in the vast, real-world statistical data of Google's Data Commons (Building / Tools)
README
# Data Gemma
This repo contains client library code for accessing DataGemma, an
open model that helps address the challenges of hallucination by grounding LLMs
in the vast, real-world statistical data of Google's Data Commons.There are two methodologies used to achieve this: Retrieval Interleaved Generation
(RIG) and Retrieval Augmented Generation (RAG). More details can be found in the
[paper](https://datacommons.org/link/DataGemmaPaper) and [blog post](https://research.google/blog/grounding-ai-in-reality-with-a-little-help-from-data-commons/).The finetuned DataGemma models are hosted in HuggingFace
([RIG](https://huggingface.co/google/datagemma-rig-27b-it),
[RAG](https://huggingface.co/google/datagemma-rag-27b-it)) and Kaggle
([RIG](https://www.kaggle.com/models/google/datagemma-rig),
[RAG](https://www.kaggle.com/models/google/datagemma-rag)).To install the library, run:
```bash
pip install git+https://github.com/datacommonsorg/llm-tools
```For examples of using this library, see our Colab notebooks for [RIG](https://github.com/datacommonsorg/llm-tools/blob/main/notebooks/data_gemma_rig.ipynb)
and
[RAG](https://github.com/datacommonsorg/llm-tools/blob/main/notebooks/data_gemma_rag.ipynb).----------
Disclaimer
----------
You're accessing a very early version of DataGemma. It is meant for trusted tester use (primarily for academic and research use) and not yet ready for commercial or general public use. This version was trained on a very small corpus of examples and may exhibit unintended, and at times controversial or inflammatory behavior. Please anticipate errors and limitations as we actively develop this large language model interface.Your feedback and evaluations are critical to refining DataGemma's performance and will directly contribute to its training process. Known limitations are detailed in the paper, and we encourage you to consult it for a comprehensive understanding of DataGemma's current capabilities.