https://github.com/chiang-yuan/llamp

A web app and Python API for multi-modal RAG framework to ground LLMs on high-fidelity materials informatics. An agentic materials scientist powered by @materialsproject, @langchain-ai, and @openai
https://github.com/chiang-yuan/llamp

ai4science cheminformatics language-model materials-informatics retrieval-augmented-generation

Last synced: 3 months ago
JSON representation

A web app and Python API for multi-modal RAG framework to ground LLMs on high-fidelity materials informatics. An agentic materials scientist powered by @materialsproject, @langchain-ai, and @openai

Host: GitHub
URL: https://github.com/chiang-yuan/llamp
Owner: chiang-yuan
License: other
Created: 2023-07-01T08:15:34.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2024-10-14T03:45:00.000Z (9 months ago)
Last Synced: 2025-03-31T06:06:01.475Z (3 months ago)
Topics: ai4science, cheminformatics, language-model, materials-informatics, retrieval-augmented-generation
Language: Jupyter Notebook
Homepage: http://ingress.llamp.development.svc.spin.nersc.org/about
Size: 13.7 MB
Stars: 75
Watchers: 1
Forks: 13
Open Issues: 7
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

best-of-atomistic-machine-learning - GitHub - 32% open · ⏱️ 14.10.2024): (Language Models)

README

LLaMP 🦙🔮

Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation

> [!TIP]
> TL;DR: LLaMP is a multimodal retrieval-augmented generation (RAG) framework of hierarchical ReAct agents that can dynamically and recursively interact with [Materials Project](https://materialsproject.org) to ground LLMs on high-fidelity materials informatics.

This repository accompanies our paper [**LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation**](https://arxiv.org/abs/2401.17244). Our codebase is built upon [LangChain](https://github.com/langchain-ai/langchain) and is designed to be modular and extensible, and can be used to reproduce the experiments in the paper, as well as to develop new experiments.

LLaMP is also a homonym of **Large Language model [Materials Project](https://materialsproject.org)**. :wink: It empowers LLMs with large-scale computational materials database to reduce the likelihood of hallucination for materials informatics.

## 🔮 Quick Start

#### Python API

```shell
git clone https://github.com/chiang-yuan/llamp.git
cd llamp/api
pip install -e .
```

After installation, check out [colab notebook chat](http://colab.research.google.com/github/chiang-yuan/llamp/blob/main/experiments/00-notebook-chat.ipynb) or the notebooks in `experiments` to start.

#### (Optional) Atomistic Simulation

You may need to install additional packages to support atomistic simulations:

```shell
pip install ase, atomate2, jobflow, mace-torch
```

#### (Optional) Docker Web Interface

```shell
docker-compose up --build
```

## 👋 Contributing

We understand sometime it is difficult to navigate Materials Project database! We want everyone to be able to access materials informatics through conversational AI. We are looking for contributors to help us build a more powerful and user-friendly LLaMP to support more MP API endpoints or external datastore and agents.

To contirbute to LLaMP, please follow these steps:

1. Fork the repository
2. Set up environment variables
```shell
cp .env.example .env.local
```
3. Deploy local development environment
```shell
docker-compose up
```
4. Make changes and submit a pull request

## 🌟 Authors and Citation

![Alt](https://repobeats.axiom.co/api/embed/75e53e291a07ad8d4b60e5f800726debe01351fb.svg "Repobeats analytics image")

If you use LLaMP, our code and data in your research, please cite our paper:

```bibtex
@article{chiang2024llamp,
title={LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation},
author={Chiang, Yuan and Chou, Chia-Hong and Riebesell, Janosh},
journal={arXiv preprint arXiv:2401.17244},
year={2024}
}
```

## 🤗 Acknowledgements

We thank Matthew McDermott (@mattmcdermott), Jordan Burns in Materials Science and Engineering at UC Berkeley for their valuable feedback and suggestions. We also thank the [Materials Project](https://materialsproject.org) team for their support and for providing the data used in this work. We also thank Dr. Karlo Berket (@kbuma) and Dr. Anubhav Jain (@computron) for their advice and guidance.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/chiang-yuan/llamp

Awesome Lists containing this project

README

LLaMP 🦙🔮

Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation