Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/augus1999/bayesian-flow-network-for-chemistry

ChemBFN: Bayesian Flow Network Framework for Chemistry Tasks.
https://github.com/augus1999/bayesian-flow-network-for-chemistry

bayesian-flow-networks cheminformatics chemistry generative-model machine-learning qsar qspr transformer

Last synced: 6 days ago
JSON representation

ChemBFN: Bayesian Flow Network Framework for Chemistry Tasks.

Awesome Lists containing this project

README

        

# ChemBFN: Bayesian Flow Network for Chemistry

**Abstract**: In this work, we introduce ChemBFN, a language model that handles chemistry tasks based on
Bayesian flow networks working on discrete data. A new accuracy schedule is proposed to improve the sampling quality by significantly reducing the reconstruction loss. We show evidence that our method is appropriate for generating molecules with satisfied diversity even when a smaller number of sampling steps is used. A classifier-free guidance method is adapted for conditional generation. It is also worthwhile to point out that after generative training, our model can be fine-tuned on regression and classification tasks with the state-of-the-art performance, which opens the gate of building all-in-one models in a single module style.

[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-bayesian-flow-network-framework-for/molecular-property-prediction-on-bbbp-1)](https://paperswithcode.com/sota/molecular-property-prediction-on-bbbp-1?p=a-bayesian-flow-network-framework-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-bayesian-flow-network-framework-for/molecular-property-prediction-on-clintox-1)](https://paperswithcode.com/sota/molecular-property-prediction-on-clintox-1?p=a-bayesian-flow-network-framework-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-bayesian-flow-network-framework-for/molecular-property-prediction-on-freesolv)](https://paperswithcode.com/sota/molecular-property-prediction-on-freesolv?p=a-bayesian-flow-network-framework-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-bayesian-flow-network-framework-for/molecular-property-prediction-on-hiv-dataset)](https://paperswithcode.com/sota/molecular-property-prediction-on-hiv-dataset?p=a-bayesian-flow-network-framework-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-bayesian-flow-network-framework-for/molecular-property-prediction-on-esol)](https://paperswithcode.com/sota/molecular-property-prediction-on-esol?p=a-bayesian-flow-network-framework-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-bayesian-flow-network-framework-for/molecular-property-prediction-on)](https://paperswithcode.com/sota/molecular-property-prediction-on?p=a-bayesian-flow-network-framework-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-bayesian-flow-network-framework-for/molecular-property-prediction-on-bace-1)](https://paperswithcode.com/sota/molecular-property-prediction-on-bace-1?p=a-bayesian-flow-network-framework-for)

## News

* [31/07/2024] Paper is available on [arxiv.org](https://arxiv.org/abs/2407.20294).
* [21/07/2024] Paper was submitted to arXiv.

## Usage

You can find example scripts in [πŸ“example](./example) folder.

## Pre-trained Model

You can find pretrained models in [release](https://github.com/Augus1999/bayesian-flow-network-for-chemistry/releases).

## Dataset Format

We provide a Python class [`CSVData`](./bayesianflow_for_chem/data.py) to handle data stored in CSV or similar format containing headers with the following tags:
* __smiles__ or __safe__ or __selfies__ (_mandatory_): the entities under this tag should be molecule SMILES, SAFE or SELFIES strings. Multiple tags are acceptable.
* __value__ (_optional_): entities under this tag should be molecular properties or classes. Multiple tags are acceptable and in this case you can tell `CSVData` which value(s) should be loaded by specifying `label_idx=[...]`. If a property is not defined, leave it empty and the entity will be automatically masked to torch.inf telling the model that this property is unknown.

## Cite This Work

```bibtex
@misc{2024chembfn,
title={A Bayesian Flow Network Framework for Chemistry Tasks},
author={Nianze Tao and Minori Abe},
year={2024},
eprint={2407.20294},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2407.20294},
}
```