https://github.com/sqrtneginf/qsar-to-sql
Tools for converting Biobyte QSAR database to SQL
https://github.com/sqrtneginf/qsar-to-sql
cheminformatics qsar sql
Last synced: 9 months ago
JSON representation
Tools for converting Biobyte QSAR database to SQL
- Host: GitHub
- URL: https://github.com/sqrtneginf/qsar-to-sql
- Owner: SqrtNegInf
- Created: 2018-06-19T23:22:26.000Z (almost 8 years ago)
- Default Branch: master
- Last Pushed: 2018-06-21T19:55:03.000Z (almost 8 years ago)
- Last Synced: 2025-06-24T08:04:44.437Z (10 months ago)
- Topics: cheminformatics, qsar, sql
- Language: Perl
- Size: 12.7 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# QSAR database converted to SQL
The custom database format designed for Biobyte's
quantitative structure-activity relationship (*QSAR*) database is not useful
for anyone not running Biobyte software. These scripts show how the data can
be exported to SQL.
## QSAR data
The portions of the raw QSAR data which are exported include:
* informational headers
* compound structure (SMILES)
* biological activity measurement
* regression model
* regression statistics
What is not exported is the underlying table of data (steric, electronic, etc)
upon which the regression model is built, so the exported data is not suitable
for developing new regression models.
## Main data tables:
1. qsar_sets -- descriptive text, regression model
* qsar_id -- integer, primary key
* system_id -- integer, xref 'systems' table
* class_id -- integer, xref 'classes' table
* compound_id -- integer, xref 'compounds' table
* action_id -- integer, xref 'actions' table
* citation_id -- integer, xref 'citations' table
* model -- text, full regression model
* n -- integer, number of data points in model
* r -- float, correlation coefficient
* s -- float, standard deviation
2. structures -- relates compounds to set given by 'qsar_id'
* structure_id -- integer, primary key
* qsar_id -- integer, xref 'qsar_sets' table
* smiles_id -- integer, xref 'smiles' table
* observed -- float, biological activity
3. qsar_parameters -- relates parameters in model to set given by 'qsar_id'
* qsar_param_id -- integer, primary key
* qsar_id -- integer, xref 'qsar_sets' table
* parameter_label -- text
* coefficient -- float, regression coefficient
* confidence -- float, 95% confidence limit
## Indirect data tables:
4. smiles
* smiles_id -- integer, primary key
* smiles -- text, 'unique' SMILES (via Biobyte method)
* mf -- text, molecular formula
* mw -- float, molecular weight
5. systems
* system_id -- integer, primary key
* system -- text
6. classes
* class_id -- integer, primary key
* class -- text
7. compounds
* compound_id -- integer, primary key
* compound -- text
8. actions
* action_id -- integer, primary key
* action -- text
9. citations
* citation_id -- integer, primary key
* citation -- text