Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mcs07/chemdataextractor
Automatically extract chemical information from scientific documents
https://github.com/mcs07/chemdataextractor
chemistry information-extraction natural-language-processing nlp python text-mining
Last synced: 5 days ago
JSON representation
Automatically extract chemical information from scientific documents
- Host: GitHub
- URL: https://github.com/mcs07/chemdataextractor
- Owner: mcs07
- License: mit
- Created: 2016-10-02T23:50:01.000Z (about 8 years ago)
- Default Branch: master
- Last Pushed: 2023-07-27T18:05:13.000Z (over 1 year ago)
- Last Synced: 2024-10-28T15:39:21.649Z (16 days ago)
- Topics: chemistry, information-extraction, natural-language-processing, nlp, python, text-mining
- Language: Python
- Homepage: http://chemdataextractor.org
- Size: 542 KB
- Stars: 308
- Watchers: 18
- Forks: 113
- Open Issues: 24
-
Metadata Files:
- Readme: README.rst
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
README
ChemDataExtractor
=================.. image:: http://img.shields.io/pypi/v/ChemDataExtractor.svg?style=flat-square
:target: https://pypi.python.org/pypi/ChemDataExtractor.. image:: http://img.shields.io/pypi/l/ChemDataExtractor.svg?style=flat-square
:target: https://github.com/mcs07/ChemDataExtractor/blob/master/LICENSE.. image:: http://img.shields.io/travis/mcs07/ChemDataExtractor.svg?style=flat-square
:target: https://travis-ci.org/mcs07/ChemDataExtractorChemDataExtractor is a toolkit for extracting chemical information from the scientific literature.
Features
--------- HTML, XML and PDF document readers
- Chemistry-aware natural language processing pipeline
- Chemical named entity recognition
- Rule-based parsing grammars for property and spectra extraction
- Table parser for extracting tabulated data
- Document processing to resolve data interdependenciesInstallation
------------To install ChemDataExtractor, simply run::
pip install chemdataextractor
Or if you are an Anaconda user, run::
conda install -c chemdataextractor chemdataextractor
Alternatively, try one of the other `installation options`_.
Documentation
-------------Full documentation is available at http://chemdataextractor.org/docs
License
-------ChemDataExtractor is licensed under the `MIT license`_, a permissive, business-friendly license for open source
software... _`installation options`: http://chemdataextractor.org/docs/install
.. _`MIT license`: https://github.com/mcs07/ChemDataExtractor/blob/master/LICENSE