Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mcs07/chemdataextractor

Automatically extract chemical information from scientific documents
https://github.com/mcs07/chemdataextractor

chemistry information-extraction natural-language-processing nlp python text-mining

Last synced: 5 days ago
JSON representation

Automatically extract chemical information from scientific documents

Awesome Lists containing this project

README

        

ChemDataExtractor
=================

.. image:: http://img.shields.io/pypi/v/ChemDataExtractor.svg?style=flat-square
:target: https://pypi.python.org/pypi/ChemDataExtractor

.. image:: http://img.shields.io/pypi/l/ChemDataExtractor.svg?style=flat-square
:target: https://github.com/mcs07/ChemDataExtractor/blob/master/LICENSE

.. image:: http://img.shields.io/travis/mcs07/ChemDataExtractor.svg?style=flat-square
:target: https://travis-ci.org/mcs07/ChemDataExtractor

ChemDataExtractor is a toolkit for extracting chemical information from the scientific literature.

Features
--------

- HTML, XML and PDF document readers
- Chemistry-aware natural language processing pipeline
- Chemical named entity recognition
- Rule-based parsing grammars for property and spectra extraction
- Table parser for extracting tabulated data
- Document processing to resolve data interdependencies

Installation
------------

To install ChemDataExtractor, simply run::

pip install chemdataextractor

Or if you are an Anaconda user, run::

conda install -c chemdataextractor chemdataextractor

Alternatively, try one of the other `installation options`_.

Documentation
-------------

Full documentation is available at http://chemdataextractor.org/docs

License
-------

ChemDataExtractor is licensed under the `MIT license`_, a permissive, business-friendly license for open source
software.

.. _`installation options`: http://chemdataextractor.org/docs/install
.. _`MIT license`: https://github.com/mcs07/ChemDataExtractor/blob/master/LICENSE