Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-textmining-materials-science
Collection of papers on text mining for materials science
https://github.com/hhaoyan/awesome-textmining-materials-science
Last synced: 4 days ago
JSON representation
-
Tools and codes
-
Plain text
- ChemDataExtractor - fledged toolkit for sentence segmentation, tokenization, chemical NER, and extracting chemical information.
- textacy - /post- processing of text used in conjunction with spaCy, such as text normalization, garbage text cleaning, extraction of ngrams, entities, etc.
-
PDF files
-
OCR tools
- Google Cloud OCR
- tesseract - source C++ OCR tool based on LSTM that supports many languages.
-
Image data extraction
-
-
Datasets/databases
-
On synthesis
- Machine-learned and codified synthesis parameters of oxide materials by Kim et al
- Text-mined dataset of inorganic materials synthesis recipes by Kononova et al
- Annotating and Extracting Synthesis Process of All-Solid-State Batteries from Scientific Literature by Kuniyoshi et al - solid-state battery articles.
- An open experimental database for exploring inorganic materials by Zakutayev et al
- Auto-generated materials database of Curie and Néel temperatures via semi-supervised relationship extraction by Court et al
-
NLP annotations
-
-
NLP pipelines
-
NLP annotations
-
Named Entity Recognition
- Named Entity Recognition and Normalization Applied to Large-Scale Information Extraction from the Materials Science Literature by Weston et al
- Automated Extraction of Chemical Synthesis Actions from Experimental Procedures by Vaucher et al - based/ML(Transformer) model to extract synthesis actions from experimental procedures.
- Automatically Extracting Action Graphs from Materials Science Synthesis Procedures by Mysore et al - based heuristics.
- Using Natural Language Processing Techniques to Extract Information on the Properties and Functionalities of Energetic Materials from Large Text Corpora by Elton et al
- Automatically Extracting Action Graphs from Materials Science Synthesis Procedures by Mysore et al - based heuristics.
-
Text classification/categorization
-
-
Data analysis
-
Synthesis data analysis/planning
-
Chemical knowledge base/graph
-
Sub Categories