https://github.com/retospect/acatome-meta
Lightweight metadata lookup for scientific papers
https://github.com/retospect/acatome-meta
crossref doi metadata python scientific-papers
Last synced: 2 months ago
JSON representation
Lightweight metadata lookup for scientific papers
- Host: GitHub
- URL: https://github.com/retospect/acatome-meta
- Owner: retospect
- License: other
- Created: 2026-03-11T19:45:27.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2026-04-01T22:21:30.000Z (3 months ago)
- Last Synced: 2026-04-02T06:34:35.605Z (3 months ago)
- Topics: crossref, doi, metadata, python, scientific-papers
- Language: Python
- Size: 39.1 KB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project
README
# acatome-meta
Lightweight metadata lookup and verification for scientific papers. Resolves PDFs to their DOI, title, authors, and journal using Crossref and Semantic Scholar.
## Features
- **PDF title extraction** — extracts candidate titles from PDF first pages
- **Crossref lookup** — DOI resolution and metadata retrieval
- **Semantic Scholar** — citation counts, abstracts, and supplementary metadata
- **Fuzzy verification** — confirms extracted metadata matches the PDF content
- **Citation parsing** — extracts structured author/year/title from reference strings
- **Zero-config** — works out of the box, optional API keys for higher rate limits
## Installation
```bash
uv pip install -e .
```
## Usage
```python
from acatome_meta import lookup
meta = lookup("/path/to/paper.pdf")
print(meta["doi"], meta["title"], meta["authors"])
```
## CLI
```bash
acatome-meta lookup paper.pdf
acatome-meta verify paper.pdf --doi 10.1234/example
```
## Configuration
Set `SEMANTIC_SCHOLAR_API_KEY` for higher rate limits:
```bash
export SEMANTIC_SCHOLAR_API_KEY=your_key_here
```
## Testing
```bash
uv run python -m pytest tests/ -v
```
## License
LGPL-3.0-or-later — see [LICENSE](LICENSE).