Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/anowell/are-we-learning-yet
How ready is Rust for Machine Learning?
https://github.com/anowell/are-we-learning-yet
machine-learning
Last synced: 4 days ago
JSON representation
How ready is Rust for Machine Learning?
- Host: GitHub
- URL: https://github.com/anowell/are-we-learning-yet
- Owner: anowell
- License: cc-by-4.0
- Created: 2016-06-22T04:11:18.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2024-12-15T23:41:34.000Z (9 days ago)
- Last Synced: 2024-12-16T20:07:26.754Z (8 days ago)
- Topics: machine-learning
- Language: Rust
- Homepage: http://arewelearningyet.com
- Size: 10.3 MB
- Stars: 456
- Watchers: 23
- Forks: 64
- Open Issues: 17
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# are-we-learning-yet
**Rust is a systems programming language, but is it a machine learning language?**
[arewelearningyet.com](http://arewelearningyet.com)
Inspired by [Are We Web Yet?](http://arewewebyet.org/), this project aims to catalog the the Rust ML ecosystem.
## Contributing
Feedback, issues, and pull requests are welcome and appreciated for adding missing crates,
providing additional resources, or improving the content.### Running locally
- Requires [`cobalt >=0.17.5`](https://cobalt-org.github.io/)
- Recommend [`just`](https://github.com/casey/just) as a task runner (or using the commands in the [Justfile](Justfile))```
# GitHub OAuth token avoids 403 rate limiting errors while generated crate data
# Tip: set in `.env` file and `just` will pick it up automatically
export GITHUB_TOKEN=# Scrape crate/repo data for sitegen
just scrape# Start a dev server on port 3000
cobalt serve
```The site should be running on [localhost:3000](http://localhost:3000)
### How it works
The repo consists of 2 key parts:
- **the scraper tool** is responsible for reading [crates.yaml](_data/crates.yaml), fetching
additional metadata about the crate from crates.io and the GitHub API, and generating a score used for ordering crates.
The fetched data is cached in a `_tmp` directory to speed up repeated site generation and avoid abusing the APIs,
and the scraper outputs `_data/crates_generated.yaml` which is used by the cobalt site generation. To force regeneration
of all crate data, remove the cached data with `just clean`. For more details, see the [scraper README](_scraper).
- **the site content** is the rest of the repo which follows the layout established by [cobalt.rs](https://cobalt-org.github.io/).
Cobalt uses this content to generate a static site into a `_site` directory (`cobalt build`) and can also start a local development
server that rebuilds the site anytime content changes (`cobalt serve`). For more details, see
[cobalt.rs](https://cobalt-org.github.io/) documentation.Note: `cobalt serve` does not trigger the scraper to rerun when `crates.yaml` (or the scraper) changes. Currently you must rerun `just scrape`
to update `crates_generated.yaml`.### Publishing
[arewelearningyet.com](arewelearningyet.com) is served by Github Pages.
Every merge into master is automatically published by a [Github Actions job](.github/workflows).Additionally, to ensure crate statistics (download counts and stars)
are regularly updated, the publishing task is also run as a weekly cron job.