Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/neptune-ai/neptune-r
📒 The MLOps stack component for experiment tracking (R interface)
https://github.com/neptune-ai/neptune-r
compare language log management metadata metrics mlops models monitoring organize parameters r store tracker visualization
Last synced: 3 months ago
JSON representation
📒 The MLOps stack component for experiment tracking (R interface)
- Host: GitHub
- URL: https://github.com/neptune-ai/neptune-r
- Owner: neptune-ai
- License: apache-2.0
- Created: 2020-03-01T11:14:49.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2023-10-05T09:58:47.000Z (over 1 year ago)
- Last Synced: 2024-09-23T04:38:22.135Z (4 months ago)
- Topics: compare, language, log, management, metadata, metrics, mlops, models, monitoring, organize, parameters, r, store, tracker, visualization
- Language: R
- Homepage: https://docs.neptune.ai/integrations/r/
- Size: 194 KB
- Stars: 14
- Watchers: 11
- Forks: 6
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
## What is neptune.ai?
Neptune is a lightweight experiment tracker for ML teams that struggle with debugging and reproducing experiments, sharing results, and messy model handover. **It offers a single place to track, compare, store, and collaborate on experiments and models.**
With Neptune, Data Scientists can develop production-ready models faster, and ML Engineers can access model artifacts instantly in order to deploy them to production.
ÂWatch a 3min explainer video →
ÂWatch a 20min product demo →
Â# Getting started
## Register
[Sign up for Neptune](https://neptune.ai).You can use Neptune for free for work, research, and personal projects. Individual accounts are free, and if you're interested in using Neptune as a team, [check available plans](https://neptune.ai/pricing).
## Install Neptune R package
If you don't have Python installed and just want to use Neptune, simply run the following code:
```R
install.packages("reticulate")
library(reticulate)
install_miniconda()install.packages("neptune")
library(neptune)
neptune_install()
```This code will install [miniconda](https://docs.conda.io/en/latest/miniconda.html) (a minimalistic Python environment) and set it up for you. This is a one-time thing and after that, you only need to run the last two lines when you want to work with Neptune:
```R
library(neptune)
neptune_install()
```If you have a Python virtual environment already set up (conda, miniconda, or virtualenv), you can point to it instead of creating a new one:
```R
# If you are using virtualenv
install.packages("neptune")
library(neptune)
neptune_install(method="virtualenv", envname = "PATH/TO/YOUR/VIRTUALENV")# If you are using conda or miniconda
install.packages("neptune")
library(neptune)
neptune_install(method="conda", envname = "PATH/TO/YOUR/CONDA/ENVIRONMENT")
```## Create a tracked run
```R
run <- neptune_init(project="MY_WORKSPACE/MY_PROJECT",
api_token="NEPTUNE_API_TOKEN")
```This code creates a run in the specified project. This will be your gateway to logging metadata to Neptune.
You need to pass your credentials (project and API token) to the `neptune_init()` method. You can also set the API token globally:
```R
neptune_set_api_token(token = "NEPTUNE_API_TOKEN")
```### API token
To find your API token:
* Go to the Neptune UI
* Open the **User menu** toggle in the upper right
* Click **Get your API token**
* Copy your API tokenor [get your API token directly from here](https://app.neptune.ai/get_my_api_token).
### Project
The project argument has the format `WORKSPACE_NAME/PROJECT_NAME`.
To find it:
* Go to the Neptune UI
* Go to your project
* Open **Settings** > **Properties**
* Copy the project name### Stop tracking
Once you are finished with tracking metadata you need to stop the tracking for that particular run:
```R
neptune_stop(run)# Note that you can also use reticulate based syntax:
run$stop()
```If you are running a script it will stop tracking automatically at the end. However, in an interactive environment such as RStudio, you need to stop it explicitly.
## Track metadata
### Log hyperparameters
```R
params <- list(
"dense_units"= 128,
"activation"= "relu",
"dropout"= 0.23,
"learning_rate"= 0.15,
"batch_size"= 64,
"n_epochs"= 30
)
run["parameters"] <- params
```If you have parameters in form of a dictionary you can log them to Neptune in batch. It will create a field with the appropriate type for each dictionary entry.
You can update the hyperparameters or add new ones later in the code:```R
# Add additional parameters
run["model/parameters/seed"] <- .Random.seed# Update parameters e.g. after triggering an early stopping
run["model/parameters/n_epochs"] <- epoch
```### Log training metrics
```R
for (i in 1:epochs) {
[...] # My training loop
neptune_log(run["train/epoch/loss"], loss)
neptune_log(run["train/epoch/accuracy"], acc)
}# Note that you can also use reticulate based syntax:
run["train/epoch/loss"]$log(loss)
```You can log training metrics to Neptune using series fields. In Neptune, there are three types of series - float series, string series, and file series. Each `neptune_append()` will add a new value at the end of the series.
### Log evaluation results
```R
run["evaluation/accuracy"] <- eval_acc
run["evaluation/loss"] <- eval_loss
```To log evaluation metrics simply assign them to a field of your choice. Using the snippet above, both evaluation metrics will be stored in the same evaluation namespace.
```R
neptune_upload(run["evaluation/ROC"], "roc.png")# You can upload ggplot plots directly without saving them to a file
neptune_upload(run["evaluation/ROC"], ggplot_roc)# If you want to control additional parameters like size of the plot you can pass the same arguments as to ggsave
neptune_upload(run["evaluation/ROC"], ggplot_roc, width=20, height=20, units="cm")# Note that you can also use reticulate based syntax:
run["evaluation/ROC"]$upload("roc.png")
run["evaluation/ROC"]$upload(ggplot_roc)
```You can log plots and charts easily using the `neptune_upload()` function. In the case of a ggplot object, it gets converted to an image file and uploaded, but you can also upload images from the local disc.
### Upload model file
You can upload any binary file (e.g. model weights) from disk using the `neptune_upload()` method. If your model is saved as multiple files you can upload a whole folder as a `FileSet` using `neptune_upload_files()`.
```R
# Upload a single fingle sile
neptune_upload(run["model"], "model.Rdata")# You can also upload folders in batch if you don't need access to the separate files
neptune_upload_files(run["model"], "models")# Note that you can also use reticulate based syntax:
run["model"]$upload("model.Rdata")
run["model"]$upload_files("models")
```# Getting help
If you got stuck or simply want to talk to us, here are your options:
* Check the [Neptune R client package documentation](https://docs.neptune.ai/integrations/r/)
* Check our [FAQ page](https://docs.neptune.ai/getting_help)
* You can submit bug reports, feature requests, or contributions directly to the repository.
* Chat! When in the Neptune application, click on the blue message icon in the bottom-right corner and send a message. A real person will talk to you ASAP (typically very ASAP),
* You can just shoot us an email at [email protected]