{"id":15291623,"url":"https://github.com/azure/azuredsvm","last_synced_at":"2025-10-07T04:31:22.634Z","repository":{"id":66003699,"uuid":"81512129","full_name":"Azure/AzureDSVM","owner":"Azure","description":"AzureDSVM is an R package that offers convenient harness of Azure DSVM, remote execution of scalable and elastic data science work, and monitoring of on-demand resource consumption. ","archived":true,"fork":false,"pushed_at":"2017-10-13T07:47:33.000Z","size":4565,"stargazers_count":17,"open_issues_count":10,"forks_count":14,"subscribers_count":77,"default_branch":"master","last_synced_at":"2025-10-06T00:36:00.013Z","etag":null,"topics":["azure","data-science","data-science-virtual-machine","r"],"latest_commit_sha":null,"homepage":"","language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Azure.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2017-02-10T01:18:40.000Z","updated_at":"2025-09-03T17:00:31.000Z","dependencies_parsed_at":"2023-02-28T03:01:12.376Z","dependency_job_id":null,"html_url":"https://github.com/Azure/AzureDSVM","commit_stats":null,"previous_names":[],"tags_count":3,"template":false,"template_full_name":null,"purl":"pkg:github/Azure/AzureDSVM","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Azure%2FAzureDSVM","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Azure%2FAzureDSVM/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Azure%2FAzureDSVM/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Azure%2FAzureDSVM/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Azure","download_url":"https://codeload.github.com/Azure/AzureDSVM/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Azure%2FAzureDSVM/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":278722482,"owners_count":26034458,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-07T02:00:06.786Z","response_time":59,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["azure","data-science","data-science-virtual-machine","r"],"created_at":"2024-09-30T16:13:35.221Z","updated_at":"2025-10-07T04:31:22.316Z","avatar_url":"https://github.com/Azure.png","language":"R","funding_links":[],"categories":[],"sub_categories":[],"readme":"# AzureDSVM\r\n\r\nThe AzureDSVM (Azure Data Science Virtual Machine) is an R Package for Data Scientists\r\nworking with the Azure compute platform as a complement to the\r\nunderlying AzureSMR for controlling [Azure Data Science Virtual Machines](https://docs.microsoft.com/en-us/azure/machine-learning/machine-learning-data-science-provision-vm).\r\n\r\nAzure Data Science Virtual Machine (DSVM) is a powerful data science development environment with pre-installed tools and packages that empower data scientists for convenient data wrangling, model building, and service deployment. \r\n\r\nThe R package of `AzureDSVM` aims at offering functions that can be conveniently used by R data scientists for operating and using Azure Data Science Virtual Machine (DSVM) elastically and economically within local R session. \r\n\r\nTo install the package from github:\r\n\r\n    devtools::install_github(\"Azure/AzureDSVM\")\r\n\r\nHelp pages are also provided for all functions within the\r\npackage. With RStudio for example type AzureDSVM into search when the\r\npackage is loaded to see a list of functions/help pages or else\r\n\r\n    library(help=AzureDSVM)\r\n\r\nNote: The package will work with any open source R Session or with\r\nMicrosoft R extensions.\r\n\r\n# Features\r\n\r\n* Elasiticity\r\n\r\n    * Deployment of a DSVM with customized information such as machine name, machine size (with compute/memory optimized general-purpose CPU, Nvidia K80/M60 GPU, etc.), operating system (Windows Server 2016, Ubunbut 16.04, and CentOS), authentication method (public key based or password based), etc.\r\n    * Enjoy all benefits of a Windows/Linux DSVM. E.g., all tools for data science work such as R/Python/Julia programming languages, SQL Server, Visual Studio with RTVS, etc., remote working environment via RStudio Server or Jupyter Notebook interface, and machine learning \u0026 artificial intelligence packages such as Microsoft CNTK, MXNet, and XGBoost.\r\n    * Execution of R analytics on DSVM(s) with various Microsoft R Server computing contexts such as \"local parallel\" and \"cluster parallel\".\r\n    * Seamless interaction with remote R Server session with `mrsdeploy` functions.\r\n    * Post-deployment installation of extension for customizing system environment, reinstalling/uninstalling software, etc.\r\n\r\n* Scalability \r\n\r\n    * Deployment of a collection of heterogeneous DSVMs for a group of data scientists.\r\n    * Scale up DSVM and form them into a cluster for parallel/distributed computation with Microsoft R Server backend. \r\n    \r\n* Usability\r\n\r\n    * Deploy, start, stop, and delete DSVM(s) on demand.\r\n    * Monitor data consumption and estimate expense of using DSVM(s) with hourly aggregation granularity.\r\n\r\n# Tutorials\r\n\r\nTo get started with this package, see the Vignettes:\r\n\r\n* [Get started](https://github.com/Azure/AzureDSVM/blob/master/vignettes/00Introduction.Rmd)\r\n* [Deployment of a single DSVM](https://github.com/Azure/AzureDSVM/blob/master/vignettes/10Deploy.Rmd)\r\n* [Deployment of multiple DSVMs](https://github.com/Azure/AzureDSVM/blob/master/vignettes/20Multi.Rmd)\r\n* [Do computation on a single DSVM or a cluster of DSVMs](https://github.com/Azure/AzureDSVM/blob/master/vignettes/30Compute.Rmd)\r\n* [Monitor data consumption and expense spent on using DSVM(s)](https://github.com/Azure/AzureDSVM/blob/master/vignettes/40Cost.Rmd)\r\n* Putting all together\r\n    * [Use case - k-means clustering](https://github.com/Azure/AzureDSVM/blob/master/vignettes/60Kmeans.Rmd)\r\n    * [Use case - Hot spots analysis](https://github.com/Azure/AzureDSVM/blob/master/vignettes/70Hotspot.Rmd)\r\n    * [Use case - Binary classification](https://github.com/Azure/AzureDSVM/blob/master/vignettes/80ModelSelect.Rmd)\r\n\r\n# Code of Conduct\r\n\r\nThis project has adopted the [Microsoft Open Source Code of\r\nConduct](https://opensource.microsoft.com/codeofconduct/).\r\nFor more information see the [Code of Conduct\r\nFAQ](https://opensource.microsoft.com/codeofconduct/faq/) or\r\ncontact [opencode@microsoft.com](mailto:opencode@microsoft.com)\r\nwith any additional questions or comments.\r\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fazure%2Fazuredsvm","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fazure%2Fazuredsvm","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fazure%2Fazuredsvm/lists"}