Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/microsoft/sqlmlutils
Utility functions for easier usage of SQL Server Machine Learning Services
https://github.com/microsoft/sqlmlutils
python r sql-server sqlserver
Last synced: 7 days ago
JSON representation
Utility functions for easier usage of SQL Server Machine Learning Services
- Host: GitHub
- URL: https://github.com/microsoft/sqlmlutils
- Owner: microsoft
- License: other
- Created: 2018-09-24T18:57:37.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2024-07-12T20:52:03.000Z (6 months ago)
- Last Synced: 2024-12-30T04:51:58.691Z (21 days ago)
- Topics: python, r, sql-server, sqlserver
- Language: R
- Homepage:
- Size: 4.07 MB
- Stars: 33
- Watchers: 11
- Forks: 32
- Open Issues: 13
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Codeowners: CODEOWNERS
- Security: SECURITY.md
Awesome Lists containing this project
- jimsghstars - microsoft/sqlmlutils - Utility functions for easier usage of SQL Server Machine Learning Services (R)
README
# sqlmlutils
[![BuildAndTest](https://github.com/microsoft/sqlmlutils/actions/workflows/ci.yaml/badge.svg)](https://github.com/microsoft/sqlmlutils/actions/workflows/ci.yaml)
sqlmlutils is a package designed to help users interact with SQL databases (SQL Server and Azure SQL Database) and execute R or Python code in SQL from an R/Python client.
Currently, only the R version of sqlmlutils is supported in Azure SQL Database. Python support will be added later.### Check out the README in each language folder for language-specific details and code examples!
# Installation
To install sqlmlutils, follow the instructions below for Python and R, respectively.
Python:
To install from PyPI:
Run
```bash
pip install sqlmlutils
```
To install from file, download the latest release from https://github.com/microsoft/sqlmlutils/releases:
```bash
pip install sqlmlutils-1.1.0.zip
```R:
Download the latest release from https://github.com/microsoft/sqlmlutils/releases.
Windows:
To obtain the version of R your server is currently using, please use this query:
```tsql
EXEC sp_execute_external_script
@language = N'R',
@script = N'
v = R.version
OutputDataSet = data.frame(rversion=paste0(v$major, ".", v$minor))',
@input_data_1 = N'select 1'
WITH RESULT SETS ((rversion varchar(max)));
```
Get the version of R which the server is using and install it locally. Then, run the following commands with the same version of R.From command prompt, run
```bash
R.exe -e "install.packages('odbc', type='binary')"
R.exe CMD INSTALL sqlmlutils_1.0.0.zip
```
OR
To build a new package file and install, run
```bash
.\buildandinstall.cmd
```Linux
```bash
R.exe -e "install.packages('odbc')"
R.exe CMD INSTALL sqlmlutils_1.0.0.tar.gz
```# Details
sqlmlutils contains 3 main parts:
- Execution of Python/R in SQL databases using sp_execute_external_script
- Creation and execution of stored procedures created from scripts and functions
- Install and manage packages in SQL databasesFor more specifics and examples of how to use each language's API, look at the README in the respective folder.
## Execute in SQL
Execute in SQL provides a convenient way for the user to execute arbitrary Python/R code inside a SQL database using an sp_execute_external_script. The user does not have to know any t-sql to use this function. Function arguments are serialized into binary and passed into the t-sql script that is generated. Warnings and printed output will be printed at the end of execution, and any results returned by the function will be passed back to the client.
## Stored Procedures (Sprocs)
The goal of this utility is to allow users to create and execute stored procedures on their database without needing to know the exact syntax of creating one. Functions and scripts are wrapped into a stored procedure and registered into a database, then can be executed from the Python/R client.
## Package Management
##### R and Python package management with sqlmlutils is supported in SQL Server 2019 CTP 2.4 and later.
With package management users can install packages to a remote SQL database from a client machine. The packages are downloaded on the client and then sent over to SQL databases where they will be installed into library folders. The folders are per-database so packages will always be installed and made available for a specific database. The package management APIs provided a PUBLIC and PRIVATE folders. Packages in the PUBLIC folder are accessible to all database users. Packages in the PRIVATE folder are only accessible by the user who installed the package.