Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ekzhang/inline-sql
🪄 Inline SQL in any Python program
https://github.com/ekzhang/inline-sql
dsl duckdb olap python query-language sql
Last synced: 6 days ago
JSON representation
🪄 Inline SQL in any Python program
- Host: GitHub
- URL: https://github.com/ekzhang/inline-sql
- Owner: ekzhang
- License: mit
- Created: 2022-10-06T15:32:20.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-02-17T00:50:09.000Z (10 months ago)
- Last Synced: 2024-10-13T11:24:57.122Z (2 months ago)
- Topics: dsl, duckdb, olap, python, query-language, sql
- Language: Python
- Homepage:
- Size: 36.1 KB
- Stars: 418
- Watchers: 8
- Forks: 8
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- stars - ekzhang/inline-sql - 🪄 Inline SQL in any Python program (Python)
- stars - ekzhang/inline-sql - 🪄 Inline SQL in any Python program (Python)
README
# Inline SQL
[![PyPI - Version](https://img.shields.io/pypi/v/inline-sql.svg)](https://pypi.org/project/inline-sql)
[![PyPI - Python Version](https://img.shields.io/pypi/pyversions/inline-sql.svg)](https://pypi.org/project/inline-sql)A simple embedded language for running inline SQL in Python programs.
```python
from inline_sql import sql, sql_valassert sql_val^ "SELECT 1 + 1" == 2
x = 5
assert sql_val^ "SELECT $x * 2" == 10df = sql^ "SELECT * FROM (VALUES (1, 10), (2, 20)) df (x, y)"
assert sql_val^ "SELECT SUM(x) + SUM(y) FROM df" == 33
```Operations in the `inline_sql` library run directly inside your process. You can query local datasets (pandas frames), CSV files, and even interpolate variables seamlessly. This is implemented as a small wrapper around [DuckDB](https://duckdb.org/), so it is [extremely fast](https://duckdb.org/2021/05/14/sql-on-pandas.html).
## Installation
Supports Python 3.7+, tested on all major operating systems.
```console
pip install inline-sql
```## Usage
The exported `sql` and `sql_val` variables are magic objects that can be used to run queries. Queries can read from local dataframes by name, and they can embed parameters using dollar-sign notation.
```python
>>> from inline_sql import sql, sql_val>>> sql_val^ "SELECT 1 + 1"
2>>> x = 5
>>> sql_val^ "SELECT 2 * $x"
10>>> sql^ "SELECT * FROM 'disasters.csv' LIMIT 5"
Entity Year Deaths
0 All natural disasters 1900 1267360
1 All natural disasters 1901 200018
2 All natural disasters 1902 46037
3 All natural disasters 1903 6506
4 All natural disasters 1905 22758>>> disasters = sql^ "SELECT * FROM 'disasters.csv'"
>>> def total_deaths(entity: str) -> float:
... return sql_val^ "SELECT SUM(deaths) FROM disasters WHERE Entity = $entity"
...>>> total_deaths("Drought")
11731294.0>>> total_deaths("Earthquake")
2576801.0
```You can run any SQL query as described in the [DuckDB documentation](https://duckdb.org/docs/guides/).
## Library Use
You can use `inline_sql` as a library. Since results from queries are ordinary `pandas.DataFrame` objects, they work in functions and application code. Here's a longer example:
```python
import pandas as pd
from inline_sql import sql, sql_valdef head_data(count: int) -> pd.DataFrame:
return sql^ "SELECT * FROM 'cars.csv' LIMIT $count"cars = head_data(50)
origin_counts = sql^ """
SELECT origin, COUNT() FROM cars
GROUP BY origin
ORDER BY count DESC
"""
print(origin_counts)most_common = origin_counts.origin[0]
print(sql_val^ """
SELECT AVG(horsepower) FROM cars
WHERE origin = $most_common
""")
```In general, `sql_val` is used to run scalar queries, while `sql` is used to run queries that return tables.
## Acknowledgements
Created by Eric Zhang ([@ekzhang1](https://twitter.com/ekzhang1)). Licensed under the [MIT license](LICENSE).