Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/cjauvin/little_pger
🐘 PostgreSQL query building with plain data structures.
https://github.com/cjauvin/little_pger
postgresql psycopg2 python sql sql-query
Last synced: about 2 months ago
JSON representation
🐘 PostgreSQL query building with plain data structures.
- Host: GitHub
- URL: https://github.com/cjauvin/little_pger
- Owner: cjauvin
- License: other
- Created: 2011-09-18T20:35:48.000Z (over 13 years ago)
- Default Branch: master
- Last Pushed: 2018-03-30T16:57:09.000Z (almost 7 years ago)
- Last Synced: 2024-10-28T17:19:39.330Z (2 months ago)
- Topics: postgresql, psycopg2, python, sql, sql-query
- Language: Python
- Homepage:
- Size: 47.9 KB
- Stars: 49
- Watchers: 4
- Forks: 3
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# Little_PGer.py
## What is it?
It's a thin layer just a tad above SQL, for use with Postgres and
[psycopg2](http://www.initd.org/psycopg/), when you want to wrap
queries in a convenient way, using plain data structures (but you
don't feel like using an ORM, for some reason).## Why?
Of course `psycopg2` already does a very fine job on its own, but in
the context of webapp backend development, I often found myself
wanting for an extra-frictionless way of shuffling around Ajax/JSON
data. As composing raw SQL queries quickly induces string-manipulation
fatigue, I gradually evolved `little_pger` for that simple purpose.If you want to know more about it, I have also discussed its use in
some particular contexts, on my blog:*
*
## To install
$ pip install little_pger
or
$ pip install -e [email protected]:cjauvin/little_pger.git#egg=little_pger
Note that `psycopg2` will be automatically installed if it isn't
already.## Testing it with this README document
Note that this `README.md` file can be executed as a test suite. To do
so, simply create a dummy database (that you can destroy afterward):$ createdb little_pger_test -U
and set this variable appropriately for your setup:
```python
>>> pg_user = '' # an empty string works when the OS and PG users share the same name```
Then simply execute the script with:
$ python -m doctest -f -v README.md
Let's go!
```python
>>> from little_pger import LittlePGer```
The first and mandatory parameter to a `LittlePGer` object is a
connection, either as a string or as a `psycopg2` object (resulting
from `psycopg2.connect`). A `LittlePGer` object can be used in two
ways. The first is as a context manager, which implies that the
transaction is encapsulated under the `with` statement (with a
`rollback` or `commit` performed automatically at exit):```python
>>> conn_str = 'dbname=little_pger_test user={}'.format(pg_user)
>>> with LittlePGer(conn=conn_str, commit=False) as pg:
... _ = pg.pg_version # (9, 5, 0) for me, perhaps not for you```
You can also use it without the context manager:
```python
>>> pg = LittlePGer(conn=conn_str, commit=False)
>>> _ = pg.pg_version # (9, 5, 0) for me, perhaps not for you```
in which case you are in charge of managing the transaction
yourself. In this document we will not use the context manager because
it makes things easier on the eyes.## Insert and update
Suppose we have two SQL tables:
```python
>>> pg.sql("""
... create table book (
... book_id serial primary key,
... author_id int,
... title text,
... n_pages int,
... topics text[]
... )
... """)>>> pg.sql("""
... create table author (
... author_id serial primary key,
... name text
... )
... """)```
you can `insert` a new `book`, along with its `author`:
```python
>>> book = pg.insert('book', values={'title': 'PG is Fun!'})
>>> author = pg.insert('author', values={'name': 'Joe Foo', 'author_id': 100})```
and `update` it:
```python
>>> book = pg.update(
... 'book', set={'author_id': author['author_id'], 'n_pages': 200},
... where={'book_id': book['book_id']}
... )
>>> sorted(book.items()) # just to clamp the field order
[('author_id', 100), ('book_id', 1), ('n_pages', 200), ('title', 'PG is Fun!'), ('topics', None)]```
As shown above, `insert` and `update` by default return a `dict`
record. However, `insert` has a convenient `return_id` keyword
argument, which means that the primary key value of the newly
created record should be returned directly:```python
>>> pg.insert(
... 'book', values={'title': 'Python and PG, a Love Story'},
... return_id=True
... )
2```
## Upsert
Even though `upsert` only appeared recently (with PG 9.5),
`little_pger` supports it for every version of PG, with a "fake
implementation" (i.e. check existence, then insert or update
accordingly) in the cases where it is not natively supported (and when
it is, a "real" implementation is used). Both implementations are
simplified versions where the primary key is implicitly used to
determine uniqueness.```python
>>> # does not yet exist, will be created
>>> book_id = pg.upsert('book', set={'title': 'A Boring Story'}, return_id=True)
>>> book_id
3>>> # already exists, will be updated
>>> book = pg.upsert('book', values={'n_pages': 123, 'book_id': book_id})
>>> book_id, book['book_id']
(3, 3)```
`insert`, `update` and `upsert` all have a convenient `filter_values`
parameter which, if used, will remove any item in the `values` dict
that doesn't belong to the target table. Without it here, an exception
would be thrown, as the `book` table does not have a `publisher`
column:```python
>>> _ = pg.upsert(
... 'book', filter_values=True,
... values={'book_id': book_id, 'publisher': 'Joe North'}
... )```
## Select
To `select` all books:
```python
>>> books = pg.select('book')
>>> len(books)
3```
or a particular book:
```python
>>> books = pg.select('book', where={'book_id': book_id})
>>> len(books)
1```
or:
```python
>>> book = pg.select1('book', where={'book_id': book_id})
>>> type(book)```
It's easy to (inner) join books and authors:
```python
>>> book = pg.select1(
... 'book', join='author', where={'book_id': 1}
... )
>>> sorted(book.items()) # just to clamp the field order
[('author_id', 100), ('book_id', 1), ('n_pages', 200), ('name', 'Joe Foo'), ('title', 'PG is Fun!'), ('topics', None)]```
or left join them:
```python
>>> book_author = pg.select1(
... 'book', left_join='author', where={'book_id': 2}
... )
>>> sorted(book_author.items()) # just to clamp the field order
[('author_id', None), ('book_id', 2), ('n_pages', None), ('name', None), ('title', 'Python and PG, a Love Story'), ('topics', None)]```
Using a `tuple` value in the `where` clause:
```python
>>> books = pg.select('book', where={'book_id': (1, 2, 3)})
>>> len(books)
3```
translates to a SQL query using the `in` operator:
```sql
select * from book where book_id in (1, 2, 3)
```Make sure that you do not use `tuple`s and `list`s interchangeably
when working with `psycopg2` and `little_pger`, as they are used for
very different purposes. Python arrays translate into PG arrays (note
that the `book.topics` column has type `text[]`):```python
>>> book = pg.update(
... 'book', set={'topics': ['database', 'programming']},
... where={'book_id': 1}
... )
>>> book['topics']
['database', 'programming']```
You can use operators other than `=`, like this:
```python
>>> books = pg.select('book', where={('book_id', '<='): 2})
>>> len(books)
2```
Using a `set` (instead of a `tuple` or a `list`) will result in a
third type of semantics:```python
>>> pg.select1(
... 'book', where={('title', 'like'): {'%PG%', '%Fun%'}}
... )['title']
'PG is Fun!'```
which translates to:
```sql
select * from book where title like '%PG%' and title like '%Fun%'
```which can be a powerful way to implement an autocomplete mechanism,
[as I explain in more details elsewhere](http://cjauvin.blogspot.ca/2012/10/a-tribute-to-unsung-pattern.html).Until now we have assumed `*` selection, but the `what` keyword allows
for more flexibility:```python
>>> res = pg.select(
... 'book', what={'*':1, 'title is not null': 'has_title'}
... )
>>> [book['has_title'] for book in res]
[True, True, True]```
Similarly:
```python
>>> res = pg.select(
... 'book', left_join='author',
... what=['name', 'count(*)'],
... group_by='name', order_by='count desc'
... )
>>> res[0]['name'], int(res[0]['count'])
(None, 2)
>>> res[1]['name'], int(res[1]['count'])
('Joe Foo', 1)```
## Delete
The `delete` function includes an option to "tighten" the
primary key sequence, to make sure that if you delete a row with some
ID that is the maximum one currently existing, it will be reused the
next time you create a new row (in other words: it prevents "gaps" in
the ID sequences).Without `tighten_sequence`:
```python
>>> pg.delete('book', where={'book_id': 3})
>>> pg.insert('book', return_id=True)
4```
With it:
```python
>>> pg.delete('book', where={'book_id': 4}, tighten_sequence=True)
>>> pg.insert('book', return_id=True)
3```