Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/gvanrossum/patma

Pattern Matching
https://github.com/gvanrossum/patma

Last synced: 30 days ago
JSON representation

Pattern Matching

Awesome Lists containing this project

README

        

Pattern Matching
================

[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/gvanrossum/patma/master?urlpath=lab/tree/playground-622.ipynb)

This repo contains an issue tracker, examples, and early work related to
[PEP 622: Structural Pattern Matching](https://www.python.org/dev/peps/pep-0622). The current version of the proposal is
[PEP 634](https://www.python.org/dev/peps/pep-0634),
which was **accepted** by the Steering Council on February 8, 2021.
The motivation and rationale are written up in
[PEP 635](https://www.python.org/dev/peps/pep-0635),
and a tutorial is in [PEP 636](https://www.python.org/dev/peps/pep-0636).
The tutorial below is also included in PEP 636 as
[Appendix A](https://www.python.org/dev/peps/pep-0636/#appendix-a-quick-intro).

Updates to the PEPs should be made in the
[PEPs repo](https://github.com/python/peps).

Origins
-------

The work has several origins:

- Many statically compiled languages (especially functional ones) have
a `match` expression, for example
[Scala](http://www.scala-lang.org/files/archive/spec/2.11/08-pattern-matching.html),
[Rust](https://doc.rust-lang.org/reference/expressions/match-expr.html),
[F#](https://docs.microsoft.com/en-us/dotnet/fsharp/language-reference/pattern-matching);
- Several extensive discussions on python-ideas, culminating in a
summarizing
[blog post](https://tobiaskohn.ch/index.php/2018/09/18/pattern-matching-syntax-in-python/)
by Tobias Kohn;
- An independently developed [draft
PEP](https://github.com/ilevkivskyi/peps/blob/pattern-matching/pep-9999.rst)
by Ivan Levkivskyi.

Implementation
--------------

A full reference implementation written by Brandt Bucher is available
as a [fork](https://github.com/brandtbucher/cpython/tree/patma) of
the CPython repo. This is readily converted to a [pull
request](https://github.com/brandtbucher/cpython/pull/2).

For those who prefer not to build a CPython binary from source there's
a Binder playground -- click the button at the top of this readme.

Examples
--------

Some [example code](examples/) is available from this repo.

Tutorial
--------

A `match` statement takes an expression and compares it to successive
patterns given as one or more `case` blocks. This is superficially
similar to a `switch` statement in C, Java or JavaScript (and many
other languages), but much more powerful.

The simplest form compares a subject value against one or more literals:

```py
def http_error(status):
match status:
case 400:
return "Bad request"
case 401:
return "Unauthorized"
case 403:
return "Forbidden"
case 404:
return "Not found"
case 418:
return "I'm a teapot"
case _:
return "Something else"
```

Note the last block: the "variable name" `_` acts as a *wildcard* and
never fails to match.

You can combine several literals in a single pattern using `|` ("or"):

```py
case 401|403|404:
return "Not allowed"
```

Patterns can look like unpacking assignments, and can be used to bind
variables:

```py
# The subject is an (x, y) tuple
match point:
case (0, 0):
print("Origin")
case (0, y):
print(f"Y={y}")
case (x, 0):
print(f"X={x}")
case (x, y):
print(f"X={x}, Y={y}")
case _:
raise ValueError("Not a point")
```

Study that one carefully! The first pattern has two literals, and can
be thought of as an extension of the literal pattern shown above. But
the next two patterns combine a literal and a variable, and the
variable *captures* a value from the subject (`point`). The fourth
pattern captures two values, which makes it conceptually similar to
the unpacking assignment `(x, y) = point`.

If you are using classes to structure your data (e.g. data classes)
you can use the class name followed by an argument list resembling a
constructor, but with the ability to capture variables:

```py
from dataclasses import dataclass

@dataclass
class Point:
x: int
y: int

def whereis(point):
match point:
case Point(0, 0):
print("Origin")
case Point(0, y):
print(f"Y={y}")
case Point(x, 0):
print(f"X={x}")
case Point():
print("Somewhere else")
case _:
print("Not a point")
```

We can use keyword parameters too. The following patterns are all
equivalent (and all bind the `y` attribute to the `var` variable):

```py
Point(1, var)
Point(1, y=var)
Point(x=1, y=var)
Point(y=var, x=1)
```

Patterns can be arbitrarily nested. For example, if we have a short
list of points, we could match it like this:

```py
match points:
case []:
print("No points")
case [Point(0, 0)]:
print("The origin")
case [Point(x, y)]:
print(f"Single point {x}, {y}")
case [Point(0, y1), Point(0, y2)]:
print(f"Two on the Y axis at {y1}, {y2}")
case _:
print("Something else")
```

We can add an `if` clause to a pattern, known as a "guard". If the
guard is false, `match` goes on to try the next `case` block. Note
that value capture happens before the guard is evaluated:

```py
match point:
case Point(x, y) if x == y:
print(f"Y=X at {x}")
case Point(x, y):
print(f"Not on the diagonal")
```

Several other key features:

- Like unpacking assignments, tuple and list patterns have exactly the
same meaning and actually match arbitrary sequences. An important
exception is that they don't match iterators or strings.
(Technically, the subject must be an instance of
`collections.abc.Sequence`.)

- Sequence patterns support wildcards: `[x, y, *rest]` and `(x, y,
*rest)` work similar to wildcards in unpacking assignments. The
name after `*` may also be `_`, so `(x, y, *_)` matches a sequence
of at least two items without binding the remaining items.

- Mapping patterns: `{"bandwidth": b, "latency": l}` captures the
`"bandwidth"` and `"latency"` values from a dict. Unlike sequence
patterns, extra keys are ignored. A wildcard `**rest` is also
supported. (But `**_` would be redundant, so it is not allowed.)

- Subpatterns may be captured using the `as` operator:

```py
case (Point(x1, y1), Point(x2, y2) as p2): ...
```

- Patterns may use named constants. These must be dotted names
to prevent them from being interpreted as capture variable:

```py
from enum import Enum
class Color(Enum):
RED = 0
GREEN = 1
BLUE = 2

match color:
case Color.RED:
print("I see red!")
case Color.GREEN:
print("Grass is green")
case Color.BLUE:
print("I'm feeling the blues :(")
```

- The literals `None`, `False` and `True` are treated specially:
comparisons to the subject are done using `is`. This:
```py
match b:
case True:
print("Yes!")
```
is exactly equivalent to this:
```py
if b is True:
print("Yes!")
```

- Classes may override the mapping from positional arguments to
attributes by setting a class variable `__match_args__`.
Read about it in the
[PEP](https://www.python.org/dev/peps/pep-0622/#special-attribute-match-args).