https://github.com/phoe/safe-read

A variant of READ secure against internbombing, excessive input and macro characters.
https://github.com/phoe/safe-read

Last synced: 9 months ago
JSON representation

A variant of READ secure against internbombing, excessive input and macro characters.

Host: GitHub
URL: https://github.com/phoe/safe-read
Owner: phoe
License: bsd-2-clause
Created: 2016-07-30T16:05:23.000Z (over 9 years ago)
Default Branch: master
Last Pushed: 2022-10-10T21:19:14.000Z (about 3 years ago)
Last Synced: 2025-01-06T03:23:32.523Z (10 months ago)
Language: Common Lisp
Size: 31.3 KB
Stars: 44
Watchers: 5
Forks: 4
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

trackawesomelist - safe-read (⭐45) - a variant of READ secure against internbombing, excessive input and macro characters. [BSD\_2Clause](https://directory.fsf.org/wiki/License:BSD_2Clause). (Recently Updated / [Apr 28, 2025](/content/2025/04/28/README.md))

README

          # SAFE-READ

The goal of this project is to create a wrapper around standard Lisp reader to make it able to read input from untrusted sources, such as internet sockets.

Example usage - creating a client-server communication protocol that is based on S-expressions. Using bare READ on both server and client allows the malicious client/server to execute any code on any other networked clients/servers.

As of now, this repository includes variant of READ secure against internbombing, excessive input and macro characters.

### Rough definitions

* **Internbombing** - the behaviour of READ function, which automatically interns every unknown symbol it doesn't know. This can lead to namespace pollution and blowing the heap with input like `(symbol1 symbol2 ... symbol9999999999)`.

* **Excessive input** - causing an out of memory error simply by feeding the READ function a large enough S-expression.

* **Macro characters** - functioning of character macros within Lisp syntax, both standard and implementation-defined. They create simple dangers such as `#.(let (memleak) (loop (setf memleak (cons memleak memleak))))` along with more subtle ones that are not listed here.

### Function SAFE-READ

_`&optional (stream *standard-input*) use-list` **→** `s-expression error-status`_

  * `STREAM` - a stream to read from.

  * `USE-LIST` - a list of package designators. Reading will occur in a temporary package that uses these packages.

  * `S-EXPRESSION` - the read S-expression or NIL if reading was impossible.

  * `ERROR-STATUS` - one of `:INCOMPLETE-INPUT :MALFORMED-INPUT :INPUT-SIZE-EXCEEDED` or NIL in case of success.

### Variable *MAX-INPUT-SIZE*

  * Input will not be read beyond this size; SAFE-READ will return `:INPUT-SIZE-EXCEEDED` as its second value.

  * Defaults to 128 kB.

### Details

  * Only lists are accepted as input. Atoms produce `:MALFORMED-INPUT`.

  * All macro characters other than `#\(` and `#\"` produce `:MALFORMED-INPUT`. (`#\)` is only accepted at the end of a list.)

  * Package-qualified names (including keywords) produce `:MALFORMED-INPUT`.

  * All symbols read are non-interned, with the exceptions of symbols exported from the packages listed in `USE-LIST` which are interned in these respective packages.

  * Reading always stops on EOF and newlines.

  * Reading from each stream is buffered, meaning that subsequent calls to SAFE-READ can produce a valid S-expression if it spans over multiple lines. An example is a list containing a string containing a newline.

  * Limitation: no new S-expression must begin on the line where the previous one ended as they will be ignored.

### TODO

  * `make it possible to signal conditions instead of relying on the second return value.`

  * `make it possible to read multiple expressions without newlines`

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/phoe/safe-read

Awesome Lists containing this project

README