Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/pointlander/peg
Peg, Parsing Expression Grammar, is an implementation of a Packrat parser generator.
https://github.com/pointlander/peg
Last synced: 2 months ago
JSON representation
Peg, Parsing Expression Grammar, is an implementation of a Packrat parser generator.
- Host: GitHub
- URL: https://github.com/pointlander/peg
- Owner: pointlander
- License: bsd-3-clause
- Created: 2010-04-25T21:20:46.000Z (over 14 years ago)
- Default Branch: master
- Last Pushed: 2024-07-08T13:47:56.000Z (6 months ago)
- Last Synced: 2024-08-02T03:08:43.739Z (5 months ago)
- Language: Go
- Homepage:
- Size: 681 KB
- Stars: 997
- Watchers: 30
- Forks: 120
- Open Issues: 33
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Authors: AUTHORS
Awesome Lists containing this project
- awesome-go - peg - Peg, Parsing Expression Grammar, is an implementation of a Packrat parser generator. - ★ 527 (Software Packages)
- awesome-go-extra - peg - 04-25T21:20:46Z|2021-08-22T22:12:48Z| (Go Tools / Other Software)
- awesome-go - peg - Peg, Parsing Expression Grammar, is an implementation of a Packrat parser generator. Stars:`1.0K`. (Software Packages / Other Software)
README
# PEG, an Implementation of a Packrat Parsing Expression Grammar in Go
[![GoDoc](https://godoc.org/github.com/pointlander/peg?status.svg)](https://godoc.org/github.com/pointlander/peg)
[![Go Report Card](https://goreportcard.com/badge/github.com/pointlander/peg)](https://goreportcard.com/report/github.com/pointlander/peg)
[![Coverage](https://gocover.io/_badge/github.com/pointlander/peg)](https://gocover.io/github.com/pointlander/peg)A [Parsing Expression Grammar](http://en.wikipedia.org/wiki/Parsing_expression_grammar) ( hence `peg`) is a way to create grammars similar in principle to [regular expressions](https://en.wikipedia.org/wiki/Regular_expression) but which allow better code integration. Specifically, `peg` is an implementation of the [Packrat](https://en.wikipedia.org/wiki/Parsing_expression_grammar#Implementing_parsers_from_parsing_expression_grammars) parser generator originally implemented as [peg/leg](https://www.piumarta.com/software/peg/) by [Ian Piumarta](https://www.piumarta.com/cv/) in C. A Packrat parser is a "descent recursive parser" capable of backtracking and negative look-ahead assertions which are problematic for regular expression engines .
## See Also
*
*
*## Installing
`go get -u github.com/pointlander/peg`
## Building
### Using Pre-Generated Files
`go install`
### Generating Files Yourself
You should only need to do this if you are contributing to the library, or if something gets messed up.`go run build.go` or `go generate`
With tests:
`go run build.go test`
## Usage
```
peg []...Usage of peg:
-inline
parse rule inlining
-noast
disable AST
-output string
specify name of output file
directly dump the syntax tree
-strict
treat compiler warnings as errors
-switch
replace if-else if-else like blocks with switch blocks
-syntax
print out the syntax tree
-version
print the version and exit
```## Sample Makefile
This sample `Makefile` will convert any file ending with `.peg` into a `.go` file with the same name. Adjust as needed.
```make
.SUFFIXES: .peg .go.peg.go:
peg -noast -switch -inline -strict -output $@ $<all: grammar.go
```Use caution when picking your names to avoid overwriting existing `.go` files. Since only one PEG grammar is allowed per Go package (currently) the use of the name `grammar.peg` is suggested as a convention:
```
grammar.peg
grammar.go
```## PEG File Syntax
First declare the package name and any import(s) required:
```
packageimport
```Then declare the parser:
```
type Peg {
}
```Next declare the rules. Note that the main rules are described below but are based on the [peg/leg rules](https://www.piumarta.com/software/peg/peg.1.html) which provide additional documentation.
The first rule is the entry point into the parser:
```
<-
```The first rule should probably end with `!.` to indicate no more input follows.
```
first <- . !.
```This is often set to `END` to make PEG rules more readable:
```
END <- !.
````.` means any character matches. For zero or more character matches, use:
```
repetition <- .*
```For one or more character matches, use:
```
oneOrMore <- .+
```For an optional character match, use:
```
optional <- .?
```If specific characters are to be matched, use single quotes:
```
specific <- 'a'* 'bc'+ 'de'?
```This will match the string `"aaabcbcde"`.
For choosing between different inputs, use alternates:
```
prioritized <- 'a' 'a'* / 'bc'+ / 'de'?
```This will match `"aaaa"` or `"bcbc"` or `"de"` or `""`. The matches are attempted in order.
If the characters are case insensitive, use double quotes:
```
insensitive <- "abc"
```This will match `"abc"` or `"Abc"` or `"ABc"` and so on.
For matching a set of characters, use a character class:
```
class <- [a-z]
```This will match `"a"` or `"b"` or all the way to `"z"`.
For an inverse character class, start with a caret:
```
inverse <- [^a-z]
```This will match anything but `"a"` or `"b"` or all the way to `"z"`.
If the character class is case insensitive, use double brackets:
```
insensitive <- [[A-Z]]
```(Note that this is not available in regular expression syntax.)
Use parentheses for grouping:
```
grouping <- (rule1 / rule2) rule3
```For looking ahead a match (predicate), use:
```
lookAhead <- &rule1 rule2
```For inverse look ahead, use:
```
inverse <- !rule1 rule2
```Use curly braces for Go code:
```
gocode <- { fmt.Println("hello world") }
```For string captures, use less than and greater than:
```
capture <- <'capture'> { fmt.Println(text) }
```Will print out `"capture"`. The captured string is stored in `buffer[begin:end]`.
## Testing Complex Grammars
Testing a grammar usually requires more than the average unit testing with multiple inputs and outputs. Grammars are also usually not for just one language implementation. Consider maintaining a list of inputs with expected outputs in a structured file format such as JSON or YAML and parsing it for testing or using one of the available options for Go such as Rob Muhlestein's [`tinout`](https://github.com/robmuh/tinout) package.
## Files
* `bootstrap/main.go` - bootstrap syntax tree of peg
* `tree/peg.go` - syntax tree and code generator
* `peg.peg` - peg in its own language## Author
Andrew Snodgrass
## Projects That Use `peg`
Here are some projects that use `peg` to provide further examples of PEG grammars:
* - natural date/time parsing
* - easy date/time formats with duration spans
* - scientific names parsing