{"id":13694410,"url":"https://github.com/tdewolff/parse","last_synced_at":"2025-05-14T01:02:17.651Z","repository":{"id":23867047,"uuid":"27245611","full_name":"tdewolff/parse","owner":"tdewolff","description":"Go parsers for web formats","archived":false,"fork":false,"pushed_at":"2025-03-14T08:58:10.000Z","size":1546,"stargazers_count":432,"open_issues_count":3,"forks_count":65,"subscribers_count":9,"default_branch":"master","last_synced_at":"2025-04-03T13:06:07.369Z","etag":null,"topics":["css","go","html","javascript","js","json","lexer","parse","svg","xml"],"latest_commit_sha":null,"homepage":"","language":"Go","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/tdewolff.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2014-11-28T00:13:13.000Z","updated_at":"2025-03-27T17:45:09.000Z","dependencies_parsed_at":"2023-01-13T23:59:44.894Z","dependency_job_id":"1ffc6feb-c556-47f0-a19d-4336d0f23ff6","html_url":"https://github.com/tdewolff/parse","commit_stats":null,"previous_names":["tdewolff/css"],"tags_count":91,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tdewolff%2Fparse","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tdewolff%2Fparse/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tdewolff%2Fparse/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tdewolff%2Fparse/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/tdewolff","download_url":"https://codeload.github.com/tdewolff/parse/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248261373,"owners_count":21074222,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["css","go","html","javascript","js","json","lexer","parse","svg","xml"],"created_at":"2024-08-02T17:01:31.319Z","updated_at":"2025-04-10T17:11:04.745Z","avatar_url":"https://github.com/tdewolff.png","language":"Go","readme":"# Parse [![API reference](https://img.shields.io/badge/godoc-reference-5272B4)](https://pkg.go.dev/github.com/tdewolff/parse/v2?tab=doc) [![Go Report Card](https://goreportcard.com/badge/github.com/tdewolff/parse)](https://goreportcard.com/report/github.com/tdewolff/parse) [![Coverage Status](https://coveralls.io/repos/github/tdewolff/parse/badge.svg?branch=master)](https://coveralls.io/github/tdewolff/parse?branch=master) [![Donate](https://img.shields.io/badge/patreon-donate-DFB317)](https://www.patreon.com/tdewolff)\n\nThis package contains several lexers and parsers written in [Go][1]. All subpackages are built to be streaming, high performance and to be in accordance with the official (latest) specifications.\n\nThe lexers are implemented using `buffer.Lexer` in https://github.com/tdewolff/parse/buffer and the parsers work on top of the lexers. Some subpackages have hashes defined (using [Hasher](https://github.com/tdewolff/hasher)) that speed up common byte-slice comparisons.\n\n## Buffer\n### Reader\nReader is a wrapper around a `[]byte` that implements the `io.Reader` interface. It is comparable to `bytes.Reader` but has slightly different semantics (and a slightly smaller memory footprint).\n\n### Writer\nWriter is a buffer that implements the `io.Writer` interface and expands the buffer as needed. The reset functionality allows for better memory reuse. After calling `Reset`, it will overwrite the current buffer and thus reduce allocations.\n\n### Lexer\nLexer is a read buffer specifically designed for building lexers. It keeps track of two positions: a start and end position. The start position is the beginning of the current token being parsed, the end position is being moved forward until a valid token is found. Calling `Shift` will collapse the positions to the end and return the parsed `[]byte`.\n\nMoving the end position can go through `Move(int)` which also accepts negative integers. One can also use `Pos() int` to try and parse a token, and if it fails rewind with `Rewind(int)`, passing the previously saved position.\n\n`Peek(int) byte` will peek forward (relative to the end position) and return the byte at that location. `PeekRune(int) (rune, int)` returns UTF-8 runes and its length at the given **byte** position. Upon an error `Peek` will return `0`, the **user must peek at every character** and not skip any, otherwise it may skip a `0` and panic on out-of-bounds indexing.\n\n`Lexeme() []byte` will return the currently selected bytes, `Skip()` will collapse the selection. `Shift() []byte` is a combination of `Lexeme() []byte` and `Skip()`.\n\nWhen the passed `io.Reader` returned an error, `Err() error` will return that error even if not at the end of the buffer.\n\n### StreamLexer\nStreamLexer behaves like Lexer but uses a buffer pool to read in chunks from `io.Reader`, retaining old buffers in memory that are still in use, and re-using old buffers otherwise. Calling `Free(n int)` frees up `n` bytes from the internal buffer(s). It holds an array of buffers to accommodate for keeping everything in-memory. Calling `ShiftLen() int` returns the number of bytes that have been shifted since the previous call to `ShiftLen`, which can be used to specify how many bytes need to be freed up from the buffer. If you don't need to keep returned byte slices around, call `Free(ShiftLen())` after every `Shift` call.\n\n## Strconv\nThis package contains string conversion function much like the standard library's `strconv` package, but it is specifically tailored for the performance needs within the `minify` package.\n\nFor example, the floating-point to string conversion function is approximately twice as fast as the standard library, but it is not as precise.\n\n## CSS\nThis package is a CSS3 lexer and parser. Both follow the specification at [CSS Syntax Module Level 3](http://www.w3.org/TR/css-syntax-3/). The lexer takes an io.Reader and converts it into tokens until the EOF. The parser returns a parse tree of the full io.Reader input stream, but the low-level `Next` function can be used for stream parsing to returns grammar units until the EOF.\n\n[See README here](https://github.com/tdewolff/parse/tree/master/css).\n\n## HTML\nThis package is an HTML5 lexer. It follows the specification at [The HTML syntax](http://www.w3.org/TR/html5/syntax.html). The lexer takes an io.Reader and converts it into tokens until the EOF.\n\n[See README here](https://github.com/tdewolff/parse/tree/master/html).\n\n## JS\nThis package is a JS lexer (ECMA-262, edition 6.0). It follows the specification at [ECMAScript Language Specification](http://www.ecma-international.org/ecma-262/6.0/). The lexer takes an io.Reader and converts it into tokens until the EOF.\n\n[See README here](https://github.com/tdewolff/parse/tree/master/js).\n\n## JSON\nThis package is a JSON parser (ECMA-404). It follows the specification at [JSON](http://json.org/). The parser takes an io.Reader and converts it into tokens until the EOF.\n\n[See README here](https://github.com/tdewolff/parse/tree/master/json).\n\n## SVG\nThis package contains common hashes for SVG1.1 tags and attributes.\n\n## XML\nThis package is an XML1.0 lexer. It follows the specification at [Extensible Markup Language (XML) 1.0 (Fifth Edition)](http://www.w3.org/TR/xml/). The lexer takes an io.Reader and converts it into tokens until the EOF.\n\n[See README here](https://github.com/tdewolff/parse/tree/master/xml).\n\n## License\nReleased under the [MIT license](LICENSE.md).\n\n[1]: http://golang.org/ \"Go Language\"\n","funding_links":["https://www.patreon.com/tdewolff"],"categories":["开源类库","Open source library"],"sub_categories":["解释器","Interpreter"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftdewolff%2Fparse","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftdewolff%2Fparse","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftdewolff%2Fparse/lists"}