https://github.com/tmilos/lexer

PHP lexical analyzer
https://github.com/tmilos/lexer

lexer php

Last synced: over 1 year ago
JSON representation

PHP lexical analyzer

Host: GitHub
URL: https://github.com/tmilos/lexer
Owner: tmilos
License: mit
Created: 2016-12-19T07:17:27.000Z (over 9 years ago)
Default Branch: master
Last Pushed: 2018-11-17T00:00:42.000Z (over 7 years ago)
Last Synced: 2024-10-19T09:04:25.615Z (over 1 year ago)
Topics: lexer, php
Language: PHP
Size: 10.7 KB
Stars: 21
Watchers: 3
Forks: 3
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # PHP lexical analyzer

PHP implementation of Lexical Analyzer.

[![Author](http://img.shields.io/badge/author-@tmilos-blue.svg?style=flat-square)](https://twitter.com/tmilos77)

[![Build Status](https://travis-ci.org/tmilos/lexer.svg?branch=master)](https://travis-ci.org/tmilos/lexer)

[![Coverage Status](https://coveralls.io/repos/github/tmilos/lexer/badge.svg?branch=master)](https://coveralls.io/github/tmilos/lexer?branch=master)

[![License](https://img.shields.io/packagist/l/tmilos/lexer.svg)](https://packagist.org/packages/tmilos/lexer)

[![SensioLabsInsight](https://insight.sensiolabs.com/projects/2027f29b-b950-45ca-a185-54a1e3379bf7/small.png)](https://insight.sensiolabs.com/projects/2027f29b-b950-45ca-a185-54a1e3379bf7)

> **Warning**

> This is not a GENERATOR like classical lex is. It does not produce any php code. It's a simple plain scanner

> of the given input string and tokenizer into given set of tokens by matching regular expressions.

> Thus, at runtime you can change the token definition and use one same code for any token set.

# Token Definition

Tokens are defined with ``TokenDefinition`` class that holds token name and regular expression. Token name can be

empty, and in that case, lexer will ignore/skip such tokens.

# Lexer Configuration

The lexer configuration holds a list of all token definitions. With ``LexerArrayConfig`` it can be easily created from an array

where keys are regular expressions and values are names of tokens.

# Full scan

Lexer's static method ``scan($config, $input)`` can be used to scan given input string and return an array of tokens.

# Lexer with state

Instance of the ``Lexer`` class be used to walk trough scanned tokens with single look-ahead token.

It's similar in API to [doctrine/lexer](https://github.com/doctrine/lexer), just tokens are defined and scanned differently, w/out

the need for recognizing the token type/name from the tokenized value - rather the token type/name is given by the same ``TokenDefn``

that gave the regex to recognize the token.

# Examples

``` php

 '',

            '\\d+' => 'number',

            '\\+' => 'plus',

            '-' => 'minus',

            '\\*' => 'mul',

            '/' => 'div',

        ]);

// static scan method that returns an array of

$tokens = Lexer::scan($config, '2 + 3');

array_map(function ($t) { return $t->getName(); }, $tokens); // ['number', 'plus', 'number']

// lexer instance

$lexer = new Lexer($config);

$lexer->setInput('2 + 3');

$lexer->moveNext();

while ($lexer->getLookahead()) {

    print $lexer->getLookahead()->getName();

    $lexer->moveNext();

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/tmilos/lexer

Awesome Lists containing this project

README