https://github.com/akphi/mer-c-less

a simple parser for a simple language
https://github.com/akphi/mer-c-less

lexical-analyzer parser syntax-analyzer

Last synced: 6 months ago
JSON representation

a simple parser for a simple language

Host: GitHub
URL: https://github.com/akphi/mer-c-less
Owner: akphi
Created: 2017-11-06T19:21:54.000Z (almost 8 years ago)
Default Branch: master
Last Pushed: 2017-11-06T19:23:58.000Z (almost 8 years ago)
Last Synced: 2025-03-06T06:04:49.888Z (7 months ago)
Topics: lexical-analyzer, parser, syntax-analyzer
Language: C
Size: 286 KB
Stars: 1
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

          # Mer-C-less

A rudimentary parser concealing within itself a regex-based lexcial analyzer and a LL(1) top-down passer.

## The Good, the Bad, and the Ugly

The grammar rules in EBNF are defined below. Note that [] and {} are not _terminals_. Besides, if any non-space characters are detected after a program end, an error is reported.

	 ::= program  

	 ::= begin  {; } end

	 ::=  | 

	 ::=  |  |  |

			  

	 ::=  := 

	 ::= read (  { ,  } )

	 ::= write (  { ,  } )

 	 ::= # alltext to end of line ignored

 	 ::=  |  | 

 	 ::= if  then  |

 		      if  then  else 

 	 ::= while  do 

 	 ::=  |

 			   

	 ::= [  ]  {   }

	 ::=  {   }

	 ::=  |  | (  )

	 ::= + | -

	 ::= + | -

	 ::= | /

	 ::= = | <> | < | <= | >= | >

	 ::=  {  |  }

	 ::=  {  }

	 ::=  {  |  }

	 ::= A | B | C | ... | Z

	 ::= a | b | c | ... | z | 

	 ::= 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Mer-C-less parser, at syntax analyzing step, will check if the current token matches what is expected, if a match occurs, the next token is requested from the lexcial analyzer, otherwise, it adds a syntax error and keep the current token for the next match, assuming that the current expected token has been satisfied. This behavior is crucial to the parser and must be noted.

## Usage

A `Makefile` is prepared for, of course, making. Several targets are available at your service. Simply `cd` to the source directory and use one of the followings:

```

make

make clean

make re

make debug

make test

make test-debug

```

The first is simply used to compile the source codes. The second is to cleanup all prior compilation. The third runs a `make clean` before running `make`. The third runs `make` with _DEBUG_ flag enabled. In debug mode, detailed messages from lexical analyzer, syntax analyzer and parser are shown. 

![](images/make_debug.png)

The target `make test` compiles the source code, runs prepared test cases in `test/case` and match the output with the expected_output in `test/expected_output`. Note that the filename must match for test to be run, otherwise, it is __SKIPPED__. Target `make test-debug` runs each test in debug mode without outcome matching.

![](images/make_test.png)

As for the token definition list, they are stored in the text file called _token_definition.txt_ together with the POSIC regex pattern which can be used to match them. Note that the order of definition really matters and as you update the token list, remember to update the header of this file as well. The header, i.e. the first 3 lines indicate the number of entries, the maximum length allowed for token name and the maximum length allowed for pattern, respectively. _tl; dr - Edit this file with caution_

If one wishes to only parse a single file, use the following command

```

./parse 

```

## Options

Refer to the setting.h file to see all available options. Most options are rather comprehensible, such as:

```

PARSE_DEBUG_ENABLED

LEX_DEBUG_ENABLED

TAB_SIZE_WARNING_ENABLED

TAB_SIZE

...	

```

You might want to check the `TAB_SIZE` option for more accurate error location information. Mer-C-less hides within itself a hidden gem where it allows error mapping on source, this can be enabled by setting `CODE_DISPLAY_ENABLED` to `1`. This should allow the program to show error-mapped source in _normal_ and _debug_ mode.

![](images/error_mapping_source.png)

Last but not least, you can customize the color of output, they are all in ANSI format. A list of commonly used colors is also included at the bottom of the setting file.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/akphi/mer-c-less

Awesome Lists containing this project

README