Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/clipcrow/xmlp

XML Parser for Deno
https://github.com/clipcrow/xmlp

deno pull-parser sax-parser typescript xml

Last synced: 19 days ago
JSON representation

XML Parser for Deno

Awesome Lists containing this project

README

        

# XML Parser for Deno

[![ci](https://github.com/m-kur/xmlp/workflows/ci/badge.svg)](https://github.com/m-kur/xmlp/actions)

This project is an XML parser implemented for Deno as simply as possible. Currently it supports SAX style and Pull style.
I'm thinking of using it only in applications that run on Deno. However, there is very little code that depends on Deno, so it's easy to make it available in Node (I don't).
If you haven't programmed with Deno yet, give it a try. Very nice. See [Deno official](https://deno.land/).

## SAXParser

When using in SAX style, create an instance of the parser and register the listener in the same way as used in the EventEmitter of Node.
The XML to be parsed is specified by Deno.Reader, UINT8 array, or a character string.

```typescript
import { SAXParser } from 'https://deno.land/x/xmlp/mod.ts';

// create a SAX parser instance
const parser = new SAXParser();

// add SAX event handlers
parser.on('start_prefix_mapping', (ns, uri) => {
console.log(`mapping start ${ns}: ${uri}`);
}).on('text', (text, element) => {
if (element.qName === 'm:comment') {
console.log(`${element.attributes[0].value}: ${text}`);
}
});

// run parser, input source is Deno.Reader or Uint8Array or string
const reader = await Deno.open('parser_test.xml');
await parser.parse(reader, 'shift_jis'); // 2nd param is optional, specifies your XML encoding.
```

SAX event listener register definitions are below.

```typescript
interface SAXEvent {
start_document: () => void;
processing_instruction: (procInst: string) => void;
sgml_declaration: (sgmlDecl: string) => void;
text: (text: string, element: ElementInfo, cdata: boolean) => void;
doctype: (doctype: string) => void;
start_prefix_mapping: (ns: string, uri: string) => void;
start_element: (element: ElementInfo) => void;
comment: (comment: string) => void;
end_element: (element: ElementInfo) => void;
end_prefix_mapping: (ns: string, uri: string) => void;
end_document: () => void;
error: (error: XMLParseError) => void;
}

class SAXParser {
on(event: K, listener: SAXEvent[K]): this {}
}
```

You can use "SAXParser" on Deno's stream i/o because this is a simple "UnderlyingSink" impl.
See the [parser.ts](parser.ts) / SAXParser#parse() -> #getWriter() -> getStream() -> write() chain.

## PullParser

I think it's more interesting to write the Pull style than the SAX. This Pull parser is implemented using the ES6 Generator / Iterator mechanism. However, the basic implementation is shared with that of the SAX parser.

Currently the Pull parser supports Uint8 arrays and strings, not Deno.Reader.

```typescript
import { PullParser } from 'https://deno.land/x/xmlp/mod.ts';

// create a pull parser instance
const parser = new PullParser();

// create an ES6 generator
const uint8Array = await Deno.readFile('parser_test.xml');
const events = parser.parse(uint8Array /*, encoding */);

// pull events, using iterator
const event = events.next();
if (event.value) {
console.log(event.value.name);
}

// using spread operator
console.log([...events].filter(({ name }) => {
return name === 'text';
}).map(({ text, cdata }) => {
return cdata ? `` : text;
}));
```

## DOMParser

Will be realized eventually.

## JSDoc

Usually all you need see is mod.ts, SAXParser class or PullParser class.

https://doc.deno.land/https/deno.land/x/xmlp/mod.ts

## Acknowledgments

The basic logic of this XML parser was obtained by reading the source code of [sax-js](https://github.com/isaacs/sax-js). Thanks.

## License

The scripts and documentation in this project are released under the [MIT License](LICENSE)