Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/0x6675636b796f75676974687562/kchardet

Character set detector library for the JVM
https://github.com/0x6675636b796f75676974687562/kchardet

Last synced: 2 months ago
JSON representation

Character set detector library for the JVM

Awesome Lists containing this project

README

        

[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

Character set detector library for the JVM — like [vidoss/jchardet](https://github.com/vidoss/jchardet), but with a leading `K`.

# Features

- Plain ASCII detection.
- **UTF-8** detection (with and without BOM).
- **UTF-16** (BE, LE) detection (with and without BOM).
- Chinese detection (**GB 2312**, **GBK**, **GB 18030**, **Big5**).
- [_Mode Line_](https://www.gnu.org/software/emacs/manual/html_node/emacs/Specifying-File-Variables.html)
based detection in source code files of known types, e.g.:

```python
#!/usr/bin/env python3
# -*- coding: ISO-8859-15 -*-

print("Hello, World!")
```