Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/halidodat/latin1-ascii-analysis


https://github.com/halidodat/latin1-ascii-analysis

Last synced: 7 days ago
JSON representation

Awesome Lists containing this project

README

        

# Ratio of latin1 and ASCII strings in common websites

## Language Choise

The choise was taken from these sites, where languages that are more that halve
of the alphabet outside the latin1 range are exclude (like greek, chinese,
etc).

1.
2.
3.
4.
5.

Languages:

1. English
2. Spanish
3. German
4. French
5. Portuguese
6. Turkish
7. Italian
8. Dutch
9. Polish
10. Vietnamese

## Websites

This list is curated to include websites that are commonly visited and used by
native speakers of these languages. Some sites are global platforms with
localized content, while others are region-specific.

The list of [websites by language][./websites-urls.md].

## Data

The following data is collected:

- Ascii string count
- Latin1 string count
- non-latin1/ascii string count