Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dwyl/english-words
:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
https://github.com/dwyl/english-words
Last synced: 15 days ago
JSON representation
:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
- Host: GitHub
- URL: https://github.com/dwyl/english-words
- Owner: dwyl
- License: unlicense
- Created: 2014-07-13T22:20:45.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2024-06-16T11:20:30.000Z (5 months ago)
- Last Synced: 2024-10-15T09:22:14.135Z (29 days ago)
- Language: Python
- Homepage:
- Size: 24.2 MB
- Stars: 10,570
- Watchers: 207
- Forks: 1,838
- Open Issues: 113
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE.md
Awesome Lists containing this project
- awesomeLibrary - english-words - π A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion (θ―θ¨θ΅ζΊεΊ / books)
- awesome-starts - dwyl/english-words - :memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion (Python)
- turkish-nlp-resources - 466k English Words.
README
List Of English Words
=============A text file containing over 466k English words.
While searching for a list of english words (for an auto-complete tutorial)
I found: https://stackoverflow.com/questions/2213607/how-to-get-english-language-word-database which refers to [https://www.infochimps.com/datasets/word-list-350000-simple-english-words-excel-readable](https://web.archive.org/web/20131118073324/https://www.infochimps.com/datasets/word-list-350000-simple-english-words-excel-readable) (archived).No idea why infochimps put the word list inside an excel (.xls) file.
I pulled out the words into a simple new-line-delimited text file.
Which is more useful when building apps or importing into databases etc.Copyright still belongs to them.
Files you may be interested in:
- [words.txt](words.txt) contains all words.
- [words_alpha.txt](words_alpha.txt) contains only [[:alpha:]] words (words that only have letters, no numbers or symbols). If you want a quick solution choose this.
- [words_dictionary.json](words_dictionary.json) contains all the words from words_alpha.txt as json format.
If you are using Python, you can easily load this file and use it as a dictionary for faster performance. All the words are assigned with 1 in the dictionary.See [read_english_dictionary.py](read_english_dictionary.py) for example usage.