Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/calebwin/frequent

A utility for crawling websites and building frequency lists of words
https://github.com/calebwin/frequent

frequency-lists python web-crawler web-crawler-python word-frequency

Last synced: 3 months ago
JSON representation

A utility for crawling websites and building frequency lists of words

Awesome Lists containing this project

README

        

# Frequent
frequent is a utility for crawling websites and building word frequency list. Mainly made because I wanted to be able to find top n most common words on different websites, but I imagine there might be more useful applications. Or not.

```python
import frequent

# get most frequent words from the w3schools website
# limit crawl depth to 25
word_frequencies = frequent.word_frequencies("https://www.w3schools.com", 25)

# get the top 50 words
top_words = website_word_frequencies.most_common(50)

# print the top 50 most frequent words
print(top_words)
```