Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by brucewlee

A curated list of projects in awesome lists by brucewlee .

https://github.com/brucewlee/lftk

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.

bea-workshop feature-extraction handcrafted-features linguistic-features natural-language-processing python readability-scores reading-time spacy text-analysis word-difficulty

Last synced: 14 Nov 2024

https://github.com/brucewlee/wiki-text-summarizer-keyword-extractor

Uses Beautiful Soup to read Wiki pages, Gensim to summarize, NLTK to process, and extracts keywords based on entropy: everything in one beautiful code. A simple but effective solution to extractive text summarization.

gensim gensim-model keyword-extraction keyword-identification nltk simple-summarizer text-mining text-summarization text-summarizer wikipedia-summarizer

Last synced: 19 Oct 2024

https://github.com/brucewlee/coca-wordfrequency

COCA, Top 5000 Word Frequency List

Last synced: 19 Oct 2024

https://github.com/brucewlee/prompt-learning-readability

[EACL 2023] use text-to-text models (BART, T5) for readability assessment

bart readability readability-scores t5

Last synced: 31 Oct 2024

https://github.com/brucewlee/lama-music-genre-dataset

.wav files, training dataset (MFCC), and graph plots (FFTs, MFCCs, Waveforms) from Latin America, Asia, MiddleEast, and Africa

africa asia audio-processing classification dataset genre genre-classification genre-suggestion genres-classification harvard-dataverse lama mfcc music music-library signal-processing sound

Last synced: 31 Oct 2024

https://github.com/brucewlee/h-test

[ACL 2024] Language Models Don't Learn the Physical Manifestation of Language

benchmark evaluation language-model

Last synced: 19 Oct 2024

https://github.com/brucewlee/textreader

Readability Formulas and Reading Time Statistics

readability readability-formulas readability-metrics readability-scores reading-time

Last synced: 31 Oct 2024

https://github.com/brucewlee/nlppandas

Basic preprocessing for NLP datasets in Pandas dataframe.

Last synced: 31 Oct 2024

https://github.com/brucewlee/values

Last synced: 31 Oct 2024

https://github.com/brucewlee/accuracy

There are more than one way to skin a cat

Last synced: 31 Oct 2024

https://github.com/brucewlee/webscraper

naver, amazon, youtub

Last synced: 31 Oct 2024

https://github.com/brucewlee/music-genre-classification

Straightforward starter code for music genre classification using: LSTM-RNN, CNN, and just plain Neural Networks.

cnn deep-learning keras-tensorflow lstm music-genre-classifier

Last synced: 31 Oct 2024

https://github.com/brucewlee/brucewlee

Last synced: 31 Oct 2024