Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by brucewlee
A curated list of projects in awesome lists by brucewlee .
https://github.com/brucewlee/lingfeat
[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment
discourse feature-extraction flesch-kincaid lexical-analysis linguistic-analysis natural-language-processing nlp readability-metrics readability-scores semantic-analysis spacy syntactic-analysis text-classification text-simplification
Last synced: 14 Oct 2024
https://github.com/brucewlee/lftk
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.
bea-workshop feature-extraction handcrafted-features linguistic-features natural-language-processing python readability-scores reading-time spacy text-analysis word-difficulty
Last synced: 14 Nov 2024
https://github.com/brucewlee/wiki-text-summarizer-keyword-extractor
Uses Beautiful Soup to read Wiki pages, Gensim to summarize, NLTK to process, and extracts keywords based on entropy: everything in one beautiful code. A simple but effective solution to extractive text summarization.
gensim gensim-model keyword-extraction keyword-identification nltk simple-summarizer text-mining text-summarization text-summarizer wikipedia-summarizer
Last synced: 19 Oct 2024
https://github.com/brucewlee/coca-wordfrequency
COCA, Top 5000 Word Frequency List
Last synced: 19 Oct 2024
https://github.com/brucewlee/prompt-learning-readability
[EACL 2023] use text-to-text models (BART, T5) for readability assessment
bart readability readability-scores t5
Last synced: 31 Oct 2024
https://github.com/brucewlee/lama-music-genre-dataset
.wav files, training dataset (MFCC), and graph plots (FFTs, MFCCs, Waveforms) from Latin America, Asia, MiddleEast, and Africa
africa asia audio-processing classification dataset genre genre-classification genre-suggestion genres-classification harvard-dataverse lama mfcc music music-library signal-processing sound
Last synced: 31 Oct 2024
https://github.com/brucewlee/h-test
[ACL 2024] Language Models Don't Learn the Physical Manifestation of Language
benchmark evaluation language-model
Last synced: 19 Oct 2024
https://github.com/brucewlee/textreader
Readability Formulas and Reading Time Statistics
readability readability-formulas readability-metrics readability-scores reading-time
Last synced: 31 Oct 2024
https://github.com/brucewlee/nlppandas
Basic preprocessing for NLP datasets in Pandas dataframe.
Last synced: 31 Oct 2024
https://github.com/brucewlee/low-cost-cryogenic-temperature-measurement-system
Last synced: 31 Oct 2024
https://github.com/brucewlee/accuracy
There are more than one way to skin a cat
Last synced: 31 Oct 2024
https://github.com/brucewlee/music-genre-classification
Straightforward starter code for music genre classification using: LSTM-RNN, CNN, and just plain Neural Networks.
cnn deep-learning keras-tensorflow lstm music-genre-classifier
Last synced: 31 Oct 2024
https://github.com/brucewlee/conference_cheatsheet
publication-related stuff, largely for myself
conference machine-learning natural-language-processing nlp publication
Last synced: 31 Oct 2024