An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with sentences

A curated list of projects in awesome lists tagged with sentences .

https://github.com/neurosnap/sentences

A multilingual command line sentence tokenizer in Golang

cli sentence-tokenizer sentences tokenizer

Last synced: 16 May 2025

https://github.com/hyunwoongko/kss

KSS: Korean String processing Suite

korean korean-nlp kss nlp sentences split-sentences

Last synced: 13 Nov 2025

https://github.com/hopto-dot/japanese-conjugation-helper

Conjugates, downloads audio files, brings up detailed word and kanji information, creates tests and more. Useful for quickly making Anki cards and searching definitions of words.

anki audio card conjugate conjugation conjugation-practice conjugator information japanese jisho kanji language language-learning lookup practice search sentences test verb word

Last synced: 30 Apr 2025

https://github.com/1j01/nonsensical

Generate grammatical sentences https://1j01.itch.io/nonsensical

dummy-data dummy-text english english-grammar lorem-ipsum placeholder-text sentence sentence-generator sentences

Last synced: 26 Oct 2025

https://github.com/joshuakgoldberg/sentences-per-line

Contributed markdownlint rule for limiting sentences per line. 📐

lint markdown markdownlint markdownlint-rule sentences

Last synced: 19 Oct 2025

https://github.com/jonschlinkert/intl-segmenter

A high-performance wrapper around Intl.Segmenter for efficient text segmentation. This class resolves memory handling issues seen with large strings and "maximum call stack exceeded" exceptions that occur when strings exceed 40-50k characters. Enhances performance by 50-500x. Only ~70 loc (with comments) and no dependencies.

graphemes intl intl-segmenter processing segment segmenter sentences splitter text words

Last synced: 15 Apr 2025

https://github.com/mideind/greynircorpus

A large treebank of parsed Icelandic text

corpus icelandic natural-language-processing nlp parsing sentences treebank

Last synced: 03 Jan 2026

https://github.com/louis030195/big-talks

Collaborative list of questions to trigger interesting conversations, thinking ... and, obviously, avoid small talks.

conversations friends life philosophy psychology sentences social

Last synced: 02 Jul 2025

https://github.com/carlosplanchon/tokenizesentences

Python3 module to tokenize english sentences.

carlosplanchon opensource python python3 sentences tokenize

Last synced: 15 Sep 2025

https://github.com/ahmedkhalf/arabic-keyword-scraper

Stop wasting your time! And obtain Arabic definitions without having to look it up.

arabic data definitions scraper sentences wordsearch

Last synced: 12 Mar 2025

https://github.com/kavgan/micropinion-generation-dataset

Dataset for Micropinion Generation. Dataset is based on user reviews from CNET. The reviews are on products from various categories like tv, cell phones, gps etc.

cnet dataset micropinion-generation-dataset sentence sentences user-reviews

Last synced: 27 Aug 2025

https://github.com/knightron0/topicgeneration

Get current topic from captions using Latent Dirichlet Allocation.

captions lda sentences topic

Last synced: 26 Feb 2025

https://github.com/1j01/babble

sentence generator (oh hey look I made a better sentence generator over here: https://github.com/1j01/nonsensical)

javascript library lorem-ipsum nonsense random-generation random-text sentence sentence-generator sentences

Last synced: 25 Feb 2025

https://github.com/vgupta123/Unsupervised-SAS

This repo contains the source code of the AMR (Abstract Meaning Representation) based approach for abstractive summarization. (ACL-SRW 2018)

abstractive-text-summarization acl2018 amr amr-generator amr-library amr-parser datasets nlp-machine-learning paper rouge sentences summarization

Last synced: 28 Apr 2025

https://github.com/astoilkov/segmenter

Work with grapheme, words, and sentences with small, simple, and fast API using Intl.Segmenter

grapheme sentences words

Last synced: 14 Apr 2025

https://github.com/abhigyantrips/gouwu

A Go package for transforming text into uwuspeak.

golang seeded sentences uwuifier

Last synced: 01 Jul 2025

https://github.com/bgokden/veri-python-text-search-demo

Text Search Demo Using Veri And Universal Sentence Encoders

dataset machine-learning news-articles sentences universal-sentence-encoders

Last synced: 07 Oct 2025