An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by cldellow

A curated list of projects in awesome lists by cldellow .

https://github.com/cldellow/sqlite-parquet-vtable

A SQLite vtable extension to read Parquet files

apache-arrow apache-parquet parquet sqlite sqlite3

Last synced: 09 Apr 2025

https://github.com/cldellow/csv2parquet

Convert a CSV to a parquet file.

apache-arrow apache-parquet csv parquet

Last synced: 21 Aug 2025

https://github.com/cldellow/datasette-scraper

Add website scraping abilities to Datasette

datasette datasette-plugin scraping

Last synced: 17 Aug 2025

https://github.com/cldellow/datasette-parquet

Add DuckDB, Parquet, CSV and JSON lines support to Datasette

datasette datasette-plugin duckdb parquet

Last synced: 13 Apr 2025

https://github.com/cldellow/real-estate-prices-cc

Source real estate prices from the Common Crawl.

Last synced: 06 May 2025

https://github.com/cldellow/datasette-ui-extras

Add editing UI and other power-user features to Datasette.

datasette datasette-io datasette-plugin

Last synced: 07 May 2025

https://github.com/cldellow/parquet-metadata

Dump metadata about a Parquet file.

apache-arrow apache-parquet parquet

Last synced: 07 May 2025

https://github.com/cldellow/datasette-rewrite-sql

A Datasette hook to inspect/rewrite the SQL users run.

datasette datasette-io datasette-plugin

Last synced: 13 May 2025

https://github.com/cldellow/ballero

Last synced: 13 Mar 2025

https://github.com/cldellow/mapt

An opinionated workflow for building OSM tiles and styles.

openstreetmap tilemaker vector-tiles

Last synced: 13 May 2025

https://github.com/cldellow/stackexchange-to-sqlite

Export a Stack Exchange dump to a SQLite3 database.

sqlite stackexchange stackexchange-dump

Last synced: 23 Apr 2025

https://github.com/cldellow/manu

Mostly archived, not updated.

Last synced: 23 Aug 2025

https://github.com/cldellow/datasette-mutable-downloads

Enable downloading mutable databases from Datasette

Last synced: 21 Jan 2026

https://github.com/cldellow/dux-demo

A repo for publishing a demo Datasette instance with datasette-ui-extras

Last synced: 18 Mar 2026

https://github.com/cldellow/datasette-ersatz-table-valued-functions

Enable a limited form of table-valued functions in Datasette

datasette datasette-io datasette-plugin sqlite

Last synced: 06 May 2026

https://github.com/cldellow/segmenter

Segment short strings into words.

nlp

Last synced: 18 Oct 2025

https://github.com/cldellow/bayesky

Bluesky firehose classifier.

Last synced: 20 May 2026

https://github.com/cldellow/iem2parquet

Export Iowa Environmental Mesonet data to Parquet files.

Last synced: 31 Mar 2025

https://github.com/cldellow/gzip

A fork of java.util.zip.GZIPInputStream that emits the offsets of nested streams.

compression gzip warc

Last synced: 14 Jul 2025

https://github.com/cldellow/datasette-current-actor

Adds a `current_actor()` function to SQLite

Last synced: 31 Mar 2025

https://github.com/cldellow/wiki-actors

Some random exploration of Oscars data

Last synced: 31 Mar 2025

https://github.com/cldellow/url-cache

Read a URL from the Internet, fetching from local cache if available.

Last synced: 14 Oct 2025

https://github.com/cldellow/snpsrt

Last synced: 15 Aug 2025

https://github.com/cldellow/cdx

Scala code to interact with the Common Crawl CDX index

Last synced: 21 Oct 2025

https://github.com/cldellow/hash-matcher

Implementation of http://pzemtsov.github.io/2016/10/16/custom-hash-function.html in Scala

Last synced: 21 Oct 2025

https://github.com/cldellow/cldellow.github.io

Blarg.

Last synced: 08 Jan 2026

https://github.com/cldellow/warc-compression

Scripts to experiment with different compression choices for WARCs.

Last synced: 31 Mar 2025

https://github.com/cldellow/cldellow

README profile

Last synced: 11 Jan 2026

https://github.com/cldellow/snpsrt2

A SQL approach to the Snapsort challenge

Last synced: 21 Oct 2025

https://github.com/cldellow/tweets

cldellow's twitter archive

Last synced: 15 May 2026

https://github.com/cldellow/sparky

Hackathon project to show realtime Spark metrics

Last synced: 21 Oct 2025

https://github.com/cldellow/libfailmalloc

Patched version of http://www.nongnu.org/failmalloc/ that compiles under modern g++

Last synced: 10 Jun 2025

https://github.com/cldellow/brics-lambda

An AWS lambda deployment to validate and test brics.dk automata.

Last synced: 21 Oct 2025

https://github.com/cldellow/ts-debug-failure

Repro of a TS compiler crash

Last synced: 21 Oct 2025

https://github.com/cldellow/vimfiles

Dotfiles for vim

Last synced: 29 Jan 2026