Projects in Awesome Lists by cldellow
A curated list of projects in awesome lists by cldellow .
https://github.com/cldellow/sqlite-parquet-vtable
A SQLite vtable extension to read Parquet files
apache-arrow apache-parquet parquet sqlite sqlite3
Last synced: 09 Apr 2025
https://github.com/cldellow/csv2parquet
Convert a CSV to a parquet file.
apache-arrow apache-parquet csv parquet
Last synced: 21 Aug 2025
https://github.com/cldellow/datasette-scraper
Add website scraping abilities to Datasette
datasette datasette-plugin scraping
Last synced: 17 Aug 2025
https://github.com/cldellow/datasette-parquet
Add DuckDB, Parquet, CSV and JSON lines support to Datasette
datasette datasette-plugin duckdb parquet
Last synced: 13 Apr 2025
https://github.com/cldellow/real-estate-prices-cc
Source real estate prices from the Common Crawl.
Last synced: 06 May 2025
https://github.com/cldellow/datasette-ui-extras
Add editing UI and other power-user features to Datasette.
datasette datasette-io datasette-plugin
Last synced: 07 May 2025
https://github.com/cldellow/parquet-metadata
Dump metadata about a Parquet file.
apache-arrow apache-parquet parquet
Last synced: 07 May 2025
https://github.com/cldellow/datasette-rewrite-sql
A Datasette hook to inspect/rewrite the SQL users run.
datasette datasette-io datasette-plugin
Last synced: 13 May 2025
https://github.com/cldellow/mapt
An opinionated workflow for building OSM tiles and styles.
openstreetmap tilemaker vector-tiles
Last synced: 13 May 2025
https://github.com/cldellow/stackexchange-to-sqlite
Export a Stack Exchange dump to a SQLite3 database.
sqlite stackexchange stackexchange-dump
Last synced: 23 Apr 2025
https://github.com/cldellow/datasette-mutable-downloads
Enable downloading mutable databases from Datasette
Last synced: 21 Jan 2026
https://github.com/cldellow/dux-demo
A repo for publishing a demo Datasette instance with datasette-ui-extras
Last synced: 18 Mar 2026
https://github.com/cldellow/datasette-ersatz-table-valued-functions
Enable a limited form of table-valued functions in Datasette
datasette datasette-io datasette-plugin sqlite
Last synced: 06 May 2026
https://github.com/cldellow/iem2parquet
Export Iowa Environmental Mesonet data to Parquet files.
Last synced: 31 Mar 2025
https://github.com/cldellow/gzip
A fork of java.util.zip.GZIPInputStream that emits the offsets of nested streams.
Last synced: 14 Jul 2025
https://github.com/cldellow/datasette-current-actor
Adds a `current_actor()` function to SQLite
Last synced: 31 Mar 2025
https://github.com/cldellow/wiki-actors
Some random exploration of Oscars data
Last synced: 31 Mar 2025
https://github.com/cldellow/url-cache
Read a URL from the Internet, fetching from local cache if available.
Last synced: 14 Oct 2025
https://github.com/cldellow/cdx
Scala code to interact with the Common Crawl CDX index
Last synced: 21 Oct 2025
https://github.com/cldellow/hash-matcher
Implementation of http://pzemtsov.github.io/2016/10/16/custom-hash-function.html in Scala
Last synced: 21 Oct 2025
https://github.com/cldellow/warc-compression
Scripts to experiment with different compression choices for WARCs.
Last synced: 31 Mar 2025
https://github.com/cldellow/snpsrt2
A SQL approach to the Snapsort challenge
Last synced: 21 Oct 2025
https://github.com/cldellow/sparky
Hackathon project to show realtime Spark metrics
Last synced: 21 Oct 2025
https://github.com/cldellow/libfailmalloc
Patched version of http://www.nongnu.org/failmalloc/ that compiles under modern g++
Last synced: 10 Jun 2025
https://github.com/cldellow/brics-lambda
An AWS lambda deployment to validate and test brics.dk automata.
Last synced: 21 Oct 2025