An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-manipulation

A curated list of projects in awesome lists tagged with data-manipulation .

https://github.com/gchq/cyberchef

The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis

compression data-analysis data-manipulation encoding encryption hashing parsing

Last synced: 12 May 2025

https://github.com/gchq/CyberChef

The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis

compression data-analysis data-manipulation encoding encryption hashing parsing

Last synced: 13 Mar 2025

https://gchq.github.io/CyberChef/

The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis

compression data-analysis data-manipulation encoding encryption hashing parsing

Last synced: 18 Mar 2025

https://github.com/tidyverse/dplyr

dplyr: A grammar of data manipulation

data-manipulation grammar r

Last synced: 17 Dec 2025

https://github.com/javascriptdata/danfojs

Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.

danfojs data-analysis data-analytics data-manipulation data-science dataframe javascript pandas plotting-charts stream-data stream-processing table tensorflow tensors

Last synced: 14 May 2025

https://github.com/vlandham/js_data

Data manipulation and processing in JavaScript

data-manipulation javascript js-data

Last synced: 04 Feb 2026

https://github.com/nathaneastwood/poorman

A poor man's dependency free grammar of data manipulation

base-r data-manipulation grammar r

Last synced: 08 Sep 2025

https://github.com/has2k1/plydata

A grammar for data manipulation in Python

data-manipulation pandas python

Last synced: 04 Apr 2025

https://github.com/pwwang/datar

A Grammar of Data Manipulation in python

data-manipulation dplyr forcats groupby pandas tibble tidyr tidyverse tribble

Last synced: 04 Apr 2025

https://github.com/fastverse/fastverse

An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R

c cpp data-aggregation data-manipulation data-science data-transformation high-performance low-dependency panel-data r rstats statistical-computing time-series weights

Last synced: 12 Dec 2025

https://github.com/pewresearch/pewmethods

Pew Research Center Methods team R package of miscellaneous functions

data-manipulation r sampling survey survey-analysis survey-data surveys weighting

Last synced: 16 Jan 2026

https://github.com/k0retux/fuddly

Fuzzing and Data Manipulation Framework (for GNU/Linux)

data-manipulation framework fuzzing python security

Last synced: 20 Apr 2025

https://github.com/sergiocorreia/ftools

Fast Stata commands for large datasets

collapse data-manipulation egen factor mata merge stata

Last synced: 12 Feb 2026

https://github.com/juliadata/indexedtables.jl

Flexible tables with ordered indices

data-analysis data-manipulation indexedtables julia juliadb

Last synced: 06 Apr 2025

https://github.com/tanyuqian/learning-data-manipulation

NeurIPS 2019 - Learning Data Manipulation for Augmentation and Weighting

bert data-augmentation data-manipulation meta-learning

Last synced: 26 Oct 2025

https://github.com/yulab-smu/treedata-book

:books: a complete reference book for treeio, tidytree and ggtree packages

data-import data-manipulation ggtree phylogeny tidytree treeio visualization

Last synced: 22 Feb 2026

https://github.com/TomasBeuzen/python-programming-for-data-science

Content from the University of British Columbia's Master of Data Science course DSCI 511.

data-manipulation data-science numpy pandas programming python teaching

Last synced: 18 Jul 2025

https://github.com/noraj/ctf-party

:triangular_flag_on_post: A CLI tool & library to enhance and speed up script/exploit writing with string conversion/manipulation.

ctf ctf-framework ctf-tools data-manipulation decoding encoding hacktoberfest hashing library security-tools string-manipulation

Last synced: 16 May 2025

https://github.com/TomFevrier/kiwis

A Pandas-inspired data wrangling toolkit in JavaScript

data data-manipulation data-wrangling pandas

Last synced: 15 Mar 2025

https://github.com/tomfevrier/kiwis

A Pandas-inspired data wrangling toolkit in JavaScript

data data-manipulation data-wrangling pandas

Last synced: 05 Apr 2026

https://github.com/daranzolin/hacksaw

Extra tidyverse-like functionality

data-manipulation r

Last synced: 17 Jan 2026

https://github.com/ssi-anik/dataset

Data set is PHP package for importing & exporting data within CSV & Database with data manipulation

csv csv-export csv-exporter csv-parsing csv-postgres csv-to-mysql data-manipulation database-csv-import php

Last synced: 26 Apr 2025

https://github.com/JonnyTran/OpenOmics

A bioinformatics API to interface with public multi-omics bio databases for wicked fast data integration.

data-integration data-manipulation genomics multi-omics python

Last synced: 18 Mar 2025

https://github.com/jonnytran/openomics

A bioinformatics API to interface with public multi-omics bio databases for wicked fast data integration.

data-integration data-manipulation genomics multi-omics python

Last synced: 16 Mar 2025

https://github.com/rubydamodar/the-ultimate-pandas-bootcamp

Welcome to the Pandas for Data Science repository! This course is designed to take you from beginner to proficient in using Pandas, the powerful data manipulation library in Python. Whether you're just starting your data science journey or looking to sharpen your skills, this repository contains all the resources

beginner-friendly csv-data data-analysis data-cleaning data-manipulation data-science data-visualization dataframe exploratory-data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python python-pandas series statistical-analysis time-series titanic-dataset

Last synced: 19 Apr 2025

https://github.com/juliaaplavin/flexijoins.jl

Flexible joining operations for tabular and non-tabular datasets with wide range of join conditions including distance, interval, and comparison predicates.

asof-join catalog-matching data-manipulation distance-join equi-join joins matching

Last synced: 02 Aug 2025

https://github.com/juliaaplavin/fleximaps.jl

Generalize `map`: make it lazy, filtering, flattening, ...

data-manipulation filtering lazy mappings

Last synced: 29 Dec 2025

https://github.com/clojure-finance/datajure

Clojure data manipulation DSL — composable query syntax built on tech.ml.dataset

clojure data-manipulation data-science dataframe dsl empirical-research query-dsl tech-ml-dataset

Last synced: 20 Apr 2026

https://github.com/mikeqfu/pyhelpers

PyHelpers: An open-source toolkit for facilitating Python users' data manipulation tasks

data-manipulation data-preprocessing py-utils python python-utilities python-utility python-utils utilities

Last synced: 10 Feb 2026

https://github.com/shukkkur/john-snows-ghost-map

Recreate John Snow's famous map of the 1854 cholera outbreak in London.

case-study data-manipulation data-visualization importing-and-cleaning-data

Last synced: 08 Sep 2025

https://github.com/shukkkur/exploring-cryptocurrency-market

To better understand the growth and impact of Bitcoin and other cryptocurrencies I explore the market capitalization of different cryptocurrencies.

data-manipulation data-visualization importing-and-cleaning-data

Last synced: 04 Oct 2025

https://github.com/shukkkur/analyzing-netflix-data

EDA, manipulating raw data, drawing conclusions from plots on Netflix data.

data-manipulation data-visualization data-vizualisation matplotlib netflix pandas

Last synced: 08 Sep 2025

https://github.com/shukkkur/do-left-handed-people-die-young

Pandas + Bayesian Statistics - to see if left-handed people actually die earlier than righties.

bayesian-statistics data-manipulation data-visualization importing-and-cleaning-data probability probability-and-statistics statistics

Last synced: 08 Sep 2025

https://github.com/data-forge/data-forge-fs

This library contains the file system extensions to Data-Forge that allow it to directly read and write CSV and JSON files in Node.js

csv data data-analysis data-cleaning data-cleansing data-forge data-management data-manipulation data-munging data-visualization data-wrangling javascript json linq nodejs pandas visualization

Last synced: 04 Sep 2025

https://github.com/shukkkur/exploring-the-history-of-lego

Using variety of data manipulation techniques to explore different aspects of Lego's history.

bricks data-analysis data-manipulation data-visualization history jupyter-notebook lego python rebrickable-database

Last synced: 08 Sep 2025

https://github.com/juliaaplavin/datapipes.jl

The most convenient piping syntax for generic data manipulation in Julia.

data-manipulation macro

Last synced: 22 Apr 2025

https://github.com/public-health-scotland/scotpho-indicator-production

Code used to prepare data for indicators in ScotPHO's profiles

data-manipulation health-data public-health

Last synced: 14 Apr 2025

https://github.com/shukkkur/analyzing-tv-data

Using data manipulation and visualization techniques to explore one of two different television broadcast datasets: The Super Bowl and hit sitcom The Office!

data-manipulation data-visualization pandas python

Last synced: 04 Oct 2025

https://github.com/bevry/sortobject

Deeply sort an object by its keys without mangling any arrays inside of it

client-side data-manipulation nodejs object-sort

Last synced: 15 Nov 2025

https://github.com/shukkkur/predict-species-from-images

Building a model that can automatically detect honey bees and bumble bees in images

data-manipulation data-visualization importing-and-cleaning-data machine-learning

Last synced: 07 Mar 2026

https://github.com/shukkkur/gender-prediction-using-sound

Analyzing the gender distribution of children's book writers and use sound to match names to gender.

case-study data-manipulation fuzzy matplotlib numpy pandas prediction

Last synced: 14 Oct 2025

https://github.com/ghurtchu/parcsv

:open_file_folder::newspaper: Manipulate CSV dataframes.

csv data-manipulation dataframe etl scala

Last synced: 28 Apr 2025

https://github.com/bevry/getsetdeep

Get or set nested variables of an object

client-side data-manipulation model nodejs

Last synced: 09 Apr 2025

https://github.com/juliaaplavin/datamanipulation.jl

General and composable utilities for manipulating tabular, quasi-tabular, and non-tabular datasets.

data-manipulation filtering grouping joins lazy mapping tables

Last synced: 13 Mar 2025

https://github.com/bevry/arrangekeys

Returns a copy of a JavaScript object with the keys arranged in a specified order. Useful for formatting JSON files.

client-side data-manipulation nodejs object-sort

Last synced: 12 Oct 2025

https://github.com/iloveitaly/funcy-pipe

If Funcy and Pipe had a baby. Decorates all Funcy methods with Pipe superpowers.

data-manipulation functional-programming funcy pipe python

Last synced: 24 Apr 2025

https://github.com/erictleung/tutorial-tidyverse

:milky_way: Presentation on the tidyverse in R to clean and manipulate data

data-cleaning data-manipulation data-science manipulate-data presentation programming r tidyverse tutorial

Last synced: 25 Mar 2025

https://github.com/criccomini/twister

A handy tool that converts Avro and Protobuf data to and from Java POJOs

avro data-conversion data-inference data-interoperability data-manipulation java json open-source pojo proto serialization

Last synced: 24 Jul 2025

https://github.com/petrbouchal/purrrow

R package for iteratively collating Arrow datasets out of memory

arrow data-manipulation data-storage r rstats

Last synced: 14 Oct 2025

https://github.com/chaitanyac22/lending-club-project---data-analysis-for-a-consumer-finance-company

Lending Club is a consumer finance company that specializes in lending various types of loans to urban customers. When the company receives a loan application, the company has to make a decision for loan approval based on the applicant’s profile. The project work aims to help the company in understanding the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default. The company can utilize this knowledge for its portfolio and risk assessment.

banking business-intelligence data-analysis data-cleaning data-manipulation data-visualization exploratory-data-analysis feature-engineering finance portfolio-management python3 risk-assessment statistics

Last synced: 23 Aug 2025

https://github.com/tuliosg/cdp

Repositório do curso "Ciência de Dados para Pesquisa".

data-analysis data-manipulation data-science data-visualization google-colab jupyter-notebook python

Last synced: 03 Mar 2026

https://github.com/enjirouz/win_api

The scariest thing I've ever written. Лабораторные по дисциплине "Операционные системы" под Windows

data-manipulation file-manipulation multithreading winapi winapi-application

Last synced: 15 Apr 2025

https://github.com/m-clark/data-manipulation-in-r

Set of slides for a workshop introducing dplyr related functionality, piping and related.

data-manipulation dplyr magrittr piping tidyverse workshop

Last synced: 30 Apr 2025

https://github.com/devxt-llc/convertanything

Convert your data to Pydantic models with help from language models.

ai artificial-intelligence data-manipulation

Last synced: 11 Jul 2025

https://github.com/flynnzac/genvar

An R package implementing an imperative data manipulation and regression framework (Stata-like)

data-manipulation imperative-data-manipulation regression

Last synced: 11 Jan 2026

https://github.com/bouldercodehub/rwdataplyr

R package to read and manipulate data from RiverWareTM

data-manipulation r riverware

Last synced: 16 Mar 2026

https://github.com/astrodynamic/retailanalitycs-in-postgresql

Develop a SQL script to create a database with tables, views, roles, and functions. Form personalized offers to increase average check, frequency of visits, and cross-selling.

bd csv data-analysis data-export data-input data-manipulation data-validation database-management functions git margin offers postgresql retail role-permission-management selling sql transaction tsv views

Last synced: 06 Apr 2026

https://github.com/oswaldobapvicjr/jep-data-extension

An extension of the Java Expression Parser library with useful functions for data handling

data-manipulation java jep jsonpath regular-expression xpath

Last synced: 08 May 2025

https://github.com/darekf77/lodash-walk-object

Iterate all properties of object or array

data-manipulation iterator-pattern

Last synced: 07 Apr 2025

https://github.com/chaitanyac22/house-price-prediction-project-for-a-us-based-housing-company

The goal of this project is to garner data insights using data analytics to purchase houses at a price below their actual value and flip them on at a higher price. This project aims at building an effective regression model using regularization (i.e. advanced linear regression: Ridge and Lasso regression) in order to predict the actual values of prospective housing properties and decide whether to invest in them or not.

advanced-linear-regression business-analytics data-analysis data-cleaning data-manipulation data-visualization exploratory-data-analysis feature-engineering lasso-regression linear-regression machine-learning model-building model-evaluation prediction-model python3 regularization rfe ridge-regression statistics

Last synced: 30 Apr 2026

https://github.com/martinboller/cc-build

Builds latest version of CyberChef and install it with NGINX on another system. CyberChef is a simple, intuitive web app for analyzing and decoding data without having to deal with complex tools or programming languages.

analysis blueteam compression cyberchef data-analysis data-manipulation decode encode encryption hashing parsing virtual-machine

Last synced: 15 Feb 2026

https://github.com/mynenik/xyplot-32

Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux

cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows

Last synced: 02 Aug 2025

https://github.com/nafisalawalidris/analyzing-nobel-prize-dataset-demographics-and-trends

This project analyses a Nobel Prize dataset using Python and data analysis libraries. It explores the distribution of winners by category and country, examines the proportion of female winners over time, investigates the age of winners when they received the prize and identifies the oldest and youngest recipients.

age-at-award country-distribution data-analysis data-manipulation dataset demographics filtering gender-balance grouping nobel-prize notable-laureates python trends visualisation winners

Last synced: 19 May 2026

https://github.com/0xibra/fluxify

A micro python library that retrieves and organizes data from a yaml mapping.

csv data-flow data-flow-control data-manipulation data-mapper data-structure json mapping python xml yaml yaml-mapping

Last synced: 14 Apr 2026

https://github.com/vaibhavacharya/json-mason

A library for performing structured modifications on JSON data. It provides a safe, predictable way to transform JSON objects through a series of operations.

data-manipulation json json-manipulation json-operations json-transformation

Last synced: 15 May 2026

https://github.com/rudeboybert/math116

2017-02 Middlebury Intro to Statistical & Data Sciences

data-manipulation data-science data-visualization statistical-inference statistics

Last synced: 18 Jan 2026