Projects in Awesome Lists tagged with data-manipulation
A curated list of projects in awesome lists tagged with data-manipulation .
https://github.com/gchq/cyberchef
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
compression data-analysis data-manipulation encoding encryption hashing parsing
Last synced: 12 May 2025
https://github.com/gchq/CyberChef
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
compression data-analysis data-manipulation encoding encryption hashing parsing
Last synced: 13 Mar 2025
https://gchq.github.io/CyberChef/
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
compression data-analysis data-manipulation encoding encryption hashing parsing
Last synced: 18 Mar 2025
https://github.com/javascriptdata/danfojs
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
danfojs data-analysis data-analytics data-manipulation data-science dataframe javascript pandas plotting-charts stream-data stream-processing table tensorflow tensors
Last synced: 14 May 2025
https://github.com/mattnotmax/cyberchef-recipes
A list of cyber-chef recipes and curated links
cyberchef cyberchef-recipes data-manipulation dfir incident-response malware regular-expression
Last synced: 02 Apr 2025
https://github.com/data-forge/data-forge-ts
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
csv data data-analysis data-cleaning data-cleansing data-forge data-management data-manipulation data-munging data-visualization data-wrangling javascript json linq nodejs pandas visualization
Last synced: 13 May 2025
https://github.com/fastverse/collapse
Advanced and Fast Data Transformation in R
cran data-aggregation data-analysis data-manipulation data-processing data-science data-transformation econometrics high-performance panel-data r rstats scientific-computing statistics time-series weighted weights
Last synced: 11 Jan 2026
https://github.com/sebkrantz/collapse
Advanced and Fast Data Transformation in R
cran data-aggregation data-analysis data-manipulation data-processing data-science data-transformation econometrics high-performance panel-data r rstats scientific-computing statistics time-series weighted weights
Last synced: 14 May 2025
https://github.com/stefmolin/Hands-On-Data-Analysis-with-Pandas-2nd-edition
Materials for following along with Hands-On Data Analysis with Pandas – Second Edition
data-analysis data-analysis-pandas data-analysis-python data-manipulation data-science data-wrangling machine-learning pandas visualizing-data
Last synced: 07 Sep 2025
https://github.com/vlandham/js_data
Data manipulation and processing in JavaScript
data-manipulation javascript js-data
Last synced: 04 Feb 2026
https://github.com/SebKrantz/collapse
Advanced and Fast Data Transformation in R
cran data-aggregation data-analysis data-manipulation data-processing data-science data-transformation econometrics high-performance panel-data r rstats scientific-computing statistics time-series weighted weights
Last synced: 26 Apr 2025
https://github.com/programminghistorian/jekyll
Jekyll-based static site for The Programming Historian
api data-management data-manipulation data-mining dh digital-humanities exhibits linked-open-data mapping multi-lingual network-analysis open-educational-resources open-source pedagogy programming-historian python r-studio scraping text-analysis web-scraping
Last synced: 14 Mar 2025
https://github.com/JuliaDataScience/JuliaDataScience
Book on Julia for Data Science
book data data-manipulation data-science data-visualization julia julia-language
Last synced: 20 Jul 2025
https://github.com/juliadatascience/juliadatascience
Book on Julia for Data Science
book data data-manipulation data-science data-visualization julia julia-language
Last synced: 14 Apr 2025
https://github.com/nathaneastwood/poorman
A poor man's dependency free grammar of data manipulation
base-r data-manipulation grammar r
Last synced: 08 Sep 2025
https://github.com/aplus-framework/database
Aplus Framework Database Library
aplus aplus-framework composer data-definition data-manipulation database framework full-stack gitlab library mariadb mysql mysqli php php8 phpoop phpstorm vscode
Last synced: 16 May 2025
https://github.com/has2k1/plydata
A grammar for data manipulation in Python
data-manipulation pandas python
Last synced: 04 Apr 2025
https://github.com/pwwang/datar
A Grammar of Data Manipulation in python
data-manipulation dplyr forcats groupby pandas tibble tidyr tidyverse tribble
Last synced: 04 Apr 2025
https://github.com/przemek83/volbx
Graphical tool for data manipulation written in C++/Qt.
c-plus-plus c-plus-plus-17 cpp cpp17 csv data data-analysis data-export data-filtering data-import data-manipulation data-visualization dynamic graphical ods plots qt spreadsheet statistical-analysis xlsx
Last synced: 08 Apr 2025
https://github.com/fastverse/fastverse
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
c cpp data-aggregation data-manipulation data-science data-transformation high-performance low-dependency panel-data r rstats statistical-computing time-series weights
Last synced: 12 Dec 2025
https://github.com/pewresearch/pewmethods
Pew Research Center Methods team R package of miscellaneous functions
data-manipulation r sampling survey survey-analysis survey-data surveys weighting
Last synced: 16 Jan 2026
https://github.com/k0retux/fuddly
Fuzzing and Data Manipulation Framework (for GNU/Linux)
data-manipulation framework fuzzing python security
Last synced: 20 Apr 2025
https://github.com/sergiocorreia/ftools
Fast Stata commands for large datasets
collapse data-manipulation egen factor mata merge stata
Last synced: 12 Feb 2026
https://github.com/ndleah/ibm-data-analyst-professional
Capstone projects of the IBM Data Analyst Professional
analyzing-data data-analysis data-analyst data-manipulation data-science data-visualization data-visualizations ibm-datascience-certification pandas python
Last synced: 05 May 2025
https://github.com/sl-solution/inmemorydatasets.jl
Multithreaded package for working with tabular data in Julia
data-manipulation data-wrangling dataframes dataset efficient high-performance in-memory join julia multithreaded tabular-data
Last synced: 17 Aug 2025
https://github.com/juliadata/indexedtables.jl
Flexible tables with ordered indices
data-analysis data-manipulation indexedtables julia juliadb
Last synced: 06 Apr 2025
https://github.com/tanyuqian/learning-data-manipulation
NeurIPS 2019 - Learning Data Manipulation for Augmentation and Weighting
bert data-augmentation data-manipulation meta-learning
Last synced: 26 Oct 2025
https://github.com/yulab-smu/treedata-book
:books: a complete reference book for treeio, tidytree and ggtree packages
data-import data-manipulation ggtree phylogeny tidytree treeio visualization
Last synced: 22 Feb 2026
https://github.com/ohare93/brain-brew
Automated Anki flashcard creation and extraction to/from Csv
anki anki-flashcards brain-brew collaboration crowdanki csv csv-converter data-manipulation learning open-source python python37 yamale
Last synced: 29 Oct 2025
https://github.com/galliaproject/gallia-core
A schema-aware Scala library for data transformation
data-engineering data-manipulation data-science data-transformation etl feature-engineering json nesting scala spark
Last synced: 12 Feb 2026
https://github.com/TomasBeuzen/python-programming-for-data-science
Content from the University of British Columbia's Master of Data Science course DSCI 511.
data-manipulation data-science numpy pandas programming python teaching
Last synced: 18 Jul 2025
https://github.com/noraj/ctf-party
:triangular_flag_on_post: A CLI tool & library to enhance and speed up script/exploit writing with string conversion/manipulation.
ctf ctf-framework ctf-tools data-manipulation decoding encoding hacktoberfest hashing library security-tools string-manipulation
Last synced: 16 May 2025
https://github.com/kevinadhiguna/dqlab-career-track
A collection of scripts written to complete DQLab Data Analyst Career Track 📊
career-track data-analysis data-analyst data-manipulation data-quality data-visualization dqlab dqlab-career-track exploratory-data-analysis machine-learning python sql
Last synced: 21 Mar 2025
https://github.com/Koushikphy/Interactive_Data_Editor
A Software to interactively edit data in a graphical manner
3d-plot computational-chemistry data data-analysis data-fitting data-manipulation data-visualization dataset electron-app electronjs fitting graph graphical griddata plotting regression-analysis smoothing snap surface-plot
Last synced: 18 Jul 2025
https://github.com/koushikphy/interactive_data_editor
A Software to interactively edit data in a graphical manner
3d-plot computational-chemistry data data-analysis data-fitting data-manipulation data-visualization dataset electron-app electronjs fitting graph graphical griddata plotting regression-analysis smoothing snap surface-plot
Last synced: 20 Aug 2025
https://github.com/TomFevrier/kiwis
A Pandas-inspired data wrangling toolkit in JavaScript
data data-manipulation data-wrangling pandas
Last synced: 15 Mar 2025
https://github.com/tomfevrier/kiwis
A Pandas-inspired data wrangling toolkit in JavaScript
data data-manipulation data-wrangling pandas
Last synced: 05 Apr 2026
https://github.com/data-datum/learning_R
List of resources for learning R
books data-manipulation data-visualization datatable dplyr functions ggplot2 purrr r r-programming r-spatial reproducible-research rmarkdown rstudio shiny shiny-apps strings strings-manipulation tidyr webinars
Last synced: 29 Jul 2025
https://github.com/matheusfelipeog/fordev
Gere e valide dados randômicos com fordev 🎲
4devs 4devs-api 4devs-module api data-generator data-manipulation data-validation fake-data fake-data-generator fordev fourthdev python random-data scrapping
Last synced: 29 Jun 2025
https://github.com/ssi-anik/dataset
Data set is PHP package for importing & exporting data within CSV & Database with data manipulation
csv csv-export csv-exporter csv-parsing csv-postgres csv-to-mysql data-manipulation database-csv-import php
Last synced: 26 Apr 2025
https://github.com/JonnyTran/OpenOmics
A bioinformatics API to interface with public multi-omics bio databases for wicked fast data integration.
data-integration data-manipulation genomics multi-omics python
Last synced: 18 Mar 2025
https://github.com/jonnytran/openomics
A bioinformatics API to interface with public multi-omics bio databases for wicked fast data integration.
data-integration data-manipulation genomics multi-omics python
Last synced: 16 Mar 2025
https://github.com/tony-aw/broadcast
R Package Broadcast: Broadcasted Array Operations Like ‘NumPy’
cran cran-r data-manipulation fastverse high-performance multidimensional-arrays numpy r r-package rstats rstats-package scientific-computing
Last synced: 22 Feb 2026
https://github.com/renkun-ken/r-data-practice
R语言数据操作练习
data-analysis data-manipulation practice r
Last synced: 29 Oct 2025
https://github.com/rubydamodar/the-ultimate-pandas-bootcamp
Welcome to the Pandas for Data Science repository! This course is designed to take you from beginner to proficient in using Pandas, the powerful data manipulation library in Python. Whether you're just starting your data science journey or looking to sharpen your skills, this repository contains all the resources
beginner-friendly csv-data data-analysis data-cleaning data-manipulation data-science data-visualization dataframe exploratory-data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python python-pandas series statistical-analysis time-series titanic-dataset
Last synced: 19 Apr 2025
https://github.com/juliaaplavin/flexijoins.jl
Flexible joining operations for tabular and non-tabular datasets with wide range of join conditions including distance, interval, and comparison predicates.
asof-join catalog-matching data-manipulation distance-join equi-join joins matching
Last synced: 02 Aug 2025
https://github.com/juliaaplavin/fleximaps.jl
Generalize `map`: make it lazy, filtering, flattening, ...
data-manipulation filtering lazy mappings
Last synced: 29 Dec 2025
https://github.com/clojure-finance/datajure
Clojure data manipulation DSL — composable query syntax built on tech.ml.dataset
clojure data-manipulation data-science dataframe dsl empirical-research query-dsl tech-ml-dataset
Last synced: 20 Apr 2026
https://github.com/mikeqfu/pyhelpers
PyHelpers: An open-source toolkit for facilitating Python users' data manipulation tasks
data-manipulation data-preprocessing py-utils python python-utilities python-utility python-utils utilities
Last synced: 10 Feb 2026
https://github.com/shukkkur/john-snows-ghost-map
Recreate John Snow's famous map of the 1854 cholera outbreak in London.
case-study data-manipulation data-visualization importing-and-cleaning-data
Last synced: 08 Sep 2025
https://github.com/shukkkur/exploring-cryptocurrency-market
To better understand the growth and impact of Bitcoin and other cryptocurrencies I explore the market capitalization of different cryptocurrencies.
data-manipulation data-visualization importing-and-cleaning-data
Last synced: 04 Oct 2025
https://github.com/shukkkur/analyzing-netflix-data
EDA, manipulating raw data, drawing conclusions from plots on Netflix data.
data-manipulation data-visualization data-vizualisation matplotlib netflix pandas
Last synced: 08 Sep 2025
https://github.com/shukkkur/do-left-handed-people-die-young
Pandas + Bayesian Statistics - to see if left-handed people actually die earlier than righties.
bayesian-statistics data-manipulation data-visualization importing-and-cleaning-data probability probability-and-statistics statistics
Last synced: 08 Sep 2025
https://github.com/data-forge/data-forge-fs
This library contains the file system extensions to Data-Forge that allow it to directly read and write CSV and JSON files in Node.js
csv data data-analysis data-cleaning data-cleansing data-forge data-management data-manipulation data-munging data-visualization data-wrangling javascript json linq nodejs pandas visualization
Last synced: 04 Sep 2025
https://github.com/shukkkur/exploring-the-history-of-lego
Using variety of data manipulation techniques to explore different aspects of Lego's history.
bricks data-analysis data-manipulation data-visualization history jupyter-notebook lego python rebrickable-database
Last synced: 08 Sep 2025
https://github.com/juliaaplavin/datapipes.jl
The most convenient piping syntax for generic data manipulation in Julia.
Last synced: 22 Apr 2025
https://github.com/juliadatascience/juliadatascience-pt
Book on Julia for Data Science (Portuguese Edition)
book data data-manipulation data-science data-visualization julia julia-language
Last synced: 24 Jun 2025
https://github.com/public-health-scotland/scotpho-indicator-production
Code used to prepare data for indicators in ScotPHO's profiles
data-manipulation health-data public-health
Last synced: 14 Apr 2025
https://github.com/sl-solution/inmemorydatasetstutorial
A tutorial for working with InMemoryDatasets.jl.
data data-manipulation data-science data-wrangling dataset flight-data-analysis inmemorydatasets julia jupyter-notebook tutorial
Last synced: 22 Apr 2025
https://github.com/shukkkur/analyzing-tv-data
Using data manipulation and visualization techniques to explore one of two different television broadcast datasets: The Super Bowl and hit sitcom The Office!
data-manipulation data-visualization pandas python
Last synced: 04 Oct 2025
https://github.com/bevry/sortobject
Deeply sort an object by its keys without mangling any arrays inside of it
client-side data-manipulation nodejs object-sort
Last synced: 15 Nov 2025
https://github.com/juliaaplavin/sqlcollections.jl
data-manipulation sql tabular-data
Last synced: 23 Mar 2025
https://github.com/shukkkur/which-debts-are-worth-the-bank-s-efforts
Using regression discontinuity try to see which debts are worth collecting.
data-manipulation data-visualization importing-and-cleaning-data probability probability-and-statistics
Last synced: 13 Feb 2026
https://github.com/chaitanyac22/car-price-prediction-model-for-an-automobile-consulting-company
The goal of this project is to build multiple linear regression models for the prediction of car prices.
business-analytics data-analytics data-cleaning data-manipulation data-visualization exploratory-data-analysis feature-engineering machine-learning model-building model-evaluation prediction-model python3 residual-analysis statistics
Last synced: 13 Apr 2025
https://github.com/shukkkur/predict-species-from-images
Building a model that can automatically detect honey bees and bumble bees in images
data-manipulation data-visualization importing-and-cleaning-data machine-learning
Last synced: 07 Mar 2026
https://github.com/amey-thakur/kaggle
Kaggle Courses - All Exercises of the respective courses.
amey ameythakur courses data-cleaning data-manipulation data-science data-visualization deep-learning feature-engineering intro-to-ml kaggle machine-learning machine-learning-explainability python
Last synced: 25 Aug 2025
https://github.com/shukkkur/gender-prediction-using-sound
Analyzing the gender distribution of children's book writers and use sound to match names to gender.
case-study data-manipulation fuzzy matplotlib numpy pandas prediction
Last synced: 14 Oct 2025
https://github.com/shukkkur/classify-song-genres-from-audio-data
Python ML Methods to classify songs into genres.
classification data-manipulation data-science data-visualization importing-and-cleaning-data machine-learning sklearn
Last synced: 08 Sep 2025
https://github.com/ghurtchu/parcsv
:open_file_folder::newspaper: Manipulate CSV dataframes.
csv data-manipulation dataframe etl scala
Last synced: 28 Apr 2025
https://github.com/shukkkur/analyzing-the-discovery-of-handwashing
Analyzing the medical data Dr.Semmelweis collected.
data-manipulation data-visualization importing-and-cleaning-data probability statistics
Last synced: 08 Oct 2025
https://github.com/bevry/getsetdeep
Get or set nested variables of an object
client-side data-manipulation model nodejs
Last synced: 09 Apr 2025
https://github.com/juliaaplavin/datamanipulation.jl
General and composable utilities for manipulating tabular, quasi-tabular, and non-tabular datasets.
data-manipulation filtering grouping joins lazy mapping tables
Last synced: 13 Mar 2025
https://github.com/bevry/arrangekeys
Returns a copy of a JavaScript object with the keys arranged in a specified order. Useful for formatting JSON files.
client-side data-manipulation nodejs object-sort
Last synced: 12 Oct 2025
https://github.com/siarheidudko/firebase-admin-cli
Cli for firebase
authentication cli data-manipulation data-migratation data-science firebase firebase-admin firebase-auth firebase-authentication firebase-database firebase-firestore firebase-firestore-database firebase-realtime-database firebase-storage firebase-tools firebase-ui firestore google-cloud-storage rtdb storage
Last synced: 27 Feb 2026
https://github.com/iloveitaly/funcy-pipe
If Funcy and Pipe had a baby. Decorates all Funcy methods with Pipe superpowers.
data-manipulation functional-programming funcy pipe python
Last synced: 24 Apr 2025
https://github.com/erictleung/tutorial-tidyverse
:milky_way: Presentation on the tidyverse in R to clean and manipulate data
data-cleaning data-manipulation data-science manipulate-data presentation programming r tidyverse tutorial
Last synced: 25 Mar 2025
https://github.com/criccomini/twister
A handy tool that converts Avro and Protobuf data to and from Java POJOs
avro data-conversion data-inference data-interoperability data-manipulation java json open-source pojo proto serialization
Last synced: 24 Jul 2025
https://github.com/petrbouchal/purrrow
R package for iteratively collating Arrow datasets out of memory
arrow data-manipulation data-storage r rstats
Last synced: 14 Oct 2025
https://github.com/chaitanyac22/lending-club-project---data-analysis-for-a-consumer-finance-company
Lending Club is a consumer finance company that specializes in lending various types of loans to urban customers. When the company receives a loan application, the company has to make a decision for loan approval based on the applicant’s profile. The project work aims to help the company in understanding the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default. The company can utilize this knowledge for its portfolio and risk assessment.
banking business-intelligence data-analysis data-cleaning data-manipulation data-visualization exploratory-data-analysis feature-engineering finance portfolio-management python3 risk-assessment statistics
Last synced: 23 Aug 2025
https://github.com/tuliosg/cdp
Repositório do curso "Ciência de Dados para Pesquisa".
data-analysis data-manipulation data-science data-visualization google-colab jupyter-notebook python
Last synced: 03 Mar 2026
https://github.com/enjirouz/win_api
The scariest thing I've ever written. Лабораторные по дисциплине "Операционные системы" под Windows
data-manipulation file-manipulation multithreading winapi winapi-application
Last synced: 15 Apr 2025
https://github.com/m-clark/data-manipulation-in-r
Set of slides for a workshop introducing dplyr related functionality, piping and related.
data-manipulation dplyr magrittr piping tidyverse workshop
Last synced: 30 Apr 2025
https://github.com/devxt-llc/convertanything
Convert your data to Pydantic models with help from language models.
ai artificial-intelligence data-manipulation
Last synced: 11 Jul 2025
https://github.com/flynnzac/genvar
An R package implementing an imperative data manipulation and regression framework (Stata-like)
data-manipulation imperative-data-manipulation regression
Last synced: 11 Jan 2026
https://github.com/bouldercodehub/rwdataplyr
R package to read and manipulate data from RiverWareTM
Last synced: 16 Mar 2026
https://github.com/astrodynamic/retailanalitycs-in-postgresql
Develop a SQL script to create a database with tables, views, roles, and functions. Form personalized offers to increase average check, frequency of visits, and cross-selling.
bd csv data-analysis data-export data-input data-manipulation data-validation database-management functions git margin offers postgresql retail role-permission-management selling sql transaction tsv views
Last synced: 06 Apr 2026
https://github.com/oswaldobapvicjr/jep-data-extension
An extension of the Java Expression Parser library with useful functions for data handling
data-manipulation java jep jsonpath regular-expression xpath
Last synced: 08 May 2025
https://github.com/darekf77/lodash-walk-object
Iterate all properties of object or array
data-manipulation iterator-pattern
Last synced: 07 Apr 2025
https://github.com/chaitanyac22/house-price-prediction-project-for-a-us-based-housing-company
The goal of this project is to garner data insights using data analytics to purchase houses at a price below their actual value and flip them on at a higher price. This project aims at building an effective regression model using regularization (i.e. advanced linear regression: Ridge and Lasso regression) in order to predict the actual values of prospective housing properties and decide whether to invest in them or not.
advanced-linear-regression business-analytics data-analysis data-cleaning data-manipulation data-visualization exploratory-data-analysis feature-engineering lasso-regression linear-regression machine-learning model-building model-evaluation prediction-model python3 regularization rfe ridge-regression statistics
Last synced: 30 Apr 2026
https://github.com/martinboller/cc-build
Builds latest version of CyberChef and install it with NGINX on another system. CyberChef is a simple, intuitive web app for analyzing and decoding data without having to deal with complex tools or programming languages.
analysis blueteam compression cyberchef data-analysis data-manipulation decode encode encryption hashing parsing virtual-machine
Last synced: 15 Feb 2026
https://github.com/mynenik/xyplot-32
Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux
cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows
Last synced: 02 Aug 2025
https://github.com/nafisalawalidris/analyzing-nobel-prize-dataset-demographics-and-trends
This project analyses a Nobel Prize dataset using Python and data analysis libraries. It explores the distribution of winners by category and country, examines the proportion of female winners over time, investigates the age of winners when they received the prize and identifies the oldest and youngest recipients.
age-at-award country-distribution data-analysis data-manipulation dataset demographics filtering gender-balance grouping nobel-prize notable-laureates python trends visualisation winners
Last synced: 19 May 2026
https://github.com/0xibra/fluxify
A micro python library that retrieves and organizes data from a yaml mapping.
csv data-flow data-flow-control data-manipulation data-mapper data-structure json mapping python xml yaml yaml-mapping
Last synced: 14 Apr 2026
https://github.com/vaibhavacharya/json-mason
A library for performing structured modifications on JSON data. It provides a safe, predictable way to transform JSON objects through a series of operations.
data-manipulation json json-manipulation json-operations json-transformation
Last synced: 15 May 2026
https://github.com/rudeboybert/math116
2017-02 Middlebury Intro to Statistical & Data Sciences
data-manipulation data-science data-visualization statistical-inference statistics
Last synced: 18 Jan 2026
https://github.com/crazywolf132/wql
WQL -- Data Manipulation Language like GraphQL
data-manipulation graphql interpreter javascript javascript-library json language parser server-side
Last synced: 05 Apr 2025