Projects in Awesome Lists by data-integrations
A curated list of projects in awesome lists by data-integrations .
https://github.com/data-integrations/wrangler
Wrangler Transform: A DMD system for transforming Big Data
avro big-data cdap cdap-plugin data-cleansing data-prep data-science data-transform data-transformation manipulate-data parsing preparation project transform transform-data wrangle
Last synced: 25 Oct 2025
https://github.com/data-integrations/google-cloud
A collection of Google Cloud Platform (GCP) plugins
bigquery cdap cdap-plugin gcs google pubsub
Last synced: 04 Oct 2025
https://github.com/data-integrations/salesforce
Salesforce plugins
cdap cdap-plugin salesforce salesforce-api
Last synced: 25 Oct 2025
https://github.com/data-integrations/bigquery-delta-plugins
BigQuery Delta Replicator Plugins
Last synced: 25 Oct 2025
https://github.com/data-integrations/database-delta-plugins
Delta plugins for databases
Last synced: 25 Oct 2025
https://github.com/data-integrations/salesforce-marketing
Plugins for Salesforce Marketing Cloud
Last synced: 25 Oct 2025
https://github.com/data-integrations/hive-plugins
Hive Export/Import plugins
apache-hive cask-marketplace cdap cdap-plugin hive hive-export
Last synced: 25 Oct 2025
https://github.com/data-integrations/hubspot
A collection of hubspot connector and plugins
Last synced: 14 Jan 2026
https://github.com/data-integrations/amazon-s3-plugins
Amazon S3 source and sink
aws aws-s3 cask-marketplace cdap cdap-plugin
Last synced: 25 Oct 2025
https://github.com/data-integrations/kafka-plugins
Kafka Source/Sink for reading/writing to kafka topic
cask-marketplace cdap cdap-plugin kafka kafka-plugins kafka-sink kafka-source plugin sink spark-streaming
Last synced: 25 Oct 2025
https://github.com/data-integrations/google-analytics-360
A collection of Google Analytics 360 connectors and plugins.
cdap cdap-plugin google-analytics-360
Last synced: 25 Oct 2025
https://github.com/data-integrations/google-ads
Collection of Google Ads connectors and plugins
Last synced: 25 Oct 2025
https://github.com/data-integrations/azure
File Source to read data from distributed file system
adls azure azure-blob azure-blob-storage cask-marketplace cdap cdap-plugin source
Last synced: 25 Oct 2025
https://github.com/data-integrations/loaddatatosnowflake-action
Load data files from internal/external(amazon S3) location to Snowflake table.
aws cask-marketplace cdap cdap-plugin snowflake
Last synced: 25 Oct 2025
https://github.com/data-integrations/dynamic-spark
Dynamic Spark CDAP Plugins
cask-marketplace cdap cdap-plugin scala-spark spark
Last synced: 25 Oct 2025
https://github.com/data-integrations/sftp-actions
cask-marketplace cdap cdap-plugin sftp
Last synced: 25 Oct 2025
https://github.com/data-integrations/http
Sink plugin to send the messages from the pipeline to an external http endpoint
cask-marketplace cdap cdap-plugin http
Last synced: 25 Oct 2025
https://github.com/data-integrations/condition-plugins
A control flow plugin for making decisions based on runtime arguments and tokens
cdap cdap-plugin condition control-flow
Last synced: 25 Oct 2025
https://github.com/data-integrations/mqtt
MQTT Broker ingestion.
cask-marketplace cdap cdap-plugin iot mqtt
Last synced: 25 Oct 2025
https://github.com/data-integrations/facebook
A collection of facebook connectors and plugins
Last synced: 25 Oct 2025
https://github.com/data-integrations/nlp
A collection of directives and plugins for Natural Language Processing
cdap directives natural-language-processing nlp plugin
Last synced: 25 Oct 2025
https://github.com/data-integrations/ftp-plugins
FTP source and sink plugins
Last synced: 20 Jan 2026
https://github.com/data-integrations/httptohdfs-action
Action Plugin: Fetches data from an external HTTP webservice or URL
cask-marketplace cdap cdap-plugin http
Last synced: 25 Oct 2025
https://github.com/data-integrations/sendgrid
A collection of sendgrid plugins
cdap cdap-plugin sendgrid sendgrid-api
Last synced: 25 Oct 2025
https://github.com/data-integrations/data-profiler
Profiles the fields to generate statistics on each column specified.
cdap cdap-plugin data-quality profiler
Last synced: 25 Oct 2025
https://github.com/data-integrations/to-utf8-action
An action plugin to convert files created in other character sets into UTF-8 format
cask-marketplace cdap cdap-plugin character-set utf-8
Last synced: 25 Oct 2025
https://github.com/data-integrations/switch
A collection of plugins that provide switch...case transforms
cdap cdap-plugins switch switchcase transforms
Last synced: 25 Oct 2025
https://github.com/data-integrations/mongodb-plugins
Plugins for MongoDB integrations
Last synced: 20 Jan 2026
https://github.com/data-integrations/xml-directives
Collection of XML directives
cask-marketplace cdap cdap-plugin dataprep directory udd xml
Last synced: 25 Oct 2025
https://github.com/data-integrations/window-aggregation
Plugins for performing window aggregation
Last synced: 20 Jan 2026
https://github.com/data-integrations/dynamodb-sink
Plugin to write data to DynamoDB.
aws aws-dynamodb cask-marketplace cdap cdap-plugin data-integration dynamodb
Last synced: 25 Oct 2025
https://github.com/data-integrations/azure-delete-action
delete entity from Azure Blob service
azure azure-blob-storage cask-marketplace cdap cdap-plugin
Last synced: 25 Oct 2025
https://github.com/data-integrations/fast-filter-transform
A filter that only allows records through that pass the specified criteria.
cask-marketplace cdap cdap-plugin filtering
Last synced: 25 Oct 2025
https://github.com/data-integrations/multi-table-plugins
Pipeline plugins that allow reading from multiple DB tables with the same source
cask-marketplace cdap cdap-filesets cdap-plugin database
Last synced: 20 Jan 2026
https://github.com/data-integrations/record-splitter-transform
Given a field and a delimiter, this splits the value into multiple records.
cask-marketplace cdap cdap-plugin split
Last synced: 20 Jan 2026
https://github.com/data-integrations/splunk
A collection of Splunk connectors and plugins
Last synced: 25 Oct 2025
https://github.com/data-integrations/zendesk
A collection of Zendesk connector and plugin
Last synced: 20 Jan 2026
https://github.com/data-integrations/s3toredshift-action
Loads the data from S3 into Redshift
aws aws-s3 cask-marketplace cdap cdap-plugin redshift
Last synced: 25 Oct 2025
https://github.com/data-integrations/example-microservice
An example implementation of a microservice
Last synced: 25 Oct 2025
https://github.com/data-integrations/example-action
This example action plugin can be used as a starting point for developing new Action plugins.
cdap cdap-example cdap-plugin example
Last synced: 20 Jan 2026
https://github.com/data-integrations/image-directives
A set of directives for working with images
cask-marketplace cdap cdap-dataprep cdap-udds data-prep directives
Last synced: 25 Oct 2025
https://github.com/data-integrations/sqlserver-delta-plugins
SQLServer delta pipeline plugins
Last synced: 25 Oct 2025
https://github.com/data-integrations/state-restore-action
State Store Action for tracking the states
cask-marketplace cdap cdap-plugin database pipeline-state state-restoration state-store
Last synced: 25 Oct 2025
https://github.com/data-integrations/trash
Trash sink that eats up all the records passed to it.
batch cask-marketplace cdap cdap-plugin dev-null devops real-time trash
Last synced: 25 Oct 2025
https://github.com/data-integrations/confluent
A collection of Confluent and Confluent Cloud
cdap cdap-plugin confluent confluent-cloud
Last synced: 20 Jan 2026
https://github.com/data-integrations/sample-spark-app
Sample Vanilla Spark App
Last synced: 25 Oct 2025
https://github.com/data-integrations/repartitioner
Repartitions a spark RDD
cask-marketplace cdap cdap-plugin rdd repartition spark spark-streaming
Last synced: 25 Oct 2025
https://github.com/data-integrations/google-search-console
Repository for Google search Console
Last synced: 25 Oct 2025