Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with emr

A curated list of projects in awesome lists tagged with emr .

https://github.com/aws/aws-sdk-pandas

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

amazon-athena amazon-sagemaker-notebook apache-arrow apache-parquet athena aws aws-glue aws-lambda data-engineering data-science emr etl glue-catalog lambda modin mysql pandas python ray redshift

Last synced: 16 Dec 2024

https://github.com/fastenhealth/fasten-onprem

Fasten is an open-source, self-hosted, personal/family electronic medical record aggregator, designed to integrate with 100,000's of insurances/hospitals/clinics

electronic-health-record electronic-medical-record emr healthcare open-source personal-health-record

Last synced: 19 Dec 2024

https://github.com/earthians/marley

Open Source Health Information System

emr erp erpnext health healthcare hims opd patient-management

Last synced: 03 Nov 2024

https://github.com/lynnlangit/learning-hadoop-and-spark

Companion to Learning Hadoop and Learning Spark courses on Linked In Learning

apache-spark dataproc emr hadoop learning-hadoop mapreduce spark wordcount

Last synced: 15 Dec 2024

https://github.com/lynnlangit/hello-aws-data-services

AWS Data/MLServices sample code & notes for my LinkedIn Learning courses

athena aws aws-cli aws-sdk dynamodb emr kinesis rds redshift sagemaker

Last synced: 28 Oct 2024

https://github.com/lynnlangit/Hello-AWS-Data-Services

AWS Data/MLServices sample code & notes for my LinkedIn Learning courses

athena aws aws-cli aws-sdk dynamodb emr kinesis rds redshift sagemaker

Last synced: 03 Sep 2024

https://github.com/sensu-plugins/sensu-plugins-aws

This plugin provides native AWS instrumentation for monitoring and metrics collection, including: health and metrics for various AWS services, such as EC2, RDS, ELB, and more, as well as handlers for EC2, SES, and SNS.

autoscaling aws aws-monitoring aws-networking certificate-manager cloudfront cloudwatch ebs ec2 elasticache emr load-balancer metrics monitoring rds sensu sensu-handler sensu-plugins sns sqs

Last synced: 20 Dec 2024

https://github.com/nellore/rail

Scalable RNA-seq analysis

alignments emr ipython mapreduce rail-rna rna-seq-analysis

Last synced: 12 Oct 2024

https://github.com/beda-software/fhir-emr

EMR based on FHIR

emr fhir

Last synced: 17 Dec 2024

https://github.com/datitran/emr-bootstrap-pyspark

Quickstart PySpark with Anaconda on AWS/EMR

aws emr python3

Last synced: 22 Oct 2024

https://github.com/basin-etl/basin

Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser

emr etl hadoop informatica odi pipeline pyspark spark

Last synced: 09 Nov 2024

https://github.com/abdullahkhawer/aws-auto-terminate-idle-emr

An AWS based solution using AWS CloudWatch and AWS Lambda based on Python to automatically terminate AWS EMR clusters that have been idle for a specified period of time.

amazon-web-services automation aws aws-cloudformation aws-cloudwatch aws-emr aws-lambda bigdata boto3 cft cloudformation cloudwatch datalake emr etl idle python python-3-7 serverless terminate

Last synced: 27 Oct 2024

https://github.com/project-monai/monai-deploy-informatics-gateway

MONAI Deploy Informatics Gateway facilitates integration with DICOM compliant systems, enables ingestion of imaging data, helps triggering of workflows with the MONAI Deploy Workflow Manager and offers pushing the output to PACS systems.

ai csharp dicom dicomweb-client dotnet ehr emr fhir fhir-client fo-dicom healthcare medical-imaging

Last synced: 14 Nov 2024

https://github.com/jaehyeon-kim/dbt-on-aws

dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats

athena dbt delta-lake emr glue hudi iceberg redshift

Last synced: 17 Dec 2024

https://github.com/fastenhealth/fasten-sources

The Fasten Sources is a library that defines medical provider metadata (definitions - OpenID Metadata documents) and http clients (OAuth2/Smart-on-FHIR clients) which can be used to retrieve data from various Medical Providers (clients).

emr fhir-client healthcare personal-health-record smart-on-fhir

Last synced: 21 Nov 2024

https://github.com/canvas-medical/mobile-patient-app

FHIR-based cross-platform mobile app for patients. Supports scheduling, messaging, payments, medical records, and more. Fork, customize, contribute!

ehr emr healthcare healthtech patient patient-portal

Last synced: 19 Nov 2024

https://github.com/camposvinicius/aws-etl

This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/data/blob/main/AdventureWorks.zip, it's a zipped file with some .csvs inside that we will apply transformations.

airflow argocd athena aws catalog data data-engineer database emr emr-cluster etl glue kubernetes pipeline postgres pyspark rds spark

Last synced: 04 Dec 2024

https://github.com/dermatologist/hephaestus

:stars: Hephaestus - ETL and ML tools for OHDSI - OMOP CDM

datawarehouse emr etl health-data health-informatics

Last synced: 09 Nov 2024

https://github.com/sparkfish/scanray

Scanner event monitor for web-based scanning of PDF417 barcodes in healthcare setting

barcode-scanner ehr electronic-health-records electronic-medical-records emr healthcare pdf417

Last synced: 17 Dec 2024

https://github.com/dermatologist/oscar-latest-docker

:whale: This is a script to deploy OSCAR EMR in a docker container.

devops-tools docker-compose emr oscar

Last synced: 09 Nov 2024

https://github.com/AuFeld/Data_Engineering_Projects

A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousing, containerization, and a dashboard to monitor data pipeline KPIs

airflow aws cassandra data-engineering data-lake data-warehouse docker emr etl-pipeline infrastructure-as-code infrastructure-setup postgresql python redshift s3 spark

Last synced: 04 Dec 2024

https://github.com/hearthsim/articles

Analysis of Hearthstone replays

emr hearthstone mrjob replays

Last synced: 08 Nov 2024

https://github.com/xianwill/spark-boilerplate

A boilerplate for spark projects with docker support for local development and scripts for emr support.

apache-spark boilerplate docker emr emr-cluster spark

Last synced: 14 Oct 2024

https://github.com/bluishglc/emr-edgenode-maker

This tool can easily make / build an emr cluster edge node / client node / gateway node

client edgenode emr gateway

Last synced: 05 Nov 2024

https://github.com/motasimfoad/emr

“EMR” is a platform built using leading edge web technologies and API’s to help Doctors/ Patient/ Hospitals/ Pharmacies to better deal with medical documentation.

apollo data doctor emr graphcool graphql hospital medical patients pharmacy reactjs record yarn

Last synced: 24 Nov 2024

https://github.com/gizmodata/spark-connect-proxy

A reverse proxy server which allows secure connectivity to a Spark Connect server

apache-spark aws emr gizmodata ibis jwt-authentication proxy pyspark spark spark-connect spark-connect-server tls-support

Last synced: 08 Nov 2024

https://github.com/bluishglc/ranger-emr-cli-installer

This is a powerful cli tool for Apache Ranger and AWS EMR automated installation & integration with OpenLDAP & Windows AD. It supports Open-Source Ranger and EMR-Native Ranger both, supports OpenLDAP & Windows AD both, and works in all AWS regions (also including China regions).

ad emr install integrate ldap ranger shell tool

Last synced: 05 Nov 2024

https://github.com/devopscorner/iac-terraform-emr

AWS Summit 2022 ASEAN --- COM203 Using IaC with Terraform to provision Big Data Platform on Amazon EMR

airflow aws cicd cloud9 codebuild codedeploy codepipeline container devops devopscorner docker docker-compose ecr emr iac infrastructure-as-code mwaa rds terraform

Last synced: 10 Nov 2024

https://github.com/elexis/elexis-environment

An integrated Elexis environment

bookstack elexis elexis-server emr nextcloud rocketchat

Last synced: 05 Nov 2024

https://github.com/vitalibo/spark-aws-orchestration

Deployment/Orchestration of Apache Spark applications on Amazon EMR.

aws cloudformation emr spark step-functions

Last synced: 07 Nov 2024

https://github.com/dermatologist/openmrs-module-skinhelpdesk

An openMRS module for dermatology to map lesions on a body image. Video: https://youtu.be/wy5JPX6AWoM

dermatology emr health-informatics healthcare-application openmrs

Last synced: 28 Oct 2024

https://github.com/us8945/aws_emr_pysparkling

Set Up Python environment on AWS EMR cluster with H2O Sparkling Water (Pysparling)

aws emr h2o jupyter-notebook pyspark pysparkling spark sparkling-water

Last synced: 06 Dec 2024

https://github.com/chrisekelley/zeprs

ZEPRS EMR

emr hmis java struts1

Last synced: 19 Nov 2024

https://github.com/nicor88/aws-infrastructure

AWS infrastructure: Cloudformation, Terraform...

aws boto3 cloudformation ec2 emr firehose iam kinesis lambda s3 vpc

Last synced: 06 Dec 2024

https://github.com/jaehyeon-kim/iceberg-etl-demo

Data Warehousing ETL Demo with Apache Iceberg on EMR Local Environment

datawarehousing emr etl iceberg scd spark

Last synced: 17 Dec 2024

https://github.com/davealdon/hl7-hero-api

:hospital: Support and library resources for HL7 Hero, a mobile app that parses HL7 2.X Schemas.

7 emr health hl7 hl7-message hl7-parser hl7-parsing hl7v2 level medical medicine msh parsing schema xamarin

Last synced: 14 Dec 2024

https://github.com/ev2900/emr_studio_hudi

Apache Hudi examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks

apache-hudi aws elastic-map-reduce emr hudi hudi-examples

Last synced: 05 Nov 2024

https://github.com/maur1th/task_diff

Create Terraform AWS Container Definition diffs

aws diff ecs emr iam-policy terraform

Last synced: 14 Oct 2024

https://github.com/tmusabbir/glue-utils

Few AWS Glue Utility Scripts

amazon-web-services aws emr etl glue lakeformation

Last synced: 04 Dec 2024

https://github.com/principlebrothers/yarysa

Yarysa EMR's is a robust and efficient operating system for medical facilities and patients. We are using blockchain technology to accelerate the transition of the health sector to a digitized system.

emr hooks react

Last synced: 07 Dec 2024

https://github.com/pomadchin/vlm-performance

GeoTrellis RasterSources Ingest benchmark

aws emr geotrellis gis raster spark

Last synced: 16 Nov 2024

https://github.com/st3v3nmw/elixir-backend

Backend for a distributed electronic health records system

ehr electronic-health-records electronic-medical-records emr personal-health-record single-sign-on

Last synced: 29 Nov 2024

https://github.com/dermatologist/openmrs-owa-react-boilerplate

An OpenMRS OWA Template Using React and Redux.

emr openmrs react-redux reactjs

Last synced: 21 Dec 2024

https://github.com/jaehyeon-kim/emr-local-dev

Spark Local Development Environment Using Docker (and vscode)

aws docker emr spark vscode

Last synced: 30 Oct 2024

https://github.com/ev2900/emr_studio_deployment

Example Jupyter notebook for EMR Studio

aws emr emr-studio spark

Last synced: 05 Nov 2024

https://github.com/ev2900/emr_studio_delta_lake

Deltalake examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks

aws databricks deltalake elastic-map-reduce emr

Last synced: 05 Nov 2024

https://github.com/ev2900/emr_studio_iceberg

Apache Icebery examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks

apache-iceberg aws elastic-map-reduce emr iceberg

Last synced: 05 Nov 2024

https://github.com/rkr2017/emr-slack-notify

AWS Lambda function to send EMR events to Slack via SNS

aws aws-lambda cloudwatch-events emr emr-cluster lambda-functions slack

Last synced: 19 Nov 2024

https://github.com/twentyone24/patient-medical-portal

Patient Medical Portal (PMP), an Open-source patient medical portal built on top of MedPlum SDK

ehr emr fhir healthcare medical open-source

Last synced: 16 Nov 2024

https://github.com/ev2900/iceberg_emr_athena

Resources from an virtual tech talk / workshop - Set Up and Use Apache Iceberg Tables on Your Data Lake

apache-iceberg athena aws emr spark

Last synced: 05 Nov 2024

https://github.com/jaehyeon-kim/emr-on-eks-terraform

Manage EMR on EKS on Terraform

aws eks emr emroneks terraform

Last synced: 17 Dec 2024

https://github.com/twentyone24/patient-wellcare-dashboard

Patient-Wellcare-Dashboard (PWD), an Open-source patient medical portal built on top of MedPlum SDK

ehr emr fhir healthcare healthcare-data medical oss patient-management typescript wellcare

Last synced: 16 Nov 2024

https://github.com/ev2900/emr_studio_stock_price_demo

Demo EMR Studio notebook using PySpark to explore Stock Price Data

aws emr emr-studio spark

Last synced: 05 Nov 2024

https://github.com/cloudposse-archives/terraform-aws-spotinst-mrscaler

Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS using a Spotinst AWS MrScaler resource

cluster emr emr-cluster hcl2 map-reduce spot-instances spotinst

Last synced: 07 Nov 2024

https://github.com/snowplow/emr-etl-runner

Run Snowplow's enrichments on Amazon Elastic MapReduce with minimum fuss

aws-emr emr snowplow snowplowanalytics

Last synced: 09 Nov 2024

https://github.com/st3v3nmw/elixir-frontend

Frontend for a distributed electronic health records system

ehr electronic-health-records electronic-medical-records emr personal-health-record single-sign-on

Last synced: 29 Nov 2024

https://github.com/raz-mon/dsp_ass2

Assignment 2 of the course 'Distributed Systems Programming' by Meni Adler. In the assignment we build an application that calculates the probabilities for any word to come after a couple of words, for ANY couple of words in the n-gram corpus (google).

aws distributed-systems ec2 emr hadoop n-gram s3

Last synced: 16 Dec 2024

https://github.com/morazow/exasol-emr-terraform

The Terraform modules to create Exasol and EMR clusters on AWS

aws emr exasol terraform

Last synced: 11 Dec 2024

https://github.com/dermatologist/medpromptjs

Collection of LLM prompts, tools, chains and agents for healthcare using LangChain & FHIR. (JS Version)

emr healthcare langchain-js large-language-models medical summarization

Last synced: 15 Dec 2024

https://github.com/dermatologist/openmrs-owa-vue-boilerplate

An OpenMRS OWA Template Using Vue.

emr openmrs vue

Last synced: 15 Dec 2024

https://github.com/aws-cloudformation/aws-cloudformation-resource-providers-emrwal

cfn resource provider package for AWS EMRWAL

aws-resources emr resources

Last synced: 08 Nov 2024

https://github.com/bluevertex/emr-bridge

Base package for connecting to various EMR systems via Laravel

emr laravel laravel-5-package php

Last synced: 15 Dec 2024

https://github.com/matbragan/emr-airflow

Developing a Flow with EMR and Airflow

airflow aws aws-emr-clusters emr emr-cluster spark

Last synced: 16 Nov 2024

https://github.com/haabiy/emrrunner

A powerful CLI tool and API for managing Spark jobs on Amazon EMR clusters.

apache-spark api cloud-computing distributed-systems emr flask software-engineering venv-bootstrap

Last synced: 16 Nov 2024

https://github.com/ericlondon/python-aws-emr-status-arduino-leds

Python AWS EMR Status Arduino LEDs

arduino aws boto3 emr led python serial usb

Last synced: 12 Nov 2024

https://github.com/andrearettaroli/simulated-transactions-big-data

The goal of this notebook is to analyze and extract some useful informations from kaggle simulated-transactions dataset

emr notebook scala spark tableau

Last synced: 09 Nov 2024

https://github.com/ramitsurana/emr-ml

AWS EMR Info including Hadoop, Map Reduce and Hive along with Machine Learning

emr hadoop map-reduce

Last synced: 09 Nov 2024

https://github.com/ineerav/tfidf-map-reduce

Running Tf-Idf using spark streaming on hillary clinton's infamous leaked email data set https://www.kaggle.com/datasets/kaggle/hillary-clinton-emails

aws emr maven pig-latin shell spark spring-boot tf-idf

Last synced: 17 Nov 2024

https://github.com/mikeacosta/data-lake-spark

Data lake ETL pipeline in Apache Spark

apache-spark aws emr s3

Last synced: 09 Nov 2024

https://github.com/yo-mah-ya/aws_emr

AWS EMR

aws emr

Last synced: 20 Nov 2024