Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by joyceannie

A curated list of projects in awesome lists by joyceannie .

https://github.com/joyceannie/data-modeling-with-postgres

The main focus of the project is data modeling with Postgres and build an ETL pipeline using Python. The first step is to define fact and dimension tables for a star schema for a particular analytic focus. The second step is to write an ETL pipeline that transfers data from files in different directories into these tables in Postgres using Python and SQL.

data-engineering data-modeling postgresql python

Last synced: 01 Dec 2024

https://github.com/joyceannie/reddit_data_pipeline

The purpose of the project is to create a data pipeline to extract data from Reddit API and create a dashboard to analyse the data. The data is extracted from the subreddit r/Python. The data is extracted daily and uploaded to S3 buckets, and copied to Redshift. The dashboard is created using Google Data Studio.

airflow aws etl python redshift s3 terraform

Last synced: 01 Dec 2024

https://github.com/joyceannie/medical_chatbot

The objective of this project is to create a chatbot that can be used to communicate with users to provide answers to their health issues. This is a RAG implementation using open source stack.

chatbot fastapi generative-ai llm nlp python qdrant rag vector-database

Last synced: 01 Dec 2024

https://github.com/joyceannie/us-immigrations-data-warehouse

A data warehouse to perform analytics on the immigration trends in the US.

airflow data-engineering etl pyspark redshift s3 spark

Last synced: 01 Dec 2024

https://github.com/joyceannie/tomatoleafdiseaseprediction

The objective of the project is to classify the images of tomato plant leaves to identify whether the plant is healthy or not. A classification model is trained using Tensorflow and Keras. The application runs locally on Flask server.

classification-algorithm flask-application keras machine-learning multiclass-classification python tensorflow2

Last synced: 01 Dec 2024

https://github.com/joyceannie/web-crawler

The crawler downloads textual content from a particular domain and removes noise using site agnostic techniques.

java

Last synced: 01 Dec 2024

https://github.com/joyceannie/sql-data-with-danny-case-studies

Case study solutions for #8WeekSQLChallenge at https://8weeksqlchallenge.com

8-week-sql-challenge 8weeksqlchallenge case-study data-analysis data-analytics postgresql sql

Last synced: 01 Dec 2024

https://github.com/joyceannie/tic-tac-toe

Implemented in React

Last synced: 01 Dec 2024

https://github.com/joyceannie/todoapp

A simple ToDo Android App developed using Andriod Studio.

android android-app android-studio java

Last synced: 01 Dec 2024

https://github.com/joyceannie/portfolio

Portfolio website using HTML, CSS and Bootstrap. Implemented as part of CS50.

Last synced: 01 Dec 2024

https://github.com/joyceannie/pig-game

This is a simple version of Pig Game developed using HTML, CSS and JavaScript

Last synced: 01 Dec 2024

https://github.com/joyceannie/network-security

A set of programming assignments done as part of the course Network Security

Last synced: 01 Dec 2024

https://github.com/joyceannie/leetcode-contests

Leetcode contest solutions

Last synced: 01 Dec 2024

https://github.com/joyceannie/natours

Implemented as part of Udemy course to learn advanced features in CSS.

Last synced: 01 Dec 2024

https://github.com/joyceannie/moviespider

This project is used to crawl movie data from IMDb. Scrapy framework is used to extract relevant information like movie title, datePublished, summary, genres, director etc.

crawler datascience python scrapy spider webscraper

Last synced: 01 Dec 2024

https://github.com/joyceannie/fake-news-detection

The objective of this project is to classify news as fake or real ones. This project deals with data analysis, visualization and classification.

classification machinelearning nlp python

Last synced: 01 Dec 2024

https://github.com/joyceannie/kaggle

This repo consists of all my relevant python notebooks in Kaggle. Some of them have a detailed article on Medium.

Last synced: 01 Dec 2024

https://github.com/joyceannie/moody-s-analytics-women-in-engineering-hackathon

My solutions to the Hackathon competition on Hackerrank

Last synced: 01 Dec 2024

https://github.com/joyceannie/leetcode

Last synced: 01 Dec 2024

https://github.com/joyceannie/credit-card-fraud-detection

The objective of the project is to identify fraudulent credit card activities. As the dataset is highly imbalanced, sampling techniques like oversampling, undersampling, SMOTE etc are used.

classification datascience machine-learning python

Last synced: 01 Dec 2024

https://github.com/joyceannie/computer-networks

A set of programming assignments done as part of the course Computer Networks

Last synced: 01 Dec 2024

https://github.com/joyceannie/data-warehouse-aws

A music streaming startup, Sparkify, has grown their user base and song database and want to move their processes and data onto the cloud. The data resides in S3, in a directory of JSON logs on user activity on the app, as well as a directory with JSON metadata on the songs in their app. The objective of the project is to create an ETL pieline to build a datawarehouse . We extract data from S3, stage them in Redshift, and transform data into a set of dimensional tables for the analytics team to continue finding insights into what songs their users are listening to.

aws aws-s3 data-warehouse python3 redshift redshift-cluster

Last synced: 01 Dec 2024

https://github.com/joyceannie/fitness_assistant

A RAG application to answer fitness user queries.

Last synced: 01 Dec 2024

https://github.com/joyceannie/ecommerce_product_search_assistant

This is a RAG application that can be used to discover ecommerce products. The application uses ElasticSearch to index the documents, and OpenAI to answer the user queries.

elasticsearch llm llmops ml posgresql python rag

Last synced: 15 Nov 2024