Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by joyceannie
A curated list of projects in awesome lists by joyceannie .
https://github.com/joyceannie/data-modeling-with-postgres
The main focus of the project is data modeling with Postgres and build an ETL pipeline using Python. The first step is to define fact and dimension tables for a star schema for a particular analytic focus. The second step is to write an ETL pipeline that transfers data from files in different directories into these tables in Postgres using Python and SQL.
data-engineering data-modeling postgresql python
Last synced: 01 Dec 2024
https://github.com/joyceannie/reddit_data_pipeline
The purpose of the project is to create a data pipeline to extract data from Reddit API and create a dashboard to analyse the data. The data is extracted from the subreddit r/Python. The data is extracted daily and uploaded to S3 buckets, and copied to Redshift. The dashboard is created using Google Data Studio.
airflow aws etl python redshift s3 terraform
Last synced: 01 Dec 2024
https://github.com/joyceannie/medical_chatbot
The objective of this project is to create a chatbot that can be used to communicate with users to provide answers to their health issues. This is a RAG implementation using open source stack.
chatbot fastapi generative-ai llm nlp python qdrant rag vector-database
Last synced: 01 Dec 2024
https://github.com/joyceannie/us-immigrations-data-warehouse
A data warehouse to perform analytics on the immigration trends in the US.
airflow data-engineering etl pyspark redshift s3 spark
Last synced: 01 Dec 2024
https://github.com/joyceannie/tomatoleafdiseaseprediction
The objective of the project is to classify the images of tomato plant leaves to identify whether the plant is healthy or not. A classification model is trained using Tensorflow and Keras. The application runs locally on Flask server.
classification-algorithm flask-application keras machine-learning multiclass-classification python tensorflow2
Last synced: 01 Dec 2024
https://github.com/joyceannie/web-crawler
The crawler downloads textual content from a particular domain and removes noise using site agnostic techniques.
Last synced: 01 Dec 2024
https://github.com/joyceannie/sql-data-with-danny-case-studies
Case study solutions for #8WeekSQLChallenge at https://8weeksqlchallenge.com
8-week-sql-challenge 8weeksqlchallenge case-study data-analysis data-analytics postgresql sql
Last synced: 01 Dec 2024
https://github.com/joyceannie/todoapp
A simple ToDo Android App developed using Andriod Studio.
android android-app android-studio java
Last synced: 01 Dec 2024
https://github.com/joyceannie/portfolio
Portfolio website using HTML, CSS and Bootstrap. Implemented as part of CS50.
Last synced: 01 Dec 2024
https://github.com/joyceannie/pig-game
This is a simple version of Pig Game developed using HTML, CSS and JavaScript
Last synced: 01 Dec 2024
https://github.com/joyceannie/network-security
A set of programming assignments done as part of the course Network Security
Last synced: 01 Dec 2024
https://github.com/joyceannie/natours
Implemented as part of Udemy course to learn advanced features in CSS.
Last synced: 01 Dec 2024
https://github.com/joyceannie/moviespider
This project is used to crawl movie data from IMDb. Scrapy framework is used to extract relevant information like movie title, datePublished, summary, genres, director etc.
crawler datascience python scrapy spider webscraper
Last synced: 01 Dec 2024
https://github.com/joyceannie/fake-news-detection
The objective of this project is to classify news as fake or real ones. This project deals with data analysis, visualization and classification.
classification machinelearning nlp python
Last synced: 01 Dec 2024
https://github.com/joyceannie/kaggle
This repo consists of all my relevant python notebooks in Kaggle. Some of them have a detailed article on Medium.
Last synced: 01 Dec 2024
https://github.com/joyceannie/moody-s-analytics-women-in-engineering-hackathon
My solutions to the Hackathon competition on Hackerrank
Last synced: 01 Dec 2024
https://github.com/joyceannie/credit-card-fraud-detection
The objective of the project is to identify fraudulent credit card activities. As the dataset is highly imbalanced, sampling techniques like oversampling, undersampling, SMOTE etc are used.
classification datascience machine-learning python
Last synced: 01 Dec 2024
https://github.com/joyceannie/computer-networks
A set of programming assignments done as part of the course Computer Networks
Last synced: 01 Dec 2024
https://github.com/joyceannie/data-warehouse-aws
A music streaming startup, Sparkify, has grown their user base and song database and want to move their processes and data onto the cloud. The data resides in S3, in a directory of JSON logs on user activity on the app, as well as a directory with JSON metadata on the songs in their app. The objective of the project is to create an ETL pieline to build a datawarehouse . We extract data from S3, stage them in Redshift, and transform data into a set of dimensional tables for the analytics team to continue finding insights into what songs their users are listening to.
aws aws-s3 data-warehouse python3 redshift redshift-cluster
Last synced: 01 Dec 2024
https://github.com/joyceannie/fitness_assistant
A RAG application to answer fitness user queries.
Last synced: 01 Dec 2024
https://github.com/joyceannie/ecommerce_product_search_assistant
This is a RAG application that can be used to discover ecommerce products. The application uses ElasticSearch to index the documents, and OpenAI to answer the user queries.
elasticsearch llm llmops ml posgresql python rag
Last synced: 15 Nov 2024