Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/xihajun/projects

Project Pool: how to survive 🏊‍♀️
https://github.com/xihajun/projects

Last synced: about 2 months ago
JSON representation

Project Pool: how to survive 🏊‍♀️

Awesome Lists containing this project

README

        

# Project Pool
_I cannot swim, but I survived the project pool_

## NLP Project
- Type corretion using Transformer & reinforcement learning (working on)
- http://angocair.garg.ed.ac.uk/
- About the Data
- [Writing: Students are better than the teacher](https://extreme-yacht-07f.notion.site/Students-are-better-than-the-teacher-3d631ce17803439986338dedda2ed48e)
- [ ] LLM (finetune llama)
- [ ] Prune_BERT

## Computer Vision
- Footprints AutoCrop: [code](https://github.com/xihajun/footwear-project), [slides](https://github.com/xihajun/footwear-project/blob/master/Footwear%20Project.pdf), [app](https://share.streamlit.io/xihajun/streamlit-footwear/main)
- Counterfeit Image Detection Project (CNN + AutoEncoder - 97%)
- Art vs History Image Style Transfer (Hackathon): [code](https://github.com/xihajun/Style-Transfer-Art), [demo](https://xihajun.github.io/Style-Transfer-Art/Demo/)

## Cambridge Univeristy Samarajiwa's Lab - [WebPage](https://www.samarajiwa-lab.org/people)

Projects

_I'd like to open source everything but sometimes I cannot :(_

- ISGverse
- Information Theroy, Bayesian Optimisation, List Similarity
- [app](https://isgverse.org) (sslab:123456ss)
- COBRA: **an excellent TF targets hunter**
- [Docs - take cobra home](http://cobrajf.readthedocs.io)
- Success: DBSCAN, Test Statistic (with mean and variance unknow), BO, GMM
- Failed: Bayesian Model, AutoEncoder, [FsNet](https://github.com/singh-ml/fsnet)
- FROGS
- Linear Model
- IFNscape
- Deep Learning, Integer Programming, Word2Vec, GNN
- ChIPseq-pipeline (Docker, bpipe, MACS2, MACS3)
- Automation (GitHub Action)
- R/Shiny: [template](https://github.com/xihajun/shiny-template)

Teaching

- CRUK Bioinformatics Summer School 2021 (Docker): [school](https://bioinformatics-core-shared-training.github.io/cruk-summer-school-2021/), [slides1](https://bioinformatics-core-shared-training.github.io/cruk-summer-school-2021/Introduction/slides/L1-summerSchool.pdf), [slides2](https://bioinformatics-core-shared-training.github.io/cruk-summer-school-2021/ChIPSeq/slides/EvaluatingChIPseqData.pdf)
- CompBio MPhil RegulatoryGenomics Practical 2021 (Docker): [code](https://github.com/ss-lab-cancerunit/CompBio_MPhil_RegulatoryGenomics_Practical)

## Machine Learning & Data Science
_Some of them are naive, but they are the learning paths I have gone through XD_
- **Machine Learning for Automated Vulnerability Detection in Source Code** (Inductive Logic Programming, code representation, AST, adjacency matrix, CNN, Word2Vec, Node2Vec, Graph2Vec): [code](https://github.com/dj311/uob-summer-project), [Review & Proposal](https://drive.google.com/file/d/1-V5WlDSV37ibEYlLbbpglJ0E5KnWojHk/view?usp=sharing), [slides](https://docs.google.com/presentation/d/1_pNo1vaU5wb1Hn49rrq8Qn3soBKf3c3h/edit?usp=sharing&ouid=111119790381783443776&rtpof=true&sd=true), [report](https://github.com/xihajun/Projects/blob/main/docs/Summer%20Project.pdf)
- **Google Summer of Code - OWASP-Seraphimdroid** (SVM, LSTM, AutoEncoder): [code](https://github.com/xihajun/OWASP-Seraphimdroid), [report](https://docs.google.com/document/d/1WzNZed2Et8eRn7xLYvWI_Wb-FUqpKK88eBp7XNqt20I/edit), [paper](https://www.research.manchester.ac.uk/portal/files/159895029/1910.10660v1.pdf)
- **Coursework**
- Abnormal Traffic Prediction (KNN, KMeans, Random Forest): [code](https://github.com/samanthawise/dtsassignment2), [report](https://drive.google.com/file/d/1F95pBroSd_HHI6o1iai1h2fhHCF2gln_/view?usp=sharing)
- Decision Trees and Unbalanced Data Sets - An Investigation (SMOTE): [code](https://github.com/dj311/data-science-toolbox-3), [report](https://drive.google.com/file/d/1GQEhy3d57Vz0uX-t9rxrvp-walCXj0bM/view?usp=sharing)
- Topic Modelling with Password (topic model): [code](https://github.com/xihajun/data-science-toolbox-kate-syd-jun), [report](https://drive.google.com/file/d/1zYensniVHABBIuV2AudHADzug5E8TMTP/view?usp=sharing)
- What is a Neural Network (tensorboard): [code](https://github.com/xihajun/Data-Science-Deep-learning-Sam-Jun), [report](https://drive.google.com/file/d/1gfzsOsl7ISKRwyrgA5CqoNZN0niC_R3H/view?usp=sharing)
- Machine Learning Gaussian Process: [report](https://drive.google.com/file/d/1AIbMpF3ds5FZYAwbPvbrdHk2mzgTWKlN/view?usp=sharing)
- Machine Learning Image: [report](https://drive.google.com/file/d/1N4WoRZyhkA2H9lb24HhhUYM2vSMtuaiz/view?usp=sharing)

## Other
- Complex Networks (**What a Small World**): [code](https://github.com/xihajun/Complexnets), [slides](https://docs.google.com/presentation/d/1FUb7EH0h11YyfNRaWhvd1o2b47_njb1A/edit?usp=sharing&ouid=111119790381783443776&rtpof=true&sd=true)
- Bitcoin & Blockchain + implementation: [code](https://github.com/xihajun/bitcoin), [slides](https://docs.google.com/presentation/d/1Cb9l7nvtER74Uw5hbyzpB4uIkYYyuS5g/edit?usp=sharing&ouid=111119790381783443776&rtpof=true&sd=true)
- Leetcode Typing: [code](https://github.com/xihajun/typecode), [demo](https://xihajun.github.io/typecode/), [typeracer](https://play.typeracer.com/?universe=lang_zh)
- SVGmerge Template (automation): [code](https://github.com/ss-lab-cancerunit/SVGmerging)

## TODO
- [x] NestTimetable: [code](https://github.com/xihajun/NestTimeManager)
- [ ] Self-Driving Implementation
- [ ] PyTorch Implementation (footwear marks project)