https://github.com/pranavarora1895/proteintypeprediction
Data Analysis on Protein Type Prediction
https://github.com/pranavarora1895/proteintypeprediction
bioinformatics data-analysis supervised-learning
Last synced: 2 months ago
JSON representation
Data Analysis on Protein Type Prediction
- Host: GitHub
- URL: https://github.com/pranavarora1895/proteintypeprediction
- Owner: pranavarora1895
- Created: 2023-01-13T23:03:26.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-01-14T00:03:49.000Z (over 3 years ago)
- Last Synced: 2025-12-27T02:52:55.574Z (6 months ago)
- Topics: bioinformatics, data-analysis, supervised-learning
- Language: Jupyter Notebook
- Homepage:
- Size: 8.44 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Prediction of Membrane Protein Types
> ### Members:
> #### 1. Parham Hajishafiezahramini
> #### 2. Pranav Arora
### Project at a glance
>In this project, we have used a real dataset of membrane proteins with five different classifications(Labels). The Initial dataset consists of 2059 and 2625 (train and test dataset) membrane proteins' amino acids sequences. We concatenated the test and train dataset and generated 4684 membrane protein sequences. Since protein sequences are just a series of alphabets (amino acid abbreviations), we implemented several procedures described below.
>
The critical point is that we implemented this project from scratch, and we did not copy any part from other works. This work was presented at the 10th international Bioinformatics conference in 2022 under Parham Hajishafiezahramini's name(First Author) (https://icb10.ut.ac.ir/paper?manu=43325).
In this project, we decided to use the same dataset and apply all the project's tasks. We added a lot of descriptions about all the data columns, how we achieved and extracted them, how we checked the outliers, how we added the missing outliers manually, how we handled the outliers and missing values, and prepared clean data.
### Demonstration Video
https://user-images.githubusercontent.com/48170643/212436102-3dcb9a1a-0330-4606-b2dd-8e0ff3217b1f.mp4