https://github.com/prajakta1321/exoplanet-atmospheric-characterization-1

A machine learning project to classify exoplanets using light curve image data. Developed as part of the ML4SCI GSoC 2025 Test Task. Includes data processing, CNN-based model, and full report.
https://github.com/prajakta1321/exoplanet-atmospheric-characterization-1

classification colab-notebook dbscan gsoc-2025 machine-learning-algorithms matplotlib-python ml numpy open-source pca-analysis python3 seaborn

Last synced: 2 months ago
JSON representation

A machine learning project to classify exoplanets using light curve image data. Developed as part of the ML4SCI GSoC 2025 Test Task. Includes data processing, CNN-based model, and full report.

Host: GitHub
URL: https://github.com/prajakta1321/exoplanet-atmospheric-characterization-1
Owner: prajakta1321
Created: 2025-04-08T07:40:27.000Z (2 months ago)
Default Branch: main
Last Pushed: 2025-04-08T07:46:37.000Z (2 months ago)
Last Synced: 2025-04-08T08:40:25.664Z (2 months ago)
Topics: classification, colab-notebook, dbscan, gsoc-2025, machine-learning-algorithms, matplotlib-python, ml, numpy, open-source, pca-analysis, python3, seaborn
Language: Jupyter Notebook
Homepage:
Size: 0 Bytes
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Exoplanet-atmospheric-characterization-1

# 🌌 Exoplanet Classification using Machine Learning

This project was developed as part of the GSoC 2025 Test Task for the ML4SCI (Machine Learning for Science) organization. The goal is to build a machine learning pipeline to classify exoplanets based on their light curve image data using supervised learning techniques.

## 📌 Project Objective

To explore, build, and evaluate an efficient ML model that can accurately classify whether a given celestial body is an exoplanet or not, using image-based data derived from light curves.

## 🛠️ Tools & Technologies Used

- Python
- Pandas & NumPy
- Matplotlib & Seaborn
- TensorFlow / Keras
- Scikit-learn
- Google Colab

## 📂 Project Structure as follows

## 🚀 Steps Performed

1. **Data Preparation**:
- Converted light curve CSVs into grayscale image data.
- Stored images in a local folder for manual upload.

2. **Image Preprocessing**:
- Resized images, normalized pixel values, and converted to NumPy arrays.

3. **Model Building**:
- Used Convolutional Neural Networks (CNNs) for classification.
- Achieved a satisfactory training and validation accuracy.

4. **Model Saving**:
- Trained model saved as `model.h5` for reuse and deployment.

5. **Report Generation**:
- Includes insights, challenges, learnings, and future scope.

## 📊 Results

- Achieved high accuracy on the validation dataset.
- Demonstrated effective use of image-based ML in scientific tasks.

## 🧠 What I Learned

- Converting CSV-based signal data to image form.
- Building and training CNNs on image datasets.
- Handling limited data using augmentation techniques.
- Structuring and documenting ML projects.

## 🤝 Contributions

If accepted, I look forward to contributing further to the ML4SCI initiative by refining this pipeline and extending it to more complex datasets.

---

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/prajakta1321/exoplanet-atmospheric-characterization-1

Awesome Lists containing this project

README