https://github.com/shaheennabi/shaheennabi

aspiring research engineer focused on reasoning, thinking models and reinforcement learning
https://github.com/shaheennabi/shaheennabi

ai-engineer aws data-scientist devops engineer generative-ai-engineer large-language-models machine-learning-engineer mlops personal-readme reasoning reinforcement-learning research thinking-model

Last synced: about 2 months ago
JSON representation

aspiring research engineer focused on reasoning, thinking models and reinforcement learning

Host: GitHub
URL: https://github.com/shaheennabi/shaheennabi
Owner: shaheennabi
Created: 2024-08-08T06:31:17.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2026-05-13T14:38:46.000Z (about 2 months ago)
Last Synced: 2026-05-13T16:38:38.757Z (about 2 months ago)
Topics: ai-engineer, aws, data-scientist, devops, engineer, generative-ai-engineer, large-language-models, machine-learning-engineer, mlops, personal-readme, reasoning, reinforcement-learning, research, thinking-model
Homepage:
Size: 297 KB
Stars: 0
Watchers: 1
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Thanks for tuning here👋

---

# Who I am

```╔════════════════════╗```
```║ research -- thinking, reasoning models ║```
```╚════════════════════╝```

I study how large language models perform multi-step reasoning and how training and post-training methods can improve their reliability, efficiency, and scalability.

My work focuses on the post-training stack for LLMs — supervised fine-tuning (SFT), preference optimization, reinforcement learning methods such as RLVR, and inference-time compute strategies that improve reasoning without requiring larger models.

I’m also interested in the interpretability of reasoning models: understanding the internal mechanisms that support multi-step reasoning and diagnosing failures such as shortcut reasoning, reward hacking, and unfaithful chain-of-thought.

Currently building and open-sourcing implementations of reasoning-focused training pipelines and contributing to LLM infrastructure and post-training frameworks.

* I love SpaceX rockets *

![](https://hit.yhype.me/github/profile?account_id=84982228)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/shaheennabi/shaheennabi

Awesome Lists containing this project

README