An open API service indexing awesome lists of open source software.

https://github.com/shaheennabi/shaheennabi

aspiring research engineer focused on reasoning, thinking models and reinforcement learning
https://github.com/shaheennabi/shaheennabi

ai-engineer aws data-scientist devops engineer generative-ai-engineer large-language-models machine-learning-engineer mlops personal-readme reasoning reinforcement-learning research thinking-model

Last synced: 24 days ago
JSON representation

aspiring research engineer focused on reasoning, thinking models and reinforcement learning

Awesome Lists containing this project

README

          

# Thanks for tuning hereπŸ‘‹


Typing SVG


---










# Who I am

```╔════════════════════╗```
```β•‘ research -- thinking, reasoning models β•‘```
```β•šβ•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•```




I study how large language models perform multi-step reasoning and how training and post-training methods can improve their reliability, efficiency, and scalability.

My work focuses on the post-training stack for LLMs β€” supervised fine-tuning (SFT), preference optimization, reinforcement learning methods such as RLVR, and inference-time compute strategies that improve reasoning without requiring larger models.

I’m also interested in the interpretability of reasoning models: understanding the internal mechanisms that support multi-step reasoning and diagnosing failures such as shortcut reasoning, reward hacking, and unfaithful chain-of-thought.

Currently building and open-sourcing implementations of reasoning-focused training pipelines and contributing to LLM infrastructure and post-training frameworks.



* I love SpaceX rockets *

![](https://hit.yhype.me/github/profile?account_id=84982228)