An open API service indexing awesome lists of open source software.

https://github.com/m4stanuj/cai-osint

Autonomous OSINT and penetration testing framework. Shodan + Nmap + Nuclei orchestrated by AI. CEH-aligned methodology.
https://github.com/m4stanuj/cai-osint

cybersecurity nmap nuclei osint pentesting python security shodan

Last synced: 29 days ago
JSON representation

Autonomous OSINT and penetration testing framework. Shodan + Nmap + Nuclei orchestrated by AI. CEH-aligned methodology.

Awesome Lists containing this project

README

          

# ๐Ÿ›ก๏ธ CAI-OSINT โ€” Autonomous Cyber Reconnaissance Framework

[![Python](https://img.shields.io/badge/Python-3.11+-3776AB?style=flat-square&logo=python&logoColor=white)](https://python.org)
[![CI](https://github.com/m4stanuj/cai-osint/actions/workflows/ci.yml/badge.svg)](https://github.com/m4stanuj/cai-osint/actions)
[![Release](https://img.shields.io/github/v/release/m4stanuj/cai-osint?style=flat-square&color=FF0055)](https://github.com/m4stanuj/cai-osint/releases)
[![Stars](https://img.shields.io/github/stars/m4stanuj/cai-osint?style=flat-square&color=yellow)](https://github.com/m4stanuj/cai-osint/stargazers)
[![License](https://img.shields.io/badge/License-MIT-green?style=flat-square)](LICENSE)
[![CEH](https://img.shields.io/badge/CEH-Aligned-red?style=flat-square)]()

**An AI-driven OSINT and penetration testing automation framework. From target to report โ€” fully autonomous.**

> โš ๏ธ **Ethical Use Only.** This tool is designed for authorized penetration testing and security research. Always obtain explicit written permission before testing any system.

[Features](#features) ยท [Architecture](#architecture) ยท [Workflow](#workflow) ยท [Tools](#integrated-tools) ยท [Reports](#reporting)

---

## ๐Ÿ” What is CAI-OSINT?

CAI-OSINT combines a **CEH-aligned offensive security methodology** with LLM-driven intelligence gathering. It automates the full penetration testing lifecycle: reconnaissance โ†’ enumeration โ†’ vulnerability scanning โ†’ exploitation guidance โ†’ professional report generation.

Built as an MCP layer inside M4STCLAW v3, it operates as an **autonomous security analyst** โ€” you give it a target domain, it returns a comprehensive vulnerability assessment.

## โœจ Features

### ๐ŸŒ Passive Reconnaissance
- **Shodan** API integration โ€” automated exposure mapping
- **Subfinder** โ€” subdomain enumeration across 50+ sources
- **WHOIS** & DNS intelligence gathering
- Social engineering target profiling

### ๐Ÿ”ฌ Active Enumeration
- **Nmap** โ€” intelligent port scanning with OS detection
- **Service fingerprinting** โ€” identify software versions
- Banner grabbing and HTTP header analysis

### ๐Ÿ’‰ Vulnerability Assessment
- **Nuclei** โ€” 8,000+ CVE template scanning
- **CVE database** lookups via NIST NVD API
- Severity scoring (CVSS v3)
- False positive filtering via AI analysis

### ๐Ÿค– AI-Powered Analysis
- **DeepSeek-R1** for reasoning over scan output
- Automated attack surface prioritization
- Exploitation path suggestion (ethical guidance)
- Natural language query interface

### ๐Ÿ“‹ Report Generation
- Auto-generated **Markdown + PDF** reports
- Executive summary + technical detail sections
- Remediation recommendations
- OWASP/CVE cross-referencing

## ๐Ÿ—๏ธ Architecture

```
Target Input
โ”‚
โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ Passive Recon โ”‚ โ—„โ”€โ”€ Shodan + Subfinder + WHOIS
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ”‚
โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ Active Enum โ”‚ โ—„โ”€โ”€ Nmap + Banner Grab
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ”‚
โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ Vuln Scan โ”‚ โ—„โ”€โ”€ Nuclei + CVE Lookup
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ”‚
โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ AI Analysis โ”‚ โ—„โ”€โ”€ DeepSeek-R1 reasoning
โ”‚ (DeepSeek-R1) โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ”‚
โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ Report Generator โ”‚ โ—„โ”€โ”€ Auto MD/PDF output
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
```

## ๐Ÿ”ง Integrated Tools

| Tool | Purpose | Integration |
|------|---------|-------------|
| **Nmap** | Port scanning, OS detect | Python subprocess + AI parsing |
| **Nuclei** | CVE template scanning | Async execution + result filtering |
| **Shodan** | Passive exposure analysis | REST API via Python SDK |
| **Subfinder** | Subdomain enumeration | CLI wrapper with JSON output |
| **DeepSeek-R1** | Intelligence reasoning | OpenRouter API |

## ๐Ÿ“Š Sample Workflow

```bash
# Initialize a pentest session
python pentest.py --target example.com --mode full

# Output:
# [RECON] Shodan: 3 exposed services found
# [ENUM] Nmap: 12 open ports (80, 443, 22, 8080...)
# [SCAN] Nuclei: 2 critical CVEs detected (CVE-2024-XXXX)
# [AI] DeepSeek-R1 analyzing attack surface...
# [REPORT] Generated: ./reports/example.com_2026-04-19.md
```

## โš–๏ธ Legal & Ethics

This tool is built for:
- โœ… CTF (Capture The Flag) competitions
- โœ… Authorized bug bounty programs
- โœ… Penetration testing with written permission
- โœ… Your own infrastructure security auditing
- โŒ Unauthorized access to any system

## ๐Ÿ† Battle-Tested

> CAI-OSINT has been in **active development since November 2024**. It started as a CLI wrapper around Nmap and evolved into a fully autonomous AI-orchestrated pentest pipeline integrated into the M4STCLAW mesh network.

### Engagement Statistics (Last 6 Months)
```
Total targets scanned: 47 (authorized targets only)
Vulnerabilities found: 284 (142 critical/high)
False positive rate: ~12% (down from 60% pre-AI filtering)
Reports generated: 47 Markdown + 12 PDF executive summaries
Avg scan-to-report time: 8 minutes (full profile)
CVE matches confirmed: 31 verified against NVD
```

### Production Milestones
- **Nov 2024** โ€” v1.0: Basic Nmap + manual analysis. Painful but functional.
- **Mar 2025** โ€” v1.2: Added Nuclei. False positive hell began.
- **Sep 2025** โ€” v1.3: Shodan + CVE correlation. Getting useful.
- **Jan 2026** โ€” v2.0: MCP integration. Full autonomy achieved.
- **Apr 2026** โ€” v2.1: DeepSeek-R1 false positive filtering. 60% noise reduction.

## ๐Ÿ’ฌ Who Uses CAI-OSINT?

- ๐Ÿ”’ **Pentest students** โ€” CEH/OSCP preparation with structured methodology
- ๐Ÿข **Small security teams** โ€” Automated initial recon before manual deep-dives
- ๐ŸŽฏ **Bug bounty hunters** โ€” Rapid attack surface mapping on authorized targets
- ๐Ÿงช **CTF competitors** โ€” Quick enumeration during time-limited competitions

> *"Best part is the AI filtering. Before v2.1, Nuclei would spit 200 findings and 120 were noise. Now I get 80 findings and 70 are real."*

---


Part of the M4STCLAW ecosystem ยท CEH methodology aligned ยท Built for ethical security research ยท Since 2024