https://github.com/radioastronomyio/rbh1-validation-reanalysis
Independent validation and reanalysis of the RBH-1 hypervelocity SMBH candidate (van Dokkum et al. 2025)
https://github.com/radioastronomyio/rbh1-validation-reanalysis
a4000 analysis-ready-dataset bayesian-inference black-hole-physics black-holes dragonflydb gpu-computing jackknife kinematics postgresql rbh-1
Last synced: 3 months ago
JSON representation
Independent validation and reanalysis of the RBH-1 hypervelocity SMBH candidate (van Dokkum et al. 2025)
- Host: GitHub
- URL: https://github.com/radioastronomyio/rbh1-validation-reanalysis
- Owner: radioastronomyio
- License: mit
- Created: 2025-12-21T16:27:03.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2025-12-24T20:43:19.000Z (6 months ago)
- Last Synced: 2025-12-26T10:35:05.881Z (6 months ago)
- Topics: a4000, analysis-ready-dataset, bayesian-inference, black-hole-physics, black-holes, dragonflydb, gpu-computing, jackknife, kinematics, postgresql, rbh-1
- Language: Python
- Homepage:
- Size: 5.98 MB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# π RBH-1 Validation & Reanalysis
> Independent validation of the RBH-1 hypervelocity SMBH candidate β testing whether the 62 kpc supersonic bow shock interpretation survives expanded methodological freedom.
[](https://github.com/radioastronomyio/rbh1-validation-reanalysis)
[](https://archive.stsci.edu/proposal_search.php?id=17301)
[](https://www.stsci.edu/jwst/science-execution/program-information?id=3149)
[](https://www.python.org/)
[](LICENSE)

RBH-1 Validation & Reanalysis is an independent validation study of the extraordinary RBH-1 linear feature (van Dokkum et al. 2025), interpreted as a 62 kpc supersonic bow shock from a hypervelocity supermassive black hole ejected at ~1000 km/s. This project contributes open tools, reproducible methods, and a complete Analysis-Ready Dataset (ARD) to the community discussion around this remarkable object.
---
## π― Quick Start
### For Researchers
- [π Validation Methodology](docs/README.md) - Complete methodology documentation
- [π Data Sources](#-data-sources) - HST and JWST observation details
- [π¬ Validation Criteria](#validation-criteria) - What "validation" means for this project
### For Developers
- [ποΈ Repository Structure](#-repository-structure) - Navigate the codebase
- [π» Scripts](scripts/README.md) - Phase-organized execution scripts
- [π¦ Source Code](src/README.md) - Reusable library modules
### For Validators
- [π Work Logs](work-logs/README.md) - Complete development history
- [β
Validation Outputs](validation/README.md) - Data integrity and QA results
- [π Notebooks](notebooks/README.md) - Exploratory analysis
---
## π Project Overview
RBH-1 represents one of the most extraordinary astronomical claims in recent years: a supermassive black hole ejected from its host galaxy, now plowing through the circumgalactic medium at supersonic velocities and leaving a 62 kiloparsec ionized wake. If confirmed, this would be the first direct observation of gravitational wave recoil in action.
This project provides independent validation through two inseparable deliverables:
| Deliverable | Description |
|-------------|-------------|
| Validation Paper | Bayesian reanalysis testing whether the bow shock interpretation survives expanded methodological freedom |
| Analysis-Ready Dataset | Complete ARD on Zenodo enabling community reanalysis without climbing the compute wall |
### Why This Project Exists
Our team has a deep interest in extreme objects. Contributing to the scientific discussion around RBH-1βthrough open methods, transparent analysis, and reusable data productsβis not just scientifically valuable, it's an honor. We approach this as neutral validation: either strengthening the extraordinary claim or identifying where additional work is needed. Both outcomes advance the field.
---
## π¬ Scientific Context
### The Object
RBH-1 is a highly linear feature extending ~62 kpc from a compact source at redshift z β 0.96. JWST NIRSpec IFU spectroscopy reveals:
- Kinematic discontinuity: ~600 km/s velocity change across ~1 kpc at the "tip"
- Shock-consistent line ratios: Emission diagnostics matching fast radiative shock models
- Surface brightness truncation: Factor >40Γ intensity drop at the tip over one resolution element
### The Interpretation

Van Dokkum et al. (2025) interpret these observations as a supersonic bow shock from an ejected SMBH. The competing "edge-on galaxy" hypothesis (SΓ‘nchez Almeida et al. 2023) proposes the feature is a serendipitously aligned, extremely thin bulgeless dwarf galaxy.
### Our Approach
We validate the bow shock interpretation through:
1. Expanded Bayesian inference on shock parameters with proper marginalization
2. Stress-testing the background subtraction methodology
3. Quantitative falsification of the edge-on galaxy alternative through forward modeling
### Validation Criteria
Confirmation conditions β the interpretation is validated if:
- 95% CI of shock velocity from line ratios overlaps spatial gradient measurement
- Difference-of-pointings artifacts account for <20% of wake flux
- Inferred metallicity consistent with CGM (Z < 0.5 Zβ)
- Velocity discontinuity persists across all jackknife subsets
Tension conditions β tension is declared if:
- Marginalizing over magnetic parameter broadens v_s posterior to include virial velocities
- Standard Level 3 extraction significantly diminishes wake signal
- Jackknife analysis shows exposure/region dependence
---
## π¦ The Analysis-Ready Dataset
### Compute Artifacts as Deliverables
Astronomical reanalysis has a hidden cost: the weeks of CPU time between downloading calibrated data and having usable posteriors. Every researcher who wants to revisit a result faces this compute wall, often regenerating artifacts that previous researchers already produced and discarded. This is institutionalized waste.
So, we're keeping our dataset, and we're sharing it.
The effort to package computed artifactsβchains, synthetic cubes, likelihood interfacesβis negligible compared to the effort to generate them. If sharing them saves even one researcher from repeating weeks of work, the cost-benefit is obvious.
This isn't a methodology innovation. It's just not throwing away what we already computed.

### What Gets Materialized

| Layer | Contents | Purpose |
|-------|----------|---------|
| Likelihood Interface | Observed cube, weights, masks, instrument kernels | Drop-in forward model testing |
| Inference Layer | Full MCMC chains (~10M samples) | Statistical reanalysis, prior reweighting |
| Representative Sample | ~30 synthetic cubes + intrinsic velocity fields | Visual inspection, residual analysis |
| Validation Layer | Jackknife distributions, rejection surfaces | Robustness verification |
### What This Enables
A researcher receiving the RBH-1 ARD can:
- Skip weeks of compute β Posterior samples already exist
- Test alternative priors β Reweight existing chains instead of rerunning
- Run their own forward model β Likelihood interface provides the contract
- Validate against new data β Future observations plug directly in
- Reproduce exactly β Full provenance chain from MAST to final measurement
---
## π Data Sources

### HST WFC3/UVIS (Program GO-17301)
| Product | Type | Purpose |
|---------|------|---------|
| `*_flc.fits` | CTE-corrected exposures | Custom error analysis |
| `*_drc.fits` | Drizzled mosaics | Astrometric alignment |
Filters: F200LP (~15ks), F350LP (~15ks) β 6 visits, 146 products, ~44 GB
### JWST NIRSpec IFU (Program GO-3149)
| Product | Type | Purpose |
|---------|------|---------|
| `*_cal.fits` | Level 2b calibrated | Sensitivity testing |
| `*_s3d.fits` | Level 3 cubes | Primary extraction |
| `*_x1d.fits` | 1D extracted | Sanity checks |
Configuration: G140M/F100LP, ~7ks total β 52 products, ~5 GB
### MAPPINGS V Shock Models
Pre-computed grids from 3MdB database for radiative shock parameter inference.
---
## ποΈ Repository Structure
```markdown
rbh1-validation-reanalysis/
βββ π¨ assets/ # Hero images and diagrams
βββ πΎ data/ # Data manifest and staging
β βββ 01_raw/ # Raw observation pointers
β βββ 02_reduced/ # Extracted spectra
β βββ 03_inference/ # MCMC outputs
β βββ 04_ard/ # ARD package staging
βββ π docs/ # Methodology documentation
β βββ documentation-standards/ # Templates and conventions
βββ π notebooks/ # Exploratory analysis
βββ π¦ rbh1-ard-v1/ # ARD release package
βββ π» scripts/ # Phase-organized execution
β βββ 00-ideation-and-setup/
β βββ 01-data-acquisition/
β βββ 02-standard-extraction/
β βββ 03-cube-differencing/
β βββ 04-noise-model/
β βββ 05-kinematic-fitting/
β βββ 06-mappings-inference/
β βββ 07-robustness-tests/
β βββ 08-galaxy-falsification/
β βββ 09-ard-materialization/
βββ π§ src/ # Reusable library code
β βββ extraction/ # Spectral extraction pipeline
β βββ inference/ # MCMC and likelihood
β βββ falsification/ # Galaxy hypothesis testing
β βββ visualization/ # Figure generation
βββ β
validation/ # Data validation outputs
βββ π work-logs/ # Phase documentation
βββ 00-ideation-and-setup/
βββ 01-data-acquisition/
βββ ...
```
### Navigation Guide
| Directory | Purpose | README |
|-----------|---------|--------|
| `assets/` | Hero images and diagrams | β |
| `data/` | Data manifest and staging areas | [README](data/README.md) |
| `docs/` | Methodology and specifications | [README](docs/README.md) |
| `notebooks/` | Exploratory Jupyter analysis | [README](notebooks/README.md) |
| `rbh1-ard-v1/` | ARD release package | [README](rbh1-ard-v1/README.md) |
| `scripts/` | Phase-organized execution scripts | [README](scripts/README.md) |
| `src/` | Reusable library modules | [README](src/README.md) |
| `validation/` | Data integrity and QA outputs | [README](validation/README.md) |
| `work-logs/` | Phase worklogs and development history | [README](work-logs/README.md) |
---
## π₯οΈ Compute Infrastructure
This project uses dedicated VMs from the Proxmox Astronomy Lab cluster.
### Allocated Resources
| Node | CPU | RAM | GPU | Role |
|------|-----|-----|-----|------|
| proj-gpu01 | AMD 5950X (16c) | 48 GB | NVIDIA A4000 16GB | GPU inference, task broker |
| proj-cpu01 | Intel 12900K (12c) | 48 GB | β | MCMC walker pool |
| proj-cpu02 | Intel 12900K (12c) | 48 GB | β | MCMC walker pool |
| proj-cpu03 | Intel 12900K (12c) | 48 GB | β | MCMC walker pool |
### Pipeline Architecture
The falsification campaign uses a Producer-Consumer pattern optimized for heterogeneous hardware:
- CPU Nodes (Producers): Propose parameters, compute galactic dynamics via `galpy`, apply Toomre Q stability filter
- GPU Node (Consumer): JAX-based 3D cube generation, PSF/LSF convolution, likelihood evaluation
- DragonFlyDB: High-throughput task broker (ephemeral)
- PostgreSQL: ACID-compliant chain storage (durable)
---
## π Project Status
| Phase | Description | Status |
|-------|-------------|--------|
| Phase 0 | Ideation and Setup | β
Complete |
| Phase 1 | Data Acquisition + Validation | β
Complete |
| Phase 2 | Standard Extraction Baseline | β³ Pending |
| Phase 3 | Cube-Space Differencing | β³ Pending |
| Phase 4 | Empirical Noise Model | β³ Pending |
| Phase 5 | Tied Kinematic Fitting | β³ Pending |
| Phase 6 | MAPPINGS V Inference | β³ Pending |
| Phase 7 | Robustness Tests | β³ Pending |
| Phase 8 | Edge-on Galaxy Falsification | β³ Pending |
| ARD | Dataset Materialization | β³ Pending |
**Current Milestone:** Phase 01 complete. HST and JWST data acquired (198 files, ~45 GB), validated against van Dokkum et al. (2025) claims, and QA visualizations generated. All validation checks pass. See [Phase 01 Worklog](work-logs/01-data-acquisition/README.md) for details.
**Next:** Phase 02 β Standard extraction baseline from JWST S3D cubes and HST DRC photometry.
**ARD Target:** v1.0 release alongside paper submission
---
## π Related Projects
This project is part of the [radioastronomyio](https://github.com/radioastronomyio) research program:
| Project | Description |
|---------|-------------|
| [proxmox-astronomy-lab](https://github.com/radioastronomyio/proxmox-astronomy-lab) | Platform documentation, VM inventory, network architecture |
| [desi-cosmic-void-galaxies](https://github.com/radioastronomyio/desi-cosmic-void-galaxies) | Galaxy populations in cosmic voids using DESI DR1 |
| [desi-quasar-outflows](https://github.com/radioastronomyio/desi-quasar-outflows) | AGN outflow spectral fitting and Cloudy modeling |
| [desi-qso-anomaly-detection](https://github.com/radioastronomyio/desi-qso-anomaly-detection) | ML anomaly detection for quasar spectra |
### External Resources
| Resource | Link |
|----------|------|
| van Dokkum et al. (2025) | [arXiv preprint](https://arxiv.org/) |
| HST GO-17301 | [MAST Archive](https://archive.stsci.edu/) |
| JWST GO-3149 | [MAST Archive](https://archive.stsci.edu/) |
| MAPPINGS V | [3MdB Database](https://3mdb.astro.unam.mx/) |
---
## π Open Science Commitment
This project is committed to full transparency and reproducibility:
| Artifact | Platform | Contents |
|----------|----------|----------|
| Code | GitHub | All analysis scripts, pipeline code |
| Work Logs | GitHub | Decision documentation, methodology notes |
| ARD | Zenodo | Complete Analysis-Ready Dataset with DOI |
| Paper | arXiv | Preprint with full methodology |
The ARD will include everything needed to reproduce our results or test alternative hypotheses without access to HPC resources.
---
## π€ Contributing
This project implements systematic AI-human collaboration through transparent documentation and open science practices.
### Development Principles
| Principle | Implementation |
|-----------|----------------|
| Documentation First | All methodology documented before execution |
| Reproducibility | Full provenance chain, versioned artifacts |
| Open Science | Code, logs, and data publicly released |
| Neutral Stance | Validation, not advocacy |
### Getting Started
1. Review the [Validation Specification](docs/README.md)
2. Check the [Work Logs](work-logs/README.md) for current status
3. See phase-specific READMEs for implementation details
---
## π License
This project is licensed under the MIT License β see the [LICENSE](LICENSE) file for details.
---
## π Acknowledgments
### Scientific Foundation
- van Dokkum et al. β Original RBH-1 discovery and interpretation
- STScI/MAST β HST and JWST data archive
- MAPPINGS V Team β Shock model grids
### Infrastructure
- Proxmox Astronomy Lab β Compute cluster
- JAX/Google β GPU-accelerated inference
- ArviZ β Bayesian analysis ecosystem
---
*Contributing to the discussion of extraordinary objects | Part of [radioastronomyio](https://github.com/radioastronomyio)*
Last Updated: December 23, 2025 | Current Phase: Phase 02 β Standard Extraction