https://github.com/plasmacontrol/data-fetching

toksearch-based scripts (+ a few OMFIT ones) for pulling data; plotting scripts for Joe and Rory's 2021 control experiment stuff
https://github.com/plasmacontrol/data-fetching

Last synced: over 1 year ago
JSON representation

toksearch-based scripts (+ a few OMFIT ones) for pulling data; plotting scripts for Joe and Rory's 2021 control experiment stuff

Host: GitHub
URL: https://github.com/plasmacontrol/data-fetching
Owner: PlasmaControl
Created: 2022-04-13T23:18:29.000Z (over 4 years ago)
Default Branch: master
Last Pushed: 2024-09-06T00:17:03.000Z (almost 2 years ago)
Last Synced: 2025-04-14T02:13:13.029Z (over 1 year ago)
Language: C
Size: 304 KB
Stars: 6
Watchers: 2
Forks: 1
Open Issues: 6
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

To run on iris and saga (General Atomics clusters):

`module load toksearch`

and, the first time you run, after you module load toksearch also pip install `h5py==3.6.0`. Shouldn't be necessary, but if you have further difficulties consider things like `module load defaults`, `module load gcc7`, `module load hdf5/gnu`

For the basic case:

In dump_shots.py edit the min_shots and max_shots to be whatever range you want, for testing use e.g. 163300 to 163310. Then run it to dump the shots you want to collect (but only those with plasma; and you can also edit to only take from certain run days in the file) via

`python dump_shots.py`

Then collect the signals from those shots. By default, path_to_config above will be configs/example.yaml which has all the signals, draws from data/shots.npy (which dump_shots.py dumps to), and dumps to output_file (MAKE SURE TO EDIT THIS). You can take out signals, e.g. for testing
- set sql_sig_names, scalar_sig_names, stability_sig_names, nb_sig_names, efit_profile_sig_names, efit_scalar_sig_names, thomson_sig_names, zipfit_sig_names to an empty array
- set include_radiation, include_full_ech_data, include_full_nb_data, include_gas_valve_info, include_log_info to False
Increase max_shots_per_run, this controls how often it checkpoints and spits out how long each batch takes; num_processes greater than 1 should automatically work on Saga and parallelizes.

When the config file is ready, run

`python new_database_maker.py path_to_config`

For large runs, use `launch_parallel_jobs.py` which manually dumps shots and splits into cases to run in parallel (toksearch theoretically can do this under the hood but in my experience it doesn't speed stuff up and is not robust). You can modify `combine_shots.py` to combine the h5 files it dumps into one.

As side notes, omfit_run_dump.py can be used for grabbing text data, which is an OMFIT script (see top of file for how to use)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/plasmacontrol/data-fetching

Awesome Lists containing this project

README