mx/Jungfraujoch

T

leonarski_fandClaude Opus 4.8 dd25de461d RotationScaleMerge: GPU partial-scaling loop (CUDA port, phase 1)

First stage of moving the rotation scale/merge onto the GPU. The per-frame partial-scaling loop
(inverse-variance group-mean reduction -> robust per-frame IRLS G -> corr update, x scaling_iter)
now runs in RotationScaleMergeGPU (.cu) when a GPU is present; the CPU loops remain the fallback.

The host keeps the one-time raw-hkl sort and the per-space-group gemmi ASU keying, and hands the
GPU a group-ordered permutation + CSR so the per-group reduction is a DETERMINISTIC segmented
reduction (one thread per group, fixed order, no atomics) - preserving the run-to-run determinism
just won on the CPU path (a float atomicAdd reduction would have re-introduced jitter). Reduction is
one-thread-per-group (groups average tens of obs, so a block-per-group wastes threads); the IRLS is
one block per frame with a deterministic shared-memory reduction.

Validated: bit-identical to the CPU path and deterministic run-to-run on lyso/cytC/Ins_H/pding
(P41212 ISa 7.8 CC1/2 99.7%, etc.). The scaling kernels are ~7x faster than the CPU compute
(~36 ms for 3 iters vs ~0.28 s); end-to-end scale/merge ~2.0 -> ~1.5 s. The remaining gap to the
<1 s target is the per-pass host round-trip (corr down/upload for the CPU combine + per-SG group-CSR
rebuild); phase 2 keeps the data resident by moving the 3D combine and the merge/error-model onto
the GPU too, so nothing round-trips.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-07-02 22:26:29 +02:00

.gitea/workflows

ci: drop requests dependency and use PowerShell for the Windows release upload

2026-07-01 21:51:54 +02:00

acquisition_device

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

scan_result: short "ice" field + global Bravais lattice type

2026-07-02 21:00:12 +02:00

scan_result: short "ice" field + global Bravais lattice type

2026-07-02 21:00:12 +02:00

Compression: add BSHUF_ZSTD_RLE_HUFF (RLE runs + Huffman literals)

2026-06-27 14:41:46 +02:00

detector_control

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

jfjoch_process: azimuthal-integration CLI + default 0.01 1/A q-spacing

2026-07-02 17:24:05 +02:00

v1.0.0-rc.115 (#22 )

2025-12-04 11:56:14 +01:00

VERSION: 1.0.0-rc.156

2026-07-01 21:33:37 +02:00

frame_serialize

ice score: per-image ice-ring indicator plumbed through all layers

2026-07-02 16:37:12 +02:00

scan_result: short "ice" field + global Bravais lattice type

2026-07-02 21:00:12 +02:00

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

RotationScaleMerge: GPU partial-scaling loop (CUDA port, phase 1)

2026-07-02 22:26:29 +02:00

v1.0.0-rc.155 (#65 )

2026-06-25 22:01:48 +02:00

v1.0.0-rc.154 (#64 )

2026-06-25 18:12:00 +02:00

v1.0.0-rc.133

2026-03-26 20:50:33 +01:00

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

v1.0.0-rc.155 (#65 )

2026-06-25 22:01:48 +02:00

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

rotation scaling: dedicated allocate-once RotationScaleMerge, ~3.4x faster scale/merge

2026-07-02 21:00:12 +02:00

ice score: read back from stored HDF5 (viewer) + write to the NXmx master

2026-07-02 16:42:17 +02:00

scan_result: short "ice" field + global Bravais lattice type

2026-07-02 21:00:12 +02:00

bragg_integration: GPU box + profile-fit integrator (standalone engine)

2026-07-02 20:59:45 +02:00

jfjoch_process: azimuthal-integration CLI + default 0.01 1/A q-spacing

2026-07-02 17:24:05 +02:00

ice score: per-image ice-ring indicator plumbed through all layers

2026-07-02 16:37:12 +02:00

ice score: read back from stored HDF5 (viewer) + write to the NXmx master

2026-07-02 16:42:17 +02:00

v1.0.0-rc.149 (#59 )

2026-06-13 21:27:41 +02:00

.gitattributes

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

.gitignore

Python client is built in CI (no new release)

2024-10-23 21:13:22 +02:00

.readthedocs.yaml

version 1.0.0-rc.25

2024-11-22 21:25:20 +01:00

CLAUDE.md

docs: complete the per-image-quantity recipe with the HDF5 read-back path

2026-07-02 16:43:02 +02:00

CMakeLists.txt

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

CONTRIBUTING.md

version 1.0.0-rc.27

2024-12-02 21:17:14 +01:00

gen_python_client.sh

v1.0.0-rc.125 (#32 )

2026-02-18 16:17:21 +01:00

gitea_create_release.py

v1.0.0-rc.94

2025-10-25 22:05:47 +02:00

gitea_upload_file.py

ci: drop requests dependency and use PowerShell for the Windows release upload

2026-07-01 21:51:54 +02:00

LICENSE

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

make_doc.sh

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

README.md

version 1.0.0-rc.27

2024-12-02 21:17:14 +01:00

THIRD_PARTY_NOTICES.md

v1.0.0-rc.153 (#63 )

2026-06-23 20:29:49 +02:00

update_version.sh

docs: publish third-party notices as a documentation page

2026-07-01 21:42:22 +02:00

VERSION

VERSION: 1.0.0-rc.156

2026-07-01 21:33:37 +02:00

README.md

Jungfraujoch

Application to receive data from the PSI JUNGFRAU and EIGER detectors.

All documentation is now placed in docs/ subdirectory and for the current version hosted on Jungfraujoch Read The Docs page.