# Large reference test datasets (git-LFS) The `[large]` Catch tests in `tests/` run the full analysis/processing pipeline over real JUNGFRAU datasets that are too big to keep as ordinary git blobs. They are tracked with **git-LFS** (see the `tests/data/*.h5` rule in the top-level `.gitattributes`). These files are **not required** to build or to run the normal test suite: every test that needs them resolves the path through `jfjoch_test::LargeDataFile()` (`tests/TestData.h`) and `SKIP()`s when the file is absent or is still an unfetched LFS pointer. `jfjoch_test` also prints, at start-up, whether this directory is populated. ## Fetching ``` git lfs install git lfs pull # or: git lfs pull --include "tests/data/*.h5" ``` ## Datasets | File | Dataset | Shipped | |----------------------------|-----------------------------------------|-----------| | `lyso_rotation_master.h5` | lysozyme rotation series (~1800 images) | yes (LFS) | `lyso_rotation_master.h5` (plus its `_data_NNNNNN.h5` files) is fetched by `git lfs pull` and drives `JFJochProcess_LysoRotation`. A separate serial dataset is intentionally **not** shipped to keep the repository small — the rotation series can be run in serial mode (full analysis without rotation indexing) to exercise that path. To add your own dataset, drop the master + its data files here as real files (not symlinks, if you intend to commit them via LFS); the master references its data files by relative name, so keep them side by side.