Commit Graph

35 Commits

Author SHA1 Message Date
b643eb2d26 Refactored command line interface, generate_species_flags, and created functions to improve performance and code clarity. 2025-03-13 14:17:46 +01:00
97f253074b Add in try-except block a finally statement to close hdf5 file for secure file closing. This was done before in the try case which is not good practice 2025-03-13 14:06:01 +01:00
d6fa51effc Fix typo and missing key. 1) Update allowed yaml files in load_project_yaml_files by adding pipelines/params/calibration_factors.yaml. 2) Correct filename access after os.path.split operation. 2025-03-10 13:02:38 +01:00
4ca99f4f04 Rewrote load_calib_file() in terms pipelines.steps.utils.load_project_yaml_files(). In the future we will replace it completely so we can validate and manage all yaml files in the chain in a more centralized manner. 2025-03-10 11:46:24 +01:00
ecdf3687ba Remove/replace def load_parameters(flag_type) with steps.utils.load_project_yaml_files(.). This is to have a more centralized treatment of yaml files inside the data chain. 2025-03-10 11:38:51 +01:00
2eb88e444c WIP: finished last commit, and tested the whole step. Now we need to implement the command line interface. 2025-03-07 16:51:06 +01:00
eb8a36dc8e Update pipelines/steps/apply_calibration_factors.py to generate error table and cleaned up command line interface. 2025-03-07 16:49:32 +01:00
78340464aa Add functions: generate_error_dataframe() with missing values, metadata_dict_to_dataframe(), and load_project_yaml_files() to easily access data from yaml files in the project. 2025-03-07 16:45:33 +01:00
f035afbf3b Fix duplicate time stamp problems and added code to check for NaTs before ebas submission files.) 2025-03-04 09:02:37 +01:00
6eccbb5018 Moved get_metadata() from pipelines/steps/prepare_ebas_submission.py to utils.py 2025-03-03 18:55:46 +01:00
def67a0562 Added correct calibration parameters to calibration factors table 2025-03-03 14:23:45 +01:00
d3a491ea85 Renamed pipeline step join_tables.py to prepare_ebas_submission.py for accuracy. This step performs more than joining tables, it also does renaming using dictionaries and generates the ebas data submission package 2025-03-03 10:48:03 +01:00
877c0178d7 WIP: Added first version of join_tables and tested txt generation. We should probaly make this step encompasse name renaming for EBAS and rename step to be more generic. Change join_tables.py to generate_ebas_converter_input_files 2025-02-28 17:23:00 +01:00
a9533ae3e8 Refine function. Abstract out load parameters as function to simplify code. 2025-02-28 17:18:13 +01:00
6890377134 Remove dataset_name parameter from command line interface. Instead i now infer it assuming there is a consistent input filename in the hdf5 file that can be found using hard coded keywords. 2025-02-28 17:17:33 +01:00
33f29a8080 WIP: Completed flag generation for species, command line interface is now clear but still does not consider potentially existing flags. Review TODO.md, for efficiency we need to simplify flag assignment and reconciliation for the species flag type. 2025-02-26 15:11:12 +01:00
4edd6809e9 WIP: Redesigned command line functionality. options --species is half baked :). It runs without error but second part needs validation. 2025-02-22 16:00:27 +01:00
e24d451542 Updated thirdparty submodule to latest version 2025-02-22 15:02:17 +01:00
0911260f26 Save changes. 2025-02-15 18:20:58 +01:00
a516479036 Change data lineage format. Now, data lineage information is accessed using name of the file as key. 2025-02-15 17:57:50 +01:00
ca08044569 Update keys of data_lineage_metadata.yaml. keys are now the filenames in the folder. 2025-02-11 13:14:14 +01:00
0623c2462b Adjust command line interface of steps/apply_... and steps/compute_aut... 2025-02-11 13:07:29 +01:00
f2854d4ca0 Implement pipelines/steps/visualize_datatable_vars.py 2025-02-11 13:03:46 +01:00
b966151d9c Update with template for compute_cpc_flags(). This still needs to be completed. 2025-02-10 10:54:18 +01:00
ca2b8b6787 Rename apply_diagnostic_flags.py to compute_automated_flags.py because the function also computes both cpc and diagnostic based flags 2025-02-10 10:45:55 +01:00
d93003f5c4 Add import statement and adjust dependencies. 2025-02-06 16:57:46 +01:00
ad95d11a0d Implement utils module for reusable functions across steps in the pipelines 2025-02-06 16:54:36 +01:00
6ed9fe1eb1 Rename pipeline step to apply_diagnositc_flags, it perform a check based on specified thresholds and then assigns I, V, or M labels producing a labeling of a variable. 2025-02-06 16:25:40 +01:00
f8b900422b Rename function 2025-02-06 16:15:19 +01:00
2c2b154528 Update to account for yaml file attribute renamings. 2025-02-05 18:15:43 +01:00
48e2127fef Reimplement apply_calibration_factors() in terms of load_calibration_file() and compute_calibration_factors() to reduce complexity and improve modularity. 2025-02-04 18:12:42 +01:00
95f13d05b2 Save changes to calibration factors and apply_calibration_factors.py after git pull operation 2025-01-30 13:33:44 +01:00
f3ec8c6acf Fix typo, i was not properly accessing the key variables in the yaml dictionary 2025-01-29 19:27:18 +01:00
23e83134d2 Restructure calibration file to make more userfriendly 2025-01-29 18:01:52 +01:00
9bb56cd3da Implement first version of processing pipeline. 2025-01-29 09:16:26 +01:00