230 Commits

Author SHA1 Message Date
10554fc41e Renamed make_dtype_yaml_compatible func as to_serializable_dtype func 2024-09-25 16:36:50 +02:00
1e1499c28a Robustified definition of path_to_input_dir arg or parameter by ensuring is always defined using forward slashes and then is normalized to the os specification. Improved dry run = True of copy directory func. 2024-09-25 15:12:19 +02:00
9eeb9d6380 Moved src/metadata_review_lib.py pipelines/metadata_revision.py 2024-09-17 16:55:22 +02:00
07401c895f Moved src/data_integration_lib.py -> pipelines/data_integration.py 2024-09-17 15:32:23 +02:00
2dd033bcb3 Refactored code into functions to parse and validate yaml condif file and to perform specified data integration task using a pipeline like software structure. 2024-09-17 15:28:11 +02:00
d63f522588 Major update. Remove file filtering option and outputname input arg. The output name is now the same as the path_to_input_dir + .h5. By default, the hdf5 writer preserves second level subdirectories and the rest are flattend. dir filtering is outsource to copy_dir_with_constraints from utils- 2024-09-16 16:35:09 +02:00
7a9f7a8c59 Renamed parameter 'input_file_system_path' to 'path_to_input_directory' for clarity. 2024-09-16 14:24:55 +02:00
cc0adfca62 Fixed import statement 2024-09-13 15:11:25 +02:00
4974246522 move def get_parent_child_relationships(file: h5py.File) from ..._vis.py to ..._ops.py 2024-09-13 14:59:11 +02:00
b42482069c src/hdf5_data_extraction.py -> src/hdf5_ops.py 2024-09-13 14:55:12 +02:00
e8e2473ebe Added new method to retreive metadata from h5file at a given obj path 2024-09-13 14:52:07 +02:00
96a2e96b6a Fixed import statement after module's relocation 2024-08-23 16:23:57 +02:00
e4b04b4484 Modified to use filereader_registry.py. 2024-08-23 16:10:23 +02:00
d985115125 Integrated copy h5 file into group functionality, imported from g5505_file_reader 2024-08-23 15:47:04 +02:00
18165eca1a Modified import statements to account for reader module's relocation. 2024-08-23 13:27:26 +02:00
a0f44a1f4b Moved src/g5505_file_reader.py -> instruments/readers/g5505_file_reader.py to increase modularity with respect to new intrument additions. 2024-08-23 10:11:29 +02:00
1112a214e9 Moved src/g5505_utils.py to utils/g5505_utils.py 2024-08-23 07:27:39 +02:00
d7fc38abd9 Moved get_parent_relationships func into hdf5_vis.py and cleaned up unused import statements 2024-08-22 09:50:26 +02:00
05d1133e32 Moved get_parent_child_relationships() funct from hdf5_lib.py tinto hdf5_vis.py to avoid circular dependency between the lower level and higher level module. Thus removed also src.hdf5_lib.py import statement. 2024-08-22 09:47:57 +02:00
d7c7808400 Implemented method for appending new attributes to an specific object. 2024-08-16 09:32:58 +02:00
bb250e9940 Implemented method to reformat a given column in a datatable holding datetime info into a desired datetime format. During data integration this will serve to normalize datatime formats across data tables 2024-08-16 08:08:28 +02:00
062a688f47 Added method to reformat columns containing datetime byte strings into a desired datetime formated object 2024-08-14 16:22:28 +02:00
c876e925a7 Modified code to point to new instrument folders location. Also, upgrated code to accept either a user specified location or the default location 2024-08-12 13:40:01 +02:00
7f0e5384ea Moved instruments folder outside src/. 2024-08-12 10:09:21 +02:00
18aba8d0d3 Implemented dataset append method in HDF5DatOpsAPI 2024-08-09 15:25:09 +02:00
5fe7fc4b70 Developed a class to manage data operations on a given hdf5 file 2024-08-09 13:23:54 +02:00
8f7f14ab68 Removed time stamp configuration attributes from ACSM_TOFWARE, because it can be messy for a configuration file. 2024-08-08 11:24:41 +02:00
74db800e01 Updated file with new instrument configuration ACSM. 2024-08-07 16:38:52 +02:00
ae1e3bfc23 Moved ext_to_reader_dict to g5505_file_reader.py and replaced redear selection based on g5505_reader.select_file_reader(hdf5_file_path). 2024-08-07 16:30:36 +02:00
4e669b3eee Moved hdf5_file_path to file reader mapping and extension definitions to g5505_file_reader_module.py. Created functions to compute file_reader key from path to file in the hdf5 file and select the reader based on the key. This should enable more modular file reader selection. 2024-08-07 16:21:22 +02:00
3430627494 Modified reader to output table_preamble as a dataset as opposed to attributes of the file. I believe this is better for readability of the metadata given that those preambles can sometimes contain large ammounts of text. 2024-08-02 14:37:06 +02:00
938e8e50a1 Changed names of expected root level metadata attributes. 2024-07-17 08:48:47 +02:00
a06e28291c Added attribution insertion order tracking at the root level and reorganized a few import statements. 2024-07-17 08:41:40 +02:00
2ebe5f3220 Made edits to documentation 2024-07-11 13:42:38 +02:00
f04f5eaaf9 Robustified column name to description assigment, however it may be a bit slower than before. 2024-07-10 13:31:47 +02:00
5c6fcabf91 Updated the yaml instrument descriptions. 2024-07-10 13:29:14 +02:00
73beb83278 Moved parse_attribute() from ..review_lib.py into ...utils.py and backpropagate (refactored) changes to respective modules. 2024-07-10 11:32:00 +02:00
2ce925735d Modified return datetime output to a format without colons, which could be problematic for filenaming. 2024-07-10 09:47:56 +02:00
0a0b4ac41d Moved a few functions from ...reader.py and hdf5_lib.py into ..utils.py, and refactored accordingly. 2024-07-10 09:19:30 +02:00
0c74c52e09 Removed smogchamber reader because its funtionality is now integrated into g5505_file_reader.py. 2024-07-09 16:13:01 +02:00
afc6c93823 Removed non utilized code. 2024-07-08 15:29:13 +02:00
cb7d914908 Cleaned code and modified def create_hdf5_file_from_dataframe to create group hierichy implicitly from path rather than recursively. 2024-07-08 15:24:48 +02:00
92eca4d79e Moved remaining git operations in metadata_review_lib.py to git_ops.py and refactored accoringly 2024-07-05 15:46:20 +02:00
77386432f8 Merge branch 'main' of https://gitlab.psi.ch/5505/dima 2024-07-02 16:50:08 +02:00
177a5aa2a1 Updated documentation. 2024-07-02 16:49:48 +02:00
ba6d89d8e1 Modified created at function to output date time and time zone 2024-07-01 16:19:28 +02:00
c074e45892 Renamed script_name to processing_file. 2024-07-01 16:17:25 +02:00
cedfe614e7 Implemented input argument to enable append information to exisintg attributes, which must take the values of either strings or lists. 2024-06-20 15:32:33 +02:00
106795ae59 Added a few lines to detect the existence of the file and change the file mode from 'w' to 'a' based on that information. 2024-06-20 09:03:47 +02:00
498a51cbc6 Updated function to add project level metadata at the root group of the hdf5 file. 2024-06-19 18:31:11 +02:00