|
06c5c6d84b
|
Incorporated method to MetadataHarvester class to collect project level metadata.
|
2024-06-19 18:30:02 +02:00 |
|
|
04558e7785
|
Added code to parse dict attributes.
|
2024-06-18 14:42:51 +02:00 |
|
|
a6868d985d
|
Fixed bug regarding datetime to str column conversion in dataframe by using .map(srt) (element wise operation) as opposed to .apply(str)
|
2024-06-18 09:21:46 +02:00 |
|
|
b66dc11a62
|
Replaced applymap to .apply because the former is being depricated
|
2024-06-17 13:47:54 +02:00 |
|
|
ed1641af55
|
Created function to save dataframes with annotations in hdf5 format
|
2024-06-17 13:36:05 +02:00 |
|
|
0eba80db41
|
Added metadata printer method and rewrote slightly a few class terms.
|
2024-06-17 08:44:44 +02:00 |
|
|
c68e800967
|
Incorporated dataframe_to_np_structured_array(df: pd.DataFrame) from another module.
|
2024-06-16 18:39:30 +02:00 |
|
|
e4de4edf28
|
Incorporated dataframe_to_np_structured_array(df: pd.DataFrame) from another module.
|
2024-06-16 18:26:12 +02:00 |
|
|
2d4ecec806
|
Moved dataframe_to_np_structured_array(df: pd.DataFrame) to src/g5505_utils.py. This is a more generic function that can be used more broadly accross modules.
|
2024-06-16 18:25:08 +02:00 |
|
|
6f5c49dc64
|
Removed lenthy example.
|
2024-06-13 16:03:04 +02:00 |
|
|
a301d42ad5
|
Replaced add_data_level_info to add_dataset.
|
2024-06-13 16:01:27 +02:00 |
|
|
0fb14b7c6c
|
Developed a metadata harvesting object to facilitate metadata collection throught the code.
|
2024-06-13 15:47:02 +02:00 |
|
|
9ab9aa49c4
|
Abstracted a code snippet from def create_hdf5_file_from_filesystem_path(..) as transfer_file_dict_to_hdf5() so that it can be reusable.
|
2024-06-13 15:44:01 +02:00 |
|
|
cbca94390f
|
Modified hardcoded paths to adapt with respect to the parent directory
|
2024-06-11 17:30:58 +02:00 |
|
|
e7ed6145f0
|
Implemented a data extraction module to access data from an hdf5 file in the form of dataframes.
|
2024-06-11 10:38:04 +02:00 |
|
|
a410bde23e
|
Removed data table split into categorical and numerical variables and numering is only introduce to disambiguate repeated columns.
|
2024-06-10 16:18:51 +02:00 |
|
|
1ec7ad76ff
|
Removed additional numbering from some intrument specifications. These are now only added if the column names are ambigous.
|
2024-06-10 16:14:13 +02:00 |
|
|
726e9b3503
|
Fixed bug in the case where data_integration_mode = 'collection'.
|
2024-06-07 16:45:00 +02:00 |
|
|
dba5bc9ea7
|
Updated instrument names from ICAD/HONO and ICAD/NO2 to HONO and NO2.
|
2024-06-07 16:41:41 +02:00 |
|
|
197ad0288a
|
Updated file reader and data integration with datastart and dataend properties.
|
2024-06-04 13:37:20 +02:00 |
|
|
9dcc757acc
|
renamed folder src/instrument_descriptions/ to src/intruments/ and moved text_data_sources.yaml in there.
|
2024-06-04 10:54:09 +02:00 |
|
|
a6ddb24eeb
|
Added .strip to column names to remove unwanted characters (\r|\t|\n) and included units description to timestamps.
|
2024-06-04 09:57:37 +02:00 |
|
|
fa2990527e
|
Simplified and documented parse_attribute function.
|
2024-06-04 09:51:12 +02:00 |
|
|
014bd14fcd
|
Modified temperature units from °C to Celcius for simpler string encoding. It seems ascii codec cannot encode such a character
|
2024-06-04 09:44:09 +02:00 |
|
|
385267a98f
|
Updated treemap visualization to select only root metadata, which is of string type.
|
2024-06-03 14:17:42 +02:00 |
|
|
560481610c
|
Updated root metadata display in treemaps
|
2024-06-02 16:43:54 +02:00 |
|
|
c74b6c1a91
|
Updated instrument attributes with datetime_format and desired_format.
|
2024-06-02 16:14:30 +02:00 |
|
|
1054367f12
|
Modified annotate_root_dir function.
|
2024-06-02 16:02:48 +02:00 |
|
|
d335836a7d
|
Updated reader to standardize timestamps to a desired format when possible. The desired format is set in text_data_sources.yaml.
|
2024-06-02 15:59:01 +02:00 |
|
|
69f3857936
|
Implemented functions for data extraction from hdf5 files.
|
2024-05-31 12:39:10 +02:00 |
|
|
922bb3ca64
|
Updated YAML config file parsing logic to account for changes in config file description.
|
2024-05-30 12:16:54 +02:00 |
|
|
3a9aede909
|
Made def third_update_hdf5_file_with_review more modular by separating data update and git operations, resulting new functions that can be reused in less restrictive matadata annotation contexts.
|
2024-05-29 15:26:48 +02:00 |
|
|
ef7c6c9efb
|
Implemented a git operations module for automated git ops, based on subprocess.
|
2024-05-29 15:17:09 +02:00 |
|
|
a86fc97605
|
Refactored due to updates in the file reader function.
|
2024-05-28 14:41:34 +02:00 |
|
|
3de6abce50
|
added the feature to activate or deactivate data copying before reading the input file. This is to avoid redundant copying when we are already working on file copies.
|
2024-05-28 14:40:14 +02:00 |
|
|
fd1c6461bb
|
Updated some of the raname_as metadata for all instruments so that it is much machine readable and perhpas be used as an alternative to the original name in future releases.
|
2024-05-28 14:37:43 +02:00 |
|
|
804ea52583
|
Modified function to return list of paths when config_file.yaml integration mode = experimental step.
|
2024-05-28 11:29:32 +02:00 |
|
|
f6a46168ec
|
Improved parsing from HDF5 attr dict to yaml compatible dict. Now we can parse HDF5 compound attributes (structured np arrays).
|
2024-05-28 11:27:44 +02:00 |
|
|
41c7660be3
|
Enhanced data transfer progress visualization and logging
|
2024-05-28 08:59:29 +02:00 |
|
|
08d58557df
|
Fixed bug that didnot allowed analythical_methods composite keywords (e.g., ICAD/HONO) to be matched in intrument configurations.
|
2024-05-28 08:57:57 +02:00 |
|
|
3270ce5ed7
|
Implemented reader file compatibility check.
|
2024-05-27 18:22:16 +02:00 |
|
|
2911416431
|
Improved modularity of hdf5_file creation by creating a function that copies the intput directory file and applies directory, files, and extensions constraints before regular directory to hdf5 transfer. See [200~def copy_directory_with_contraints(input_dir_path, output_dir_path, select_dir_keywords, select_file_keywords, allowed_file_extensions):
|
2024-05-27 18:15:08 +02:00 |
|
|
24a2d5d37e
|
Refactored list to array conversion using metadata_rewiew_lib
|
2024-05-26 15:04:07 +02:00 |
|
|
77afbbbf8f
|
Added function to convert list of strings into a np.array of bytes. This is useful to create list-valued attributes in HDF5.
|
2024-05-26 14:56:36 +02:00 |
|
|
88572b44b1
|
Fixed buggy statement. import datetime ... followed by datetime.now() was fixed as datetime.datetime.now().
|
2024-05-26 12:26:54 +02:00 |
|
|
37071945f5
|
Removed hdf5 file creation redundancy by creating a helper function create_HDF5_file(date_str,select_file_keywords), which handles variations in date_str and keywords.
|
2024-05-26 12:24:15 +02:00 |
|
|
4dc09339b5
|
Replaced lambda function with regular function and fstring for better readability and debugging
|
2024-05-26 11:39:40 +02:00 |
|
|
b7f9bfe149
|
Replaced print statement with logging and raise exception for better error handling and managment
|
2024-05-26 11:34:20 +02:00 |
|
|
ac37235072
|
Added function setup_logging to configure logger to record logs in specified output directory.
|
2024-05-26 11:19:54 +02:00 |
|
|
c7051bfe69
|
updated readme and reader to handle ignore ascii character errors
|
2024-05-24 15:55:15 +02:00 |
|