timesead.data.preprocessing.tep
Functions
|
|
|
Preprocess TEP dataset for experiments |
Module Contents
- timesead.data.preprocessing.tep.make_chunks(liste, chunksize)
- timesead.data.preprocessing.tep.preprocess_tep_data(dataset_dir: str, out_dir: str, chunksize: int = 100000, fault_chunksize: int = 7)
Preprocess TEP dataset for experiments
- Parameters:
dataset_dir (str) – Path to the dataset folder
out_dir (str) – Directory where the preprocessed data should be saved. This directory should exist already.
chunksize (int) – The processed csv is saved using this chunksize value.
fault_chunksize (int) – The processed TEP faults as saved in chunks specified by this.