timesead.data.preprocessing.tep

Functions

make_chunks(liste, chunksize)

preprocess_tep_data(dataset_dir, out_dir[, chunksize, ...])

Preprocess TEP dataset for experiments

Module Contents

timesead.data.preprocessing.tep.make_chunks(liste, chunksize)
timesead.data.preprocessing.tep.preprocess_tep_data(dataset_dir: str, out_dir: str, chunksize: int = 100000, fault_chunksize: int = 7)

Preprocess TEP dataset for experiments

Parameters:
  • dataset_dir (str) – Path to the dataset folder

  • out_dir (str) – Directory where the preprocessed data should be saved. This directory should exist already.

  • chunksize (int) – The processed csv is saved using this chunksize value.

  • fault_chunksize (int) – The processed TEP faults as saved in chunks specified by this.