Benchmark Dataset

Module Introduction

The "Benchmarking Datasets" module serves as a repository for rigorous validation, hosting three distinct reference libraries: the Real IPD Dataset, the Published KM Dataset, and the Synthetic KM Dataset.

The complete benchmarking datasets are publicly accessible via Zenodo at https://zenodo.org/records/18320575.

Each dataset serves a specific validation purpose:

  • Real IPD Dataset: Designed to assess distributional fidelity. It validates the concordance of the Weibull distribution parameters between the reconstructed IPD and the ground-truth IPD.
  • Published KM Dataset & Synthetic KM Dataset: Focused on statistical precision. These datasets are utilized to evaluate the accuracy of survival statistics derived from the reconstructed IPD by comparing them against the ground-truth values calculated from the original data.