ChallengeResults#

class pepbench.evaluation.ChallengeResults(agg_mean_std: DataFrame, agg_total: DataFrame, single: DataFrame, per_sample: DataFrame)[source]#

Container for the results produced by a PEP evaluation challenge.

This named tuple contains the main result tables produced by the evaluation pipeline.

Attributes:
agg_mean_stdpandas.DataFrame

The aggregated metrics (mean and std) across the dataset.

agg_totalpandas.DataFrame

The total counts (e.g. number of PEPs valid/invalid) for the dataset.

singlepandas.DataFrame

The single (non-aggregated) results for each datapoint / test instance.

per_samplepandas.DataFrame

The per-sample results (detailed per-estimation metrics) for each datapoint.

Methods

count(value, /)

Return number of occurrences of value.

index(value[, start, stop])

Return first index of value.

__init__(*args, **kwargs)#
agg_mean_std: DataFrame#

Alias for field number 0

agg_total: DataFrame#

Alias for field number 1

single: DataFrame#

Alias for field number 2

per_sample: DataFrame#

Alias for field number 3

count(value, /)#

Return number of occurrences of value.

index(value, start=0, stop=sys.maxsize, /)#

Return first index of value.

Raises ValueError if the value is not present.