ChallengeResults#

class pepbench.evaluation.ChallengeResults(agg_mean_std: DataFrame, agg_total: DataFrame, single: DataFrame, per_sample: DataFrame)[source]#

Container for the results produced by a PEP evaluation challenge.

This named tuple contains the main result tables produced by the evaluation pipeline.

Attributes:

agg_mean_stdpandas.DataFrame: The aggregated metrics (mean and std) across the dataset.
agg_totalpandas.DataFrame: The total counts (e.g. number of PEPs valid/invalid) for the dataset.
singlepandas.DataFrame: The single (non-aggregated) results for each datapoint / test instance.
per_samplepandas.DataFrame: The per-sample results (detailed per-estimation metrics) for each datapoint.

Methods

`count`(value, /)	Return number of occurrences of value.
`index`(value[, start, stop])	Return first index of value.

__init__(*args, **kwargs)#

agg_mean_std: DataFrame#: Alias for field number 0

agg_total: DataFrame#: Alias for field number 1

single: DataFrame#: Alias for field number 2

per_sample: DataFrame#: Alias for field number 3

count(value, /)#: Return number of occurrences of value.

index(value, start=0, stop=sys.maxsize, /)#

Return first index of value.

Raises ValueError if the value is not present.

ChallengeResults#

This Page