semsynth.downstream_fidelity
Downstream fidelity comparison between real and synthetic data.
Functions
|
|
|
|
|
Add dunder methods based on the fields defined in the class. |
|
Return the first non-empty column name found in a JSON-LD column entry. |
|
Return a simplified role label for modeling contexts. |
|
Recompute downstream metrics from existing real/synthetic CSVs. |
|
Decorate a function as a build rule with automatic provenance. |
Classes
|
Special type indicating an unconstrained type. |
|
Type for categorical data with the categories and orderedness. |
|
Base configuration container with TOML application helpers. |
|
|
|
Marker for input paths where |
|
Logistic Regression (aka logit, MaxEnt) classifier. |
|
Marker for output paths where |
|
PurePath subclass that can make system calls. |
|
Univariate imputer for completing missing values with simple strategies. |
- class semsynth.downstream_fidelity.DownstreamConfig(m: 'int' = 20, burnin: 'int' = 5, max_interactions: 'int' = 5, cv: 'int' = 5)
Bases:
Config- burnin: int = 5
- cv: int = 5
- m: int = 20
- max_interactions: int = 5
- semsynth.downstream_fidelity.auto_formula(df: DataFrame, meta: Mapping[str, Any], cfg: DownstreamConfig) str
- semsynth.downstream_fidelity.compute_downstream(df_real: DataFrame, df_synth: DataFrame, meta: Mapping[str, Any] | None = None, *, cfg: DownstreamConfig | None = None, target: str | None = None) Mapping[str, Any]
- semsynth.downstream_fidelity.recompute_downstream(metrics: OutPath = OutPath('{metrics}'), *, real: InPath | None = None, synth: InPath | None = None, meta: InPath | None = None, target: str | None = None, verbose: bool = False) None
Recompute downstream metrics from existing real/synthetic CSVs.