fontr.pipelines.data_processing package

Submodules

fontr.pipelines.data_processing.nodes module

get_label2idx_mapping(idx2label)[source]
Return type:

dict

labeled_images_split(data, parameters)[source]

Split labeled image list to train, validation and test dataset

Parameters:
  • data (pd.DataFrame) – list of images

  • parameters (dict[str, Any]) – pipeline parameters

Returns:

split dataset

Return type:

tuple[pd.DataFrame, pd.DataFrame, pd.DataFrame]

unlabeled_images_split(data, parameters)[source]

Split unlabeled image list to train and test dataset

Parameters:
  • data (pd.DataFrame) – list of images

  • parameters (dict[str, Any]) – pipeline parameters

Returns:

split dataset

Return type:

tuple[pd.DataFrame, pd.DataFrame, pd.DataFrame]

fontr.pipelines.data_processing.pipeline module

create_pipeline(**kwargs)[source]
Return type:

Pipeline