lib.sedna.datasources

Subpackages

Package Contents

Classes

BaseDataSource

An abstract class representing a BaseDataSource.

TxtDataParse

txt file which contain image list parser

CSVDataParse

csv file which contain Structured Data parser

class lib.sedna.datasources.BaseDataSource(data_type='train', func=None)[source]

An abstract class representing a BaseDataSource.

All datasets that represent a map from keys to data samples should subclass it. All subclasses should overwrite parse`, supporting get train/eval/infer data by a function. Subclasses could also optionally overwrite __len__, which is expected to return the size of the dataset.overwrite x for the feature-embedding, y for the target label.

Parameters
  • data_type (str) – define the datasource is train/eval/test

  • func (function) – function use to parse an iter object batch by batch

num_examples(self) int[source]
__len__(self)[source]
abstract parse(self, *args, **kwargs)[source]
property is_test_data(self)[source]
save(self, output='')[source]
class lib.sedna.datasources.TxtDataParse(data_type, func=None)[source]

Bases: BaseDataSource, abc.ABC

txt file which contain image list parser

parse(self, *args, **kwargs)[source]
class lib.sedna.datasources.CSVDataParse(data_type, func=None)[source]

Bases: BaseDataSource, abc.ABC

csv file which contain Structured Data parser

static parse_json(lines: dict, **kwargs) pandas.DataFrame[source]
parse(self, *args, **kwargs)[source]