lib.sedna.datasources

Subpackages

Package Contents

Classes

BaseDataSource

An abstract class representing a BaseDataSource.

TxtDataParse

txt file which contain image list parser

CSVDataParse

csv file which contain Structured Data parser

JSONDataParse

json file which contain Structured Data parser

class lib.sedna.datasources.BaseDataSource(data_type='train', func=None)[source]

An abstract class representing a BaseDataSource.

All datasets that represent a map from keys to data samples should subclass it. All subclasses should overwrite parse`, supporting get train/eval/infer data by a function. Subclasses could also optionally overwrite __len__, which is expected to return the size of the dataset.overwrite x for the feature-embedding, y for the target label.

Parameters:
  • data_type (str) – define the datasource is train/eval/test

  • func (function) – function use to parse an iter object batch by batch

property is_test_data[source]
num_examples() int[source]
__len__()[source]
abstract parse(*args, **kwargs)[source]
save(output='')[source]
class lib.sedna.datasources.TxtDataParse(data_type, func=None)[source]

Bases: BaseDataSource, abc.ABC

txt file which contain image list parser

parse(*args, **kwargs)[source]
class lib.sedna.datasources.CSVDataParse(data_type, func=None)[source]

Bases: BaseDataSource, abc.ABC

csv file which contain Structured Data parser

static parse_json(lines: dict, **kwargs) pandas.DataFrame[source]
parse(*args, **kwargs)[source]
class lib.sedna.datasources.JSONDataParse(data_type, func=None)[source]

Bases: BaseDataSource, abc.ABC

json file which contain Structured Data parser

parse(*args, **kwargs)[source]
load_anno_from_ids(id_)[source]