tau_eval.tasks package
Submodules
Module contents
- class tau_eval.tasks.CustomTask(dataset: datasets.arrow_dataset.Dataset = None, name: str = '', s1: str = 'text', s2: str = '')[source]
Bases:
object- dataset: Dataset = None
- name: str = ''
- s1: str = 'text'
- s2: str = ''
- class tau_eval.tasks.DeIdentification(dataset: datasets.arrow_dataset.Dataset = None, name: str = '', s1: str = 'text', s2: str = '', max_rows: int = None)[source]
Bases:
CustomTask- dataset: Dataset = None
- max_rows: int = None
- name: str = ''
- class tau_eval.tasks.IMDBAuthorshipClassification(n_authors: int = 10, min_docs_per_author: int = 1000, random_seed: int = 0, **kwargs)[source]
Bases:
ClassificationA classification task for authorship attribution using the IMDb-62 dataset.
Inherits from tasknet.Classification and automatically processes the dataset to select authors with sufficient documents for classification.
- property author_info: dict
Get information about selected authors and their document counts.