tau_eval.tasks.imdb_authorship_classification module

class tau_eval.tasks.imdb_authorship_classification.IMDBAuthorshipClassification(n_authors: int = 10, min_docs_per_author: int = 1000, random_seed: int = 0, **kwargs)[source]

Bases: Classification

A classification task for authorship attribution using the IMDb-62 dataset.

Inherits from tasknet.Classification and automatically processes the dataset to select authors with sufficient documents for classification.

property author_info: dict

Get information about selected authors and their document counts.