DomainAwareDataset str repr edge case handling #49

YanisLalou · 2024-01-11T09:25:09Z

Issue #12

Handling the str output of a DomainAwareDataset if the number of domin_names_ is too large
Print the number of datapoints in the dataset

…in_names_ is too large + Print the number of datapoints in the dataset

YanisLalou · 2024-01-11T09:27:24Z

The goal is to have a similar string representation as torchvision for datasets:
Dataset CIFAR10 Number of datapoints: 50000 Root location: datasets/CIFAR10 Split: Train

However rn in the DomainAwareDataset object we don't have the Dataset name + Root location

rflamary · 2024-01-11T09:27:46Z

skada/datasets/_base.py


    def __repr__(self) -> str:
-        return self.__str__()
+        head = self.__str__()
+        body = [f"Number of datapoints: {sum(len(tup[0]) for tup in self.domains_)}"]


you shoudl also print teh numer of domains and call it total sizes (giving both total n and d)

kachayev · 2024-01-11T11:24:05Z

skada/datasets/_base.py

+        output = "\n".join([head] + body)
+        return output
+
+    def get_domain_representation(self, max_domains=5, max_length=50):


I suggest renaming the function to _get_domain_representation to emphasize its role as an internal function. This change will help clarify that it should not be considered part of the class's public API.

rflamary

This seems OK but you need to do some testing (add at leats one test) with different types of datasets and check that nothing breaks

codecov · 2024-01-12T13:22:11Z

Codecov Report

Attention: 1 lines in your changes are missing coverage. Please review.

Comparison is base (1a1de71) 83.98% compared to head (865aac2) 84.16%.

Files	Patch %	Lines
skada/datasets/_base.py	93.33%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #49      +/-   ##
==========================================
+ Coverage   83.98%   84.16%   +0.18%     
==========================================
  Files          35       35              
  Lines        2135     2160      +25     
==========================================
+ Hits         1793     1818      +25     
  Misses        342      342

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

kachayev · 2024-01-12T14:58:14Z

@YanisLalou Flake8 is not happy here as well.

Handling the str output of a DomainAwareDataset if the number of doma…

19ce369

…in_names_ is too large + Print the number of datapoints in the dataset

rflamary reviewed Jan 11, 2024

View reviewed changes

Add number of domains

e71cc73

kachayev reviewed Jan 11, 2024

View reviewed changes

Renaming get_domain_representation to _get_domain_representation

3e12e12

rflamary reviewed Jan 12, 2024

View reviewed changes

Add test for str() and repr() functions of DomainAwareDataset

2bd92df

YanisLalou added the enhancement New feature or request label Jan 12, 2024

YanisLalou requested a review from kachayev January 12, 2024 13:38

YanisLalou self-assigned this Jan 12, 2024

Merge branch 'main' into dataset_repr_branch

ea0ee49

Flake8 warnings

865aac2

rflamary merged commit 803ba26 into scikit-adaptation:main Jan 17, 2024
4 checks passed

YanisLalou deleted the dataset_repr_branch branch January 18, 2024 09:16

kachayev mentioned this pull request Jan 24, 2024

__str__ and __repr__ for DomainAwareDataset #12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DomainAwareDataset str repr edge case handling #49

DomainAwareDataset str repr edge case handling #49

YanisLalou commented Jan 11, 2024

YanisLalou commented Jan 11, 2024

rflamary Jan 11, 2024

kachayev Jan 11, 2024

rflamary left a comment

codecov bot commented Jan 12, 2024 •

edited

Loading

kachayev commented Jan 12, 2024

DomainAwareDataset str repr edge case handling #49

DomainAwareDataset str repr edge case handling #49

Conversation

YanisLalou commented Jan 11, 2024

YanisLalou commented Jan 11, 2024

rflamary Jan 11, 2024

Choose a reason for hiding this comment

kachayev Jan 11, 2024

Choose a reason for hiding this comment

rflamary left a comment

Choose a reason for hiding this comment

codecov bot commented Jan 12, 2024 • edited Loading

Codecov Report

kachayev commented Jan 12, 2024

codecov bot commented Jan 12, 2024 •

edited

Loading