Add more FOR_ITER specialization stats #32151

sweeneyde · 2022-03-28T08:15:47Z

No description provided.

markshannon · 2022-03-28T15:55:34Z

How did you choose the categories? What do the stats look like?

They look quite tailored to the benchmark suite. Would it possible to generalize them a bit?
Certain builtin classes, like zip and enumerate are worth checking for. But beyond that it is probably only worth categorizing into broader categories like: "iterator for underlying sequence", "computed iterator", "implemented in C", "implemented in Python".
That sort of thing.

sweeneyde · 2022-03-29T01:41:34Z

I chose the categories by running the test suite and pyperformance with a printf at the end of _PySpecialization_ClassifyIterator until there stopped being so many prints. I can pare down the number of cases to ones we care about, but here are some of the results with all of those details:

markshannon · 2022-03-30T11:56:49Z

Ok, we can always change the categories later, if we need to.
In the meantime, this is useful information.

markshannon · 2022-03-30T12:01:10Z

Python/specialize.c

@@ -452,6 +452,12 @@ initial_counter_value(void) {
 #define SPEC_FAIL_COMPARE_OP_EXTENDED_ARG 24

 /* FOR_ITER */
+#define SPEC_FAIL_FOR_ITER_REVERSED 4


The values below 8 are common, in the section marked /* Common */ above.
You can always raise SPECIALIZATION_FAILURE_KINDS if you need.

sweeneyde · 2022-06-13T04:57:15Z

Results from python -m test, now with the new ascii string iterator:

Failure kind	Count	Ratio
list	40313806	30.7%
range	37450044	28.5%
itertools	10394906	7.9%
map	9850377	7.5%
tuple	7643483	5.8%
generator	7477618	5.7%
enumerate	`3882317`	3.0%
ascii string	3266112	2.5%
dict items	3198088	2.4%
callable	`1707168`	1.3%
dict keys	1115718	0.8%
bytes	1105824	0.8%
zip	984525	0.7%
seq iter	974430	0.7%
other	801038	0.6%
set	678627	0.5%
dict values	222521	0.2%
reversed list	184092	0.1%
string	72383	0.1%

I think this is useful information, so I'll go ahead and merge. As was said, we can always adjust things more later.

Add more FOR_ITER stats

77c059e

bedevere-bot added the awaiting core review label Mar 28, 2022

the-knights-who-say-ni added the CLA signed label Mar 28, 2022

sweeneyde added skip issue skip news labels Mar 28, 2022

sweeneyde requested a review from markshannon March 28, 2022 08:16

Check for PyReversed_Type

64f77df

sweeneyde closed this Mar 28, 2022

sweeneyde reopened this Mar 28, 2022

Merge branch 'main' into for_iter_stats

943360f

markshannon reviewed Mar 30, 2022

View reviewed changes

sweeneyde and others added 4 commits March 30, 2022 08:18

remove a few failure kinds

dfc25f9

Merge branch 'main' into for_iter_stats

ed64be1

Merge branch 'main' into for_iter_stats

3b1f909

check for ascii string iterator

3083f3d

sweeneyde mentioned this pull request Jun 12, 2022

Optimizing FOR_ITER bytecode #91432

Closed

Merge branch 'main' into for_iter_stats

19fd145

sweeneyde merged commit c5d0517 into python:main Jun 13, 2022

bedevere-bot removed the awaiting core review label Jun 13, 2022

sweeneyde deleted the for_iter_stats branch June 13, 2022 05:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more FOR_ITER specialization stats #32151

Add more FOR_ITER specialization stats #32151

sweeneyde commented Mar 28, 2022

markshannon commented Mar 28, 2022

sweeneyde commented Mar 29, 2022

markshannon commented Mar 30, 2022

markshannon Mar 30, 2022

sweeneyde commented Jun 13, 2022

Add more FOR_ITER specialization stats #32151

Add more FOR_ITER specialization stats #32151

Conversation

sweeneyde commented Mar 28, 2022

markshannon commented Mar 28, 2022

sweeneyde commented Mar 29, 2022

markshannon commented Mar 30, 2022

markshannon Mar 30, 2022

Choose a reason for hiding this comment

sweeneyde commented Jun 13, 2022