Merge co_names and co_consts into a single array. #92

markshannon · 2021-10-07T08:42:32Z

Merging co_names and co_consts would have some significant benefits:

It would make code objects smaller. There is unlikely to be much sharing between names and constants, but saving two tuples per code object is significant.
It would free up a register in the interpreter which should be worth a few percent speed up (in the interpreter, less overall).
By using an array, not a tuple, some lazy initialization of constants becomes possible. For example, a deep-frozen module might not be able to include frozen sets due to hashing; tuples could be stored and converted to frozen-sets on code object initialization.

The obvious downside is for functions where the total number of names and constants exceeds 256. I hypothesize that those functions are rare, but don't have data to confirm that.

This is probably best left until https://bugs.python.org/issue36521 has been resolved.

The text was updated successfully, but these errors were encountered:

iritkatriel · 2021-10-07T09:10:59Z

Are there no backwards compatibility issues? They are both part of the api.

Also, would this make the consts not immutable from user code? That would probably be an api change.

gvanrossum · 2021-10-07T18:01:10Z

The obvious downside is for functions where the total number of names and constants exceeds 256. I hypothesize that those functions are rare, but don't have data to confirm that.

That would be easy to validate by counting these numbers in the top 100 or 5000 packages on GitHub.

pxeger · 2021-10-07T20:07:30Z

Using the top 1000 PyPI packages, I got 0.14% of code objects having more than 256 total names and constants combined.

names    proportion >=256: 0.000237037
consts   proportion >=256: 0.000597600
total    proportion >=256: 0.001360792

names    proportion >=65536: 0.000000000
consts   proportion >=65536: 0.000000000
total    proportion >=65536: 0.000000000

https://github.com/pxeger/pycstats

Edit: updated data with top 1000 packages

gvanrossum · 2021-10-07T23:32:59Z

But how often are they >= 255 together?

Also, it looks like to minimize extended-arg opcodes we need to place the names first. (The dynamic profile could theoretically be different, but that feels unlikely.)

pxeger · 2021-10-08T07:32:29Z

I'm not sure what you mean by "But how often are they >= 255 together?".

The 0.12% number is already the proportion for which len(co_consts) + len(co_names) >= 256, not just the sum of the proportions for names and consts separately, if that's what you meant

gvanrossum · 2021-10-08T15:22:23Z

Oh, d’oh. Of course. Sorry!

markshannon · 2021-10-11T18:40:36Z

Are there no backwards compatibility issues? They are both part of the api.

Also, would this make the consts not immutable from user code? That would probably be an api change.

The array would be internal, so could not be mutated. co_consts and co_names would be lazily created if needed.

gvanrossum · 2021-10-11T18:54:17Z

The array would be internal, so could not be mutated. co_consts and co_names would be lazily created if needed.

Could you update the initial comment to clarify that (a lot)? This wasn't obvious to me at all (though I'm sure it was in your mind when you wrote it down :-).

markshannon · 2021-10-12T14:46:05Z

I didn't want to shackle whoever was going to implement it :)

Now that I've given a bit of thought, here's another way to implement it:

The consts and names go in the same array (names first, but it doesn't really matter), and we store a pointer to the first const.
That way consts can be indexed forwards and names backwards.
We can still access 256 consts and 255 names before needing extended instructions. We already do this for the quickened code (instructions forwards, caches backwards).

E.g. names: "spam", "eggs". consts: None, 7.

array = { "spam", "eggs", None, 7}
pointer = &array[2];

iritkatriel · 2021-10-15T16:46:10Z

When I experimented with MAKE_INT there was a noticeable difference in micro benchmarks between the opcode doing small_ints[oparg] and small_ints[oparg-5].

markshannon · 2021-10-18T09:30:23Z

@iritkatriel OOI what was the difference between the two?

iritkatriel · 2021-10-18T09:33:44Z

@iritkatriel OOI what was the difference between the two?

Start here: #65 (comment)

gvanrossum · 2021-10-18T17:46:39Z

Presumably small_ints[oparg-5] was slower because of the extra subtract?

markshannon mentioned this issue Nov 16, 2021

bpo-45753: Interpreter internal tweaks python/cpython#29575

Merged

faster-cpython locked and limited conversation to collaborators Dec 2, 2021

markshannon closed this as completed Dec 2, 2021

gramster added this to Fancy CPython Board Jan 10, 2022

gramster moved this to Todo in Fancy CPython Board Jan 10, 2022

gramster moved this from Todo to Other in Fancy CPython Board Jan 10, 2022

gramster removed this from Fancy CPython Board Jan 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

Merge co_names and co_consts into a single array. #92

Merge co_names and co_consts into a single array. #92

markshannon commented Oct 7, 2021

iritkatriel commented Oct 7, 2021

gvanrossum commented Oct 7, 2021

pxeger commented Oct 7, 2021 •

edited

Loading

gvanrossum commented Oct 7, 2021

pxeger commented Oct 8, 2021

gvanrossum commented Oct 8, 2021

markshannon commented Oct 11, 2021

gvanrossum commented Oct 11, 2021

markshannon commented Oct 12, 2021

iritkatriel commented Oct 15, 2021

markshannon commented Oct 18, 2021

iritkatriel commented Oct 18, 2021

gvanrossum commented Oct 18, 2021

This issue was moved to a discussion.

This issue was moved to a discussion.

Merge co_names and co_consts into a single array. #92

Merge co_names and co_consts into a single array. #92

Comments

markshannon commented Oct 7, 2021

iritkatriel commented Oct 7, 2021

gvanrossum commented Oct 7, 2021

pxeger commented Oct 7, 2021 • edited Loading

gvanrossum commented Oct 7, 2021

pxeger commented Oct 8, 2021

gvanrossum commented Oct 8, 2021

markshannon commented Oct 11, 2021

gvanrossum commented Oct 11, 2021

markshannon commented Oct 12, 2021

iritkatriel commented Oct 15, 2021

markshannon commented Oct 18, 2021

iritkatriel commented Oct 18, 2021

gvanrossum commented Oct 18, 2021

This issue was moved to a discussion.

pxeger commented Oct 7, 2021 •

edited

Loading