String and byte[] converters using partial reads/writes #112129

PranavSenthilnathan · 2025-02-04T15:39:57Z

No description provided.

dotnet-policy-service · 2025-02-04T15:40:35Z

Tagging subscribers to this area: @dotnet/area-system-text-json, @gregsdennis
See info in area-owners.md if you want to be subscribed.

eiriktsarpalis · 2025-02-04T15:48:10Z

...ibraries/System.Text.Json/src/System/Text/Json/Serialization/JsonHybridResumableConverter.cs

+    /// This is used when the Stream-based serialization APIs are used.
+    /// </summary>
+    /// <typeparam name="T"></typeparam>
+    internal abstract class JsonHybridResumableConverter<T> : JsonConverter<T>


How is this different compared to the existing resumable converter base?

It exposes WriteWithoutStackFrame which doesn't push/pop stack frames by default but based on the value (if it's long or serialization is done via streaming) there's a method to push/pop the stack frames. This keeps the common case of short strings/byte[] fast by not needing a stack frame but still allows large strings to opt into stack management.

eiriktsarpalis · 2025-02-04T15:51:44Z

src/libraries/System.Text.Json/src/System/Text/Json/Reader/Utf8JsonReader.cs

@@ -46,6 +46,8 @@ public ref partial struct Utf8JsonReader
        private SequencePosition _currentPosition;
        private readonly ReadOnlySequence<byte> _sequence;

+        internal bool _hasPartialStringValue;


Like with Utf8JsonWriter, couldn't this just use the same special JsonTokenType value to reflect segmented reads?

eiriktsarpalis · 2025-02-04T15:54:25Z

src/libraries/System.Text.Json/src/System/Text/Json/Reader/Utf8JsonReader.cs

@@ -1300,6 +1302,7 @@ private bool ConsumeString()
                    ValueIsEscaped = false;
                    _tokenType = JsonTokenType.String;
                    _consumed += idx + 2;
+                    _hasPartialStringValue = false;


This flags is introducing a new state for Utf8JsonReader that could break user-defined converters not expecting it. We should make absolutely sure instances setting this flag should never leak via public APIs. I would recommend adding a few debug asserts e.g. in the Read() method checking that the flag is always unset.

eiriktsarpalis · 2025-02-04T15:57:02Z

...s/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/ByteArrayConverter.cs

@@ -29,6 +33,176 @@ public override void Write(Utf8JsonWriter writer, byte[]? value, JsonSerializerO
            }
        }

+        internal override bool OnTryRead(ref Utf8JsonReader reader, Type typeToConvert, JsonSerializerOptions options, scoped ref ReadStack state, out byte[]? value)
+        {
+            if (state.Current.ObjectState < StackFrameObjectState.CreatedObject)


Like with existing resumable converters, there should be a fast path that runs when SupportContinuation is set to false. In this case we can assume that the string has been fully populated and therefore we can read it using the non-segmenting APIs.

This is handled in JsonConverter.TryRead where we check ConverterStrategy == ConverterStrategy.SegmentableValue && !state.IsContinuation && !reader._hasPartialStringValue which guarantees that only partially populated buffers will go through this path since fully populated buffers would have reader._hasPartialStringValue == false. It's too late to do the check here since the stack already is pushed at this point so it needs to be in JsonConverter.TryRead or earlier.

JsonConverter.TryRead is a hot path method and historically changing it has been causing a great deal of performance regressions. I would recommend moving as much of the logic as possible inside the resumable converters themselves. I would also recommend keeping as much consistency as possible with the existing resumable converters, since their code is extremely nontrivial and difficult to follow.

PranavSenthilnathan added 2 commits February 4, 2025 07:30

String and byte[] converters using segmented reads/writes

c732bd0

cleanup

672672f

dotnet-issue-labeler bot added the area-System.Text.Json label Feb 4, 2025

dotnet-policy-service bot assigned PranavSenthilnathan Feb 4, 2025

eiriktsarpalis reviewed Feb 4, 2025

View reviewed changes

build-analysis bot mentioned this pull request Feb 4, 2025

Intermittent build failure in AfterSourceBuild: "Could not write state file" #76488

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

String and byte[] converters using partial reads/writes #112129

String and byte[] converters using partial reads/writes #112129

PranavSenthilnathan commented Feb 4, 2025

dotnet-policy-service bot commented Feb 4, 2025

eiriktsarpalis Feb 4, 2025

PranavSenthilnathan Feb 4, 2025

eiriktsarpalis Feb 4, 2025

eiriktsarpalis Feb 4, 2025

eiriktsarpalis Feb 4, 2025

PranavSenthilnathan Feb 4, 2025

eiriktsarpalis Feb 4, 2025

String and byte[] converters using partial reads/writes #112129

Are you sure you want to change the base?

String and byte[] converters using partial reads/writes #112129

Conversation

PranavSenthilnathan commented Feb 4, 2025

dotnet-policy-service bot commented Feb 4, 2025

eiriktsarpalis Feb 4, 2025

Choose a reason for hiding this comment

PranavSenthilnathan Feb 4, 2025

Choose a reason for hiding this comment

eiriktsarpalis Feb 4, 2025

Choose a reason for hiding this comment

eiriktsarpalis Feb 4, 2025

Choose a reason for hiding this comment

eiriktsarpalis Feb 4, 2025

Choose a reason for hiding this comment

PranavSenthilnathan Feb 4, 2025

Choose a reason for hiding this comment

eiriktsarpalis Feb 4, 2025

Choose a reason for hiding this comment