Parse incoming JSON in POST body #377

Changaco · 2014-08-07T16:42:51Z

I'm implementing a simplate to receive webhook callbacks which are POST requests with JSON data in the body (and Content-Type: application/json in the headers). Adding support for JSON in Aspen's Body._parse() function could make it slightly easier to write such simplates. What do you think ?

The text was updated successfully, but these errors were encountered:

chadwhitacre · 2014-08-07T16:52:35Z

Sounds good to me.

pjz · 2014-08-07T18:52:51Z

I... don't understand your requirements. Why is this not just:

if request.line.method == 'POST' and headers.get('Content-Type', None):
    json = json.loads(request.body.raw)

Changaco · 2014-08-07T19:07:51Z

@pjz json.loads(request.body.raw) is what I'm doing, but Aspen could do it for me, like it does for application/x-www-form-urlencoded and multipart/form-data.

pjz · 2014-08-07T21:11:05Z

JSON isn't really web-spec. It's common, but not specified in any part of HTTP anywhere. I'd be against special-casing it.

OTOH, I'd be all about figuring out some way to set up a general-case handler for arbitrary content-types. A way to specify "if the content-type of the POST body is application/json, run it through this handler and store the result in this variable." Hm, probably want that to be site-wide instead of per-endpoint, eh? So it could be an algorithm hook:

def deserialize_POSTed_json(request):
    if request.line.method == 'POST' and headers.get('Content-Type', None) == 'application/json':
        request.json = json.loads(request.body.raw)
    else:
        request.json = None

Put the above into your algorithm stack somewhere in the request-handling sequence and you should be GTG.

chadwhitacre · 2014-08-07T21:38:13Z

@pjz I believe that application/x-www-form-urlencoded and multipart/form-data are also not specified in HTTP, but rather in HTML, so per your rationale those should also not be special-cased in our request object, eh?

chadwhitacre · 2014-08-07T21:42:25Z

P.S. I'd be all for implementing body parsing in an algorithm function (that'd be part of "flatten algorithm further" in our roadmap; #357). In that case it seems that we should do the same with the existing special cases. Then the question is what body parsers we ship in our stock algorithm, and I think we should include an application/json body parser along with application/x-www-form-urlencoded and multipart/form-data.

pjz · 2014-08-07T21:42:39Z

Agreed! I think the request object should probably have a little registry where you can register content-type handlers or something.

pjz · 2014-08-07T21:43:55Z

Well, there should be a registry somewhere, at any rate. Maybe the request object? Oh, no, it's site wide. Probably the website object.

Changaco · 2014-08-08T09:18:24Z

A registry seems like a simple and performant method to me. Something like:

website.body_parsers['application/json'] = json.loads

pjz · 2014-08-08T14:09:23Z

That sounds okay... but where should the result be put? That's going to require some management to avoid collisions in the website object's main namespace.

chadwhitacre · 2014-08-08T16:27:06Z

[W]here should the result be put?

What do we do now? Don't we use request.body for both of the cases we currently support?

pjz · 2014-08-08T19:30:26Z

Ah, hmm. Yeah, that could work, though it looks like it may require some metaclass hacking or something to replicate the current functionality.

pjz · 2014-08-15T18:08:21Z

Actually we'll have to change the current API a little; instead of request.body.raw we'll have request.body and request.body_raw (though I'm open to other names for the second one - request.raw_body ? request.rawbody ? ).

Changaco · 2014-08-16T11:10:33Z

I think I prefer raw_body. However, since you're changing the API, now might be a good time to expose the fact that the body is a stream, by giving access to the socket instead of copying it all into a buffer.

pjz · 2014-08-18T02:01:23Z

Hmm. I like giving access, but what happens if a body_parser doesn't set .raw_body ? Or is that a requirement? Though admittedly more memory hungry, It's certainly easier on any parser to work off of a string instead of a socket (worst case they turn it back into a socket with StringIO). I was thinking that it would basically:

set raw_body
call appropriate body_parser if any
set body to result of parser, or to raw_body if none

Changaco · 2014-08-18T07:59:53Z

I like giving access, but what happens if a body_parser doesn't set .raw_body ?

Why would a body parser set .raw_body ? It seems to me that by definition .raw_body is what you have before going through any parser.

It's certainly easier on any parser to work off of a string instead of a socket

Not really, the parser just has to call .read() if it wants a bytestring.

Here's what I'm thinking:

have the socket accessible as .body_sock or .raw_body or whatever
turn .body into a lazy property
when .body is accessed look up the appropriate body parser in website.body_parsers and run it, or raise Response(415) if there is no parser for that Content-Type

pjz · 2014-08-19T13:13:58Z

If the parser reads the socket then the user can't, and vice versa. By pre-emptively reading the socket and storing the result in .raw_body, they can both access the raw data, so both .raw_body and .body can both be accessible to the programmer. I'm not sure why that's useful, but I hate taking away options.

I do like the Response(415) idea, though.

Changaco · 2014-08-19T13:42:59Z

I'm not sure why that's useful

Me neither, but it is possible to have both, by first reading .raw_body and then using a BytesIO hack to get .body.

request.body is now the result of running the appropriate body_parser (based on content-type header) over request.raw_body Inlined the functionality of context.py into the dynamic_resource object

pjz · 2014-08-24T00:44:24Z

Like that, @Changaco ?

Changaco · 2014-08-24T08:37:16Z

#379 isn't exactly what I proposed:

.body isn't a lazy property, it's parsed even if the simplate doesn't use it
Response(415) isn't raised when the Content-Type is unknown

but aside from that it looks okay.

pjz · 2014-08-25T17:43:27Z

Should the Response(415) be raised lazily or immediately?

Changaco · 2014-08-25T17:47:38Z

If .body becomes lazy then I think it makes sense to raise the exception lazily.

pjz · 2014-08-25T17:58:08Z

okay, fixed in the latest version of #379

Fix #377: Reify body_parsers and make request.body dynamic

ravitejabadisa · 2015-11-30T15:02:17Z

HI i am facing an issue whe i have migrated code from my local system to AWS

RawPostDataException at /users/
You cannot access body after reading from request's data stream

It seems i can't access request.method and request.data in views.py simulteneously.
Is this specific to AWS?Do i nedd to do some specific config changes to resolve this issue?

It would be great if you can suggest some work around for this issue

pjz · 2015-11-30T16:23:54Z

If you meant to post this as an issue, you should open a new one, not revive one that's over a year old. Also, you should include more detail, up to and including simplified code exemplifying your problem.

Changaco · 2015-11-30T16:29:53Z

A quick web search shows that RawPostDataException is a Django exception.

request.body is now the result of running the appropriate body_parser (based on content-type header) over request.raw_body Inlined the functionality of context.py into the dynamic_resource object

Fix #377: Reify body_parsers and make request.body dynamic

pjz closed this as completed in ab377fc Sep 2, 2014

chadwhitacre added a commit that referenced this issue Sep 2, 2014

Merge pull request #379 from gratipay/issue377

3a2a03d

Fix #377: Reify body_parsers and make request.body dynamic

Changaco pushed a commit that referenced this issue Mar 11, 2016

Merge pull request #379 from gratipay/issue377

df08f7f

Fix #377: Reify body_parsers and make request.body dynamic

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parse incoming JSON in POST body #377

Parse incoming JSON in POST body #377

Changaco commented Aug 7, 2014

chadwhitacre commented Aug 7, 2014

pjz commented Aug 7, 2014

Changaco commented Aug 7, 2014

pjz commented Aug 7, 2014

chadwhitacre commented Aug 7, 2014

chadwhitacre commented Aug 7, 2014

pjz commented Aug 7, 2014

pjz commented Aug 7, 2014

Changaco commented Aug 8, 2014

pjz commented Aug 8, 2014

chadwhitacre commented Aug 8, 2014

pjz commented Aug 8, 2014

pjz commented Aug 15, 2014

Changaco commented Aug 16, 2014

pjz commented Aug 18, 2014

Changaco commented Aug 18, 2014

pjz commented Aug 19, 2014

Changaco commented Aug 19, 2014

pjz commented Aug 24, 2014

Changaco commented Aug 24, 2014

pjz commented Aug 25, 2014

Changaco commented Aug 25, 2014

pjz commented Aug 25, 2014

ravitejabadisa commented Nov 30, 2015

pjz commented Nov 30, 2015

Changaco commented Nov 30, 2015

Parse incoming JSON in POST body #377

Parse incoming JSON in POST body #377

Comments

Changaco commented Aug 7, 2014

chadwhitacre commented Aug 7, 2014

pjz commented Aug 7, 2014

Changaco commented Aug 7, 2014

pjz commented Aug 7, 2014

chadwhitacre commented Aug 7, 2014

chadwhitacre commented Aug 7, 2014

pjz commented Aug 7, 2014

pjz commented Aug 7, 2014

Changaco commented Aug 8, 2014

pjz commented Aug 8, 2014

chadwhitacre commented Aug 8, 2014

pjz commented Aug 8, 2014

pjz commented Aug 15, 2014

Changaco commented Aug 16, 2014

pjz commented Aug 18, 2014

Changaco commented Aug 18, 2014

pjz commented Aug 19, 2014

Changaco commented Aug 19, 2014

pjz commented Aug 24, 2014

Changaco commented Aug 24, 2014

pjz commented Aug 25, 2014

Changaco commented Aug 25, 2014

pjz commented Aug 25, 2014

ravitejabadisa commented Nov 30, 2015

pjz commented Nov 30, 2015

Changaco commented Nov 30, 2015