Can we simplify io::copy? #365

ghost · 2019-10-17T20:01:02Z

The current signature of async_std::io::copy is:

pub async fn copy<R, W>(reader: &mut R, writer: &mut W) -> io::Result<u64>
where
    R: Read + Unpin + ?Sized,
    W: Write + Unpin + ?Sized;

Both reader and writer need to be mutable references because we're just mimicking std::io::copy.

Unfortunately, this API is annoying to use when we want to share TcpStreams. Just look at this convoluted &mut (&stream, &stream) pattern:

async fn process(stream: TcpStream) -> io::Result<()> {
    let (reader, writer) = &mut (&stream, &stream);
    io::copy(reader, writer).await?;
    Ok(())
}

I think we can do better. What if we didn't follow the API from std and had the following instead?

pub async fn copy<R, W>(reader: R, writer: W) -> io::Result<u64>
where
    R: Read + Unpin,
    W: Write + Unpin;

This might look like a less powerful APIs than the previous one, but I believe it is functionally the same. Note that we have these blanket impls of Read and Write:

impl<T: Read + Unpin + ?Sized> Read for &mut T {}
impl<T: Write + Unpin + ?Sized> Write for &mut T {}

That means if we can pass &mut T into the previous API, then it should be totally fine to pass it into the new one too! In other words, the new API is fully compatible with the previous one (unless I'm missing something here).

So the cool thing is that while it might seem we're deviating from the std APIs, we kind of aren't. :)

Here's how we can write an echo TCP server using the new async_std::io::copy:

async fn process(stream: TcpStream) -> io::Result<()> {
    io::copy(&stream, &stream).await?;
    Ok(())
}

And here's how we do it on Arc<TcpStream>:

async fn process(stream: Arc<TcpStream>) -> io::Result<()> {
    io::copy(&*stream, &*stream).await?;
    Ok(())
}

This change could make sharing streams a lot easier, and we've seen plenty of people struggle with this problem. Wdyt?

The text was updated successfully, but these errors were encountered:

ghost · 2019-10-17T20:01:14Z

cc @skade @yoshuawuyts

ghost · 2019-10-17T20:16:06Z

I believe we could make the same change in the standard library. scratches head

yoshuawuyts · 2019-10-17T20:36:35Z

@stjepang I'd be in favor of trying this out — it seems you've thought this through and this seems good. It might be interesting to open an issue for this in std also!

skade · 2019-10-18T18:59:01Z

I think the main motivation behind forcibly taking a reference is to show intended use, I don't see much use for an API where both parameters are passed owned if they are streams.

I'm not sure if this is a good motivation for such an API, as accidentally passing the stream owned will be caught by the borrow checker in all cases it is a problem. I'm fine with trying out this change.

ghost · 2019-11-07T12:17:04Z

It's interesting how the futures crate only went halfway there:

pub fn copy<R, W>(reader: R, writer: &mut W) -> Copy<R, W> where
    R: AsyncRead,
    W: AsyncWrite + Unpin + ?Sized;

Source: https://docs.rs/futures/0.3.0/futures/io/fn.copy.html

So reader is owned, but writer still isn't. This still doesn't make the pattern we want possible but makes some others a bit easier.

ghost · 2019-11-07T12:20:24Z

What are your feelings on this? People are repeatedly complaining about &mut (&stream, &stream) and I must admit -- it is is an odd pattern.

Tokio has the .split() method that returns back a reader and writer pair, but that's a lot of API surface which doesn't exist in the standard library.

With the change proposed here, everything that works in std would also work here, except we'd also be a bit more general and accept things like io::read(&stream, &stream).

yoshuawuyts · 2019-11-07T12:24:34Z

Still happy to try this change out!

edit: we should probably mark it as "unstable" though, at least for one release cycle to allow us to change our minds.

gsquire · 2019-11-15T00:20:25Z

Tokio has the .split() method that returns back a reader and writer pair, but that's a lot of API surface which doesn't exist in the standard library.

I saw another issue mentioning a try_clone method for TcpStream but the author solved it in another way. I think it would be nice to have a function that can "split" a TcpStream into a read half and a write half. My use case would be something like this snippet:

let conn = TcpStream::connect(&self.addr).await?;
let (r, w) = conn.split(); // Or some variation of this function.
self.reader = BufReader::new(r);
self.writer = BufWriter::new(w);

This way I can have a bi-directional flow between my client and server that is buffered.

yoshuawuyts · 2019-11-15T17:39:28Z

@gsquire you can put TcpStream inside of an Arc and it should mostly work the way you want it to already.

gsquire · 2019-11-15T19:07:09Z

@yoshuawuyts I ran into borrowing issues since this was inside of a function that was making a new type. I'll re-evaluate and see where I end up. Maybe the bufstream crate will support async-std at some point.

sebastiencs · 2019-11-16T07:28:17Z

@gsquire you can put TcpStream inside of an Arc and it should mostly work the way you want it to already.

Could you elaborate @yoshuawuyts ?
I also want both a BufReader and Bufwriter from a TcpStream and I can't find a way to achieve this currently.
Having a try_clone or split method would be nice.

yoshuawuyts · 2019-11-16T09:51:18Z

I believe Arc::new(TcpStream::connect(addr)) should just about do what you want it to; can then freely use the stream and clone it around. Because most methods use inner mutability only the signature is &self, so it's very flexible.

On phone now, so haven't had a chance to test this out. But afaik people have done this successfully before.

sebastiencs · 2019-11-16T10:53:05Z

@yoshuawuyts The issue is that BufReader::new and BufWriter::new take their argument by value, not by ref.
So we can make either a BufReader or a BufWriter, not both

ghost · 2019-11-16T15:08:58Z

This problem has a straightforward solution. :) We just need to implement Read and Write for Arc<TcpStream> and then you'll be able to do:

let conn = Arc::new(TcpStream::connect(&self.addr).await?);
self.reader = BufReader::new(conn.clone());
self.writer = BufWriter::new(conn);

sebastiencs · 2019-11-17T11:31:55Z

@stjepang Is that possible with the orphan rules ? Read, Write and Arc don't belong to async-std

yoshuawuyts · 2019-11-18T11:24:49Z

edit: this is wrong, figured it out in #365 haha

@sebastiencs just tried this out, and it seems this is indeed not possible.

Target program

use async_std::io::{self, BufReader, BufWriter};
use async_std::net::TcpStream;
use async_std::sync::Arc;
use async_std::task;

fn main() -> io::Result<()> {
    task::block_on(async {
        let addr = "localhost:8080";
        let conn = Arc::new(TcpStream::connect(&addr).await?);
        let reader = BufReader::new(conn.clone());
        let writer = BufWriter::new(conn);
        Ok(())
    })
}

async-std impl

I added this to src/net/tcp/stream.rs

impl Read for std::sync::Arc<TcpStream> {
    fn poll_read(
        self: Pin<&mut Self>,
        cx: &mut Context<'_>,
        buf: &mut [u8],
    ) -> Poll<io::Result<usize>> {
        self.watcher.poll_read_with(cx, |mut inner| inner.read(buf))
    }
}

Error

error[E0117]: only traits defined in the current crate can be implemented for arbitrary types
   --> src/net/tcp/stream.rs:367:1
    |
367 | impl Read for std::sync::Arc<TcpStream> {
    | ^^^^^^^^^^^^^^-------------------------
    | |             |
    | |             `std::sync::Arc` is not defined in the current crate
    | impl doesn't use only types from inside the current crate
    |
    = note: define and implement a trait or new type instead
error: aborting due to previous error
For more information about this error, try `rustc --explain E0117`.
error: could not compile `async-std`.
To learn more, run the command again with --verbose.

yoshuawuyts · 2019-11-18T11:27:31Z

Oh, ugh nevermind. Figured it out; needed to do &* haha. Needed to trigger the deref, and then take ownership again. This works, no extra impls needed:

use async_std::io::{self, BufReader, BufWriter};
use async_std::net::TcpStream;
use async_std::sync::Arc;
use async_std::task;

fn main() -> io::Result<()> {
    task::block_on(async {
        let addr = "localhost:8080";
        let conn = Arc::new(TcpStream::connect(&addr).await?);
        let reader = BufReader::new(&*conn.clone());
        let writer = BufWriter::new(&*conn);
        Ok(())
    })
}

ghost · 2019-11-18T12:00:06Z

Indeed, it seems coherence rules don't allow the impl for Arc<TcpStream> :(

@yoshuawuyts BufWriter::new(&*conn) works, but now BufWriter is constrained by the lifetime of this temporary reference, whereas we'd prefer BufWriter to take ownership of the Arc.

ghost · 2019-11-18T12:07:21Z

Perhaps we should implement Clone for TcpStream?

let conn = TcpStream::connect(&addr).await?;
let reader = BufReader::new(conn.clone());
let writer = BufWriter::new(conn);

In the standard library we have TcpStream::try_clone(), which calls dup() on its file descriptor. The reason why this method is try_clone() is because dup() might fail.

I believe we could've also had a regular clone() method on std::net::TcpStream, but the standard library chose not to do this because the inner stream would then have to be stored inside an Arc.

However, in async-std, our TcpStream already contains an Arc<Entry> so we probably shouldn't worry about the performance impact of having another Arc. I believe we could even introduce clone() without any adding extra performance penalties.

yoshuawuyts · 2019-11-18T12:14:50Z

@stjepang ah yeah that'd be great. Filed an issue for it here: #553

yoshuawuyts · 2019-12-12T10:27:33Z

This has been implemented; only thing left is stabilization now but we should track that separately.

gsquire · 2019-12-12T18:53:35Z

Does this also close #553?

Edit: Oops, I forgot that was split off from the original ask. Disregard.

ghost added question/feedback A question or user feedback api design Open design questions labels Oct 17, 2019

This was referenced Nov 5, 2019

Change copy_into/copy_buf_into to free functions for consistency with the standard library rust-lang/futures-rs#1948

Merged

TcpListener echo example is confusing #468

Open

yoshuawuyts mentioned this issue Nov 7, 2019

0.99.12 #469

Merged

ghost mentioned this issue Nov 7, 2019

Unstable feature: copy takes arguments by value #471

Merged

yoshuawuyts mentioned this issue Nov 18, 2019

Implement Clone for TcpStream #553

Closed

yoshuawuyts mentioned this issue Nov 20, 2019

what should we do in async with ownership with tcpstream? #563

Closed

yoshuawuyts closed this as completed Dec 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we simplify io::copy? #365

Can we simplify io::copy? #365

ghost commented Oct 17, 2019

ghost commented Oct 17, 2019

ghost commented Oct 17, 2019

yoshuawuyts commented Oct 17, 2019

skade commented Oct 18, 2019

ghost commented Nov 7, 2019 •

edited by ghost

Loading

ghost commented Nov 7, 2019

yoshuawuyts commented Nov 7, 2019 •

edited

Loading

gsquire commented Nov 15, 2019

yoshuawuyts commented Nov 15, 2019

gsquire commented Nov 15, 2019

sebastiencs commented Nov 16, 2019

yoshuawuyts commented Nov 16, 2019

sebastiencs commented Nov 16, 2019

ghost commented Nov 16, 2019

sebastiencs commented Nov 17, 2019

yoshuawuyts commented Nov 18, 2019 •

edited

Loading

yoshuawuyts commented Nov 18, 2019

ghost commented Nov 18, 2019

ghost commented Nov 18, 2019

yoshuawuyts commented Nov 18, 2019

yoshuawuyts commented Dec 12, 2019

gsquire commented Dec 12, 2019 •

edited

Loading

Can we simplify io::copy? #365

Can we simplify io::copy? #365

Comments

ghost commented Oct 17, 2019

ghost commented Oct 17, 2019

ghost commented Oct 17, 2019

yoshuawuyts commented Oct 17, 2019

skade commented Oct 18, 2019

ghost commented Nov 7, 2019 • edited by ghost Loading

ghost commented Nov 7, 2019

yoshuawuyts commented Nov 7, 2019 • edited Loading

gsquire commented Nov 15, 2019

yoshuawuyts commented Nov 15, 2019

gsquire commented Nov 15, 2019

sebastiencs commented Nov 16, 2019

yoshuawuyts commented Nov 16, 2019

sebastiencs commented Nov 16, 2019

ghost commented Nov 16, 2019

sebastiencs commented Nov 17, 2019

yoshuawuyts commented Nov 18, 2019 • edited Loading

Target program

async-std impl

Error

yoshuawuyts commented Nov 18, 2019

ghost commented Nov 18, 2019

ghost commented Nov 18, 2019

yoshuawuyts commented Nov 18, 2019

yoshuawuyts commented Dec 12, 2019

gsquire commented Dec 12, 2019 • edited Loading

ghost commented Nov 7, 2019 •

edited by ghost

Loading

yoshuawuyts commented Nov 7, 2019 •

edited

Loading

yoshuawuyts commented Nov 18, 2019 •

edited

Loading

gsquire commented Dec 12, 2019 •

edited

Loading