Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Share support #585

Open
jscheid opened this issue Nov 11, 2024 · 4 comments
Open

Share support #585

jscheid opened this issue Nov 11, 2024 · 4 comments

Comments

@jscheid
Copy link

jscheid commented Nov 11, 2024

Hi, thanks for maintaining this package. I'd like to add support for the share interface - that is,

  • a Share struct representing a share handle, with functions to configure it,
  • the ability to add a Share instance to an Easy instance, or remove it,
  • a mechanism to ensure share handles are kept around as long as they're referenced by any Easy handles.

Would you accept a PR for this, and are you interested in discussing API design and implementation details?

@sagebind
Copy link
Collaborator

The Share API is pretty unsafe, as you have to bring your own mutexes and manually keep track of references to avoid use-after-free. It might be doable, but would be tricky I think. It would have to be implemented in a way that:

  • Does not require the user to know about or use the Share API
  • Doesn't add unnecessary mutexes if the user isn't using the Share API
  • Doesn't require any backwards-incompatible changes
  • Is totally safe, of course

Given those constraints I don't see any reason why we wouldn't accept this to be added to the API, as it could be useful in some scenarios.


For design, I messed with this API several years ago, though I never finished it. Probably the most realistic approach would be to use an Arc internally in our Share wrapper. The API we present looks like you add Easy handles to it, but under the hood you're really adding a clone of the Share reference to the Easy handle. So each Easy2 struct would have an Option<Share> of some kind, to prevent the Share from being dropped until all Easy handles are.

Both CURLSHOPT_LOCKFUNC and CURLSHOPT_UNLOCKFUNC must be set by our Share wrapper unconditionally, since presumably users will want to continue to use their Easy handles as Send.

If users want a lock-free Share for single-threaded use, it may be better to create a totally separate LocalShare type that has different constraints. It would have to act more like Multi does today, taking ownership of the Easy handles and returning a reduced version that is !Send. I don't know if there's much of a use-case for this though.

The docs are not clear whether it is safe to set CURLSHOPT_SHARE after some Easy handles have already started using it. It may be better to require the aspects to share be set once in the constructor and not allow it to be changed again.

As a user, I might expect using the API to perhaps look something like this:

let share = Share::builder()
    .dns()
    .psl()
    .build();

let easy1 = Easy::new();
let easy2 = Easy::new();

easy1.set_share(&share);
easy2.set_share(&share);

Note that the docs do indicate that the following can not be used in a multi-threaded context, even if locking is configured:

  • CURL_LOCK_DATA_COOKIE
  • CURL_LOCK_DATA_SSL_SESSION
  • CURL_LOCK_DATA_CONNECT
  • CURL_LOCK_DATA_HSTS

I guess that does make me wonder if actually a LocalShare would be more interesting...

@jscheid
Copy link
Author

jscheid commented Nov 12, 2024

Thanks for getting back to me so quickly, and for sharing your thoughts. The API design looks very similar to what I had in mind.

I guess that does make me wonder if actually a LocalShare would be more interesting...

Yes - after opening this issue I came across this message which actually makes it sounds like offering LocalShare is more useful (or rather, less misleading) than a thread-safe share.

Given this, let me first run some tests to see how much value this adds over the caching built into multi. Specifically I'm wondering if DNS and SSL contexts are cached by multi - the documentation isn't entirely clear on that if I remember correctly.

@sagebind
Copy link
Collaborator

sagebind commented Nov 13, 2024

Yes, certain elements are already shared between handles that are added to the same multi, which IMO also limits the usefulness of the Share API if Multi does some of the same things. The Everything Curl book says:

All easy handles added to the same multi handle automatically share connection cache and dns cache.

This does start to refresh my memory of the last time I interacted with this API, and decided it actually wasn't all that useful.

@ppearson
Copy link

ppearson commented Jan 1, 2025

As someone who is also interested in getting access to the Share interface, given I'm in the process of re-writing a C++ app which uses curl into Rust, the share interface also allows for curl to automatically share and handle cookies between multiple handles, which isn't possible otherwise...

My particular use-case is having background worker threads download CSS/JS/Image files after the first initial perform() call on the first handle and HTML parsing from that, along with the initial main thread then assisting. That's what the C++ code was doing, as some of the subsequent HTTP requests need things like session cookies that were only set by the first main request response from the server to work, and the curl C share interface for Cookies worked perfectly for allowing that to happen, and curl would manage all the cookie processing and sharing itself.

Currently, the only way I can see of doing it with curl-rust (at least using the Easy interface, I guess using the Multi interface might be another way) is saving out a cookie jar file in the first request, and then have subsequent requests try and use that output file as the cookie file, which at some point in the next month I'll prototype up to see how well it works, but I suspect it'll be a bit messy and not as nice as native Share support.

I'd be happy to try and help out in this area if possible (I'm relatively new to Rust however).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants