-
Notifications
You must be signed in to change notification settings - Fork 413
Description
In our project, we use the GenericFileSystem's _cp_file function to (asynchronously) copy files from a remote SSH filesystem to our local filesystem (as suggested in this comment).
Before the release of 2023.1.0, this function accepted the parameters fs and fs2, which could be used to pass in existing filesystem instances between which the copy operation should take place. Since 2023.1.0 however, these parameters have been removed (silently and without reference in the changelog, though perhaps the 'experimental' note in the docs means changes won't get a notice?).
With the removal of these parameters, this function now only uses filesystem instances as resolved by the _resolve_fs function based on the url parameter. But, since the resolve function returns the result of a call to registry.filesystem() with only the inferred protocol as parameter (and no **storage_options), this fundamentally breaks for filesystems that require some parameter for their __init__ function, like the SFTPFileSystem (the default implementation for the SSH protocol, and which requires a host parameter).
This issue can easily be reproduced by calling the _cp_file function on a GenericFileSystem where one of the url parameters starts with "ssh://" as protocol, or any other protocol whose FileSystem implementation requires additional parameters.
Of course, if there is another workflow to get the _cp_file function working for pre-defined filesystems that is the 'intended' way, I would very much like to hear it and this issue should be closed.