-
-
Notifications
You must be signed in to change notification settings - Fork 1.2k
[pydap backend] enables downloading/processing multiple arrays within single http request #10629
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
Mikejmnez
wants to merge
25
commits into
pydata:main
Choose a base branch
from
Mikejmnez:pydap4_scale
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+172
−26
Open
Changes from all commits
Commits
Show all changes
25 commits
Select commit
Hold shift + click to select a range
2e6c9d6
update PydapArrayWrapper to support backend batching
Mikejmnez dbaa889
rebase
Mikejmnez 6b9c604
pydap-server it not necessary
Mikejmnez f07e6e2
set `batch=False` as default
Mikejmnez 38d2bee
set `batch=False` as default in datatree
Mikejmnez 0a2ac02
set `batch=False` as default in open groups as dict
Mikejmnez 0a4f0e7
for flaky, install pydap from repo for now
Mikejmnez eddd6bb
initial tests - quantify cached url
Mikejmnez cb0261f
adds tests to datatree backend to assert multiple dimensions download…
Mikejmnez c69ab80
update testing to show number of download urls
Mikejmnez 6668948
simplified logic
Mikejmnez 4300483
specify cached session debug name to actually cache urls
Mikejmnez e5f80ff
fix for mypy
Mikejmnez b95aaf6
user visible changes on `whats-new.rst`
Mikejmnez 80a74ac
impose sorted to `get_dimensions` method
Mikejmnez 5c37145
reformat `whats-new.rst`
Mikejmnez 8024813
revert to install pydap from conda and not from repo
Mikejmnez 070d9bc
expose checksum as user kwarg
Mikejmnez e705e60
include `checksums` optional argument in `whats-new`
Mikejmnez 45db12f
update to newest release of pydap via pip until conda install is avai…
Mikejmnez bca8a98
use requests_cache session with retry-params when 500 errors occur
Mikejmnez 3a15909
update env yml file to use new pydap release via conda
Mikejmnez 23cf53b
turn on testing on datatree from test.opendap.org
Mikejmnez 31d1be2
rebase with main
Mikejmnez 796ec7a
update what`s new
Mikejmnez File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you help me undestand -- why would a user not want to enable
batchmode if they are using a new enough version of pydap?Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That is a good question. The route map for pydap is for
batch=Truealways fordap4. So adap4-urlwould automatically do thisbatch=truethingy. But right now, batch=true or false follow slightly different pathways internally within pydap (some of it very old, as you may know). In the roadmap for pydap, there is definitely is a major refactoring in sight.FYI: Even when
batch=Falseby default, dimension data is alwaysbatched(downloaded at once) whenprotocol= dap4, as long as they are using a "new enough version of pydap (>=3.5.6)" (see new line 177 onbackends.pydap_). I think this is the first step to making thisdap4 <--> batch=Truein the future.But enabling
batch=Truefor streaming/downloading data that is being subset by Xarray (e.g.ds.isel(lat=slice1, lon=slice2).to_netcdfcan become rapidly complex (in particular in the presence of hierarchies). I done plenty of testing (with both hierarchical data, and data on staggered grids), and while things work well so far, I think "soft launching" this feature on xarray makes the most sense to me. It seems safer to me at least.