Skip to content

Support sharding through config and raster_write_kwargs#1106

Draft
melonora wants to merge 15 commits intoscverse:mainfrom
melonora:support_sharding
Draft

Support sharding through config and raster_write_kwargs#1106
melonora wants to merge 15 commits intoscverse:mainfrom
melonora:support_sharding

Conversation

@melonora
Copy link
Copy Markdown
Collaborator

No description provided.

@melonora
Copy link
Copy Markdown
Collaborator Author

melonora commented Apr 14, 2026

Failing atm due to ome-zarr not yet being released. You can test locally with ome-zarr-py from main.

Also, need to add support for zarrs to improve speed of shard io

@codecov
Copy link
Copy Markdown

codecov bot commented Apr 14, 2026

Codecov Report

❌ Patch coverage is 86.66667% with 12 lines in your changes missing coverage. Please review.
✅ Project coverage is 91.86%. Comparing base (cf91ad5) to head (6334fa8).

Files with missing lines Patch % Lines
src/spatialdata/config.py 86.11% 10 Missing ⚠️
src/spatialdata/_io/io_raster.py 80.00% 2 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main    #1106      +/-   ##
==========================================
- Coverage   91.93%   91.86%   -0.07%     
==========================================
  Files          51       51              
  Lines        7772     7858      +86     
==========================================
+ Hits         7145     7219      +74     
- Misses        627      639      +12     
Files with missing lines Coverage Δ
src/spatialdata/__init__.py 96.00% <100.00%> (+0.34%) ⬆️
src/spatialdata/_core/spatialdata.py 91.98% <100.00%> (+0.05%) ⬆️
src/spatialdata/_io/io_raster.py 91.35% <80.00%> (-0.74%) ⬇️
src/spatialdata/config.py 87.34% <86.11%> (-12.66%) ⬇️
🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

The reason for only supporting these versions is that they provide the proper use of the zarr api inside dask and also
the possibility for setting the tune optimization. The latter is required to prevent errors due to collapsing dask partitions
when reading data back in from parquet.
@Mr-Milk
Copy link
Copy Markdown

Mr-Milk commented Apr 15, 2026

Should we also allow the control of sharding for anndata?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants