278 KiB
Change Log
Unreleased
Major Changes
wgpu::Instance::enumerate_adapters is now async & available on WebGPU
BREAKING CHANGE: enumerate_adapters is now async:
- pub fn enumerate_adapters(&self, backends: Backends) -> Vec<Adapter> {
+ pub fn enumerate_adapters(&self, backends: Backends) -> impl Future<Output = Vec<Adapter>> {
This yields (kek) two benefits:
- This method is now implemented on non-native using the standard
Adapter::request_adapter(…), makingenumerate_adaptersa portable surface. This was previously a nontrivial pain point when an application wanted to do some of its own filtering of adapters. - This method can now be implemented in custom backends.
By @R-Cramer4 in #8230
New LoadOp::DontCare
In the case where a renderpass unconditionally writes to all pixels in the rendertarget,
Load can cause unnecessary memory traffic, and Clear can spend time unnecessarily
clearing the rendertargets. DontCare is a new LoadOp which will leave the contents
of the rendertarget undefined. Because this could lead to undefined behavior, this API
requires that the user gives an unsafe token to use the api.
While you can use this unconditionally, on platforms where DontCare is not available,
it will internally use a different load op.
load: LoadOp::DontCare(unsafe { wgpu::LoadOpDontCare::enabled() })
By @cwfitzgerald in #8549
MipmapFilterMode is split from FilterMode
This is a breaking change that aligns wgpu with spec.
SamplerDescriptor {
...
- mipmap_filter: FilterMode::Nearest
+ mipmap_filter: MipmapFilterMode::Nearest
...
}
Multiview on all major platforms and support for multiview bitmasks
Multiview is a feature that allows rendering the same content to multiple layers of a texture. This is useful primarily in VR where you wish to display almost identical content to 2 views, just with a different perspective. Instead of using 2 draw calls or 2 instances for each object, you can use this feature.
Multiview is also called view instancing in DX12 land or vertex amplification in Metal land.
Multiview has been reworked, adding support for Metal and DX12, and adding testing and validation to wgpu itself.
This change also introduces a view bitmask, a new field in RenderPassDescriptor that allows a render pass to render to multiple non-adjacent layers
when using the SELECTIVE_MULTIVIEW feature. Note that this also influences apps that don't use multiview, as they have to set this mask to None.
- wgpu::RenderPassDescriptor {
- label: None,
- color_attachments: &color_attachments,
- depth_stencil_attachment: None,
- timestamp_writes: None,
- occlusion_query_set: None,
- }
+ wgpu::RenderPassDescriptor {
+ label: None,
+ color_attachments: &color_attachments,
+ depth_stencil_attachment: None,
+ timestamp_writes: None,
+ occlusion_query_set: None,
+ multiview_mask: NonZero::new(3),
+ }
One other breaking change worth noting is that in WGSL @builtin(view_index) now requires a type of u32, where previously it required i32.
By @SupaMaggie70Incorporated in #8206.
Error scopes now use guards and are thread-local.
- device.push_error_scope(wgpu::ErrorFilter::Validation);
+ let scope = device.push_error_scope(wgpu::ErrorFilter::Validation);
// ... perform operations on the device ...
- let error: Option<Error> = device.pop_error_scope().await;
+ let error: Option<Error> = scope.pop().await;
Device error scopes now operate on a per-thread basis. This allows them to be used easily within multithreaded contexts, without having the error scope capture errors from other threads.
When the std feature is not enabled, we have no way to differentiate between threads, so error scopes return to be
global operations.
By @cwfitzgerald in #8685
Log Levels
We have received complaints about wgpu being way too log spammy at log levels info/warn/error. We have
adjusted our log policy and changed logging such that info and above should be silent unless some exceptional
event happens. Our new log policy is as follows:
- Error: if we can’t (for some reason, usually a bug) communicate an error any other way.
- Warning: similar, but there may be one-shot warnings about almost certainly sub-optimal.
- Info: do not use
- Debug: Used for interesting events happening inside wgpu.
- Trace: Used for all events that might be useful to either
wgpuor application developers.
By @cwfitzgerald in #8579.
subgroup_{min,max}_size renamed and moved from Limits -> AdapterInfo
To bring our code in line with the WebGPU spec, we have moved information about subgroup size from limits to adapter info. Limits was not the correct place for this anyway, and we had some code special casing those limits.
Additionally we have renamed the fields to match the spec.
- let min = limits.min_subgroup_size;
+ let min = info.subgroup_min_size;
- let max = limits.max_subgroup_size;
+ let max = info.subgroup_max_size;
By @cwfitzgerald in #8609.
New Features
- Added support for transient textures on Vulkan and Metal. By @opstic in #8247
- Implement shader triangle barycentric coordinate builtins. By @atlv24 in #8320.
- Added support for binding arrays of storage textures on Metal. By @msvbg in #8464
- Added support for multisampled texture arrays on Vulkan through adapter feature
MULTISAMPLE_ARRAY. By @LaylBongers in #8571. - Added
get_configurationtowgpu::Surface, that returns the current configuration ofwgpu::Surface. By @sagudev in #8664.
Changes
General
- Require new enable extensions when using ray queries and position fetch (
wgpu_ray_query,wgpu_ray_query_vertex_return). By @Vecvec in #8545. - Texture now has
from_custom. By @R-Cramer4 in #8315. - Using both the wgpu command encoding APIs and
CommandEncoder::as_hal_muton the same encoder will now result in a panic. - Allow
include_spirv!andinclude_spirv_raw!macros to be used in constants and statics. By @clarfonthey in #8250. - Added support for rendering onto multi-planar textures. By @noituri in #8307.
- Validation errors from
CommandEncoder::finish()will report the label of the invalid encoder. By @kpreid in #8449. - Corrected documentation of the minimum alignment of the end of a mapped range of a buffer (it is 4, not 8). By @kpreid in #8450.
util::StagingBeltnow takes aDevicewhen it is created instead of when it is used. By @kpreid in #8462.wgpu_hal::vulkan::TextureAPI changes to handle externally-created textures and memory more flexibly. By @s-ol in #8512, #8521.
Metal
- Add support for mesh shaders. By @SupaMaggie70Incorporated in #8139
Naga
- Prevent UB with invalid ray query calls on spirv. By @Vecvec in #8390.
- Update the set of binding_array capabilities. In most cases, they are set automatically from
wgpufeatures, and this change should not be user-visible. By @andyleiserson in #8671.
Bug Fixes
naga
- Fix a bug that resulted in the Metal error
program scope variable must reside in constant address spacein some cases. By @teoxoy in #8311.
General
- Fixed a bug where mapping sub-ranges of a buffer on web would fail with
OperationError: GPUBuffer.getMappedRange: GetMappedRange range extends beyond buffer's mapped range. By @ryankaplan in #8349 - Reject fragment shader output
locations >max_color_attachmentslimit. By @ErichDonGubler in #8316. - WebGPU device requests now support the required limits
maxColorAttachmentsandmaxColorAttachmentBytesPerSample. By @evilpie in #8328 - Reject binding indices that exceed
wgpu_types::Limits::max_bindings_per_bind_groupwhen deriving a bind group layout for a pipeline. By @jimblandy in #8325. - Removed three features from
wgpu-halwhich did nothing useful:"cargo-clippy","gpu-allocator", and"rustc-hash". By @kpreid in #8357. wgpu_types::PollErrornow always implements theErrortrait. By @kpreid in #8384.- The texture subresources used by the color attachments of a render pass are no longer allowed to overlap when accessed via different texture views. By @andyleiserson in #8402.
- The
STORAGE_READ_ONLYtexture usage is now permitted to coexist with other read-only usages. By @andyleiserson in #8490. - Validate that buffers are unmapped in
write_buffercalls. By @ErichDonGubler in #8454. - Add WGSL parsing for mesh shaders. By @inner-daemons in #8370.
- Shorten critical section inside present such that the snatch write lock is no longer held during present, preventing other work happening on other threads. By @cwfitzgerald in #8608.
naga
- The
||and&&operators now "short circuit", i.e., do not evaluate the RHS if the result can be determined from just the LHS. By @andyleiserson in #7339.
DX12
- Align copies b/w textures and buffers via a single intermediate buffer per copy when
D3D12_FEATURE_DATA_D3D12_OPTIONS13.UnrestrictedBufferTextureCopyPitchSupportedisfalse. By @ErichDonGubler in #7721. - Fix detection of Int64 Buffer/Texture atomic features. By @cwfitzgerald in #8667.
Vulkan
- Fixed a validation error regarding atomic memory semantics. By @atlv24 in #8391.
WebGPU
- Fixed a bug where the texture aspect was not passed through when calling
copy_texture_to_bufferin WebGPU, causing the copy to fail for depth/stencil textures. By @Tim-Evans-Seequent in #8445.
GLES
- Fix race when downloading texture from compute shader pass. By @SpeedCrash100 in #8527
- Fix double window class registration when dynamic libraries are used. By @Azorlogh in #8548
- Fix context loss on device initialization on GL3.3-4.1 contexts. By @cwfitzgerald in #8674.
hal
DropCallbacks are now called after dropping all other fields of their parent structs. By @jerzywilczek in #8353
v27.0.4 (2025-10-23)
This release includes wgpu-hal version 27.0.4. All other crates remain at their previous versions.
Bug Fixes
General
- Remove fragile dependency constraint on
ordered-floatthat prevented semver-compatible changes above5.0.0. By @kpreid in #8371.
Vulkan
- Work around extremely poor frame pacing from AMD and Nvidia cards on Windows in
FifoandFifoRelaxedpresent modes. This is due to the drivers implicitly using a DXGI (Direct3D) swapchain to implement these modes and it having vastly different timing properties. See https://github.com/gfx-rs/wgpu/issues/8310 and https://github.com/gfx-rs/wgpu/issues/8354 for more information. By @cwfitzgerald in #8420.
v27.0.3 (2025-10-22)
This release includes naga, wgpu-core and wgpu-hal version 27.0.3. All other crates remain at their previous versions.
Bug Fixes
naga
- Fix a bug that resulted in the Metal error
program scope variable must reside in constant address spacein some cases. Backport of #8311 by @teoxoy.
General
- Remove an assertion that causes problems if
CommandEncoder::as_hal_mutis used. By @andyleiserson in #8387.
DX12
- Align copies b/w textures and buffers via a single intermediate buffer per copy when
D3D12_FEATURE_DATA_D3D12_OPTIONS13.UnrestrictedBufferTextureCopyPitchSupportedisfalse. By @ErichDonGubler in #7721, backported in #8374.
v27.0.2 (2025-10-03)
Bug Fixes
DX12
- Fix device creation failures for devices that do not support mesh shaders. By @vorporeal in #8297.
v27.0.1 (2025-10-02)
Bug Fixes
- Fixed the build on docs.rs. By @cwfitzgerald in #8292.
v27.0.0 (2025-10-01)
Major Changes
Deferred command buffer actions: map_buffer_on_submit and on_submitted_work_done
You may schedule buffer mapping and a submission-complete callback to run automatically after you submit, directly from encoders, command buffers, and passes.
// Record some GPU work so the submission isn't empty and touches `buffer`.
encoder.clear_buffer(&buffer, 0, None);
// Defer mapping until this encoder is submitted.
encoder.map_buffer_on_submit(&buffer, wgpu::MapMode::Read, 0..size, |result| { .. });
// Fires after the command buffer's work is finished.
encoder.on_submitted_work_done(|| { .. });
// Automatically calls `map_async` and `on_submitted_work_done` after this submission finishes.
queue.submit([encoder.finish()]);
Available on CommandEncoder, CommandBuffer, RenderPass, and ComputePass.
By @cwfitzgerald in #8125.
Builtin Support for DXGI swapchains on top of of DirectComposition Visuals in DX12
By enabling DirectComposition support, the dx12 backend can now support transparent windows.
This creates a single IDCompositionVisual over the entire window that is used by the mfSurface. If a user wants to manage the composition tree themselves, they should create their own device and composition, and pass the relevant visual down into wgpu via SurfaceTargetUnsafe::CompositionVisual.
let instance = wgpu::Instance::new(&wgpu::InstanceDescriptor {
backend_options: wgpu::BackendOptions {
dx12: wgpu::Dx12BackendOptions {
presentation_system: wgpu::Dx12SwapchainKind::DxgiFromVisual,
..
},
..
},
..
});
By @n1ght-hunter in #7550.
EXPERIMENTAL_RAY_TRACING_ACCELERATION_STRUCTURE has been merged into EXPERIMENTAL_RAY_QUERY
We have merged the acceleration structure feature into the RayQuery feature. This is to help work around an AMD driver bug and reduce the feature complexity of ray tracing. In the future when ray tracing pipelines are implemented, if either feature is enabled, acceleration structures will be available.
- Features::EXPERIMENTAL_RAY_TRACING_ACCELERATION_STRUCTURE
+ Features::EXPERIMENTAL_RAY_QUERY
By @Vecvec in #7913.
New EXPERIMENTAL_PRECOMPILED_SHADERS API
We have added Features::EXPERIMENTAL_PRECOMPILED_SHADERS, replacing existing passthrough types with a unified CreateShaderModuleDescriptorPassthrough which allows passing multiple shader codes for different backends. By @SupaMaggie70Incorporated in #7834
Difference for SPIR-V passthrough:
- device.create_shader_module_passthrough(wgpu::ShaderModuleDescriptorPassthrough::SpirV(
- wgpu::ShaderModuleDescriptorSpirV {
- label: None,
- source: spirv_code,
- },
- ))
+ device.create_shader_module_passthrough(wgpu::ShaderModuleDescriptorPassthrough {
+ entry_point: "main".into(),
+ label: None,
+ spirv: Some(spirv_code),
+ ..Default::default()
})
This allows using precompiled shaders without manually checking which backend's code to pass, for example if you have shaders precompiled for both DXIL and SPIR-V.
Buffer mapping apis no longer have lifetimes
Buffer::get_mapped_range(), Buffer::get_mapped_range_mut(), and Queue::write_buffer_with() now return guard objects without any lifetimes. This
makes it significantly easier to store these types in structs, which is useful for building utilities that build the contents of a buffer over time.
- let buffer_mapping_ref: wgpu::BufferView<'_> = buffer.get_mapped_range(..);
- let buffer_mapping_mut: wgpu::BufferViewMut<'_> = buffer.get_mapped_range_mut(..);
- let queue_write_with: wgpu::QueueWriteBufferView<'_> = queue.write_buffer_with(..);
+ let buffer_mapping_ref: wgpu::BufferView = buffer.get_mapped_range(..);
+ let buffer_mapping_mut: wgpu::BufferViewMut = buffer.get_mapped_range_mut(..);
+ let queue_write_with: wgpu::QueueWriteBufferView = queue.write_buffer_with(..);
By @sagudev in #8046 and @cwfitzgerald in #8070.
EXPERIMENTAL_* features now require unsafe code to enable
We want to be able to expose potentially experimental features to our users before we have ensured that they are fully sound to use.
As such, we now require any feature that is prefixed with EXPERIMENTAL to have a special unsafe token enabled in the device descriptor
acknowledging that the features may still have bugs in them and to report any they find.
adapter.request_device(&wgpu::DeviceDescriptor {
features: wgpu::Features::EXPERIMENTAL_MESH_SHADER,
experimental_features: unsafe { wgpu::ExperimentalFeatures::enabled() }
..
})
By @cwfitzgerald in #8163.
Multi-draw indirect is now unconditionally supported when indirect draws are supported
We have removed Features::MULTI_DRAW_INDIRECT as it was unconditionally available on all platforms.
RenderPass::multi_draw_indirect is now available if the device supports downlevel flag DownlevelFlags::INDIRECT_EXECUTION.
If you are using spirv-passthrough with multi-draw indirect and gl_DrawID, you can know if MULTI_DRAW_INDIRECT is being emulated
by if the Feature::MULTI_DRAW_INDIRECT_COUNT feature is available on the device, this feature cannot be emulated efficicently.
By @cwfitzgerald in #8162.
wgpu::PollType::Wait has now an optional timeout
We removed wgpu::PollType::WaitForSubmissionIndex and added fields to wgpu::PollType::Wait in order to express timeouts.
Before/after for wgpu::PollType::Wait:
-device.poll(wgpu::PollType::Wait).unwrap();
-device.poll(wgpu::PollType::wait_indefinitely()).unwrap();
+device.poll(wgpu::PollType::Wait {
+ submission_index: None, // Wait for most recent submission
+ timeout: Some(std::time::Duration::from_secs(60)), // Previous behavior, but more likely you want `None` instead.
+ })
+ .unwrap();
Before/after for wgpu::PollType::WaitForSubmissionIndex:
-device.poll(wgpu::PollType::WaitForSubmissionIndex(index_to_wait_on))
+device.poll(wgpu::PollType::Wait {
+ submission_index: Some(index_to_wait_on),
+ timeout: Some(std::time::Duration::from_secs(60)), // Previous behavior, but more likely you want `None` instead.
+ })
+ .unwrap();
⚠️ Previously, both wgpu::PollType::WaitForSubmissionIndex and wgpu::PollType::Wait had a hard-coded timeout of 60 seconds.
To wait indefinitely on the latest submission, you can also use the wait_indefinitely convenience function:
device.poll(wgpu::PollType::wait_indefinitely());
New Features
General
- Added mesh shader support to
wgpu, with examples. Requires passthrough. By @SupaMaggie70Incorporated in #7345. - Added support for external textures based on WebGPU's
GPUExternalTexture. These allow shaders to transparently operate on potentially multiplanar source texture data in either RGB or YCbCr formats via WGSL'stexture_externaltype. This is gated behind theFeatures::EXTERNAL_TEXTUREfeature, which is currently only supported on DX12. By @jamienicol in #4386. wgpu::Device::pollcan now specify a timeout viawgpu::PollType::Wait. By @wumpf in #8282 & #8285
naga
- Expose
naga::front::wgsl::UnimplementedEnableExtension. By @ErichDonGubler in #8237.
Changes
General
- Command encoding now happens when
CommandEncoder::finishis called, not when the individual operations are requested. This does not affect the API, but may affect performance characteristics. By @andyleiserson in #8220. - Prevent resources for acceleration structures being created if acceleration structures are not enabled. By @Vecvec in #8036.
- Validate that each
push_debug_grouppairs with exactly onepop_debug_group. By @andyleiserson in #8048. set_viewportnow requires that the supplied minimum depth value is less than the maximum depth value. By @andyleiserson in #8040.- Validation of
copy_texture_to_buffer,copy_buffer_to_texture, andcopy_texture_to_textureoperations more closely follows the WebGPU specification. By @andyleiserson in various PRs.- Copies within the same texture must not overlap.
- Copies of multisampled or depth/stencil formats must span an entire subresource (layer).
- Copies of depth/stencil formats must be 4B aligned.
- For texture-buffer copies,
bytes_per_rowon the buffer side must be 256B-aligned, even if the transfer is a single row.
- The offset for
set_vertex_bufferandset_index_buffermust be 4B aligned. By @andyleiserson in #7929. - The offset and size of bindings are validated as fitting within the underlying buffer in more cases. By @andyleiserson in #7911.
- The function you pass to
Device::on_uncaptured_error()must now implementSyncin addition toSend, and be wrapped inArcinstead ofBox. In exchange for this, it is no longer possible for callingwgpufunctions while in that callback to cause a deadlock (not that we encourage you to actually do that). By @kpreid in #8011. - Make a compacted hal acceleration structure inherit a label from the base BLAS. By @Vecvec in #8103.
- The limits requested for a device must now satisfy
min_subgroup_size <= max_subgroup_size. By @andyleiserson in #8085. - Improve errors when buffer mapping is done incorrectly. Allow aliasing immutable [
BufferViews]. By @cwfitzgerald in #8150. - Require new
F16_IN_F32downlevel flag forquantizeToF16,pack2x16float, andunpack2x16floatin WGSL input. By @aleiserson in #8130. - The error message for non-copyable depth/stencil formats no longer mentions the aspect when it is not relevant. By @reima in #8156.
- Track the initialization status of buffer memory correctly when
copy_texture_to_bufferskips over padding space between rows or layers, or when the start/end of a texture-buffer transfer is not 4B aligned. By @andyleiserson in #8099.
naga
- naga now requires that no type be larger than 1 GB. This limit may be lowered in the future; feedback on an appropriate value for the limit is welcome. By @andyleiserson in #7950.
- If the shader source contains control characters, naga now replaces them with U+FFFD ("replacement character") in diagnostic output. By @andyleiserson in #8049.
- Add f16 IO polyfill on Vulkan backend to enable SHADER_F16 use without requiring
storageInputOutput16. By @cryvosh in #7884. - For custom Naga backend authors:
naga::proc::Namernow accepts reserved keywords using two new dedicated types,proc::{KeywordSet, CaseInsensitiveKeywordSet}. By @kpreid in #8136. - BREAKING: Previously the WGSL storage-texture format
rg11b10floatwas incorrectly accepted and generated by naga, but now only accepts the the correct namerg11b10ufloatinstead. By @ErikWDev in #8219. - The
source()method ofShaderErrorno longer reports the error as its own source. By @andyleiserson in #8258. - naga correctly ingests SPIR-V that use descriptor runtime indexing, which in turn is correctly converted into WGSLs binding array. By @hasenbanck in 8256.
- naga correctly ingests SPIR-V that loads from multi-sampled textures, which in turn is correctly converted into WGSLs texture_multisampled_2d and load operations. By @hasenbanck in 8270.
- naga implement OpImageGather and OpImageDrefGather operations when ingesting SPIR-V. By @hasenbanck in 8280.
DX12
- Allow disabling waiting for latency waitable object. By @marcpabst in #7400
- Add mesh shader support, including to the example. By @SupaMaggie70Incorporated in #8110
Bug Fixes
General
- Validate that effective buffer binding size is aligned to 4 when creating bind groups with buffer entries.. By @ErichDonGubler in 8041.
DX12
- Create an event per wait to prevent 60 second hangs in certain multithreaded scenarios. By @Vecvec in #8273.
- Fixed a bug where access to matrices with 2 rows would not work in some cases. By @andyleiserson in #7438.
EGL
- Fixed unwrap failed in context creation for some Android devices. By @uael in #8024.
Vulkan
- Fixed wrong color format+space being reported versus what is hardcoded in
create_swapchain(). By @MarijnS95 in #8226.
naga
- [wgsl-in] Allow a trailing comma in
@blend_src(…)attributes. By @ErichDonGubler in #8137. - [wgsl-in] Allow a trailing comma in the list of
casevalues inside aswitch. By @reima in #8165. - Escape, rather than strip, identifiers with Unicode. By @ErichDonGubler in 7995.
Documentation
General
- Clarify that subgroup barriers require both the
SUBGROUPandSUBGROUP_BARRIERfeatures / capabilities. By @andyleiserson in #8203.
v26.0.6 (2025-10-23)
This release includes wgpu-hal version 26.0.6. All other crates remain at their previous versions.
Bug Fixes
Vulkan
- Work around extremely poor frame pacing from AMD and Nvidia cards on Windows in
FifoandFifoRelaxedpresent modes. This is due to the drivers implicitly using a DXGI (Direct3D) swapchain to implement these modes and it having vastly different timing properties. See https://github.com/gfx-rs/wgpu/issues/8310 and https://github.com/gfx-rs/wgpu/issues/8354 for more information. By @cwfitzgerald in #8420.
v26.0.5 (2025-10-21)
This release includes wgpu-hal version 26.0.5. All other crates remain at their previous versions.
Bug Fixes
DX12
- Align copies b/w textures and buffers via a single intermediate buffer per copy when
D3D12_FEATURE_DATA_D3D12_OPTIONS13.UnrestrictedBufferTextureCopyPitchSupportedisfalse. By @ErichDonGubler in #7721, backported in #8375.
v26.0.4 (2025-08-07)
Bug Fixes
Vulkan
- Fix
STATUS_HEAP_CORRUPTIONcrash when concurrently callingcreate_sampler. By @atlv24 in #8043, #8056.
v26.0.3 (2025-07-30)
Bug Fixes
- Fixed memory leak in vulkan backend. By @cwfitzgerald in #8031.
Bug Fixes
naga
- Fix empty
ifstatements causing errors on spirv 1.6+. By @Vecvec in #7883.
v26.0.2 (2025-07-23)
Bug Fixes
- Fixed vulkan validation error regarding the swapchain in latest SDK. By @cwfitzgerald in #7971.
- Fixed flickering on AMD devices and crashes inside Renderdoc due to incorrect caching of
VkFramebuffers when the driver re-used image view handles. By @cwfitzgerald in #7972.
Warning
There is formally a breaking change in
wgpu_hal::vulkan::Device::texture_from_rawas there is now a&selfreceiver where there previously wasn't one. This will not affect you unless you explicitly use this api. We have gone ahead with the release as the bug was pervasive and made wgpu unusable for the affected people on v26.
v26.0.1 (2025-07-10)
Bug Fixes
- Fixed build error inside
wgpu::util::initialize_adapter_from_envwhenstdfeature is not enabled. By @kpreid in #7918. - Fixed build error occurring when the
profilingdependency is configured to have profiling active. By @kpreid in #7916. - Emit a validation error instead of panicking when a query set index is OOB. By @ErichDonGubler in #7908.
v26.0.0 (2025-07-09)
Major Features
New method TextureView::texture
You can now call texture_view.texture() to get access to the texture that
a given texture view points to.
By @cwfitzgerald and @Wumpf in #7907.
as_hal calls now return guards instead of using callbacks.
Previously, if you wanted to get access to the wgpu-hal or underlying api types, you would call as_hal and get the hal type as a callback. Now the function returns a guard which dereferences to the hal type.
- device.as_hal::<hal::api::Vulkan>(|hal_device| {...});
+ let hal_device: impl Deref<Item = hal::vulkan::Device> = device.as_hal::<hal::api::Vulkan>();
By @cwfitzgerald in #7863.
Enabling Vulkan Features/Extensions
For those who are doing vulkan/wgpu interop or passthrough and need to enable features/extensions that wgpu does not expose, there is a new wgpu_hal::vulkan::Adapter::open_with_callback that allows the user to modify the pnext chains and extension lists populated by wgpu before we create a vulkan device. This should vastly simplify the experience, as previously you needed to create a device yourself.
Underlying api interop is a quickly evolving space, so we welcome all feedback!
type VkApi = wgpu::hal::api::Vulkan;
let adapter: wgpu::Adapter = ...;
let mut buffer_device_address_create_info = ash::vk::PhysicalDeviceBufferDeviceAddressFeatures { .. };
let hal_device: wgpu::hal::OpenDevice<VkApi> = adapter
.as_hal::<VkApi>()
.unwrap()
.open_with_callback(
wgpu::Features::empty(),
&wgpu::MemoryHints::Performance,
Some(Box::new(|args| {
// Add the buffer device address extension.
args.extensions.push(ash::khr::buffer_device_address::NAME);
// Extend the create info with the buffer device address create info.
*args.create_info = args
.create_info
.push_next(&mut buffer_device_address_create_info);
// We also have access to the queue create infos if we need them.
let _ = args.queue_create_infos;
})),
)
.unwrap();
let (device, queue) = adapter
.create_device_from_hal(hal_device, &wgpu::DeviceDescriptor { .. })
.unwrap();
By @Vecvec in #7829.
naga
- Added
no_stdsupport with default features disabled. By @Bushrat011899 in #7585. - [wgsl-in,ir] Add support for parsing rust-style doc comments via
naga::front::glsl::Frontend::new_with_options. By @Vrixyz in #6364. - When emitting GLSL, Uniform and Storage Buffer memory layouts are now emitted even if no explicit binding is given. By @cloone8 in #7579.
- Diagnostic rendering methods (i.e.,
naga::{front::wgsl::ParseError,WithSpan}::emit_error_to_string_with_path) now accept more types for theirpathargument via a new sealedAsDiagnosticFilePathtrait. By @atlv24, @bushrat011899, and @ErichDonGubler in #7643. - Add support for quad operations (requires
SUBGROUPfeature to be enabled). By @dzamkov and @valaphee in #7683. - Add support for
atomicCompareExchangeWeakin HLSL and GLSL backends. By @cryvosh in #7658
General
- Add support for astc-sliced-3d feature. By @mehmetoguzderin in #7577
- Added
wgpu_hal::dx12::Adapter::as_raw(). By @tronical in ##7852 - Add support for rendering to slices of 3D texture views and single layered 2D-Array texture views (this requires
VK_KHR_maintenance1which should be widely available on newer drivers). By @teoxoy in #7596 - Add extra acceleration structure vertex formats. By @Vecvec in #7580.
- Add acceleration structure limits. By @Vecvec in #7845.
- Add support for clip-distances feature for Vulkan and GL backends. By @dzamkov in #7730
- Added
wgpu_types::error::{ErrorType, WebGpuError}for classification of errors according to WebGPU'sGPUError's classification scheme, and implementWebGpuErrorfor existing errors. This allows users ofwgpu-coreto offload error classification onto the wgpu ecosystem, rather than having to do it themselves without sufficient information. By @ErichDonGubler in #6547.
Bug Fixes
General
- Fix error message for sampler array limit. By @LPGhatguy in #7704.
- Fix bug where using
BufferSlice::get_mapped_range_as_array_buffer()on a buffer would prevent you from ever unmapping it. Note that this API has changed and is nowBufferView::as_uint8array().
naga
- naga now infers the correct binding layout when a resource appears only in an assignment to
_. By @andyleiserson in #7540. - Implement
dot4U8Packedanddot4I8Packedfor all backends, using specialized intrinsics on SPIR-V, HLSL, and Metal if available, and polyfills everywhere else. By @robamler in #7494, #7574, and #7653. - Add polyfilled
pack4x{I,U}8Clampedbuilt-ins to all backends and WGSL frontend. By @ErichDonGubler in #7546. - Allow textureLoad's sample index arg to be unsigned. By @jimblandy in #7625.
- Properly convert arguments to atomic operations. By @jimblandy in #7573.
- Apply necessary automatic conversions to the
valueargument oftextureStore. By @jimblandy in #7567. - Properly apply WGSL's automatic conversions to the arguments to texture sampling functions. By @jimblandy in #7548.
- Properly evaluate
abs(most negative abstract int). By @jimblandy in #7507. - Generate vectorized code for
[un]pack4x{I,U}8[Clamp]on SPIR-V and MSL 2.1+. By @robamler in #7664. - Fix typing for
select, which had issues particularly with a lack of automatic type conversion. By @ErichDonGubler in #7572. - Allow scalars as the first argument of the
distancebuilt-in function. By @bernhl in #7530. - Don't panic when handling
f16for pipeline constants, i.e.,overrides in WGSL. By @ErichDonGubler in #7801. - Prevent aliased ray queries crashing naga when writing SPIR-V out. By @Vecvec in #7759.
DX12
- Get
vertex_index&instance_indexbuiltins working for indirect draws. By @teoxoy in #7535
Vulkan
- Fix OpenBSD compilation of
wgpu_hal::vulkan::drm. By @ErichDonGubler in #7810. - Fix warnings for unrecognized present mode. By @Wumpf in #7850.
Metal
- Remove extraneous main thread warning in
fn surface_capabilities(). By @jamesordner in #7692
WebGPU
- Fix setting unclipped_depth. By @atlv24 in #7841
- Implement
on_submitted_work_donefor WebGPU backend. By @drewcrawford in #7864
Changes
- Loosen Viewport validation requirements to match the new specs. By @ebbdrop in #7564
wgpuanddeno_webgpunow usewgpu-types::error::WebGpuErrorto classify errors. Any changes here are likely to be regressions; please report them if you find them! By @ErichDonGubler in #6547.
General
- Support BLAS compaction in wgpu. By @Vecvec in #7285.
- Removed
MaintainBasein favor of usingPollType. By @waywardmonkeys in #7508. - The
destroyfunctions for buffers and textures in wgpu-core are now infallible. Previously, they returned an error if called multiple times for the same object. This only affects the wgpu-core API; the wgpu API already allowed multipledestroycalls. By @andyleiserson in #7686 and #7720. - Remove
CommandEncoder::build_acceleration_structures_unsafe_tlasin favour ofas_haland apply simplifications allowed by this. By @Vecvec in #7513 - The type of the
sizeparameter tocopy_buffer_to_bufferhas changed fromBufferAddresstoimpl Into<Option<BufferAddress>>. This achieves the spec-defined behavior of the value being optional, while still accepting existing calls without changes. By @andyleiserson in #7659. - To bring wgpu's error reporting into compliance with the WebGPU specification, the error type returned from some functions has changed, and some errors may be raised at a different time than they were previously.
- The error type returned by many methods on
CommandEncoder,RenderPassEncoder,ComputePassEncoder, andRenderBundleEncoderhas changed toEncoderStateErrororPassStateError. These functions will return theEndedvariant of these errors if called on an encoder that is no longer active. Reporting of all other errors is deferred until a call tofinish(). - Variants holding a
CommandEncoderErrorin the error enumsClearError,ComputePassErrorInner,QueryError, andRenderPassErrorInnerhave been replaced with variants holding anEncoderStateError. - The definition of
enum CommandEncoderErrorhas changed significantly, to reflect which errors can be raised byCommandEncoder.finish(). There are also some errors that no longer appear directly inCommandEncoderError, and instead appear nested within theRenderPassorComputePassvariants. CopyErrorhas been removed. Errors that were previously aCopyErrorare now aCommandEncoderErrorreturned byfinish(). (The detailed reasons for copies to fail were and still are described byTransferError, which was previously a variant ofCopyError, and is now a variant ofCommandEncoderError).
- The error type returned by many methods on
naga
- Mark
readonly_and_readwrite_storage_textures&packed_4x8_integer_dot_productlanguage extensions as implemented. By @teoxoy in #7543 naga::back::hlsl::Writer::newhas a newpipeline_optionsargument.hlsl::PipelineOptions::default()can be passed as a default. Theshader_stageandentry_pointmembers ofpipeline_optionscan be used to write only a single entry point when using the HLSL and MSL backends (GLSL and SPIR-V already had this functionality). The Metal and DX12 HALs now write only a single entry point when loading shaders. By @andyleiserson in #7626.- Implemented
early_depth_testfor SPIR-V backend, enablingSHADER_EARLY_DEPTH_TESTfor Vulkan. Additionally, fixed conservative depth optimizations when usingearly_depth_test. The syntax for forcing early depth tests is now@early_depth_test(force)instead of@early_depth_test. By @dzamkov in #7676. ImplementedLanguageExtension::VARIANTSis now implemented manually rather than derived usingstrum(allowingstrumto become a dev-only dependency) so it is no longer a member of thestrum::VARIANTStrait. Unless you are using this trait as a bound this should have no effect.- Compaction changes, by @andyleiserson in #7703:
process_overridesnow compacts the module to remove unused items. It is no longer necessary to supply values for overrides that are not used by the active entry point.- The
compactCargo feature has been removed. It is no longer possible to exclude compaction support from the build. compactnow has an additional argument that specifies whether to remove unused functions, globals, and named types and overrides. For the previous behavior, passKeepUnused::Yes.
D3D12
- Remove the need for dxil.dll. By @teoxoy in #7566
- Ability to get the raw
IDXGIFactory4fromInstance. By @MendyBerger in #7827
Vulkan
- Use highest SPIR-V version supported by Vulkan API version. By @robamler in #7595
HAL
- Added initial
no_stdsupport towgpu-hal. By @bushrat011899 in #7599
Documentation
General
- Remove outdated information about
Adapter::request_device. By @tesselode in #7768
v25.0.2 (2025-05-24)
Bug Fixes
General
- Fix a possible deadlock within
Queue::write_buffer. By @RedMindZ in #7582 - Fix
raw-window-handledependency being too lenient. By @kpreid in #7526
WebGPU
- Insert fragment pipeline constants into fragment descriptor instead of vertex descriptor. By @DerSchmale in #7621
v25.0.1 (2025-04-11)
Bug Fixes
- Fix typos in various documentation. By @waywardmonkeys in #7510.
- Fix compile error when building with
profiling/profile-with-*feature enabled. By @waywardmonkeys in #7509. - Use
once_cell::race::OnceBoxinstead ofstd::sync::LazyLockto allownaga::proc::Namer::default()to be available without backend features being enabled. By @cwfitzgerald in #7517.
DX12
- Fix validation error when creating a non-mappable buffer using the committed allocation scheme. By @cwfitzgerald and @ErichDonGubler in #7519.
v25.0.0 (2025-04-10)
Major Features
Hashmaps Removed from APIs
Both PipelineCompilationOptions::constants and ShaderSource::Glsl::defines now take
slices of key-value pairs instead of hashmaps. This is to prepare for no_std
support and allow us to keep which hashmap hasher and such as implementation details. It
also allows more easily creating these structures inline.
By @cwfitzgerald in #7133
All Backends Now Have Features
Previously, the vulkan and gles backends were non-optional on windows, linux, and android and there was no way to disable them. We have now figured out how to properly make them disablable! Additionally, if you turn on the webgl feature, you will only get the GLES backend on WebAssembly, it won't leak into native builds, like previously it might have.
Warning
If you use wgpu with
default-features = falseand you want to retain thevulkanandglesbackends, you will need to add them to your feature list.-wgpu = { version = "24", default-features = false, features = ["metal", "wgsl", "webgl"] } +wgpu = { version = "25", default-features = false, features = ["metal", "wgsl", "webgl", "vulkan", "gles"] }
By @cwfitzgerald in #7076.
device.poll Api Reworked
This release reworked the poll api significantly to allow polling to return errors when polling hits internal timeout limits.
Maintain was renamed PollType. Additionally, poll now returns a result containing information about what happened during the poll.
-pub fn wgpu::Device::poll(&self, maintain: wgpu::Maintain) -> wgpu::MaintainResult
+pub fn wgpu::Device::poll(&self, poll_type: wgpu::PollType) -> Result<wgpu::PollStatus, wgpu::PollError>
-device.poll(wgpu::Maintain::Poll);
+device.poll(wgpu::PollType::Poll).unwrap();
pub enum PollType<T> {
/// On wgpu-core based backends, block until the given submission has
/// completed execution, and any callbacks have been invoked.
///
/// On WebGPU, this has no effect. Callbacks are invoked from the
/// window event loop.
WaitForSubmissionIndex(T),
/// Same as WaitForSubmissionIndex but waits for the most recent submission.
Wait,
/// Check the device for a single time without blocking.
Poll,
}
pub enum PollStatus {
/// There are no active submissions in flight as of the beginning of the poll call.
/// Other submissions may have been queued on other threads during the call.
///
/// This implies that the given Wait was satisfied before the timeout.
QueueEmpty,
/// The requested Wait was satisfied before the timeout.
WaitSucceeded,
/// This was a poll.
Poll,
}
pub enum PollError {
/// The requested Wait timed out before the submission was completed.
Timeout,
}
Warning
As part of this change, WebGL's default behavior has changed. Previously
device.poll(Wait)appeared as though it functioned correctly. This was a quirk caused by the bug that these PRs fixed. Now it will always returnTimeoutif the submission has not already completed. As many people rely on this behavior on WebGL, there is a new options inBackendOptions. If you want the old behavior, set the following on instance creation:instance_desc.backend_options.gl.fence_behavior = wgpu::GlFenceBehavior::AutoFinish;You will lose the ability to know exactly when a submission has completed, but
device.poll(Wait)will behave the same as it does on native.
By @cwfitzgerald in #6942 and #7030.
wgpu::Device::start_capture renamed, documented, and made unsafe
- device.start_capture();
+ unsafe { device.start_graphics_debugger_capture() }
// Your code here
- device.stop_capture();
+ unsafe { device.stop_graphics_debugger_capture() }
There is now documentation to describe how this maps to the various debuggers' apis.
By @cwfitzgerald in #7470
Ensure loops generated by SPIR-V and HLSL naga backends are bounded
Make sure that all loops in shaders generated by these naga backends are bounded
to avoid undefined behaviour due to infinite loops. Note that this may have a
performance cost. As with the existing implementation for the MSL backend this
can be disabled by using Device::create_shader_module_trusted().
By @jamienicol in #6929 and #7080.
Split up Features internally
Internally split up the Features struct and recombine them internally using a macro. There should be no breaking
changes from this. This means there are also namespaces (as well as the old Features::*) for all wgpu specific
features and webgpu feature (FeaturesWGPU and FeaturesWebGPU respectively) and Features::from_internal_flags which
allow you to be explicit about whether features you need are available on the web too.
WebGPU compliant dual source blending feature
Previously, dual source blending was implemented with a wgpu native only feature flag and used a custom syntax in wgpu.
By now, dual source blending was added to the WebGPU spec as an extension.
We're now following suite and implement the official syntax.
Existing shaders using dual source blending need to be updated:
struct FragmentOutput{
- @location(0) source0: vec4<f32>,
- @location(0) @second_blend_source source1: vec4<f32>,
+ @location(0) @blend_src(0) source0: vec4<f32>,
+ @location(0) @blend_src(1) source1: vec4<f32>,
}
With that wgpu::Features::DUAL_SOURCE_BLENDING is now available on WebGPU.
Furthermore, GLSL shaders now support dual source blending as well via the index layout qualifier:
layout(location = 0, index = 0) out vec4 output0;
layout(location = 0, index = 1) out vec4 output1;
By @wumpf in #7144
Unify interface for SpirV shader passthrough
Replace device create_shader_module_spirv function with a generic create_shader_module_passthrough function
taking a ShaderModuleDescriptorPassthrough enum as parameter.
Update your calls to create_shader_module_spirv and use create_shader_module_passthrough instead:
- device.create_shader_module_spirv(
- wgpu::ShaderModuleDescriptorSpirV {
- label: Some(&name),
- source: Cow::Borrowed(&source),
- }
- )
+ device.create_shader_module_passthrough(
+ wgpu::ShaderModuleDescriptorPassthrough::SpirV(
+ wgpu::ShaderModuleDescriptorSpirV {
+ label: Some(&name),
+ source: Cow::Borrowed(&source),
+ },
+ ),
+ )
By @syl20bnr in #7326.
Noop Backend
It is now possible to create a dummy wgpu device even when no GPU is available. This may be useful for testing of code which manages graphics resources. Currently, it supports reading and writing buffers, and other resource types can be created but do nothing.
To use it, enable the noop feature of wgpu, and either call Device::noop(), or add NoopBackendOptions { enable: true } to the backend options of your Instance (this is an additional safeguard beyond the Backends bits).
By @kpreid in #7063 and #7342.
SHADER_F16 feature is now available with naga shaders
Previously this feature only allowed you to use f16 on SPIR-V passthrough shaders. Now you can use it on all shaders, including WGSL, SPIR-V, and GLSL!
enable f16;
fn hello_world(a: f16) -> f16 {
return a + 1.0h;
}
By @FL33TW00D, @ErichDonGubler, and @cwfitzgerald in #5701
Bindless support improved and validation rules changed.
Metal support for bindless has significantly improved and the limits for binding arrays have been increased.
Previously, all resources inside binding arrays contributed towards the standard limit of their type (texture_2d arrays for example would contribute to max_sampled_textures_per_shader_stage). Now these resources will only contribute towards binding-array specific limits:
max_binding_array_elements_per_shader_stagefor all non-sampler resourcesmax_binding_array_sampler_elements_per_shader_stagefor sampler resources.
This change has allowed the metal binding array limits to go from between 32 and 128 resources, all the way 500,000 sampled textures. Additionally binding arrays are now bound more efficiently on Metal.
This change also enabled legacy Intel GPUs to support 1M bindless resources, instead of the previous 1800.
To facilitate this change, there was an additional validation rule put in place: if there is a binding array in a bind group, you may not use dynamic offset buffers or uniform buffers in that bind group. This requirement comes from vulkan rules on UpdateAfterBind descriptors.
By @cwfitzgerald in #6811, #6815, and #6952.
New Features
General
- Add
Buffermethods corresponding toBufferSlicemethods, so you can skip creating aBufferSlicewhen it offers no benefit, andBufferSlice::slice()for sub-slicing a slice. By @kpreid in #7123. - Add
BufferSlice::buffer(),BufferSlice::offset()andBufferSlice::size(). By @kpreid in #7148. - Add
impl From<BufferSlice> for BufferBindingandimpl From<BufferSlice> for BindingResource, allowingBufferSlices to be easily used in creating bind groups. By @kpreid in #7148. - Add
util::StagingBelt::allocate()so the staging belt can be used to write textures. By @kpreid in #6900. - Added
CommandEncoder::transition_resources()for native API interop, and allowing users to slightly optimize barriers. By @JMS55 in #6678. - Add
wgpu_hal::vulkan::Adapter::texture_format_as_rawfor native API interop. By @JMS55 in #7228. - Support getting vertices of the hit triangle when raytracing. By @Vecvec in #7183.
- Add
as_halfor both acceleration structures. By @Vecvec in #7303. - Add Metal compute shader passthrough. Use
create_shader_module_passthroughon device. By @syl20bnr in #7326. - new
Features::MSL_SHADER_PASSTHROUGHrun-time feature allows providing pass-through MSL Metal shaders. By @syl20bnr in #7326. - Added mesh shader support to
wgpu_hal. By @SupaMaggie70Incorporated in #7089
naga
- Add support for unsigned types when calling textureLoad with the level parameter. By @ygdrasil-io in #7058.
- Support @must_use attribute on function declarations. By @turbocrime in #6801.
- Support for generating the candidate intersections from AABB geometry, and confirming the hits. By @kvark in #7047.
- Make naga:🔙:spv::Function::to_words write the OpFunctionEnd instruction in itself, instead of making another call after it. By @junjunjd in #7156.
- Add support for texture memory barriers. By @Devon7925 in #7173.
- Add polyfills for
unpackSnorm4x8,unpackUnorm4x8,unpackSnorm2x16,unpackUnorm2x16for GLSL versions they aren't supported in. By @DJMcNab in #7408.
Examples
- Added an example that shows how to handle datasets too large to fit in a single
GPUBufferby distributing it across many buffers, and then having the shader receive them as abinding_arrayof storage buffers. By @alphastrata in #6138
Changes
General
wgpu::Instance::request_adapter()now returnsResultinstead ofOption; the error provides information about why no suitable adapter was returned. By @kpreid in #7330.- Support BLAS compaction in wgpu-hal. By @Vecvec in #7101.
- Avoid using default features in many dependencies, etc. By Brody in #7031
- Use
hashbrownto simplify no-std support. By Brody in #6938 & #6925. - If you use Binding Arrays in a bind group, you may not use Dynamic Offset Buffers or Uniform Buffers in that bind group. By @cwfitzgerald in #6811
- Rename
instance_idandinstance_custom_indextoinstance_indexandinstance_custom_databy @Vecvec in #6780
naga
- naga IR types are now available in the module
naga::ir(e.g.naga::ir::Module). The original names (e.g.naga::Module) remain present for compatibility. By @kpreid in #7365. - Refactored
usestatements to simplify futureno_stdsupport. By @bushrat011899 in #7256 - naga's WGSL frontend no longer allows using the
&operator to take the address of a component of a vector, which is not permitted by the WGSL specification. By @andyleiserson in #7284 - naga's use of
termcolorandstderrare now optional behind features of the same names. By @bushrat011899 in #7482
Vulkan
HAL queue callback support
- Add a way to notify with
Queue::submit()to Vulkan'svk::Semaphoreallocated outside of wgpu. By @sotaroikeda in #6813.
Bug Fixes
naga
- Fix some instances of functions which have a return type but don't return a value being incorrectly validated. By @jamienicol in #7013.
- Allow abstract expressions to be used in WGSL function return statements. By @jamienicol in #7035.
- Error if structs have two fields with the same name. By @SparkyPotato in #7088.
- Forward '--keep-coordinate-space' flag to GLSL backend in naga-cli. By @cloone8 in #7206.
- Allow template lists to have a trailing comma. By @KentSlaney in #7142.
- Allow WGSL const declarations to have abstract types. By @jamienicol in #7055 and #7222.
- Allows override-sized arrays to resolve to the same size without causing the type arena to panic. By @KentSlaney in #7082.
- Allow abstract types to be used for WGSL switch statement selector and case selector expressions. By @jamienicol in #7250.
- Apply automatic conversions to
letdeclarations, and acceptvecN()as a constructor for vectors (in any context). By @andyleiserson in #7367. - The
&&and||operators are no longer allowed on vectors. By @andyleiserson in #7368. - Prevent ray intersection function overwriting each other. By @Vecvec in #7497.
- Require that the level operand of an ImageQuery::Size expression is i32 or u32, per spec. By @jimblandy in #7426.
- Implement constant evaluation for the cross builtin. By @jimblandy in #7404.
- Properly handle automatic type conversions in calls to
MathFunctionbuiltins. By @jimblandy in #6833.
General
- Fix some validation errors when building acceleration structures. By @Vecvec in #7486.
- Avoid overflow in query set bounds check validation. By @ErichDonGubler in #6933.
- Add Flush to GL Queue::submit. By @cwfitzgerald in #6941.
- Reduce downlevel
max_color_attachmentslimit from 8 to 4 for better GLES compatibility. By @adrian17 in #6994. - Fix building a BLAS with a transform buffer by adding a flag to indicate usage of the transform buffer. By @Vecvec in #7062.
- Move incrementation of
Device::last_acceleration_structure_build_command_indexinto queue submit. By @Vecvec in #7462. - Implement indirect draw validation. By @teoxoy in #7140
Vulkan
- Stop naga causing undefined behavior when a ray query misses. By @Vecvec in #6752.
- In naga's SPIR-V backend, avoid duplicating SPIR-V OpTypePointer instructions. By @jimblandy in #7246.
Gles
- Support OpenHarmony render with
gles. By @richerfu in #7085
Dx12
- Fix HLSL storage format generation. By @Vecvec in #6993 and #7104
- Fix 3D storage texture bindings. By @SparkyPotato in #7071
- Fix DX12 composite alpha modes. By @amrbashir in #7117
- Bound check dynamic buffers. By @teoxoy in #6931
- Fix size of buffer. By @teoxoy in #7310
WebGPU
- Improve efficiency of dropping read-only buffer mappings. By @kpreid in #7007.
Performance
naga
- Replace
unicode-xidwithunicode-ident. By @CrazyboyQCD in #7135
Documentation
-
Improved documentation around pipeline caches and
TextureBlitter. By @DJMcNab in #6978 and #7003. -
Improved documentation of
PresentMode, buffer mapping functions, memory alignment requirements, texture formats’ automatic conversions, and various types and constants. By @kpreid in #7211 and #7283. -
Added a hello window example. By @laycookie in #6992.
Examples
- Call
pre_present_notify()before presenting. By @kjarosh in #7074.
v24.0.5 (2025-05-24)
Bug Fixes
General
- Fix a possible deadlock within
Queue::write_buffer. By @RedMindZ in #7582
WebGPU
- Insert fragment pipeline constants into fragment descriptor instead of vertex descriptor. By @DerSchmale in #7621
v24.0.4 (2025-04-03)
Metal
- Use resize observers for smoother resizing. By @madsmtm in #7026.
v24.0.3 (2025-03-19)
Bug Fixes
- Fix drop order in
Surface, solving segfaults on exit on some systems. By @ed-2100 in #6997
v24.0.2 (2025-02-26)
Bug Fixes
- Fix GLES renderpass clears causing violation of
max_color_attachmentslimit. By @adrian17 in #6994. - Fix a possible deadlock within
Queue::write_texture. By @metamuffin in #7004 - Decrement
max_storage_buffer_binding_sizeby 1 to matchmax_buffer_size. By @minus1ms in #7217
v24.0.1 (2025-01-22)
Bug Fixes
- Fix
wgpunot building with--no-default-featureson when targetingwasm32-unknown-unknown. By @wumpf in #6946. - Implement
CloneonShaderModule. By @a1phyr in #6937. - Fix
CopyExternalImageDestInfonot exported onwgpu. By @wumpf in #6962.
v24.0.0 (2025-01-15)
Major changes
Refactored Dispatch Between wgpu-core and webgpu
The crate wgpu has two different "backends", one which targets webgpu in the browser, one which targets wgpu_core on native platforms and webgl. This was previously very difficult to traverse and add new features to. The entire system was refactored to make it simpler. Additionally the new system has zero overhead if there is only one "backend" in use. You can see the new system in action by using go-to-definition on any wgpu functions in your IDE.
By @cwfitzgerald in #6619.
Most objects in wgpu are now Clone
All types in the wgpu API are now Clone.
This is implemented with internal reference counting, so cloning for instance a Buffer does copies only the "handle" of the GPU buffer, not the underlying resource.
Previously, libraries using wgpu objects like Device, Buffer or Texture etc. often had to manually wrap them in a Arc to allow passing between libraries.
This caused a lot of friction since if one library wanted to use a Buffer by value, calling code had to give up ownership of the resource which may interfere with other subsystems.
Note that this also mimics how the WebGPU javascript API works where objects can be cloned and moved around freely.
By @cwfitzgerald in #6665.
Render and Compute Passes Now Properly Enforce Their Lifetime
A regression introduced in 23.0.0 caused lifetimes of render and compute passes to be incorrectly enforced. While this is not a soundness issue, the intent is to move an error from runtime to compile time. This issue has been fixed and restored to the 22.0.0 behavior.
Bindless (binding_array) Grew More Capabilities
- DX12 now supports
PARTIALLY_BOUND_BINDING_ARRAYon Resource Binding Tier 3 Hardware. This is most D3D12 hardware D3D12 Feature Table for more information on what hardware supports this feature. By @cwfitzgerald in #6734.
Device::create_shader_module_unchecked Renamed and Now Has Configuration Options
create_shader_module_unchecked became create_shader_module_trusted.
This allows you to customize which exact checks are omitted so that you can get the correct balance of performance and safety for your use case. Calling the function is still unsafe, but now can be used to skip certain checks only on certain builds.
This also allows users to disable the workarounds in the msl-out backend to prevent the compiler from optimizing infinite loops. This can have a big impact on performance, but is not recommended for untrusted shaders.
let desc: ShaderModuleDescriptor = include_wgsl!(...)
- let module = unsafe { device.create_shader_module_unchecked(desc) };
+ let module = unsafe { device.create_shader_module_trusted(desc, wgpu::ShaderRuntimeChecks::unchecked()) };
By @cwfitzgerald and @rudderbucky in #6662.
wgpu::Instance::new now takes InstanceDescriptor by reference
Previously wgpu::Instance::new took InstanceDescriptor by value (which is overall fairly uncommon in wgpu).
Furthermore, InstanceDescriptor is now cloneable.
- let instance = wgpu::Instance::new(instance_desc);
+ let instance = wgpu::Instance::new(&instance_desc);
By @wumpf in #6849.
Environment Variable Handling Overhaul
Previously how various bits of code handled reading settings from environment variables was inconsistent and unideomatic.
We have unified it to (Type::from_env() or Type::from_env_or_default()) and Type::with_env for all types.
- wgpu::util::backend_bits_from_env()
+ wgpu::Backends::from_env()
- wgpu::util::power_preference_from_env()
+ wgpu::PowerPreference::from_env()
- wgpu::util::dx12_shader_compiler_from_env()
+ wgpu::Dx12Compiler::from_env()
- wgpu::util::gles_minor_version_from_env()
+ wgpu::Gles3MinorVersion::from_env()
- wgpu::util::instance_descriptor_from_env()
+ wgpu::InstanceDescriptor::from_env_or_default()
- wgpu::util::parse_backends_from_comma_list(&str)
+ wgpu::Backends::from_comma_list(&str)
By @cwfitzgerald in #6895
Backend-specific instance options are now in separate structs
In order to better facilitate growing more interesting backend options, we have put them into individual structs. This allows users to more easily understand what options can be defaulted and which they care about. All of these new structs implement from_env() and delegate to their respective from_env() methods.
- let instance = wgpu::Instance::new(&wgpu::InstanceDescriptor {
- backends: wgpu::Backends::all(),
- flags: wgpu::InstanceFlags::default(),
- dx12_shader_compiler: wgpu::Dx12Compiler::Dxc,
- gles_minor_version: wgpu::Gles3MinorVersion::Automatic,
- });
+ let instance = wgpu::Instance::new(&wgpu::InstanceDescriptor {
+ backends: wgpu::Backends::all(),
+ flags: wgpu::InstanceFlags::default(),
+ backend_options: wgpu::BackendOptions {
+ dx12: wgpu::Dx12BackendOptions {
+ shader_compiler: wgpu::Dx12ShaderCompiler::Dxc,
+ },
+ gl: wgpu::GlBackendOptions {
+ gles_minor_version: wgpu::Gles3MinorVersion::Automatic,
+ },
+ },
+ });
If you do not need any of these options, or only need one backend's info use the default() impl to fill out the remaining feelds.
By @cwfitzgerald in #6895
The diagnostic(…); directive is now supported in WGSL
naga now parses diagnostic(…); directives according to the WGSL spec. This allows users to control certain lints, similar to Rust's allow, warn, and deny attributes. For example, in standard WGSL (but, notably, not naga yet—see https://github.com/gfx-rs/wgpu/issues/4369) this snippet would emit a uniformity error:
@group(0) @binding(0) var s : sampler;
@group(0) @binding(2) var tex : texture_2d<f32>;
@group(1) @binding(0) var<storage, read> ro_buffer : array<f32, 4>;
@fragment
fn main(@builtin(position) p : vec4f) -> @location(0) vec4f {
if ro_buffer[0] == 0 {
// Emits a derivative uniformity error during validation.
return textureSample(tex, s, vec2(0.,0.));
}
return vec4f(0.);
}
…but we can now silence it with the off severity level, like so:
// Disable the diagnosic with this…
diagnostic(off, derivative_uniformity);
@group(0) @binding(0) var s : sampler;
@group(0) @binding(2) var tex : texture_2d<f32>;
@group(1) @binding(0) var<storage, read> ro_buffer : array<f32, 4>;
@fragment
fn main(@builtin(position) p : vec4f) -> @location(0) vec4f {
if ro_buffer[0] == 0 {
// Look ma, no error!
return textureSample(tex, s, vec2(0.,0.));
}
return vec4f(0.);
}
There are some limitations to keep in mind with this new functionality:
- We support
@diagnostic(…)rules asfnattributes, but prioritization for rules in statement positions (i.e.,if (…) @diagnostic(…) { … }is unclear. If you are blocked by not being able to parsediagnostic(…)rules in statement positions, please let us know in https://github.com/gfx-rs/wgpu/issues/5320, so we can determine how to prioritize it! - Standard WGSL specifies
error,warning,info, andoffseverity levels. These are all technically usable now! A caveat, though: warning- and info-level are only emitted tostderrvia thelogfaçade, rather than being reported through aResult::Errin naga or theCompilationInfointerface inwgpu{,-core}. This will require breaking changes in naga to fix, and is being tracked by https://github.com/gfx-rs/wgpu/issues/6458. - Not all lints can be controlled with
diagnostic(…)rules. In fact, only thederivative_uniformitytriggering rule exists in the WGSL standard. That said, naga contributors are excited to see how this level of control unlocks a new ecosystem of configurable diagnostics. - Finally,
diagnostic(…)rules are not yet emitted in WGSL output. This means thatwgsl-in→wgsl-outis currently a lossy process. We felt that it was important to unblock users who neededdiagnostic(…)rules (i.e., https://github.com/gfx-rs/wgpu/issues/3135) before we took significant effort to fix this (tracked in https://github.com/gfx-rs/wgpu/issues/6496).
By @ErichDonGubler in #6456, #6148, #6533, #6353, #6537.
New Features
naga
- Support atomic operations on fields of global structs in the SPIR-V frontend. By @schell in #6693.
- Clean up tests for atomic operations support in SPIR-V frontend. By @schell in #6692
- Fix an issue where
nagaCLI would incorrectly skip the first positional argument when--stdin-file-pathwas specified. By @ErichDonGubler in #6480. - Fix textureNumLevels in the GLSL backend. By @magcius in #6483.
- Support 64-bit hex literals and unary operations in constants #6616.
- Implement
quantizeToF16()for WGSL frontend, and WGSL, SPIR-V, HLSL, MSL, and GLSL backends. By @jamienicol in #6519. - Add support for GLSL
usampler*andisampler*. By @DavidPeicho in #6513. - Expose Ray Query flags as constants in WGSL. Implement candidate intersections. By @kvark in #5429
- Add new vertex formats (
{U,S}{int,norm}{8,16},Float16andUnorm8x4Bgra). By @nolanderc in #6632 - Allow for override-expressions in
workgroup_size. By @KentSlaney in #6635. - Add support for OpAtomicCompareExchange in SPIR-V frontend. By @schell in #6590.
- Implement type inference for abstract arguments to user-defined functions. By @jamienicol in #6577.
- Allow for override-expressions in array sizes. By @KentSlaney in #6654.
pointer_composite_accessWGSL language extension is implemented. By @sagudev in #6913
General
- Add unified documentation for ray-tracing. By @Vecvec in #6747
- Return submission index in
map_asyncandon_submitted_work_doneto track down completion of async callbacks. By @eliemichel in #6360. - Move raytracing alignments into HAL instead of in core. By @Vecvec in #6563.
- Allow for statically linking DXC rather than including separate
.dllfiles. By @DouglasDwyer in #6574. DeviceTypeandAdapterInfonow implHashby @cwfitzgerald in #6868- Add build support for Apple Vision Pro. By @guusw in #6611.
- Add
wgsl_language_featuresfor obtaining available WGSL language feature by @sagudev in #6814 - Image atomic support in shaders. By @atlv24 in #6706
- 64 bit image atomic support in shaders. By @atlv24 in #5537
- Add
no_stdsupport towgpu-types. By @bushrat011899 in #6892.
Vulkan
- Allow using some 32-bit floating-point atomic operations (load, store, add, sub, exchange) in shaders. It requires the extension
VK_EXT_shader_atomic_float. By @AsherJingkongChen in #6234.
Metal
- Allow using some 32-bit floating-point atomic operations (load, store, add, sub, exchange) in shaders. It requires Metal 3.0+ with Apple 7, 8, 9 or Mac 2. By @AsherJingkongChen in #6234.
- Add build support for Apple Vision Pro. By @guusw in #6611.
- Add
raw_handlemethod to access raw Metal textures in #6894.
D3D12
- Support DXR (DirectX Ray-tracing) in wgpu-hal. By @Vecvec in #6777
Changes
naga
- Show types of LHS and RHS in binary operation type mismatch errors. By @ErichDonGubler in #6450.
- The GLSL parser now uses less expressions for function calls. By @magcius in #6604.
- Add a note to help with a common syntax error case for global diagnostic filter directives. By @e-hat in #6718
- Change arithmetic operations between two i32 variables to wrap on overflow to match WGSL spec. By @matthew-wong1 in #6835.
- Add directives to suggestions in error message for parsing global items. By @e-hat in #6723.
- Automatic conversion for
overrideinitializers. By @sagudev in 6920
General
- Align Storage Access enums to the webgpu spec. By @atlv24 in #6642
- Make
Surface::as_haltake an immutable reference to the surface. By @jerzywilczek in #9999 - Add actual sample type to
CreateBindGroupError::InvalidTextureSampleTypeerror message. By @ErichDonGubler in #6530. - Improve binding error to give a clearer message when there is a mismatch between resource binding as it is in the shader and as it is in the binding layout. By @eliemichel in #6553.
Surface::configureandSurface::get_current_textureare no longer fatal. By @alokedesai in #6253- Rename
BlasTriangleGeometry::index_buffer_offsettoBlasTriangleGeometry::first_index. By @Vecvec in #6873
D3D12
- Avoid using FXC as fallback when the DXC container was passed at instance creation. Paths to
dxcompiler.dll&dxil.dllare also now required. By @teoxoy in #6643.
Vulkan
- Add a cache for samplers, deduplicating any samplers, allowing more programs to stay within the global sampler limit. By @cwfitzgerald in #6847
HAL
- Replace
usage: Range<T>, forBufferUses,TextureUses, andAccelerationStructureBarrierwith a newStateTransition<T>. By @atlv24 in #6703 - Change the
DropCallbackAPI to useFnOnceinstead ofFnMut. By @jerzywilczek in #6482
Bug Fixes
General
- Handle query set creation failure as an internal error that loses the
Device, rather than panicking. By @ErichDonGubler in #6505. - Ensure that
Features::TIMESTAMP_QUERYis set when using timestamp writes in render and compute passes. By @ErichDonGubler in #6497. - Check for device mismatches when beginning render and compute passes. By @ErichDonGubler in #6497.
- Lower
QUERY_SET_MAX_QUERIES(and enforced limits) from 8192 to 4096 to match WebGPU spec. By @ErichDonGubler in #6525. - Allow non-filterable float on texture bindings never used with samplers when using a derived bind group layout. By @ErichDonGubler in #6531.
- Replace potentially unsound usage of
PreHashedMapwithFastHashMap. By @jamienicol in #6541. - Add missing validation for timestamp writes in compute and render passes. By @ErichDonGubler in #6578, #6583.
- Check the status of the
TIMESTAMP_QUERYfeature before other validation. - Check that indices are in-bounds for the query set.
- Check that begin and end indices are not equal.
- Check that at least one index is specified.
- Check the status of the
- Reject destroyed buffers in query set resolution. By @ErichDonGubler in #6579.
- Fix panic when dropping
Deviceon some environments. By @Dinnerbone in #6681. - Reduced the overhead of command buffer validation. By @nical in #6721.
- Set index type to NONE in
get_acceleration_structure_build_sizes. By @Vecvec in #6802. - Fix
wgpu-infonot showing dx12 adapters. By @wumpf in #6844. - Use
transform_buffer_offsetwhen initialisingtransform_buffer. By @Vecvec in #6864.
naga
- Fix crash when a texture argument is missing. By @aedm in #6486
- Emit an error in constant evaluation, rather than crash, in certain cases where
vecNconstructors have less than N arguments. By @ErichDonGubler in #6508. - Fix an error in template list matching
>=ina<b>=c. By @KentSlaney in #6898. - Correctly validate handles in override-sized array types. By @jimblandy in #6882.
- Clean up validation of
Statement::ImageStore. By @jimblandy in #6729. - In compaction, avoid cloning the type arena. By @jimblandy in #6790
- In validation, forbid cycles between global expressions and types. By @jimblandy in #6800
- Allow abstract scalars in modf and frexp results. By @jimblandy in #6821
- In the WGSL front end, apply automatic conversions to values being assigned. By @jimblandy in #6822
- Fix a leak by ensuring that types that depend on expressions are correctly compacted. By @KentSlaney in #6934.
Vulkan
- Allocate descriptors for acceleration structures. By @Vecvec in #6861.
max_color_attachment_bytes_per_sampleis now correctly set to 128. By @cwfitzgerald in #6866
D3D12
- Fix no longer showing software rasterizer adapters. By @wumpf in #6843.
max_color_attachment_bytes_per_sampleis now correctly set to 128. By @cwfitzgerald in #6866
Examples
- Add multiple render targets example. By @kaphula in #5297
Testing
- Tests the early returns in the acceleration structure build calls with empty calls. By @Vecvec in #6651.
23.0.1 (2024-11-25)
This release includes patches for wgpu, wgpu-core and wgpu-hal. All other crates remain at 23.0.0.
Below changes were cherry-picked from 24.0.0 development line.
Bug fixes
General
- Fix Texture view leaks regression. By @xiaopengli89 in #6576
Metal
- Fix surface creation crashing on iOS. By @mockersf in #6535
Vulkan
- Fix surface capabilities being advertised when its query failed. By @wumpf in #6510
23.0.0 (2024-10-25)
Themes of this release
This release's theme is one that is likely to repeat for a few releases: convergence with the WebGPU specification! wgpu's design and base functionality are actually determined by two specifications: one for WebGPU, and one for the WebGPU Shading Language.
This may not sound exciting, but let us convince you otherwise! All major web browsers have committed to offering WebGPU in their environment. Even JS runtimes like Node and Deno have communities that are very interested in providing WebGPU! WebGPU is slowly eating the world, as it were. 😀 It's really important, then, that WebGPU implementations behave in ways that one would expect across all platforms. For example, if Firefox's WebGPU implementation were to break when running scripts and shaders that worked just fine in Chrome, that would mean sad users for both application authors and browser authors.
wgpu also benefits from standard, portable behavior in the same way as web browsers. Because of this behavior, it's generally fairly easy to port over usage of WebGPU in JavaScript to wgpu. It is also what lets wgpu go full circle: wgpu can be an implementation of WebGPU on native targets, but also it can use other implementations of WebGPU as a backend in JavaScript when compiled to WASM. Therefore, the same dynamic applies: if wgpu's own behavior were significantly different, then wgpu and end users would be sad, sad humans as soon as they discover places where their nice apps are breaking, right?
The answer is: yes, we do have sad, sad humans that really want their wgpu code to work everywhere. As Firefox and others use wgpu to implement WebGPU, the above example of Firefox diverging from standard is, unfortunately, today's reality. It mostly behaves the same as a standards-compliant WebGPU, but it still doesn't in many important ways. Of particular note is naga, its implementation of the WebGPU Shader Language. Shaders are pretty much a black-and-white point of failure in GPU programming; if they don't compile, then you can't use the rest of the API! And yet, it's extremely easy to run into a case like that from https://github.com/gfx-rs/wgpu/issues/4400:
fn gimme_a_float() -> f32 {
return 42; // fails in naga, but standard WGSL happily converts to `f32`
}
We intend to continue making visible strides in converging with specifications for WebGPU and WGSL, as this release has. This is, unfortunately, one of the major reasons that wgpu has no plans to work hard at keeping a SemVer-stable interface for the foreseeable future; we have an entire platform of GPU programming functionality we have to catch up with, and SemVer stability is unfortunately in tension with that. So, for now, you're going to keep seeing major releases and breaking changes. Where possible, we'll try to make that painless, but compromises to do so don't always make sense with our limited resources.
This is also the last planned major version release of 2024; the next milestone is set for January 1st, 2025, according to our regular 12-week cadence (offset from the originally planned date of 2024-10-09 for this release 😅). We'll see you next year!
Contributor spotlight: @sagudev
This release, we'd like to spotlight the work of @sagudev, who has made significant contributions to the wgpu ecosystem this release. Among other things, they contributed a particularly notable feature where runtime-known indices are finally allowed for use with const array values. For example, this WGSL shader previously wasn't allowed:
const arr: array<u32, 4> = array(1, 2, 3, 4);
fn what_number_should_i_use(idx: u32) -> u32 {
return arr[idx];
}
…but now it works! This is significant because this sort of shader rejection was one of the most impactful issues we are aware of for converging with the WGSL specification. There are more still to go—some of which we expect to even more drastically change how folks author shaders—but we suspect that many more will come in the next few releases, including with @sagudev's help.
We're excited for more of @sagudev's contributions via the Servo community. Oh, did we forget to mention that these contributions were motivated by their work on Servo? That's right, a third well-known JavaScript runtime is now using wgpu to implement its WebGPU implementation. We're excited to support Servo to becoming another fully fledged browsing environment this way.
Major Changes
In addition to the above spotlight, we have the following particularly interesting items to call out for this release:
wgpu-core is no longer generic over wgpu-hal backends
Dynamic dispatch between different backends has been moved from the user facing wgpu crate, to a new dynamic dispatch mechanism inside the backend abstraction layer wgpu-hal.
Whenever targeting more than a single backend (default on Windows & Linux) this leads to faster compile times and smaller binaries! This also solves a long standing issue with cargo doc failing to run for wgpu-core.
Benchmarking indicated that compute pass recording is slower as a consequence, whereas on render passes speed improvements have been observed. However, this effort simplifies many of the internals of the wgpu family of crates which we're hoping to build performance improvements upon in the future.
By @wumpf in #6069, #6099, #6100.
wgpu's resources no longer have .global_id() getters
wgpu-core's internals no longer use nor need IDs and we are moving towards removing IDs completely. This is a step in that direction.
Current users of .global_id() are encouraged to make use of the PartialEq, Eq, Hash, PartialOrd and Ord traits that have now been implemented for wgpu resources.
By @teoxoy in #6134.
set_bind_group now takes an Option for the bind group argument.
https://gpuweb.github.io/gpuweb/#programmable-passes-bind-groups specifies that bindGroup is nullable. This change is the start of implementing this part of the spec. Callers that specify a Some() value should have unchanged behavior. Handling of None values still needs to be implemented by backends.
For convenience, the set_bind_group on compute/render passes & encoders takes impl Into<Option<&BindGroup>>, so most code should still work the same.
By @bradwerth in #6216.
entry_points are now Optional
One of the changes in the WebGPU spec. (from about this time last year 😅) was to allow optional entry points in GPUProgrammableStage. In wgpu, this corresponds to a subset of fields in FragmentState, VertexState, and ComputeState as the entry_point member:
let render_pipeline = device.createRenderPipeline(wgpu::RenderPipelineDescriptor {
module,
entry_point: Some("cs_main"), // This is now `Option`al.
// …
});
let compute_pipeline = device.createComputePipeline(wgpu::ComputePipelineDescriptor {
module,
entry_point: None, // This is now `Option`al.
// …
});
When set to None, it's assumed that the shader only has a single entry point associated with the pipeline stage (i.e., @compute, @fragment, or @vertex). If there is not one and only one candidate entry point, then a validation error is returned. To continue the example, we might have written the above API usage with the following shader module:
// We can't use `entry_point: None` for compute pipelines with this module,
// because there are two `@compute` entry points.
@compute
fn cs_main() { /* … */ }
@compute
fn other_cs_main() { /* … */ }
// The following entry points _can_ be inferred from `entry_point: None` in a
// render pipeline, because they're the only `@vertex` and `@fragment` entry
// points:
@vertex
fn vs_main() { /* … */ }
@fragment
fn fs_main() { /* … */ }
wgpu's DX12 backend is now based on the windows crate ecosystem, instead of the d3d12 crate
wgpu has retired the d3d12 crate (based on winapi), and now uses the windows crate for interfacing with Windows. For many, this may not be a change that affects day-to-day work. However, for users who need to vet their dependencies, or who may vendor in dependencies, this may be a nontrivial migration.
By @MarijnS95 in #6006.
New Features
Wgpu
- Added initial acceleration structure and ray query support into wgpu. By @expenses @daniel-keitel @Vecvec @JMS55 @atlv24 in #6291
naga
- Support constant evaluation for
firstLeadingBitandfirstTrailingBitnumeric built-ins in WGSL. Front-ends that translate to these built-ins also benefit from constant evaluation. By @ErichDonGubler in #5101. - Add
firstandeithersampling types for@interpolate(flat, …)in WGSL. By @ErichDonGubler in #6181. - Support for more atomic ops in the SPIR-V frontend. By @schell in #5824.
- Support local
constdeclarations in WGSL. By @sagudev in #6156. - Implemented
const_assertin WGSL. By @sagudev in #6198. - Support polyfilling
inversein WGSL. By @chyyran in #6385. - Add base support for parsing
requires,enable, anddiagnosticdirectives. No extensions or diagnostic filters are yet supported, but diagnostics have improved dramatically. By @ErichDonGubler in #6352, #6424, #6437. - Include error chain information as a message and notes in shader compilation messages. By @ErichDonGubler in #6436.
- Unify naga CLI error output with the format of shader compilation messages. By @ErichDonGubler in #6436.
General
- Add
VideoFrametoExternalImageSourceenum. By @jprochazk in #6170. - Add
wgpu::util::new_instance_with_webgpu_detection&wgpu::util::is_browser_webgpu_supportedto make it easier to support WebGPU & WebGL in the same binary. By @wumpf in #6371.
Vulkan
- Allow using VK_GOOGLE_display_timing unsafely with the
VULKAN_GOOGLE_DISPLAY_TIMINGfeature. By @DJMcNab in #6149.
Metal
- Implement
atomicCompareExchangeWeak. By @AsherJingkongChen in #6265. - Unless an explicit
CAMetalLayeris provided, surfaces now render to a sublayer. This improves resizing behavior, fixing glitches during on window resize. By @madsmtm in #6107.
Bug Fixes
- Fix incorrect hlsl image output type conversion. By @atlv24 in #6123.
naga
- SPIR-V frontend splats depth texture sample and load results. Fixes issue #4551. By @schell in #6384.
- Accept only
vec3(notvecN) for thecrossbuilt-in. By @ErichDonGubler in #6171. - Configure
SourceLanguagewhen enabling debug info in SPV-out. By @kvark in #6256. - Do not consider per-polygon and flat inputs subgroup uniform. By @magcius in #6276.
- Validate all swizzle components are either color (rgba) or dimension (xyzw) in WGSL. By @sagudev in #6187.
- Fix detection of shl overflows to detect arithmetic overflows. By @sagudev in #6186.
- Fix type parameters to vec/mat type constructors to also support aliases. By @sagudev in #6189.
- Accept global
vars without explicit type. By @sagudev in #6199. - Fix handling of phony statements, so they are actually emitted. By @sagudev in #6328.
- Added
gl_DrawIDto glsl andDrawIndexto spv. By @ChosenName in #6325. - Matrices can now be indexed by value (#4337), and indexing arrays by value no longer causes excessive spilling (#6358). By @jimblandy in #6390.
- Add support for
textureQueryLevelsto the GLSL parser. By @magcius in #6325. - Fix unescaped identifiers in the Metal backend shader I/O structures causing shader miscompilation. By @ErichDonGubler in #6438.
General
- If GL context creation fails retry with GLES. By @Rapdorian in #5996.
- Bump MSRV for
d3d12/naga/wgpu-core/wgpu-hal/wgpu-types' to 1.76. By @wumpf in #6003. - Print requested and supported usages on
UnsupportedUsageerror. By @VladasZ in #6007. - Deduplicate bind group layouts that are created from pipelines with "auto" layouts. By @teoxoy #6049.
- Document
wgpu_halbounds-checking promises, and adaptwgpu_core's lazy initialization logic to the slightly weaker-than-expected guarantees. By @jimblandy in #6201. - Raise validation error instead of panicking in
{Render,Compute}Pipeline::get_bind_group_layouton native / WebGL. By @bgr360 in #6280. - BREAKING: Remove the last exposed C symbols in project, located in
wgpu_core::render::bundle::bundle_ffi, to allow multiple versions of wgpu to compile together. By @ErichDonGubler in #6272. - Call
flush_mapped_rangeswhen unmapping write-mapped buffers. By @teoxoy in #6089. - When mapping buffers for reading, mark buffers as initialized only when they have
MAP_WRITEusage. By @teoxoy in #6178. - Add a separate pipeline constants error. By @teoxoy in #6094.
- Ensure safety of indirect dispatch by injecting a compute shader that validates the content of the indirect buffer. By @teoxoy in #5714.
- Add conversions between
TextureFormatandStorageFormat. By @caelunshun in #6185
GLES / OpenGL
- Fix GL debug message callbacks not being properly cleaned up (causing UB). By @Imberflur in #6114.
- Fix calling
slice::from_raw_partswith unaligned pointers in push constant handling. By @Imberflur in #6341. - Optimise fence checking when
Queue::submitis called many times per frame. By @dinnerbone in #6427.
WebGPU
- Fix JS
TypeErrorexception inInstance::request_adapterwhen browser doesn't support WebGPU butwgpunot compiled withwebglsupport. By @bgr360 in #6197.
Vulkan
- Avoid undefined behaviour with adversarial debug label. By @DJMcNab in #6257.
- Add
.index_type(vk::IndexType::NONE_KHR)when creatingAccelerationStructureGeometryTrianglesDataKHRin the raytraced triangle example to prevent a validation error. By @Vecvec in #6282.
Changes
wgpu_hal::gles::Adapter::new_externalnow requires the context to be current when dropping the adapter and related objects. By @Imberflur in #6114.- Reduce the amount of debug and trace logs emitted by wgpu-core and wgpu-hal. By @nical in #6065.
- Rename
Rg11b10FloattoRg11b10Ufloat. By @sagudev in #6108. - Invalidate the device when we encounter driver-induced device loss or on unexpected errors. By @teoxoy in #6229.
- Make Vulkan error handling more robust. By @teoxoy in #6119.
- Add bounds checking to Buffer slice method. By @beholdnec in #6432.
- Replace
impl From<StorageFormat> for ScalarKindwithimpl From<StorageFormat> for Scalarso that byte width is included. By @atlv24 in #6451.
Internal
- Tracker simplifications. By @teoxoy in #6073 & #6088.
- D3D12 cleanup. By @teoxoy in #6200.
- Use
ManuallyDropin remaining places. By @teoxoy in #6092. - Move out invalidity from the
Registry. By @teoxoy in #6243. - Remove
backendfrom ID. By @teoxoy in #6263.
HAL
- Change the inconsistent
DropGuardbased API on Vulkan and GLES to a consistent, callback-based one. By @jerzywilczek in #6164.
Documentation
- Removed some OpenGL and Vulkan references from
wgpu-typesdocumentation. Fixed Storage texel types in examples. By @Nelarius in #6271. - Used
wgpu::include_wgsl!(…)more in examples and tests. By @ErichDonGubler in #6326.
Dependency Updates
GLES
- Replace
winapicode in WGL wrapper to use thewindowscrate. By @MarijnS95 in #6006. - Update
glutinto0.31withglutin-winitcrate. By @MarijnS95 in #6150 and #6176. - Implement
Adapter::new_external()for WGL (just like EGL) to import an external OpenGL ES context. By @MarijnS95 in #6152.
DX12
- Replace
winapicode to use thewindowscrate. By @MarijnS95 in #5956 and #6173. - Get
num_workgroupsbuiltin working for indirect dispatches. By @teoxoy in #5730.
HAL
- Update
parking_lotto0.12. By @mahkoh in #6287.
v22.1.0 (2024-07-17)
This release includes wgpu, wgpu-core and naga. All other crates remain at 22.0.0.
Added
naga
- Added back implementations of PartialEq for more IR types. By @teoxoy in #6045
Bug Fixes
General
- Fix profiling with
tracy. By @waywardmonkeys in #5988 - Fix function for checking bind compatibility to error instead of panic. By @sagudev #6012
- Fix crash when dropping the surface after the device. By @wumpf in #6052
- Fix length of copy in
queue_write_texture. By @teoxoy in #6009 - Fix error message that is thrown in create_render_pass to no longer say
compute_pass. By @matthew-wong1 #6041 - As a workaround for issue #4905,
wgpu-coreis undocumented unless--cfg wgpu_core_docfeature is enabled. By @kpreid in #5987
22.0.0 (2024-07-17)
Overview
Our first major version release!
For the first time ever, wgpu is being released with a major version (i.e., 22.* instead of 0.22.*)! Maintainership has decided to fully adhere to Semantic Versioning's recommendations for versioning production software. According to SemVer 2.0.0's Q&A about when to use 1.0.0 versions (and beyond):
How do I know when to release 1.0.0?
If your software is being used in production, it should probably already be 1.0.0. If you have a stable API on which users have come to depend, you should be 1.0.0. If you’re worrying a lot about backward compatibility, you should probably already be 1.0.0.
It is a well-known fact that wgpu has been used for applications and platforms already in production for years, at this point. We are often concerned with tracking breaking changes, and affecting these consumers' ability to ship. By releasing our first major version, we publicly acknowledge that this is the case. We encourage other projects in the Rust ecosystem to follow suit.
Note that while we start to use the major version number, wgpu is not "going stable", as many Rust projects do. We anticipate many breaking changes before we fully comply with the WebGPU spec., which we expect to take a small number of years.
Overview
A major (pun intended) theme of this release is incremental improvement. Among the typically large set of bug fixes, new features, and other adjustments to wgpu by the many contributors listed below, @wumpf and @teoxoy have merged a series of many simplifications to wgpu's internals and, in one case, to the render and compute pass recording APIs. Many of these change wgpu to use atomically reference-counted resource tracking (i.e., Arc<…>), rather than using IDs to manage the lifetimes of platform-specific graphics resources in a registry of separate reference counts. This has led us to diagnose and fix many long-standing bugs, and net some neat performance improvements on the order of 40% or more of some workloads.
While the above is exciting, we acknowledge already finding and fixing some (easy-to-fix) regressions from the above work. If you migrate to wgpu 22 and encounter such bugs, please engage us in the issue tracker right away!
Major Changes
Lifetime bounds on wgpu::RenderPass & wgpu::ComputePass
wgpu::RenderPass & wgpu::ComputePass recording methods (e.g. wgpu::RenderPass:set_render_pipeline) no longer impose a lifetime constraint to objects passed to a pass (like pipelines/buffers/bindgroups/query-sets etc.).
This means the following pattern works now as expected:
let mut pipelines: Vec<wgpu::RenderPipeline> = ...;
// ...
let mut cpass = encoder.begin_compute_pass(&wgpu::ComputePassDescriptor::default());
cpass.set_pipeline(&pipelines[123]);
// Change pipeline container - this requires mutable access to `pipelines` while one of the pipelines is in use.
pipelines.push(/* ... */);
// Continue pass recording.
cpass.set_bindgroup(...);
Previously, a set pipeline (or other resource) had to outlive pass recording which often affected wider systems,
meaning that users needed to prove to the borrow checker that Vec<wgpu::RenderPipeline> (or similar constructs)
aren't accessed mutably for the duration of pass recording.
Furthermore, you can now opt out of wgpu::RenderPass/wgpu::ComputePass's lifetime dependency on its parent wgpu::CommandEncoder using wgpu::RenderPass::forget_lifetime/wgpu::ComputePass::forget_lifetime:
fn independent_cpass<'enc>(encoder: &'enc mut wgpu::CommandEncoder) -> wgpu::ComputePass<'static> {
let cpass: wgpu::ComputePass<'enc> = encoder.begin_compute_pass(&wgpu::ComputePassDescriptor::default());
cpass.forget_lifetime()
}
⚠️ As long as a wgpu::RenderPass/wgpu::ComputePass is pending for a given wgpu::CommandEncoder, creation of a compute or render pass is an error and invalidates the wgpu::CommandEncoder.
forget_lifetime can be very useful for library authors, but opens up an easy way for incorrect use, so use with care.
This method doesn't add any additional overhead and has no side effects on pass recording.
By @wumpf in #5569, #5575, #5620, #5768 (together with @kpreid), #5671, #5794, #5884.
Querying shader compilation errors
Wgpu now supports querying shader compilation info.
This allows you to get more structured information about compilation errors, warnings and info:
...
let lighting_shader = ctx.device.create_shader_module(include_wgsl!("lighting.wgsl"));
let compilation_info = lighting_shader.get_compilation_info().await;
for message in compilation_info
.messages
.iter()
.filter(|m| m.message_type == wgpu::CompilationMessageType::Error)
{
let line = message.location.map(|l| l.line_number).unwrap_or(1);
println!("Compile error at line {line}");
}
By @stefnotch in #5410
64 bit integer atomic support in shaders.
Add support for 64 bit integer atomic operations in shaders.
Add the following flags to wgpu_types::Features:
-
SHADER_INT64_ATOMIC_ALL_OPSenables all atomic operations onatomic<i64>andatomic<u64>values. -
SHADER_INT64_ATOMIC_MIN_MAXis a subset of the above, enabling onlyAtomicFunction::MinandAtomicFunction::Maxoperations onatomic<i64>andatomic<u64>values in theStorageaddress space. These are the only 64-bit atomic operations available on Metal as of 3.1.
Add corresponding flags to naga::valid::Capabilities. These are supported by the
WGSL front end, and all naga backends.
Platform support:
-
On Direct3d 12, in
D3D12_FEATURE_DATA_D3D12_OPTIONS9, ifAtomicInt64OnTypedResourceSupportedandAtomicInt64OnGroupSharedSupportedare both available, then both wgpu features described above are available. -
On Metal,
SHADER_INT64_ATOMIC_MIN_MAXis available on Apple9 hardware, and on hardware that advertises both Apple8 and Mac2 support. This also requires Metal Shading Language 2.4 or later. Metal does not yet support the more generalSHADER_INT64_ATOMIC_ALL_OPS. -
On Vulkan, if the
VK_KHR_shader_atomic_int64extension is available with both theshader_buffer_int64_atomicsandshader_shared_int64_atomicsfeatures, then both wgpu features described above are available.
By @atlv24 in #5383
A compatible surface is now required for request_adapter() on WebGL2 + enumerate_adapters() is now native only.
When targeting WebGL2, it has always been the case that a surface had to be created before calling request_adapter().
We now make this requirement explicit.
Validation was also added to prevent configuring the surface with a device that doesn't share the same underlying WebGL2 context since this has never worked.
Calling enumerate_adapters() when targeting WebGPU used to return an empty Vec and since we now require users
to pass a compatible surface when targeting WebGL2, having enumerate_adapters() doesn't make sense.
By @teoxoy in #5901
New features
General
- Added
as_halforBufferto access wgpu created buffers form wgpu-hal. By @JasondeWolff in #5724 include_wgsl!is now callable in const contexts by @9SMTM6 in #5872- Added memory allocation hints to
DeviceDescriptorby @nical in #5875MemoryHints::Performance, the default, favors performance over memory usage and will likely cause large amounts of VRAM to be allocated up-front. This hint is typically good for games.MemoryHints::MemoryUsagefavors memory usage over performance. This hint is typically useful for smaller applications or UI libraries.MemoryHints::Manualallows the user to specify parameters for the underlying GPU memory allocator. These parameters are subject to change.- These hints may be ignored by some backends. Currently only the Vulkan and D3D12 backends take them into account.
- Add
HTMLImageElementandImageDataas external source for copying images. By @Valaphee in #5668
naga
-
Added -D, --defines option to naga CLI to define preprocessor macros by @theomonnom in #5859
-
Added type upgrades to SPIR-V atomic support. Added related infrastructure. Tracking issue is here. By @schell in #5775.
-
Implement
WGSL'sunpack4xI8,unpack4xU8,pack4xI8andpack4xU8. By @VlaDexa in #5424 -
Began work adding support for atomics to the SPIR-V frontend. Tracking issue is here. By @schell in #5702.
-
In hlsl-out, allow passing information about the fragment entry point to omit vertex outputs that are not in the fragment inputs. By @Imberflur in #5531
-
In spv-out, allow passing
acceleration_structureas a function argument. By @kvark in #5961let writer: naga::back::hlsl::Writer = /* ... */; -writer.write(&module, &module_info); +writer.write(&module, &module_info, None); -
HLSL & MSL output can now be added conditionally on the target via the
msl-out-if-target-appleandhlsl-out-if-target-windowsfeatures. This is used in wgpu-hal to no longer compile with MSL output whenmetalis enabled & MacOS isn't targeted and no longer compile with HLSL output whendx12is enabled & Windows isn't targeted. By @wumpf in #5919
Vulkan
- Added a
PipelineCacheresource to allow using Vulkan pipeline caches. By @DJMcNab in #5319
WebGPU
- Added support for pipeline-overridable constants to the WebGPU backend by @DouglasDwyer in #5688
Changes
General
- Unconsumed vertex outputs are now always allowed. Removed
StageError::InputNotConsumed,Features::SHADER_UNUSED_VERTEX_OUTPUT, and associated validation. By @Imberflur in #5531 - Avoid introducing spurious features for optional dependencies. By @bjorn3 in #5691
wgpu::Erroris nowSync, making it possible to be wrapped inanyhow::Errororeyre::Report. By @nolanderc in #5820- Added benchmark suite. By @cwfitzgerald in #5694, compute passes by @wumpf in #5767
- Improve performance of
.submit()by 39-64% (.submit()+.poll()by 22-32%). By @teoxoy in #5910 - The
tracewgpu feature has been temporarily removed. By @teoxoy in #5975
Metal
-
Removed the
linkCargo feature.This was used to allow weakly linking frameworks. This can be achieved with putting something like the following in your
.cargo/config.tomlinstead:[target.'cfg(target_vendor = "apple")'] rustflags = ["-C", "link-args=-weak_framework Metal -weak_framework QuartzCore -weak_framework CoreGraphics"]By @madsmtm in #5752
Bug Fixes
General
- Ensure render pipelines have at least 1 target. By @ErichDonGubler in #5715
wgpu::ComputePassnow internally takes ownership ofQuerySetfor bothwgpu::ComputePassTimestampWritesas well as timestamp writes and statistics query, fixing crashes when destroyingQuerySetbefore ending the pass. By @wumpf in #5671- Validate resources passed during compute pass recording for mismatching device. By @wumpf in #5779
- Fix staging buffers being destroyed too early. By @teoxoy in #5910
- Fix attachment byte cost validation panicking with native only formats. By @teoxoy in #5934
- [wgpu] Fix leaks from auto layout pipelines. By @teoxoy in #5971
- [wgpu-core] Fix length of copy in
queue_write_texture(causing UB). By @teoxoy in #5973 - Add missing same device checks. By @teoxoy in #5980
GLES / OpenGL
- Fix
ClearColorF,ClearColorUandClearColorIcommands being issued beforeSetDrawColorBuffers#5666 - Replace
glClearwithglClearBufferFbecauseglDrawBuffersrequires that the ith buffer must beCOLOR_ATTACHMENTiorNONE#5666 - Return the unmodified version in driver_info. By @Valaphee in #5753
naga
- In spv-out don't decorate a
BindingArray's type withBlockif the type is a struct with a runtime array by @Vecvec in #5776 - Add
packedas a keyword for GLSL by @kjarosh in #5855
v0.20.2 (2024-06-12)
This release force-bumps transitive dependencies of wgpu on wgpu-core and wgpu-hal to 0.21.1, to resolve some undefined behavior observable in the DX12 backend after upgrading to Rust 1.79 or later.
Bug Fixes
General
- Fix a
CommandBufferleak. By @cwfitzgerald and @nical in #5141
DX12
- Do not feed
&""toD3DCompile, by @workingjubilee in #5812.
v0.20.1 (2024-06-12)
This release included v0.21.0 of wgpu-core and wgpu-hal, due to breaking changes needed to solve vulkan validation issues.
Bug Fixes
This release fixes the validation errors whenever a surface is used with the vulkan backend. By @cwfitzgerald in #5681.
General
- Clean up weak references to texture views and bind groups to prevent memory leaks. By @xiaopengli89 in #5595.
- Fix segfault on exit is queue & device are dropped before surface. By @sagudev in #5640.
Metal
- Fix unrecognized selector crash on iOS 12. By @vladasz in #5744.
Vulkan
- Fix enablement of subgroup ops extension on Vulkan devices that don't support Vulkan 1.3. By @cwfitzgerald in #5624.
GLES / OpenGL
- Fix regression on OpenGL (EGL) where non-sRGB still used sRGB #5642
naga
- Work around shader consumers that have bugs handling
switchstatements with a single body for all cases. These are now written asdo {} while(false);loops in hlsl-out and glsl-out. By @Imberflur in #5654 - In hlsl-out, defer
continuestatements in switches by setting a flag and breaking from the switch. This allows such constructs to work with FXC which does not supportcontinuewithin a switch. By @Imberflur in #5654
v0.20.0 (2024-04-28)
Major Changes
Pipeline overridable constants
Wgpu supports now pipeline-overridable constants
This allows you to define constants in wgsl like this:
override some_factor: f32 = 42.1337; // Specifies a default of 42.1337 if it's not set.
And then set them at runtime like so on your pipeline consuming this shader:
// ...
fragment: Some(wgpu::FragmentState {
compilation_options: wgpu::PipelineCompilationOptions {
constants: &[("some_factor".to_owned(), 0.1234)].into(), // Sets `some_factor` to 0.1234.
..Default::default()
},
// ...
}),
// ...
By @teoxoy & @jimblandy in #5500
Changed feature requirements for timestamps
Due to a specification change write_timestamp is no longer supported on WebGPU.
wgpu::CommandEncoder::write_timestamp requires now the new wgpu::Features::TIMESTAMP_QUERY_INSIDE_ENCODERS feature which is available on all native backends but not on WebGPU.
By @wumpf in #5188
Wgsl const evaluation for many more built-ins
Many numeric built-ins have had a constant evaluation implementation added for them, which allows them to be used in a const context:
abs, acos, acosh, asin, asinh, atan, atanh, cos, cosh, round, saturate, sin, sinh, sqrt, step, tan, tanh, ceil, countLeadingZeros, countOneBits, countTrailingZeros, degrees, exp, exp2, floor, fract, fma, inverseSqrt, log, log2, max, min, radians, reverseBits, sign, trunc
By @ErichDonGubler in #4879, #5098
New native-only wgsl features
Subgroup operations
The following subgroup operations are available in wgsl now:
subgroupBallot, subgroupAll, subgroupAny, subgroupAdd, subgroupMul, subgroupMin, subgroupMax, subgroupAnd, subgroupOr, subgroupXor, subgroupExclusiveAdd, subgroupExclusiveMul, subgroupInclusiveAdd, subgroupInclusiveMul, subgroupBroadcastFirst, subgroupBroadcast, subgroupShuffle, subgroupShuffleDown, subgroupShuffleUp, subgroupShuffleXor
Availability is governed by the following feature flags:
wgpu::Features::SUBGROUPfor all operations exceptsubgroupBarrierin fragment & compute, supported on Vulkan, DX12 and Metal.wgpu::Features::SUBGROUP_VERTEX, for all operations exceptsubgroupBarriergeneral operations in vertex shaders, supported on Vulkanwgpu::Features::SUBGROUP_BARRIER, for support of thesubgroupBarrieroperation, supported on Vulkan & Metal
Note that there currently some differences between wgpu's native-only implementation and the open WebGPU proposal.
By @exrook and @lichtso in #5301
Signed and unsigned 64 bit integer support in shaders.
wgpu::Features::SHADER_INT64 enables 64 bit integer signed and unsigned integer variables in wgsl (i64 and u64 respectively).
Supported on Vulkan, DX12 (requires DXC) and Metal (with MSL 2.3+ support).
By @atlv24 and @cwfitzgerald in #5154
New features
General
- Implemented the
Unorm10_10_10_2VertexFormat by @McMackety in #5477 wgpu-types'straceandreplayfeatures have been replaced by theserdefeature. By @KirmesBude in #5149wgpu-core'sserial-passfeature has been removed. Useserdeinstead. By @KirmesBude in #5149- Added
InstanceFlags::GPU_BASED_VALIDATION, which enables GPU-based validation for shaders. This is currently only supported on the DX12 and Vulkan backends; other platforms ignore this flag, for now. By @ErichDonGubler in #5146, #5046.- When set, this flag implies
InstanceFlags::VALIDATION. - This has been added to the set of flags set by
InstanceFlags::advanced_debugging. Since the overhead is potentially very large, the flag is not enabled by default in debug builds when usingInstanceFlags::from_build_config. - As with other instance flags, this flag can be changed in calls to
InstanceFlags::with_envwith the newWGPU_GPU_BASED_VALIDATIONenvironment variable.
- When set, this flag implies
wgpu::Instancecan now report whichwgpu::Backendsare available based on the build configuration. By @wumpf #5167-wgpu::Instance::any_backend_feature_enabled() +!wgpu::Instance::enabled_backend_features().is_empty()- Breaking change:
wgpu_core::pipeline::ProgrammableStageDescriptoris now optional. By @ErichDonGubler in #5305. Features::downlevel{_webgl2,}_featureswas made const by @MultisampledNight in #5343- Breaking change:
wgpu_core::pipeline::ShaderErrorhas been moved tonaga. By @stefnotch in #5410 - More as_hal methods and improvements by @JMS55 in #5452
- Added
wgpu::CommandEncoder::as_hal_mut - Added
wgpu::TextureView::as_hal wgpu::Texture::as_halnow returns a user-defined type to match the other as_hal functions
- Added
naga
- Allow user to select which MSL version to use via
--metal-versionwith naga CLI. By @pcleavelin in #5392 - Support
arrayLengthfor runtime-sized arrays inside binding arrays (for WGSL input and SPIR-V output). By @kvark in #5428 - Added
--shader-stageand--input-kindoptions to naga-cli for specifying vertex/fragment/compute shaders, and frontend. by @ratmice in #5411 - Added a
create_validatorfunction to wgpu_coreDeviceto create nagaValidators. By @atlv24 #5606
WebGPU
- Implement the
device_set_device_lost_callbackmethod forContextWebGpu. By @suti in #5438 - Add support for storage texture access modes
ReadOnlyandReadWrite. By @JolifantoBambla in #5434
GLES / OpenGL
- Log an error when GLES texture format heuristics fail. By @PolyMeilex in #5266
- Cache the sample count to keep
get_texture_format_featurescheap. By @Dinnerbone in #5346 - Mark
DEPTH32FLOAT_STENCIL8as supported in GLES. By @Dinnerbone in #5370 - Desktop GL now also supports
TEXTURE_COMPRESSION_ETC2. By @Valaphee in #5568 - Don't create a program for shader-clearing if that workaround isn't required. By @Dinnerbone in #5348.
- OpenGL will now be preferred over OpenGL ES on EGL, making it consistent with WGL. By @valaphee in #5482
- Fill out
driveranddriver_info, with the OpenGL flavor and version, similar to Vulkan. By @valaphee in #5482
Metal
- Metal 3.0 and 3.1 detection. By @atlv24 in #5497
DX12
- Shader Model 6.1-6.7 detection. By @atlv24 in #5498
Other performance improvements
- Simplify and speed up the allocation of internal IDs. By @nical in #5229
- Use memory pooling for UsageScopes to avoid frequent large allocations. by @robtfm in #5414
- Eager release of GPU resources comes from device.trackers. By @bradwerth in #5075
- Support disabling zero-initialization of workgroup local memory in compute shaders. By @DJMcNab in #5508
Documentation
- Improved
wgpu_haldocumentation. By @jimblandy in #5516, #5524, #5562, #5563, #5566, #5617, #5618 - Add mention of primitive restart in the description of
PrimitiveState::strip_index_format. By @cpsdqs in #5350 - Document and tweak precise behaviour of
SourceLocation. By @stefnotch in #5386 and #5410 - Give short example of WGSL
push_constantsyntax. By @waywardmonkeys in #5393 - Fix incorrect documentation of
Limits::max_compute_workgroup_storage_sizedefault value. By @atlv24 in #5601
Bug Fixes
General
- Fix
serdefeature not compiling forwgpu-types. By @KirmesBude in #5149 - Fix the validation of vertex and index ranges. By @nical in #5144 and #5156
- Fix panic when creating a surface while no backend is available. By @wumpf #5166
- Correctly compute minimum buffer size for array-typed
storageanduniformvars. By @jimblandy #5222 - Fix timeout when presenting a surface where no work has been done. By @waywardmonkeys in #5200
- Fix registry leaks with de-duplicated resources. By @nical in #5244
- Fix linking when targeting android. By @ashdnazg in #5326.
- Failing to set the device lost closure will call the closure before returning. By @bradwerth in #5358.
- Fix deadlocks caused by recursive read-write lock acquisitions #5426.
- Remove exposed C symbols (
extern "C"+ [no_mangle]) from RenderPass & ComputePass recording. By @wumpf in #5409. - Fix surfaces being only compatible with first backend enabled on an instance, causing failures when manually specifying an adapter. By @Wumpf in #5535.
naga
- In spv-in, remove unnecessary "gl_PerVertex" name check so unused builtins will always be skipped. Prevents validation errors caused by capability requirements of these builtins #4915. By @Imberflur in #5227.
- In spv-out, check for acceleration and ray-query types when enabling ray-query extension to prevent validation error. By @Vecvec in #5463
- Add a limit for curly brace nesting in WGSL parsing, plus a note about stack size requirements. By @ErichDonGubler in #5447.
- In hlsl-out, fix accesses on zero value expressions by generating helper functions for
Expression::ZeroValue. By @Imberflur in #5587. - Fix behavior of
extractBitsandinsertBitswhenoffset + countoverflows the bit width. By @cwfitzgerald in #5305 - Fix behavior of integer
clampwhenminargument >maxargument. By @cwfitzgerald in #5300. - Fix
TypeInner::scalar_widthto be consistent with the rest of the codebase and return values in bytes not bits. By @atlv24 in #5532.
GLES / OpenGL
- GLSL 410 does not support layout(binding = ...), enable only for GLSL 420. By @bes in #5357
- Fixes for being able to use an OpenGL 4.1 core context provided by macOS with wgpu. By @bes in #5331.
- Fix crash when holding multiple devices on wayland/surfaceless. By @ashdnazg in #5351.
- Fix
first_instancegetting ignored in draw indexed whenARB_shader_draw_parametersfeature is present andbase_vertexis 0. By @valaphee in #5482
Vulkan
- Set object labels when the DEBUG flag is set, even if the VALIDATION flag is disabled. By @DJMcNab in #5345.
- Add safety check to
wgpu_hal::vulkan::CommandEncoderto make surediscard_encodingis not called in the closed state. By @villuna in #5557 - Fix SPIR-V type capability requests to not depend on
LocalTypecaching. By @atlv24 in #5590 - Upgrade
ashto0.38. By @MarijnS95 in #5504.
Tests
- Fix intermittent crashes on Linux in the
multithreaded_computetest. By @jimblandy in #5129. - Refactor tests to read feature flags by name instead of a hardcoded hexadecimal u64. By @atlv24 in #5155.
- Add test that verifies that we can drop the queue before using the device to create a command encoder. By @Davidster in #5211
0.19.5 (2024-07-16)
This release only releases wgpu-hal 0.19.5, which contains an important fix
for DX12.
Bug Fixes
DX12
v0.19.4 (2024-04-17)
Bug Fixes
General
- Don't depend on bind group and bind group layout entry order in backends. This caused incorrect severely incorrect command execution and, in some cases, crashes. By @ErichDonGubler in #5421.
- Properly clean up all write_buffer/texture temporary resources. By @robtfm in #5413.
- Fix deadlock in certain situations when mapping buffers using
wgpu-profiler. By @cwfitzgerald in #5517
WebGPU
- Correctly pass through timestamp queries to WebGPU. By @cwfitzgerald in #5527.
v0.19.3 (2024-03-01)
This release includes wgpu, wgpu-core, and wgpu-hal. All other crates are unchanged.
Major Changes
Vendored WebGPU Bindings from web_sys
--cfg=web_sys_unstable_apis is no longer needed in your RUSTFLAGS to compile for WebGPU!!!
While WebGPU's javascript api is stable in the browsers, the web_sys bindings for WebGPU are still improving. As such they are hidden behind the special cfg --cfg=web_sys_unstable_apis and are not available by default. Everyone who wanted to use our WebGPU backend needed to enable this cfg in their RUSTFLAGS. This was very inconvenient and made it hard to use WebGPU, especially when WebGPU is enabled by default. Additionally, the unstable APIs don't adhere to semver, so there were repeated breakages.
To combat this problem we have decided to vendor the web_sys bindings for WebGPU within the crate. Notably we are not forking the bindings, merely vendoring, so any improvements we make to the bindings will be contributed directly to upstream web_sys.
By @cwfitzgerald in #5325.
Bug Fixes
General
- Fix an issue where command encoders weren't properly freed if an error occurred during command encoding. By @ErichDonGubler in #5251.
- Fix incorrect validation causing all indexed draws on render bundles to fail. By @wumpf in #5430.
Android
- Fix linking error when targeting android without
winit. By @ashdnazg in #5326.
v0.19.2 (2024-02-29)
This release includes wgpu, wgpu-core, wgpu-hal, wgpu-types, and naga. All other crates are unchanged.
Added/New Features
General
wgpu::Idnow implementsPartialOrd/Ordallowing it to be put inBTreeMaps. By @cwfitzgerald and @9291Sam in #5176
OpenGL
- Log an error when OpenGL texture format heuristics fail. By @PolyMeilex in #5266
wgsl-out
- Learned to generate acceleration structure types. By @JMS55 in #5261
Documentation
- Fix link in
wgpu::Instance::create_surfacedocumentation. By @HexoKnight in #5280. - Fix typo in
wgpu::CommandEncoder::clear_bufferdocumentation. By @PWhiddy in #5281. Surfaceconfiguration incorrectly claimed thatwgpu::Instance::create_surfacewas unsafe. By @hackaugusto in #5265.
Bug Fixes
General
- Device lost callbacks are invoked when replaced and when global is dropped. By @bradwerth in #5168
- Fix performance regression when allocating a large amount of resources of the same type. By @nical in #5229
- Fix docs.rs wasm32 builds. By @cwfitzgerald in #5310
- Improve error message when binding count limit hit. By @hackaugusto in #5298
- Remove an unnecessary
cloneduring GLSL shader ingestion. By @a1phyr in #5118. - Fix missing validation for
Device::clear_bufferwhereoffset + size > buffer.sizewas not checked whensizewas omitted. By @ErichDonGubler in #5282.
DX12
- Fix
panic!when droppingInstancewithoutInstanceFlags::VALIDATION. By @hakolao in #5134
OpenGL
- Fix internal format for the
Etc2Rgba8Unormformat. By @andristarr in #5178 - Try to load
libX11.so.6in addition tolibX11.soon linux. #5307 - Make use of
GL_EXT_texture_shadow_lodto support sampling a cube depth texture with an explicit LOD. By @cmrschwarz in #5171.
glsl-in
- Fix code generation from nested loops. By @cwfitzgerald and @teoxoy in #5311
v0.19.1 (2024-01-22)
This release includes wgpu and wgpu-hal. The rest of the crates are unchanged since 0.19.0.
Bug Fixes
DX12
- Properly register all swapchain buffers to prevent error on surface present. By @dtzxporter in #5091
- Check for extra null states when creating resources. By @nical in #5096
- Fix depth-only and stencil-only views causing crashes. By @teoxoy in #5100
OpenGL
- In Surface::configure and Surface::present on Windows, fix the current GL context not being unset when releasing the lock that guards access to making the context current. This was causing other threads to panic when trying to make the context current. By @Imberflur in #5087.
WebGPU
- Improve error message when compiling WebGPU backend on wasm without the
web_sys_unstable_apisset. By @rukai in #5104
Documentation
- Document Wayland specific behavior related to
SurfaceTexture::present. By @i509VCB in #5093.
v0.19.0 (2024-01-17)
This release includes:
wgpuwgpu-corewgpu-halwgpu-typeswgpu-infonaga(skipped from 0.14 to 0.19)naga-cli(skipped from 0.14 to 0.19)d3d12(skipped from 0.7 to 0.19)
Improved Multithreading through internal use of Reference Counting
Large refactoring of wgpu’s internals aiming at reducing lock contention, and providing better performance when using wgpu on multiple threads.
By @gents83 in #3626 and thanks also to @jimblandy, @nical, @Wumpf, @Elabajaba & @cwfitzgerald
All Public Dependencies are Re-Exported
All of wgpu's public dependencies are now re-exported at the top level so that users don't need to take their own dependencies. This includes:
- wgpu-core
- wgpu-hal
- naga
- raw_window_handle
- web_sys
Feature Flag Changes
WebGPU & WebGL in the same Binary
Enabling webgl no longer removes the webgpu backend.
Instead, there's a new (default enabled) webgpu feature that allows to explicitly opt-out of webgpu if so desired.
If both webgl & webgpu are enabled, wgpu::Instance decides upon creation whether to target wgpu-core/WebGL or WebGPU.
This means that adapter selection is not handled as with regular adapters, but still allows to decide at runtime whether
webgpu or the webgl backend should be used using a single wasm binary.
By @wumpf in #5044
naga-ir Dedicated Feature
The naga-ir feature has been added to allow you to add naga module shaders without guessing about what other features needed to be enabled to get access to it.
By @cwfitzgerald in #5063.
expose-ids Feature available unconditionally
This feature allowed you to call global_id on any wgpu opaque handle to get a unique hashable identity for the given resource. This is now available without the feature flag.
By @cwfitzgerald in #4841.
dx12 and metal Backend Crate Features
wgpu now exposes backend feature for the Direct3D 12 (dx12) and Metal (metal) backend. These are enabled by default, but don't do anything when not targeting the corresponding OS.
By @daxpedda in #4815.
Direct3D 11 Backend Removal
This backend had no functionality, and with the recent support for GL on Desktop, which allows wgpu to run on older devices, there was no need to keep this backend. By @valaphee in #4828.
WGPU_ALLOW_UNDERLYING_NONCOMPLIANT_ADAPTER Environment Variable
This adds a way to allow a Vulkan driver which is non-compliant per VK_KHR_driver_properties to be enumerated. This is intended for testing new Vulkan drivers which are not Vulkan compliant yet.
By @i509VCB in #4754.
DeviceExt::create_texture_with_data allows Mip-Major Data
Previously, DeviceExt::create_texture_with_data only allowed data to be provided in layer major order. There is now a order parameter which allows you to specify if the data is in layer major or mip major order.
let tex = ctx.device.create_texture_with_data(
&queue,
&descriptor,
+ wgpu::util::TextureDataOrder::LayerMajor,
src_data,
);
By @cwfitzgerald in #4780.
Safe & unified Surface Creation
It is now possible to safely create a wgpu::Surface with wgpu::Instance::create_surface() by letting wgpu::Surface hold a lifetime to window.
Passing an owned value window to Surface will return a wgpu::Surface<'static>.
All possible safe variants (owned windows and web canvases) are grouped using wgpu::SurfaceTarget.
Conversion to wgpu::SurfaceTarget is automatic for any type implementing raw-window-handle's HasWindowHandle & HasDisplayHandle traits, i.e. most window types.
For web canvas types this has to be done explicitly:
let surface: wgpu::Surface<'static> = instance.create_surface(wgpu::SurfaceTarget::Canvas(my_canvas))?;
All unsafe variants are now grouped under wgpu::Instance::create_surface_unsafe which takes the
wgpu::SurfaceTargetUnsafe enum and always returns wgpu::Surface<'static>.
In order to create a wgpu::Surface<'static> without passing ownership of the window use
wgpu::SurfaceTargetUnsafe::from_window:
let surface = unsafe {
instance.create_surface_unsafe(wgpu::SurfaceTargetUnsafe::from_window(&my_window))?
};
The easiest way to make this code safe is to use shared ownership:
let window: Arc<winit::Window>;
// ...
let surface = instance.create_surface(window.clone())?;
All platform specific surface creation using points have moved into SurfaceTargetUnsafe as well.
For example:
Safety by @daxpedda in #4597 Unification by @wumpf in #4984
Add partial Support for WGSL Abstract Types
Abstract types make numeric literals easier to use, by automatically converting literals and other constant expressions from abstract numeric types to concrete types when safe and necessary. For example, to build a vector of floating-point numbers, naga previously made you write:
vec3<f32>(1.0, 2.0, 3.0)
With this change, you can now simply write:
vec3<f32>(1, 2, 3)
Even though the literals are abstract integers, naga recognizes
that it is safe and necessary to convert them to f32 values in
order to build the vector. You can also use abstract values as
initializers for global constants and global and local variables,
like this:
var unit_x: vec2<f32> = vec2(1, 0);
The literals 1 and 0 are abstract integers, and the expression
vec2(1, 0) is an abstract vector. However, naga recognizes that
it can convert that to the concrete type vec2<f32> to satisfy
the given type of unit_x.
The WGSL specification permits abstract integers and
floating-point values in almost all contexts, but naga's support
for this is still incomplete. Many WGSL operators and builtin
functions are specified to produce abstract results when applied
to abstract inputs, but for now naga simply concretizes them all
before applying the operation. We will expand naga's abstract type
support in subsequent pull requests.
As part of this work, the public types naga::ScalarKind and
naga::Literal now have new variants, AbstractInt and AbstractFloat.
By @jimblandy in #4743, #4755.
Instance::enumerate_adapters now returns Vec<Adapter> instead of an ExactSizeIterator
This allows us to support WebGPU and WebGL in the same binary.
- let adapters: Vec<Adapter> = instance.enumerate_adapters(wgpu::Backends::all()).collect();
+ let adapters: Vec<Adapter> = instance.enumerate_adapters(wgpu::Backends::all());
By @wumpf in #5044
device.poll() now returns a MaintainResult instead of a bool
This is a forward looking change, as we plan to add more information to the MaintainResult in the future.
This enum has the same data as the boolean, but with some useful helper functions.
- let queue_finished: bool = device.poll(wgpu::Maintain::Wait);
+ let queue_finished: bool = device.poll(wgpu::Maintain::Wait).is_queue_empty();
By @cwfitzgerald in #5053
New Features
General
- Added
DownlevelFlags::VERTEX_AND_INSTANCE_INDEX_RESPECTS_RESPECTIVE_FIRST_VALUE_IN_INDIRECT_DRAWto know if@builtin(vertex_index)and@builtin(instance_index)will respect thefirst_vertex/first_instancein indirect calls. If this is not present, both will always start counting from 0. Currently enabled on all backends except DX12. By @cwfitzgerald in #4722. - Added support for the
FLOAT32_FILTERABLEfeature (web and native, corresponds to WebGPU'sfloat32-filterable). By @almarklein in #4759. - GPU buffer memory is released during "lose the device". By @bradwerth in #4851.
- wgpu and wgpu-core cargo feature flags are now documented on docs.rs. By @wumpf in #4886.
- DeviceLostClosure is guaranteed to be invoked exactly once. By @bradwerth in #4862.
- Log vulkan validation layer messages during instance creation and destruction: By @exrook in #4586.
TextureFormat::block_sizeis deprecated, useTextureFormat::block_copy_sizeinstead: By @wumpf in #4647.- Rename of
DispatchIndirect,DrawIndexedIndirect, andDrawIndirecttypes in thewgpu::utilmodule toDispatchIndirectArgs,DrawIndexedIndirectArgs, andDrawIndirectArgs. By @cwfitzgerald in #4723. - Make the size parameter of
encoder.clear_bufferanOption<u64>instead ofOption<NonZero<u64>>. By @nical in #4737. - Reduce the
infolog level noise. By @nical in #4769, #4711 and #4772 - Rename
features&limitsfields ofDeviceDescriptortorequired_features&required_limits. By @teoxoy in #4803. SurfaceConfigurationnow exposesdesired_maximum_frame_latencywhich was previously hard-coded to 2. By setting it to 1 you can reduce latency under the risk of making GPU & CPU work sequential. Currently, on DX12 this affects theMaximumFrameLatency, on all other backends except OpenGL the size of the swapchain (on OpenGL this has no effect). By @emilk & @wumpf in #4899
OpenGL
@builtin(instance_index)now properly reflects the range provided in the draw call instead of always counting from 0. By @cwfitzgerald in #4722.- Desktop GL now supports
POLYGON_MODE_LINEandPOLYGON_MODE_POINT. By @valaphee in #4836.
naga
- naga's WGSL front end now allows operators to produce values with abstract types, rather than concretizing their operands. By @jimblandy in #4850 and #4870.
- naga's WGSL front and back ends now have experimental support for 64-bit floating-point literals:
1.0lfdenotes anf64value. There has been experimental support for anf64type for a while, but until now there was no syntax for writing literals with that type. As before, naga module validation rejectsf64values unlessnaga::valid::Capabilities::FLOAT64is requested. By @jimblandy in #4747. - naga constant evaluation can now process binary operators whose operands are both vectors. By @jimblandy in #4861.
- Add
--bulk-validateoption to naga CLI. By @jimblandy in #4871. - naga's
cargo xtask validatenow runs validation jobs in parallel, using the jobserver protocol to limit concurrency, and offers avalidate allsubcommand, which runs all available validation types. By @jimblandy in #4902. - Remove
spanandvalidatefeatures. Always fully validate shader modules, and always track source positions for use in error messages. By @teoxoy in #4706. - Introduce a new
Scalarstruct type for use in naga's IR, and update all frontend, middle, and backend code appropriately. By @jimblandy in #4673. - Add more metal keywords. By @fornwall in #4707.
- Add a new
naga::Literalvariant,I64, for signed 64-bit literals. #4711. - Emit and init
structmember padding always. By @ErichDonGubler in #4701. - In WGSL output, always include the
isuffix oni32literals. By @jimblandy in #4863. - In WGSL output, always include the
fsuffix onf32literals. By @jimblandy in #4869.
Bug Fixes
General
BufferMappedRangetrait is nowWasmNotSendSync, i.e. it isSend/Syncif not on wasm orfragile-send-sync-non-atomic-wasmis enabled. By @wumpf in #4818.- Align
wgpu_types::CompositeAlphaModeserde serialization to spec. By @littledivy in #4940. - Fix error message of
ConfigureSurfaceError::TooLarge. By @Dinnerbone in #4960. - Fix dropping of
DeviceLostCallbackCparams. By @bradwerth in #5032. - Fixed a number of panics. By @nical in #4999, #5014, #5024, #5025, #5026, #5027, #5028 and #5042.
- No longer validate surfaces against their allowed extent range on configure. This caused warnings that were almost impossible to avoid. As before, the resulting behavior depends on the compositor. By @wumpf in #4796.
DX12
- Fixed D3D12_SUBRESOURCE_FOOTPRINT calculation for block compressed textures which caused a crash with
Queue::write_textureon DX12. By @DTZxPorter in #4990.
Vulkan
- Use
VK_EXT_robustness2only when not using an outdated intel iGPU driver. By @TheoDulka in #4602.
WebGPU
- Allow calling
BufferSlice::get_mapped_rangemultiple times on the same buffer slice (instead of throwing a Javascript exception). By @DouglasDwyer in #4726.
WGL
- Create a hidden window per
wgpu::Instanceinstead of sharing a global one. By @Zoxc in #4603
naga
- Make module compaction preserve the module's named types, even if they are unused. By @jimblandy in #4734.
- Improve algorithm used by module compaction. By @jimblandy in #4662.
- When reading GLSL, fix the argument types of the double-precision floating-point overloads of the
dot,reflect,distance, andldexpbuiltin functions. Correct the WGSL generated for constructing 64-bit floating-point matrices. Add tests for all the above. By @jimblandy in #4684. - Allow naga's IR types to represent matrices with elements elements of any scalar kind. This makes it possible for naga IR types to represent WGSL abstract matrices. By @jimblandy in #4735.
- Preserve the source spans for constants and expressions correctly across module compaction. By @jimblandy in #4696.
- Record the names of WGSL
aliasdeclarations in naga IRTypes. By @jimblandy in #4733.
Metal
- Allow the
COPY_SRCusage flag in surface configuration. By @Toqozz in #4852.
Examples
- remove winit dependency from hello-compute example. By @psvri in #4699
- hello-compute example fix failure with
wgpu error: Validation Errorif arguments are missing. By @vilcans in #4939. - Made the examples page not crash on Chrome on Android, and responsive to screen sizes. By @Dinnerbone in #4958.
v0.18.2 (2023-12-06)
This release includes naga version 0.14.2. The crates wgpu-core, wgpu-hal are still at 0.18.1 and the crates wgpu and wgpu-types are still at 0.18.0.
Bug Fixes
naga
- When evaluating const-expressions and generating SPIR-V, properly handle
Composeexpressions whose operands areSplatexpressions. Such expressions are created and marked as constant by the constant evaluator. By @jimblandy in #4695.
v0.18.1 (2023-11-15)
(naga version 0.14.1)
Bug Fixes
General
- Fix panic in
Surface::configurein debug builds. By @cwfitzgerald in #4635 - Fix crash when all the following are true: By @teoxoy in ##4642
- Passing a naga module directly to
Device::create_shader_module. InstanceFlags::DEBUGis enabled.
- Passing a naga module directly to
DX12
- Always use HLSL 2018 when using DXC to compile HLSL shaders. By @daxpedda in #4629
Metal
- In Metal Shading Language output, fix issue where local variables were sometimes using variable names from previous functions. By @DJMcNab in #4594
v0.18.0 (2023-10-25)
For naga changelogs at or before v0.14.0. See naga's changelog.
Desktop OpenGL 3.3+ Support on Windows
We now support OpenGL on Windows! This brings support for a vast majority of the hardware that used to be covered by our DX11 backend. As of this writing we support OpenGL 3.3+, though there are efforts to reduce that further.
This allows us to cover the last 12 years of Intel GPUs (starting with Ivy Bridge; aka 3xxx), and the last 16 years of AMD (starting with Terascale; aka HD 2000) / NVidia GPUs (starting with Tesla; aka GeForce 8xxx).
By @Zoxc in #4248
Timestamp Queries Supported on Metal and OpenGL
Timestamp queries are now supported on both Metal and Desktop OpenGL. On Apple chips on Metal, they only support timestamp queries in command buffers or in the renderpass descriptor, they do not support them inside a pass.
Metal: By @Wumpf in #4008 OpenGL: By @Zoxc in #4267
Render/Compute Pass Query Writes
Addition of the TimestampWrites type to compute and render pass descriptors to allow profiling on tilers which do not support timestamps inside passes.
Added an example to demonstrate the various kinds of timestamps.
Additionally, metal now supports timestamp queries!
By @FL33TW00D & @wumpf in #3636.
Occlusion Queries
We now support binary occlusion queries! This allows you to determine if any of the draw calls within the query drew any pixels.
Use the new occlusion_query_set field on RenderPassDescriptor to give a query set that occlusion queries will write to.
let mut rpass = encoder.begin_render_pass(&wgpu::RenderPassDescriptor {
// ...
+ occlusion_query_set: Some(&my_occlusion_query_set),
});
Within the renderpass do the following to write the occlusion query results to the query set at the given index:
rpass.begin_occlusion_query(index);
rpass.draw(...);
rpass.draw(...);
rpass.end_occlusion_query();
These are binary occlusion queries, so the result will be either 0 or an unspecified non-zero value.
By @Valaphee in #3402
Shader Improvements
// WGSL constant expressions are now supported!
const BLAH: u32 = 1u + 1u;
// `rgb10a2uint` and `bgra8unorm` can now be used as a storage image format.
var image: texture_storage_2d<rgb10a2uint, write>;
var image: texture_storage_2d<bgra8unorm, write>;
// You can now use dual source blending!
struct FragmentOutput{
@location(0) source1: vec4<f32>,
@location(0) @second_blend_source source2: vec4<f32>,
}
// `modf`/`frexp` now return structures
let result = modf(1.5);
result.fract == 0.5;
result.whole == 1.0;
let result = frexp(1.5);
result.fract == 0.75;
result.exponent == 2i;
// `modf`/`frexp` are currently disabled on GLSL and SPIR-V input.
Shader Validation Improvements
// Cannot get pointer to a workgroup variable
fn func(p: ptr<workgroup, u32>); // ERROR
// Cannot create Inf/NaN through constant expressions
const INF: f32 = 3.40282347e+38 + 1.0; // ERROR
const NAN: f32 = 0.0 / 0.0; // ERROR
// `outerProduct` function removed
// Error on repeated or missing `@workgroup_size()`
@workgroup_size(1) @workgroup_size(2) // ERROR
fn compute_main() {}
// Error on repeated attributes.
fn fragment_main(@location(0) @location(0) location_0: f32) // ERROR
RenderPass StoreOp is now Enumeration
wgpu::Operations::store used to be an underdocumented boolean value,
causing misunderstandings of the effect of setting it to false.
The API now more closely resembles WebGPU which distinguishes between store and discard,
see WebGPU spec on GPUStoreOp.
// ...
depth_ops: Some(wgpu::Operations {
load: wgpu::LoadOp::Clear(1.0),
- store: false,
+ store: wgpu::StoreOp::Discard,
}),
// ...
By @wumpf in #4147
Instance Descriptor Settings
The instance descriptor grew two more fields: flags and gles_minor_version.
flags allow you to toggle the underlying api validation layers, debug information about shaders and objects in capture programs, and the ability to discard labels
gles_minor_version is a rather niche feature that allows you to force the GLES backend to use a specific minor version, this is useful to get ANGLE to enable more than GLES 3.0.
let instance = wgpu::Instance::new(InstanceDescriptor {
...
+ flags: wgpu::InstanceFlags::default()
+ gles_minor_version: wgpu::Gles3MinorVersion::Automatic,
});
gles_minor_version: By @PJB3005 in #3998
flags: By @nical in #4230
Many New Examples!
- Added the following examples: By @JustAnotherCodemonkey in #3885.
Revamped Testing Suite
Our testing harness was completely revamped and now automatically runs against all gpus in the system, shows the expected status of every test, and is tolerant to flakes.
Additionally, we have filled out our CI to now run the latest versions of WARP and Mesa. This means we can test even more features on CI than before.
By @cwfitzgerald in #3873
The GLES backend is now optional on macOS
The angle feature flag has to be set for the GLES backend to be enabled on Windows & macOS.
By @teoxoy in #4185
Added/New Features
Changes
General
- Omit texture store bound checks since they are no-ops if out of bounds on all APIs. By @teoxoy in #3975
- Validate
DownlevelFlags::READ_ONLY_DEPTH_STENCIL. By @teoxoy in #4031 - Add validation in accordance with WebGPU
setViewportvalid usage forx,yandthis.[[attachment_size]]. By @James2022-rgb in #4058 wgpu::CreateSurfaceErrorandwgpu::RequestDeviceErrornow give details of the failure, but no longer implementPartialEqand cannot be constructed. By @kpreid in #4066 and #4145- Make
WGPU_POWER_PREF=nonea valid value. By @fornwall in 4076 - Support dual source blending in OpenGL ES, Metal, Vulkan & DX12. By @freqmod in 4022
- Add stub support for device destroy and device validity. By @bradwerth in 4163 and in 4212
- Add trace-level logging for most entry points in wgpu-core By @nical in 4183
- Add
Rgb10a2Uintformat. By @teoxoy in 4199 - Validate that resources are used on the right device. By @nical in 4207
- Expose instance flags.
- Add support for the bgra8unorm-storage feature. By @jinleili and @nical in #4228
- Calls to lost devices now return
DeviceError::Lostinstead ofDeviceError::Invalid. By @bradwerth in #4238 - Let the
"strict_asserts"feature enable check that wgpu-core's lock-ordering tokens are unique per thread. By @jimblandy in #4258 - Allow filtering labels out before they are passed to GPU drivers by @nical in https://github.com/gfx-rs/wgpu/pull/4246
DeviceLostClosurecallback mechanism provided so user agents can resolveGPUDevice.lostPromises at the appropriate time by @bradwerth in #4645
Vulkan
- Rename
wgpu_hal::vulkan::Instance::required_extensionstodesired_extensions. By @jimblandy in #4115 - Don't bother calling
vkFreeCommandBufferswhenvkDestroyCommandPoolwill take care of that for us. By @jimblandy in #4059
DX12
- Bump
gpu-allocatorto 0.23. By @Elabajaba in #4198
Documentation
- Use WGSL for VertexFormat example types. By @ScanMountGoat in #4035
- Fix description of
Features::TEXTURE_COMPRESSION_ASTC_HDRin #4157
Bug Fixes
General
- Derive storage bindings via
naga::StorageAccessinstead ofnaga::GlobalUse. By @teoxoy in #3985. Queue::on_submitted_work_donecallbacks will now always be called after all previousBufferSlice::map_asynccallbacks, even when there are no active submissions. By @cwfitzgerald in #4036.- Fix
cleartexture views being leaked whenwgpu::SurfaceTextureis dropped before it is presented. By @rajveermalviya in #4057. - Add
Feature::SHADER_UNUSED_VERTEX_OUTPUTto allow unused vertex shader outputs. By @Aaron1011 in #4116. - Fix a panic in
surface_configure. By @nical in #4220 and #4227 - Pipelines register their implicit layouts in error cases. By @bradwerth in #4624
- Better handle explicit destruction of textures and buffers. By @nical in #4657
Vulkan
- Fix enabling
wgpu::Features::PARTIALLY_BOUND_BINDING_ARRAYnot being actually enabled in vulkan backend. By @39ali in#3772. - Don't pass
vk::InstanceCreateFlags::ENUMERATE_PORTABILITY_KHRunless theVK_KHR_portability_enumerationextension is available. By @jimblandy in#4038. - Enhancement of [#4038], using ash's definition instead of hard-coded c_str. By @hybcloud in#4044.
- Enable vulkan presentation on (Linux) Intel Mesa >= v21.2. By @flukejones in#4110
DX12
- DX12 doesn't support `Features::POLYGON_MODE_POINT``. By @teoxoy in #4032.
- Set
Features::VERTEX_WRITABLE_STORAGEbased on the right feature level. By @teoxoy in #4033.
Metal
- Ensure that MTLCommandEncoder calls endEncoding before it is deallocated. By @bradwerth in #4023
WebGPU
- Ensure that limit requests and reporting is done correctly. By @OptimisticPeach in #4107
- Validate usage of polygon mode. By @teoxoy in #4196
GLES
- enable/disable blending per attachment only when available (on ES 3.2 or higher). By @teoxoy in #4234
Documentation
- Add an overview of
RenderPassand how render state works. By @kpreid in #4055
Examples
- Created
wgpu-example::utilsmodule to contain misc functions and such that are common code but aren't part of the example framework. Add to it the functionsoutput_image_wasmandoutput_image_native, both for outputtingVec<u8>RGBA images either to the disc or the web page. By @JustAnotherCodemonkey in #3885. - Removed
captureexample as it had issues (did not run on wasm) and has been replaced byrender-to-texture(see above). By @JustAnotherCodemonkey in #3885.
v0.17.2 (2023-10-03)
Bug Fixes
Vulkan
- Fix x11 hang while resizing on vulkan. @Azorlogh in #4184.
v0.17.1 (2023-09-27)
Added/New Features
- Add
get_mapped_range_as_array_bufferfor faster buffer read-backs in wasm builds. By @ryankaplan in [#4042] (https://github.com/gfx-rs/wgpu/pull/4042).
Bug Fixes
DX12
- Fix panic on resize when using DX12. By @cwfitzgerald in #4106
Vulkan
- Suppress validation error caused by OBS layer. This was also fixed upstream. By @cwfitzgerald in #4002
- Work around bug in nvidia's vkCmdFillBuffer implementation. By @cwfitzgerald in #4132.
v0.17.0 (2023-07-20)
This is the first release that featured wgpu-info as a binary crate for getting information about what devices wgpu sees in your system. It can dump the information in both human readable format and json.
Major Changes
This release was fairly minor as breaking changes go.
wgpu types now !Send !Sync on wasm
Up until this point, wgpu has made the assumption that threads do not exist on wasm. With the rise of libraries like wasm_thread making it easier and easier to do wasm multithreading this assumption is no longer sound. As all wgpu objects contain references into the JS heap, they cannot leave the thread they started on.
As we understand that this change might be very inconvenient for users who don't care about wasm threading, there is a crate feature which re-enables the old behavior: fragile-send-sync-non-atomic-wasm. So long as you don't compile your code with -Ctarget-feature=+atomics, Send and Sync will be implemented again on wgpu types on wasm. As the name implies, especially for libraries, this is very fragile, as you don't know if a user will want to compile with atomics (and therefore threads) or not.
By @daxpedda in #3691
Power Preference is now optional
The power_preference field of RequestAdapterOptions is now optional. If it is PowerPreference::None, we will choose the first available adapter, preferring GPU adapters over CPU adapters.
By @Aaron1011 in #3903
initialize_adapter_from_env argument changes
Removed the backend_bits parameter from initialize_adapter_from_env and initialize_adapter_from_env_or_default. If you want to limit the backends used by this function, only enable the wanted backends in the instance.
Added a compatible surface parameter, to ensure the given device is able to be presented onto the given surface.
- wgpu::util::initialize_adapter_from_env(instance, backend_bits);
+ wgpu::util::initialize_adapter_from_env(instance, Some(&compatible_surface));
By @fornwall in #3904 and #3905
Misc Breaking Changes
- Change
AdapterInfo::{device,vendor}to beu32instead ofusize. By @ameknite in #3760
Changes
- Added support for importing external buffers using
buffer_from_raw(Dx12, Metal, Vulkan) andcreate_buffer_from_hal. By @AdrianEddy in #3355
Vulkan
- Work around Vulkan-ValidationLayers#5671 by ignoring reports of violations of VUID-vkCmdEndDebugUtilsLabelEXT-commandBuffer-01912. By @jimblandy in #3809.
Added/New Features
General
- Empty scissor rects are allowed now, matching the specification. by @PJB3005 in #3863.
- Add back components info to
TextureFormats. By @teoxoy in #3843. - Add
get_mapped_range_as_array_bufferfor faster buffer read-backs in wasm builds. By @ryankaplan in [#4042] (https://github.com/gfx-rs/wgpu/pull/4042).
Documentation
- Better documentation for draw, draw_indexed, set_viewport and set_scissor_rect. By @genusistimelord in #3860
- Fix link to
GPUVertexBufferLayout. By @fornwall in #3906 - Document feature requirements for
DEPTH32FLOAT_STENCIL8by @ErichDonGubler in #3734. - Flesh out docs. for
AdapterInfo::{device,vendor}by @ErichDonGubler in #3763. - Spell out which sizes are in bytes. By @jimblandy in #3773.
- Validate that
descriptor.usageis not empty increate_bufferby @nical in #3928 - Update
max_bindings_per_bind_grouplimit to reflect spec changes by @ErichDonGubler and @nical in #3943 #3942 - Add better docs for
Limits, listing the actual limits returned bydownlevel_defaultsanddownlevel_webgl2_defaultsby @JustAnotherCodemonkey in #3988
Bug Fixes
General
- Fix order of arguments to glPolygonOffset by @komadori in #3783.
- Fix OpenGL/EGL backend not respecting non-sRGB texture formats in
SurfaceConfiguration. by @liquidev in #3817 - Make write- and read-only marked buffers match non-readonly layouts. by @fornwall in #3893
- Fix leaking X11 connections. by @wez in #3924
- Fix ASTC feature selection in the webgl backend. by @expenses in #3934
- Fix Multiview to disable validation of TextureViewDimension and ArrayLayerCount. By @MalekiRe in #3779.
Vulkan
- Fix incorrect aspect in barriers when using emulated Stencil8 textures. By @cwfitzgerald in #3833.
- Implement depth-clip-control using depthClamp instead of VK_EXT_depth_clip_enable. By @AlbinBernhardssonARM #3892.
- Fix enabling
wgpu::Features::PARTIALLY_BOUND_BINDING_ARRAYnot being actually enabled in vulkan backend. By @39ali in#3772.
Metal
- Fix renderpasses being used inside of renderpasses. By @cwfitzgerald in #3828
- Support (simulated) visionOS. By @jinleili in #3883
DX12
- Disable suballocation on Intel Iris(R) Xe. By @xiaopengli89 in #3668
- Change the
max_buffer_sizelimit fromu64::MAXtoi32::MAX. By @nical in #4020
WebGPU
- Use
get_preferred_canvas_format()to fillformatsofSurfaceCapabilities. By @jinleili in #3744
Examples
- Publish examples to wgpu.rs on updates to trunk branch instead of gecko. By @paul-hansen in #3750
- Ignore the exception values generated by the winit resize event. By @jinleili in #3916
v0.16.3 (2023-07-19)
Changes
General
- Make the
Idtype that is exposed when using theexpose-idsfeature implementSendandSyncagain. This was unintentionally changed by the v0.16.0 release and is now fixed.
v0.16.2 (2023-07-09)
Changes
DX12
- Increase the
max_storage_buffers_per_shader_stageandmax_storage_textures_per_shader_stagelimits based on what the hardware supports. by @Elabajaba in [#3798]https://github.com/gfx-rs/wgpu/pull/3798
v0.16.1 (2023-05-24)
Bug Fixes
- Fix missing 4X MSAA support on some OpenGL backends. By @emilk in #3780
General
- Fix crash on dropping
wgpu::CommandBuffer. By @wumpf in #3726. - Use
u32s internally for bind group indices, rather thanu8. By @ErichDonGubler in #3743.
WebGPU
- Fix crash when calling
create_surface_from_canvas. By @grovesNL in #3718
v0.16.0 (2023-04-19)
Major changes
Shader Changes
type has been replaced with alias to match with upstream WebGPU.
- type MyType = vec4<u32>;
+ alias MyType = vec4<u32>;
TextureFormat info API
The TextureFormat::describe function was removed in favor of separate functions: block_dimensions, is_compressed, is_srgb, required_features, guaranteed_format_features, sample_type and block_size.
- let block_dimensions = format.describe().block_dimensions;
+ let block_dimensions = format.block_dimensions();
- let is_compressed = format.describe().is_compressed();
+ let is_compressed = format.is_compressed();
- let is_srgb = format.describe().srgb;
+ let is_srgb = format.is_srgb();
- let required_features = format.describe().required_features;
+ let required_features = format.required_features();
Additionally guaranteed_format_features now takes a set of features to assume are enabled.
- let guaranteed_format_features = format.describe().guaranteed_format_features;
+ let guaranteed_format_features = format.guaranteed_format_features(device.features());
Additionally sample_type and block_size now take an optional TextureAspect and return Options.
- let sample_type = format.describe().sample_type;
+ let sample_type = format.sample_type(None).expect("combined depth-stencil format requires specifying a TextureAspect");
- let block_size = format.describe().block_size;
+ let block_size = format.block_size(None).expect("combined depth-stencil format requires specifying a TextureAspect");
By @teoxoy in #3436
BufferUsages::QUERY_RESOLVE
Buffers used as the destination argument of CommandEncoder::resolve_query_set now have to contain the QUERY_RESOLVE usage instead of the COPY_DST usage.
let destination = device.create_buffer(&wgpu::BufferDescriptor {
// ...
- usage: wgpu::BufferUsages::COPY_DST | wgpu::BufferUsages::MAP_READ,
+ usage: wgpu::BufferUsages::QUERY_RESOLVE | wgpu::BufferUsages::MAP_READ,
mapped_at_creation: false,
});
command_encoder.resolve_query_set(&query_set, query_range, &destination, destination_offset);
By @JolifantoBambla in #3489
Renamed features
The following Features have been renamed.
SHADER_FLOAT16->SHADER_F16SHADER_FLOAT64->SHADER_F64SHADER_INT16->SHADER_I16TEXTURE_COMPRESSION_ASTC_LDR->TEXTURE_COMPRESSION_ASTCWRITE_TIMESTAMP_INSIDE_PASSES->TIMESTAMP_QUERY_INSIDE_PASSES
By @teoxoy in #3534
Anisotropic Filtering
Anisotropic filtering has been brought in line with the spec. The anisotropic clamp is now a u16 (was a Option<u8>) which must be at least 1.
If the anisotropy clamp is not 1, all the filters in a sampler must be Linear.
SamplerDescriptor {
- anisotropic_clamp: None,
+ anisotropic_clamp: 1,
}
By @cwfitzgerald in #3610.
TextureFormat Names
Some texture format names have changed to get back in line with the spec.
- TextureFormat::Bc6hRgbSfloat
+ TextureFormat::Bc6hRgbFloat
By @cwfitzgerald in #3671.
Misc Breaking Changes
- Change type of
mip_level_countandarray_layer_count(members ofTextureViewDescriptorandImageSubresourceRange) fromOption<NonZeroU32>toOption<u32>. By @teoxoy in #3445 - Change type of
bytes_per_rowandrows_per_image(members ofImageDataLayout) fromOption<NonZeroU32>toOption<u32>. By @teoxoy in #3529 - On Web,
Instance::create_surface_from_canvas()andcreate_surface_from_offscreen_canvas()now take the canvas by value. By @daxpedda in #3690
Added/New Features
General
- Added feature flags for ray-tracing (currently only hal):
RAY_QUERYandRAY_TRACING@daniel-keitel (started by @expenses) in #3507
Vulkan
- Implemented basic ray-tracing api for acceleration structures, and ray-queries @daniel-keitel (started by @expenses) in #3507
Hal
- Added basic ray-tracing api for acceleration structures, and ray-queries @daniel-keitel (started by @expenses) in #3507
Changes
General
- Added
TextureFormatFeatureFlags::MULTISAMPLE_X16. By @Dinnerbone in #3454 - Added
BufferUsages::QUERY_RESOLVE. By @JolifantoBambla in #3489 - Support stencil-only views and copying to/from combined depth-stencil textures. By @teoxoy in #3436
- Added
Features::SHADER_EARLY_DEPTH_TEST. By @teoxoy in #3494 - All
fxhashdependencies have been replaced withrustc-hash. By @james7132 in #3502 - Allow copying of textures with copy-compatible formats. By @teoxoy in #3528
- Improve attachment related errors. By @cwfitzgerald in #3549
- Make error descriptions all upper case. By @cwfitzgerald in #3549
- Don't include ANSI terminal color escape sequences in shader module validation error messages. By @jimblandy in #3591
- Report error messages from DXC compile. By @Davidster in #3632
- Error in native when using a filterable
TextureSampleType::Floaton a multisampleBindingType::Texture. By @mockersf in #3686 - On Web, the size of the canvas is adjusted when using
Surface::configure(). If the canvas was given an explicit size (via CSS), this will not affect the visual size of the canvas. By @daxpedda in #3690 - Added
Global::create_render_bundle_error. By @jimblandy in #3746
WebGPU
- Implement the new checks for readonly stencils. By @JCapucho in #3443
- Reimplement
adapter|device_features. By @jinleili in #3428 - Implement
command_encoder_resolve_query_set. By @JolifantoBambla in #3489 - Add support for
Features::RG11B10UFLOAT_RENDERABLE. By @mockersf in #3689
Vulkan
- Set
max_memory_allocation_sizeviaPhysicalDeviceMaintenance3Properties. By @jinleili in #3567 - Silence false-positive validation error about surface resizing. By @seabassjh in #3627
Bug Fixes
General
copyTextureToTexturesrc/dst aspects must both refer to all aspects of src/dst format. By @teoxoy in #3431- Validate before extracting texture selectors. By @teoxoy in #3487
- Fix fatal errors (those which panic even if an error handler is set) not including all of the details. By @kpreid in #3563
- Validate shader location clashes. By @emilk in #3613
- Fix surfaces not being dropped until exit. By @benjaminschaaf in #3647
WebGPU
- Fix handling of
Nonevalues fordepth_opsandstencil_opsinRenderPassDescriptor::depth_stencil_attachment. By @niklaskorz in #3660 - Avoid using
WasmAbifunctions for WebGPU backend. By @grovesNL in #3657
DX12
- Use typeless formats for textures that might be viewed as srgb or non-srgb. By @teoxoy in #3555
GLES
- Set FORCE_POINT_SIZE if it is vertex shader with mesh consist of point list. By @REASY in 3440
- Remove unwraps inside
surface.configure. By @cwfitzgerald in #3585 - Fix
copy_external_image_to_texture,copy_texture_to_textureandcopy_buffer_to_texturenot taking the specified index into account if the target texture is a cube map, 2D texture array or cube map array. By @daxpedda #3641 - Fix disabling of vertex attributes with non-consecutive locations. By @Azorlogh in #3706
Metal
- Fix metal erroring on an
array_strideof 0. By @teoxoy in #3538 create_texturereturns an error ifnew_texturereturns NULL. By @jinleili in #3554- Fix shader bounds checking being ignored. By @FL33TW00D in #3603
Vulkan
- Treat
VK_SUBOPTIMAL_KHRasVK_SUCCESSon Android due to rotation issues. By @James2022-rgb in #3525
Examples
- Use
BufferUsages::QUERY_RESOLVEinstead ofBufferUsages::COPY_DSTfor buffers used inCommandEncoder::resolve_query_setcalls inmipmapexample. By @JolifantoBambla in #3489
v0.15.3 (2023-03-22)
Bug Fixes
Metal
- Fix incorrect mipmap being sampled when using
MinLod <= 0.0andMaxLod >= 32.0or when the fragment shader samples different Lods in the same quad. By @cwfitzgerald in #3610.
GLES
- Fix
Vertex buffer is not big enough for the draw call.for ANGLE/Web when rendering with instance attributes on a single instance. By @wumpf in #3596 - Reset all queue state between command buffers in a submit. By @jleibs #3589
- Reset the state of
SAMPLE_ALPHA_TO_COVERAGEon queue reset. By @jleibs #3589
wgpu-0.15.2 (2023-03-08)
Bug Fixes
Metal
- Fix definition of
NSOperatingSystemVersionto avoid potential crashes. By @grovesNL in #3557
GLES
- Enable
WEBGL_debug_renderer_infobefore querying unmasked vendor/renderer to avoid crashing on emscripten in #3519
wgpu-0.15.1 (2023-02-09)
Changes
General
- Fix for some minor issues in comments on some features. By @Wumpf in #3455
Vulkan
- Improve format MSAA capabilities detection. By @jinleili in #3429
DX12
- Update gpu allocator to 0.22. By @Elabajaba in #3447
WebGPU
- Implement
CommandEncoder::clear_buffer. By @raphlinus in #3426
Bug Fixes
General
- Re-sort supported surface formats based on srgb-ness. By @cwfitzgerald in #3444
Vulkan
- Fix surface view formats validation error. By @jinleili in #3432
DX12
- Fix DXC validation issues when using a custom
dxil_path. By @Elabajaba in #3434
GLES
- Unbind vertex buffers at end of renderpass. By @cwfitzgerald in #3459
WebGPU
- Reimplement
{adapter|device}_features. By @jinleili in #3428
Documentation
General
- Build for Wasm on docs.rs. By @daxpedda in #3462
wgpu-0.15.0 (2023-01-25)
Major Changes
WGSL Top-Level let is now const
All top level constants are now declared with const, catching up with the wgsl spec.
let is no longer allowed at the global scope, only within functions.
-let SOME_CONSTANT = 12.0;
+const SOME_CONSTANT = 12.0;
See https://github.com/gfx-rs/naga/blob/master/CHANGELOG.md#v011-2023-01-25 for smaller shader improvements.
Surface Capabilities API
The various surface capability functions were combined into a single call that gives you all the capabilities.
- let formats = surface.get_supported_formats(&adapter);
- let present_modes = surface.get_supported_present_modes(&adapter);
- let alpha_modes = surface.get_supported_alpha_modes(&adapter);
+ let caps = surface.get_capabilities(&adapter);
+ let formats = caps.formats;
+ let present_modes = caps.present_modes;
+ let alpha_modes = caps.alpha_modes;
Additionally Surface::get_default_config now returns an Option and returns None if the surface isn't supported by the adapter.
- let config = surface.get_default_config(&adapter);
+ let config = surface.get_default_config(&adapter).expect("Surface unsupported by adapter");
Fallible surface creation
Instance::create_surface() now returns Result<Surface, CreateSurfaceError> instead of Surface. This allows an error to be returned if the given window is a HTML canvas and obtaining a WebGPU or WebGL 2 context fails. (No other platforms currently report any errors through this path.) By @kpreid in #3052
Queue::copy_external_image_to_texture on WebAssembly
A new api, Queue::copy_external_image_to_texture, allows you to create wgpu textures from various web image primitives. Specifically from HtmlVideoElement, HtmlCanvasElement, OffscreenCanvas, and ImageBitmap. This provides multiple low-copy ways of interacting with the browser. WebGL is also supported, though WebGL has some additional restrictions, represented by the UNRESTRICTED_EXTERNAL_IMAGE_COPIES downlevel flag. By @cwfitzgerald in #3288
Instance creation now takes InstanceDescriptor instead of Backends
Instance::new() and hub::Global::new() now take an InstanceDescriptor struct which contains both the existing Backends selection as well as a new Dx12Compiler field for selecting which Dx12 shader compiler to use.
- let instance = Instance::new(wgpu::Backends::all());
+ let instance = Instance::new(wgpu::InstanceDescriptor {
+ backends: wgpu::Backends::all(),
+ dx12_shader_compiler: wgpu::Dx12Compiler::Fxc,
+ });
Instance now also also implements Default, which uses wgpu::Backends::all() and wgpu::Dx12Compiler::Fxc for InstanceDescriptor
- let instance = Instance::new(wgpu::InstanceDescriptor {
- backends: wgpu::Backends::all(),
- dx12_shader_compiler: wgpu::Dx12Compiler::Fxc,
- });
+ let instance = Instance::default();
By @Elabajaba in #3356
Texture Format Reinterpretation
The new view_formats field in the TextureDescriptor is used to specify a list of formats the texture can be re-interpreted to in a texture view. Currently only changing srgb-ness is allowed (ex. Rgba8Unorm <=> Rgba8UnormSrgb).
let texture = device.create_texture(&wgpu::TextureDescriptor {
// ...
format: TextureFormat::Rgba8UnormSrgb,
+ view_formats: &[TextureFormat::Rgba8Unorm],
});
let config = wgpu::SurfaceConfiguration {
// ...
format: TextureFormat::Rgba8Unorm,
+ view_formats: vec![wgpu::TextureFormat::Rgba8UnormSrgb],
};
surface.configure(&device, &config);
MSAA x2 and x8 Support
Via the TEXTURE_ADAPTER_SPECIFIC_FORMAT_FEATURES feature, MSAA x2 and x8 are now supported on textures. To query for x2 or x8 support, enable the feature and look at the texture format flags for the texture format of your choice.
By @39ali in 3140
DXC Shader Compiler Support for DX12
You can now choose to use the DXC compiler for DX12 instead of FXC. The DXC compiler is faster, less buggy, and allows for new features compared to the old, unmaintained FXC compiler.
You can choose which compiler to use at Instance creation using the dx12_shader_compiler field in the InstanceDescriptor struct. Note that DXC requires both dxcompiler.dll and dxil.dll, which can be downloaded from https://github.com/microsoft/DirectXShaderCompiler/releases. Both .dlls need to be shipped with your application when targeting DX12 and using the DXC compiler. If the .dlls can't be loaded, then it will fall back to the FXC compiler. By @39ali and @Elabajaba in #3356
Suballocate DX12 buffers and textures
The DX12 backend can now suballocate buffers and textures from larger chunks of memory, which can give a significant increase in performance (in testing a 100x improvement has been seen in a simple scene with 200 write_buffer calls per frame, and a 1.4x improvement in Bistro using Bevy).
Previously wgpu-hal's DX12 backend created a new heap on the GPU every time you called write_buffer (by calling CreateCommittedResource), whereas now it uses gpu_allocator to manage GPU memory (and calls CreatePlacedResource with a suballocated heap). By @Elabajaba in #3163
Backend selection by features in wgpu-core
Whereas wgpu-core used to automatically select backends to enable
based on the target OS and architecture, it now has separate features
to enable each backend:
- "metal", for the Metal API on macOS and iOS
- "vulkan", for the Vulkan API (Linux, some Android, and occasionally Windows)
- "dx12", for Microsoft's Direct3D 12 API
- "gles", OpenGL ES, available on many systems
- "dx11", for Microsoft's Direct3D 11 API
None are enabled by default, but the wgpu crate automatically
selects these features based on the target operating system and
architecture, using the same rules that wgpu-core used to, so users
of wgpu should be unaffected by this change. However, other crates
using wgpu-core directly will need to copy wgpu's logic or write
their own. See the [target] section of wgpu/Cargo.toml for
details.
Similarly, wgpu-core now has emscripten and renderdoc features
that wgpu enables on appropriate platforms.
In previous releases, the wgpu-core crate decided which backends to
support. However, this left wgpu-core's users with no way to
override those choices. (Firefox doesn't want the GLES back end, for
example.) There doesn't seem to be any way to have a crate select
backends based on target OS and architecture that users of that crate
can still override. Default features can't be selected based on the
target, for example. That implies that we should do the selection as
late in the dependency DAG as feasible. Having wgpu (and
wgpu-core's other dependents) choose backends seems like the best
option.
By @jimblandy in #3254.
Changes
General
- Convert all
DefaultImplementations on Enums toderive(Default) - Implement
DefaultforCompositeAlphaMode - New downlevel feature
UNRESTRICTED_INDEX_BUFFERto indicate support for usingINDEXtogether with other non-copy/map usages (unsupported on WebGL). By @Wumpf in #3157 - Add missing
DEPTH_BIAS_CLAMPandFULL_DRAW_INDEX_UINT32downlevel flags. By @teoxoy in #3316 - Combine
Surface::get_supported_formats,Surface::get_supported_present_modes, andSurface::get_supported_alpha_modesintoSurface::get_capabilitiesandSurfaceCapabilities. By @cwfitzgerald in #3157 - Make
Surface::get_default_configreturn an Option to prevent panics. By @cwfitzgerald in #3157 - Lower the
max_buffer_sizelimit value for compatibility with Apple2 and WebGPU compliance. By @jinleili in #3255 - Limits
min_uniform_buffer_offset_alignmentandmin_storage_buffer_offset_alignmentis now always at least 32. By @wumpf #3262 - Dereferencing a buffer view is now marked inline. By @Wumpf in #3307
- The
strict_assertfamily of macros was moved towgpu-types. By @i509VCB in #3051 - Make
ObjectIdstructure and invariants idiomatic. By @teoxoy in #3347 - Add validation in accordance with WebGPU
GPUSamplerDescriptorvalid usage forlodMinClampandlodMaxClamp. By @James2022-rgb in #3353 - Remove panics in
Derefimplementations forQueueWriteBufferViewandBufferViewMut. Instead, warnings are logged, since reading from these types is not recommended. By @botahamec in [#3336] - Implement
view_formatsin the TextureDescriptor to match the WebGPU spec. By @jinleili in #3237 - Show more information in error message for non-aligned buffer bindings in WebGL #3414
- Update
TextureViewvalidation according to the WebGPU spec. By @teoxoy in #3410 - Implement
view_formatsin the SurfaceConfiguration to match the WebGPU spec. By @jinleili in #3409
Vulkan
- Set
WEBGPU_TEXTURE_FORMAT_SUPPORTdownlevel flag depending on the proper format support by @teoxoy in #3367. - Set
COPY_SRC/COPY_DSTonly based on Vulkan'sTRANSFER_SRC/TRANSFER_DSTby @teoxoy in #3366.
GLES
- Browsers that support
OVR_multiview2now report theMULTIVIEWfeature by @expenses in #3121. Limits::max_push_constant_sizeon GLES is now 256 by @Dinnerbone in #3374.- Creating multiple pipelines with the same shaders will now be faster, by @Dinnerbone in #3380.
WebGPU
- Implement
queue_validate_write_bufferby @jinleili in #3098 - Sync depth/stencil copy restrictions with the spec by @teoxoy in #3314
Added/New Features
General
- Implement
HashforDepthStencilStateandDepthBiasState - Add the
"wgsl"feature, to enable WGSL shaders inwgpu-coreandwgpu. Enabled by default inwgpu. By @daxpedda in #2890. - Implement
CloneforShaderSourceandShaderModuleDescriptorinwgpu. By @daxpedda in #3086. - Add
get_default_configforSurfaceto simplify user creation ofSurfaceConfiguration. By @jinleili in #3034 - Improve compute shader validation error message. By @haraldreingruber in #3139
- Native adapters can now use MSAA x2 and x8 if it's supported , previously only x1 and x4 were supported . By @39ali in 3140
- Implemented correleation between user timestamps and platform specific presentation timestamps via [
Adapter::get_presentation_timestamp]. By @cwfitzgerald in #3240 - Added support for
Features::SHADER_PRIMITIVE_INDEXon all backends. By @cwfitzgerald in #3272 - Implemented
TextureFormat::Stencil8, allowing for stencil testing without depth components. By @Dinnerbone in #3343 - Implemented
add_srgb_suffix()forTextureFormatfor converting linear formats to sRGB. By @Elabajaba in #3419 - Zero-initialize workgroup memory. By @teoxoy in #3174
GLES
- Surfaces support now
TextureFormat::Rgba8Unormand (non-web only)TextureFormat::Bgra8Unorm. By @Wumpf in #3070 - Support alpha to coverage. By @Wumpf in #3156
- Support filtering f32 textures. By @expenses in #3261
Vulkan
- Add
SHADER_INT16feature to enable theshaderInt16VkPhysicalDeviceFeature. By @Elabajaba in #3401
WebGPU
- Add
MULTISAMPLE_X2,MULTISAMPLE_X4andMULTISAMPLE_X8toTextureFormatFeatureFlags. By @39ali in 3140 - Sync
TextureFormat.describewith the spec. By @teoxoy in 3312
Metal
- Add a way to create
DeviceandQueuefrom raw Metal resources in wgpu-hal. By @AdrianEddy in #3338
Bug Fixes
General
- Update ndk-sys to v0.4.1+23.1.7779620, to fix checksum failures. By @jimblandy in #3232.
- Bother to free the
hal::Api::CommandBufferwhen awgpu_core::command::CommandEncoderis dropped. By @jimblandy in #3069. - Fixed the mipmap example by adding the missing WRITE_TIMESTAMP_INSIDE_PASSES feature. By @Olaroll in #3081.
- Avoid panicking in some interactions with invalid resources by @nical in (#3094)[https://github.com/gfx-rs/wgpu/pull/3094]
- Fixed an integer overflow in
copy_texture_to_textureby @nical #3090 - Remove
wgpu_types::Features::DEPTH24PLUS_STENCIL8, makingwgpu::TextureFormat::Depth24PlusStencil8available on all backends. By @Healthire in (#3151)[https://github.com/gfx-rs/wgpu/pull/3151] - Fix an integer overflow in
queue_write_textureby @nical in (#3146)[https://github.com/gfx-rs/wgpu/pull/3146] - Make
RenderPassCompatibilityErrorandCreateShaderModuleErrornot so huge. By @jimblandy in (#3226)[https://github.com/gfx-rs/wgpu/pull/3226] - Check for invalid bitflag bits in wgpu-core and allow them to be captured/replayed by @nical in (#3229)[https://github.com/gfx-rs/wgpu/pull/3229]
- Evaluate
gfx_select!'s#[cfg]conditions at the right time. By @jimblandy in #3253 - Improve error messages when binding bind group with dynamic offsets. By @cwfitzgerald in #3294
- Allow non-filtering sampling of integer textures. By @JMS55 in #3362.
- Validate texture ids in
Global::queue_texture_write. By @jimblandy in #3378. - Don't panic on mapped buffer in queue_submit. By @crowlKats in #3364.
- Fix being able to sample a depth texture with a filtering sampler. By @teoxoy in #3394.
- Make
make_spirv_rawandmake_spirvhandle big-endian binaries. By @1e1001 in #3411.
Vulkan
- Update ash to 0.37.1+1.3.235 to fix CI breaking by changing a call to the deprecated
debug_utils_set_object_name()function toset_debug_utils_object_name()by @elabajaba in #3273 - Document and improve extension detection. By @teoxoy in #3327
- Don't use a pointer to a local copy of a
PhysicalDeviceDriverPropertiesstruct after it has gone out of scope. In fact, don't make a local copy at all. Introduce a helper function for buildingCStrs from C character arrays, and remove someunsafeblocks. By @jimblandy in #3076.
DX12
- Fix
depth16Unormformats by @teoxoy in #3313 - Don't re-use
GraphicsCommandListwhencloseorresetfails. By @xiaopengli89 in #3204
Metal
- Fix texture view creation with full-resource views when using an explicit
mip_level_countorarray_layer_count. By @cwfitzgerald in #3323
GLES
- Fixed WebGL not displaying srgb targets correctly if a non-screen filling viewport was previously set. By @Wumpf in #3093
- Fix disallowing multisampling for float textures if otherwise supported. By @Wumpf in #3183
- Fix a panic when creating a pipeline with opaque types other than samplers (images and atomic counters). By @James2022-rgb in #3361
- Fix uniform buffers being empty on some vendors. By @Dinnerbone in #3391
- Fix a panic allocating a new buffer on webgl. By @Dinnerbone in #3396
WebGPU
- Use
loginstead ofprintlnin hello example by @JolifantoBambla in #2858
deno-webgpu
- Let
setVertexBufferandsetIndexBuffercalls onGPURenderBundleEncoderthrow an error if thesizeargument is zero, rather than treating that as "until the end of the buffer". By @jimblandy in #3171
Emscripten
- Let the wgpu examples
framework.rscompile again under Emscripten. By @jimblandy in #3246
Examples
- Log adapter info in hello example on wasm target by @JolifantoBambla in #2858
- Added new example
stencil-trianglesto show basic use of stencil testing. By @Dinnerbone in #3343
Testing/Internal
- Update the
minimum supported rust versionto 1.64 - Move
ResourceMetadatainto its own module. By @jimblandy in #3213 - Add WebAssembly testing infrastructure. By @haraldreingruber in #3238
- Error message when you forget to use cargo-nextest. By @cwfitzgerald in #3293
- Fix all suggestions from
cargo clippy
wgpu-0.14.2 (2022-11-28)
Bug Fixes
- Fix incorrect offset in
get_mapped_rangeby @nical in #3233
wgpu-0.14.1 (2022-11-02)
Bug Fixes
- Make
wgpu::TextureFormat::Depth24PlusStencil8available on all backends by making the feature unconditionally available and the feature unneeded to use the format. By @Healthire and @cwfitzgerald in #3165
wgpu-0.14.0 (2022-10-05)
Major Changes
@invariant Warning
When using CompareFunction::Equal or CompareFunction::NotEqual on a pipeline, there is now a warning logged if the vertex shader does not have a @invariant tag on it. On some machines, rendering the same triangles multiple times without an @invariant tag will result in slightly different depths for every pixel. Because the *Equal functions rely on depth being the same every time it is rendered, we now warn if it is missing.
-@vertex
-fn vert_main(v_in: VertexInput) -> @builtin(position) vec4<f32> {...}
+@vertex
+fn vert_main(v_in: VertexInput) -> @builtin(position) @invariant vec4<f32> {...}
Surface Alpha and PresentModes
Surface supports alpha_mode now. When alpha_mode is equal to PreMultiplied or PostMultiplied,
the alpha channel of framebuffer is respected in the compositing process, but which mode is available depends on
the different API and Device. If don't care about alpha_mode, you can set it to Auto.
SurfaceConfiguration {
// ...
+ alpha_mode: surface.get_supported_alpha_modes(&adapter)[0],
}
The function to enumerate supported presentation modes changed:
- pub fn wgpu::Surface::get_supported_modes(&self, adapter: &wgpu::Adapter) -> Vec<PresentMode>
+ pub fn wgpu::Surface::get_supported_present_modes(&self, adapter: &wgpu::Adapter) -> Vec<PresentMode>
Updated raw-window-handle to 0.5
This will allow use of the latest version of winit. As such the bound on create_surface is now RWH 0.5 and requires
both raw_window_handle::HasRawWindowHandle and raw_window_handle::HasRawDisplayHandle.
Added/New Features
- Add
Buffer::size()andBuffer::usage(); by @kpreid in #2923 - Split Blendability and Filterability into Two Different TextureFormatFeatureFlags; by @stakka in #3012
- Expose
alpha_modeon SurfaceConfiguration, by @jinleili in #2836 - Introduce fields for driver name and info in
AdapterInfo, by @i509VCB in #3037 - Add way to create gles hal textures from raw gl names to allow externally managed textures. By @i509VCB #3046
- Implemented
copy_external_image_to_textureon WebGPU, by @ybiletskyi in #2781
Bug Fixes
General
- Free
StagingBufferseven when an error occurs in the operation that consumes them. By @jimblandy in #2961 - Avoid overflow when checking that texture copies fall within bounds. By @jimblandy in #2963
- Improve the validation and error reporting of buffer mappings by @nical in #2848
- Fix compilation errors when using wgpu-core in isolation while targeting
wasm32-unknown-unknownby @Seamooo in #2922 - Fixed opening of RenderDoc library by @abuffseagull in #2930
- Added missing validation for
BufferUsagesmismatches whenFeatures::MAPPABLE_PRIMARY_BUFFERSis not enabled. By @imberflur in #3023 - Fixed
CommandEncodernot beingSendandSyncon web by @i509VCB in #3025 - Document meaning of
vendorinAdapterInfoif the vendor has no PCI id. - Fix missing resource labels from some Errors by @scoopr in #3066
Metal
- Add the missing
msg_send![view, retain]call withinfrom_viewby @jinleili in #2976 - Fix
max_buffermax_textureandmax_vertex_bufferslimits by @jinleili in #2978 - Remove PrivateCapabilities's
format_rgb10a2_unorm_surfacefield by @jinleili in #2981 - Fix validation error when copying into a subset of a single-layer texture by @nical in #3063
- Fix
_buffer_sizesencoding by @dtiselice in #3047
Vulkan
- Fix
astc_hdrformats support by @jinleili in [#2971]](https://github.com/gfx-rs/wgpu/pull/2971) - Update to naga
b209d911(2022-9-1) to avoid generating SPIR-V that violates Vulkan valid usage rulesVUID-StandaloneSpirv-Flat-06202andVUID-StandaloneSpirv-Flat-04744. By @jimblandy in #3008 - Fix bug where the Vulkan backend would panic when using a supported window and display handle but the dependent extensions are not available by @i509VCB in #3054.
GLES
- Report vendor id for Mesa and Apple GPUs. By @i509VCB #3036
- Report Apple M2 gpu as integrated. By @i509VCB #3036
WebGPU
- When called in a web worker,
Context::init()now usesweb_sys::WorkerGlobalContextto create awgpu::Instanceinstead of trying to access the unavailableweb_sys::Windowby @JolifantoBambla in #2858
Changes
General
- Changed wgpu-hal and wgpu-core implementation to pass RawDisplayHandle and RawWindowHandle as separate parameters instead of passing an impl trait over both HasRawDisplayHandle and HasRawWindowHandle. By @i509VCB in #3022
- Changed
Instance::as_hal<A>to just return anOption<&A::Instance>rather than taking a callback. By @jimb in #2991 - Added downlevel restriction error message for
InvalidFormatUsageserror by @Seamooo in #2886 - Add warning when using CompareFunction::*Equal with vertex shader that is missing @invariant tag by @cwfitzgerald in #2887
- Update Winit to version 0.27 and raw-window-handle to 0.5 by @wyatt-herkamp in #2918
- Address Clippy 0.1.63 complaints. By @jimblandy in #2977
- Don't use
PhantomDataforIdentityManager'sInputtype. By @jimblandy in #2972 - Changed naga variant in ShaderSource to
Cow<'static, Module>, to allow loading global variables by @daxpedda in #2903 - Updated the maximum binding index to match the WebGPU specification by @nical in #2957
- Add
unsafe_op_in_unsafe_fnto Clippy lints in the entire workspace. By @ErichDonGubler in #3044.
Metal
- Extract the generic code into
get_metal_layerby @jinleili in #2826
Vulkan
- Remove use of Vulkan12Features/Properties types. By @i509VCB in #2936
- Provide a means for
wgpuusers to accessvk::Queueand the queue index. By @anlumo in #2950 - Use the use effective api version for determining device features instead of wrongly assuming
VkPhysicalDeviceProperties.apiVersionis the actual version of the device. By @i509VCB in #3011 DropGuardhas been moved to the root of the wgpu-hal crate. By @i509VCB #3046
GLES
- Add
Rgba16Floatformat support for color attachments. By @jinleili in #3045 TEXTURE_COMPRESSION_ASTC_HDRfeature detection by @jinleili in #3042
Performance
- Made
StagingBelt::write_buffer()check more thoroughly for reusable memory; by @kpreid in #2906
Documentation
- Add WGSL examples to complement existing examples written in GLSL by @norepimorphism in #2888
- Document
wgpu_coreresource allocation. @jimblandy in #2973 - Expanded
StagingBeltdocumentation by @kpreid in #2905 - Fixed documentation for
Instance::create_surface_from_canvasandInstance::create_surface_from_offscreen_canvasregarding their safety contract. These functions are not unsafe. By @jimblandy #2990 - Document that
write_buffer_with()is sound but unwise to read from by @kpreid in #3006 - Explain why
Adapter::as_halandDevice::as_halhave to take callback functions. By @jimblandy in #2992
Dependency Updates
WebGPU
- Update wasm32 dependencies, set
alpha_modeon web target by @jinleili in #3040
Build Configuration
- Add the
"strict_asserts"feature, to enable additional internal run-time validation inwgpu-core. By @jimblandy in #2872.
Full API Diff
Manual concatenation of cargo public-api --diff-git-checkouts v0.13.2 v0.14.0 -p wgpu and cargo public-api --diff-git-checkouts v0.13.2 v0.14.0 -p wgpu-types
Removed items from the public API
=================================
-pub fn wgpu::Surface::get_supported_modes(&self, adapter: &wgpu::Adapter) -> Vec<PresentMode>
-pub const wgpu::Features::DEPTH24UNORM_STENCIL8: Self
-pub enum variant wgpu::TextureFormat::Depth24UnormStencil8
Changed items in the public API
===============================
-pub unsafe fn wgpu::Instance::as_hal<A: wgc::hub::HalApi, F: FnOnce(Option<&<A as >::Instance>) -> R, R>(&self, hal_instance_callback: F) -> R
+pub unsafe fn wgpu::Instance::as_hal<A: wgc::hub::HalApi>(&self) -> Option<&<A as >::Instance>
-pub unsafe fn wgpu::Instance::create_surface<W: raw_window_handle::HasRawWindowHandle>(&self, window: &W) -> wgpu::Surface
+pub unsafe fn wgpu::Instance::create_surface<W: raw_window_handle::HasRawWindowHandle + raw_window_handle::HasRawDisplayHandle>(&self, window: &W) -> wgpu::Surface
Added items to the public API
=============================
+pub fn wgpu::Buffer::size(&self) -> wgt::BufferAddress
+pub fn wgpu::Buffer::usage(&self) -> BufferUsages
+pub fn wgpu::Surface::get_supported_alpha_modes(&self, adapter: &wgpu::Adapter) -> Vec<CompositeAlphaMode>
+pub fn wgpu::Surface::get_supported_present_modes(&self, adapter: &wgpu::Adapter) -> Vec<PresentMode>
+#[repr(C)] pub enum wgpu::CompositeAlphaMode
+impl RefUnwindSafe for wgpu::CompositeAlphaMode
+impl Send for wgpu::CompositeAlphaMode
+impl Sync for wgpu::CompositeAlphaMode
+impl Unpin for wgpu::CompositeAlphaMode
+impl UnwindSafe for wgpu::CompositeAlphaMode
+pub const wgpu::Features::DEPTH24PLUS_STENCIL8: Self
+pub const wgpu::TextureFormatFeatureFlags::BLENDABLE: Self
+pub enum variant wgpu::CompositeAlphaMode::Auto = 0
+pub enum variant wgpu::CompositeAlphaMode::Inherit = 4
+pub enum variant wgpu::CompositeAlphaMode::Opaque = 1
+pub enum variant wgpu::CompositeAlphaMode::PostMultiplied = 3
+pub enum variant wgpu::CompositeAlphaMode::PreMultiplied = 2
+pub enum variant wgpu::TextureFormat::Depth16Unorm
+pub fn wgpu::CompositeAlphaMode::clone(&self) -> wgpu::CompositeAlphaMode
+pub fn wgpu::CompositeAlphaMode::eq(&self, other: &wgpu::CompositeAlphaMode) -> bool
+pub fn wgpu::CompositeAlphaMode::fmt(&self, f: &mut $crate::fmt::Formatter<'_>) -> $crate::fmt::Result
+pub fn wgpu::CompositeAlphaMode::hash<__H: $crate::hash::Hasher>(&self, state: &mut __H) -> ()
+pub struct field wgpu::AdapterInfo::driver: String
+pub struct field wgpu::AdapterInfo::driver_info: String
+pub struct field wgpu::SurfaceConfiguration::alpha_mode: wgpu_types::CompositeAlphaMode
wgpu-0.13.2 (2022-07-13)
Bug Fixes
General
- Prefer
DeviceType::DiscreteGpuoverDeviceType::OtherforPowerPreference::LowPowerso Vulkan is preferred over OpenGL again by @Craig-Macomber in #2853 - Allow running
get_texture_format_featureson unsupported texture formats (returning no flags) by @cwfitzgerald in #2856 - Allow multi-sampled textures that are supported by the device but not WebGPU if
TEXTURE_ADAPTER_SPECIFIC_FORMAT_FEATURESis enabled by @cwfitzgerald in #2856 get_texture_format_featuresonly lists the COPY_* usages if the adapter actually supports that usage by @cwfitzgerald in #2856- Fix bind group / pipeline deduplication not taking into account RenderBundle execution resetting these values by @shoebe #2867
- Fix panics that occur when using
as_halfunctions when the hal generic type does not match the hub being looked up in by @i509VCB #2871 - Add some validation in map_async by @nical in #2876
- Fix bugs when mapping/unmapping zero-sized buffers and ranges by @nical in #2877
- Fix out-of-bound write in
map_bufferwith non-zero offset by @nical in #2916 - Validate the number of color attachments in
create_render_pipelineby @nical in #2913 - Validate against the maximum binding index in
create_bind_group_layoutby @nical in #2892 - Validate that map_async's range is not negative by @nical in #2938
- Fix calculation/validation of layer/mip ranges in create_texture_view by @nical in #2955
- Validate the sample count and mip level in
copy_texture_to_bufferby @nical in #2958 - Expose the cause of the error in the
map_asynccallback in #2939
DX12
DownlevelCapabilities::default()now returns theANISOTROPIC_FILTERINGflag set to true so DX12 listsANISOTROPIC_FILTERINGas true again by @cwfitzgerald in #2851- Properly query format features for UAV/SRV usages of depth formats by @cwfitzgerald in #2856
Vulkan
- Vulkan 1.0 drivers that support
VK_KHR_multiviewnow properly report theMULTIVIEWfeature as supported by @i509VCB in #2934. - Stop using
VkPhysicalDevice11Featuresin Vulkan 1.1 which is confusingly provided in Vulkan 1.2 by @i509VCB in #2934.
GLES
- Fix depth stencil texture format capability by @jinleili in #2854
get_texture_format_featuresnow only returns usages for formats it actually supports by @cwfitzgerald in #2856
Hal
- Allow access to queue family index in Vulkan hal by @i509VCB in #2859
- Allow access to the EGLDisplay and EGLContext pointer in Gles hal Adapter and Device by @i509VCB in #2860
Documentation
- Update present_mode docs as most of them don't automatically fall back to Fifo anymore. by @Elabajaba in #2855
Hal
- Document safety requirements for
Adapter::from_externalin gles hal by @i509VCB in #2863 - Make
AdapterContexta publicly accessible type in the gles hal by @i509VCB in #2870
wgpu-0.13.1 (2022-07-02)
Bug Fixes
General
- Fix out of bounds access when surface texture is written to by multiple command buffers by @cwfitzgerald in #2843
GLES
- AutoNoVSync now correctly falls back to Fifo by @simbleau in #2842
- Fix GL_EXT_color_buffer_float detection on native by @cwfitzgerald in #2843
wgpu-0.13 (2022-06-30)
Major Changes
WGSL Syntax
WGSL syntax has changed in a couple ways. The new syntax is easier to read and work with.
Attribute declarations are written differently:
- [[group(1), binding(0)]]
+ @group(1) @binding(0)
Stage declarations are now separate attributes rather than part of the stage attribute:
- [[stage(vertex)]]
+ @vertex
Structs now use , as field separator and no longer need semicolons after the declaration:
- struct MyStruct {
- my_member: u32;
- };
+ struct MyStruct {
+ my_member: u32,
+ }
Surface API
The method of getting the preferred swapchain format has changed to allow viewing all formats supported by the surface.
- let format = surface.get_preferred_format(&adapter).unwrap();
+ let format = surface.get_supported_formats(&adapter)[0];
Presentation modes now need to match exactly what the surface supports. FIFO is always supported,
but all other modes vary from API to API and Device to Device. To get a list of all supported modes,
call the following. The order does not indicate preference.
let modes = surface.get_supported_present_modes(&adapter);
Timestamp Queries
Timestamp queries are now restricted behind multiple features to allow implementation on TBDR (Tile-Based Deferred Rendering) based GPUs, such as mobile devices and Apple's M chips.
Features::TIMESTAMP_QUERIES now allows for calling write_timestamp only on CommandEncoders.
Features::WRITE_TIMESTAMP_INSIDE_PASSES is needed to call write_timestamp on RenderPassEncoders or ComputePassEncoders.
map_async
The function for mapping buffers no longer returns a future, and instead calls a callback when the buffer is mapped.
This aligns with the use of the API more clearly - you aren't supposed to block and wait on the future to resolve, you are supposed to keep rendering and wait until the buffer maps on its own. Mapping and the flow of mapping is an under-documented area that we hope to improve in the future.
- let future = buffer.slice(..).map_async(MapMode::Read);
+ buffer.slice(..).map_async(MapMode::Read, || {
+ // Called when buffer is mapped.
+ })
Submission Indexes
Calling queue.submit now returns an opaque submission index that can be used as an argument to
device.poll to say which submission to wait to complete.
Other Breaking Changes
Device::create_shader_module now takes the shader descriptor by value:
- device.create_shader_module(&shader_module_descriptor)
+ device.create_shader_module(shader_module_descriptor)
Color attachments can be sparse, so they are now optional:
FragmentState {
- targets: &[color_target_state]
+ targets: &[Some(color_target_state)]
// ..
}
RenderPassDescriptor {
- color_attachments: &[render_pass_color_attachment]
+ color_attachments: &[Some(render_pass_color_attachment)]
// ..
}
RenderBundleEncoderDescriptor {
- color_formats: &[texture_format]
+ color_formats: &[Some(texture_format)]
// ..
}
Extent3d::max_mips now requires you to pass a TextureDimension to specify whether or not depth_or_array_layers should be ignored:
Extent3d {
width: 1920,
height: 1080,
depth_or_array_layers: 6,
- }.max_mips()
+ }.max_mips(wgpu::TextureDimension::D3)
Limits has a new field, max_buffer_size (not an issue if you don't define limits manually):
Limits {
// ...
+ max_buffer_size: 256 * 1024 * 1024, // adjust as you see fit
}
Features::CLEAR_COMMANDS is now unnecessary and no longer exists. The feature to clear buffers and textures is now part of upstream WebGPU.
DeviceDescriptor {
// ...
features: wgpu::Features::VERTEX_WRITABLE_STORAGE
| wgpu::Features::MAPPABLE_PRIMARY_BUFFERS
| wgpu::Features::TEXTURE_BINDING_ARRAY
| wgpu::Features::BUFFER_BINDING_ARRAY
| wgpu::Features::STORAGE_RESOURCE_BINDING_ARRAY
- | wgpu::Features::CLEAR_COMMANDS
,
}
ComputePass::dispatch has been renamed to ComputePass::dispatch_workgroups
- cpass.dispatch(self.work_group_count, 1, 1)
+ cpass.dispatch_workgroups(self.work_group_count, 1, 1)
Added/New Features
General
- Add
util::indirect::*helper structs by @IcanDivideBy0 in #2365 - Add
AddressMode::ClampToZeroby @laptou in #2364 - Add MULTISAMPLED_SHADING downlevel flag by @jinleili in #2425
- Allow non struct buffers in wgsl by @IcanDivideBy0 in #2451
- Prefix every wgpu-generated label with
(wgpu). by @kpreid in #2590 - Permit non-struct, non-array types as buffers. by @jimblandy in #2584
- Return
queue_emptyfor Device::poll by @xiaopengli89 in #2643 - Add
SHADER_FLOAT16feature by @jinleili in #2646 - Add DEPTH32FLOAT_STENCIL8 feature by @jinleili in #2664
- Add DEPTH24UNORM_STENCIL8 feature by @jinleili in #2689
- Implement submission indexes by @cwfitzgerald in #2700
- [WebGL] Add a downlevel capability for rendering to floating point textures by @expenses in #2729
- allow creating wgpu::Instance from wgpu_core::Instance by @i509VCB in #2763
- Force binding sizes to be multiples of 16 on webgl by @cwfitzgerald in #2808
- Add naga variant to ShaderSource by @rttad in #2801
- Implement Queue::write_buffer_with by @teoxoy in #2777
Vulkan
- Re-allow vk backend on Apple platforms via
vulkan-portabilityfeature by @jinleili in #2488 - vulkan: HDR ASTC formats support by @jinleili in #2496
Metal
- Implement push constants for metal backend by @TheOnlyMrCat in #2314
- Metal backend ASTC HDR formats support by @jinleili in #2477
- Add COPY_DST to Metal's surface usage bits by @vl4dimir in #2491
- Add
Features::MULTI_DRAW_INDIRECTto Metal by @expenses in #2737
GLES
- Support externally initialized contexts by @kvark in #2350
- Angle support on macOS by @jinleili in #2461
- Use EGL surfaceless platform when windowing system is not found by @sh7dm in #2339
- Do a downlevel check for anisotrophy and enable it in the webgl backend by @expenses in #2616
- OffscreenCanvas Support for WebGL Backend by @haraldreingruber-dedalus in #2603
DX12
- Support to create surface from visual on Windows by @xiaopengli89 in #2434
- Add raw_queue for d3d12 device by @xiaopengli89 in #2600
DX11
- Skeleton of a DX11 backend - not working yet by @cwfitzgerald in #2443
Hal
- Adapter and Instance as_hal functions by @i509VCB in #2663
- expose some underlying types in Vulkan hal by @i509VCB in #2667
- Add raw_device method for dx12, vulkan hal by @xiaopengli89 in #2360
- expose egl display in gles Instance hal by @i509VCB in #2670
- Add raw_adapter method for dx12 hal adapter by @xiaopengli89 in #2714
- Acquire texture:
Option<std::time::Duration>timeouts by @rib in #2724 - expose vulkan physical device capabilities, enabled device extensions by @i509VCB in #2688
Emscripten
- feature: emscripten by @caiiiycuk in #2422
- feature = emscripten, compatibility fixes for wgpu-native by @caiiiycuk in #2450
Changes
General
- Make ShaderSource #[non_exhaustive] by @fintelia in #2312
- Make
execute_bundles()receive IntoIterator by @maku693 in #2410 - Raise
wgpu_hal::MAX_COLOR_TARGETSto 8. by @jimblandy in #2640 - Rename dispatch -> dispatch_workgroups by @jinleili in #2619
- Update texture_create_view logic to match spec by @jinleili in #2621
- Move TEXTURE_COMPRESSION_ETC2 | ASTC_LDR to web section to match spec by @jinleili in #2671
- Check that all vertex outputs are consumed by the fragment shader by @cwfitzgerald in #2704
- Convert map_async from being async to being callback based by @cwfitzgerald in #2698
- Align the validation of Device::create_texture with the WebGPU spec by @nical in #2759
- Add InvalidGroupIndex validation at create_shader_module by @jinleili in #2775
- Rename MAX_COLOR_TARGETS to MAX_COLOR_ATTACHMENTS to match spec by @jinleili in #2780
- Change get_preferred_format to get_supported_formats by @stevenhuyn in #2783
- Restrict WriteTimestamp Inside Passes by @cwfitzgerald in #2802
- Flip span labels to work better with tools by @cwfitzgerald in #2820
Gles
- Make GLES DeviceType unknown by default by @PolyMeilex in #2647
Metal
- metal: check if in the main thread when calling
create_surfaceby @jinleili in #2736
Hal
- limit binding sizes to i32 by @kvark in #2363
Bug Fixes
General
- Fix trac(y/ing) compile issue by @cwfitzgerald in #2333
- Improve detection and validation of cubemap views by @kvark in #2331
- Don't create array layer trackers for 3D textures. by @ElectronicRU in #2348
- Limit 1D texture mips to 1 by @kvark in #2374
- Texture format MSAA capabilities by @kvark in #2377
- Fix write_buffer to surface texture @kvark in #2385
- Improve some error messages by @cwfitzgerald in #2446
- Don't recycle indices that reach EOL by @kvark in #2462
- Validated render usages for 3D textures by @kvark in #2482
- Wrap all validation logs with catch_unwinds by @cwfitzgerald in #2511
- Fix clippy lints by @a1phyr in #2560
- Free the raw device when
wgpu::Deviceis dropped. by @jimblandy in #2567 - wgpu-core: Register new pipelines with device's tracker. by @jimblandy in #2565
- impl Debug for StagingBelt by @kpreid in #2572
- Use fully qualified syntax for some calls. by @jimblandy in #2655
- fix: panic in
Storage::getby @SparkyPotato in #2657 - Report invalid pipelines in render bundles as errors, not panics. by @jimblandy in #2666
- Perform "valid to use with" checks when recording render bundles. by @jimblandy in #2690
- Stop using storage usage for sampling by @cwfitzgerald in #2703
- Track depth and stencil writability separately. by @jimblandy in #2693
- Improve InvalidScissorRect error message by @jinleili in #2713
- Improve InvalidViewport error message by @jinleili in #2723
- Don't dirty the vertex buffer for stride/rate changes on bundles. by @jimblandy in #2744
- Clean up render bundle index buffer tracking. by @jimblandy in #2743
- Improve read-write and read-only texture storage error message by @jinleili in #2745
- Change
WEBGPU_TEXTURE_FORMAT_SUPPORTto1 << 14instead of1 << 15by @expenses in #2772 - fix BufferMapCallbackC & SubmittedWorkDoneClosureC by @rajveermalviya in #2787
- Fix formatting of
TextureDimensionError::LimitExceeded. by @kpreid in #2799 - Remove redundant
#[cfg]conditions frombackend/direct.rs. by @jimblandy in #2811 - Replace android-properties with android_system_properties. by @nical in #2815
- Relax render pass color_attachments validation by @jinleili in #2778
- Properly Barrier Compute Indirect Buffers by @cwfitzgerald in #2810
- Use numeric constants to define
wgpu_types::Featuresvalues. by @jimblandy in #2817
Metal
- Fix surface texture clear view by @kvark in #2341
- Set preserveInvariance for shader options by @scoopr in #2372
- Properly set msl version to 2.3 if supported by @cwfitzgerald in #2418
- Identify Apple M1 GPU as integrated by @superdump in #2429
- Fix M1 in macOS incorrectly reports supported compressed texture formats by @superdump in #2453
- Msl: support unsized array not in structures by @kvark in #2459
- Fix
Surface::from_uiviewcan not guarantee set correctcontentScaleFactorby @jinleili in #2470 - Set
max_buffer_sizeby the correct physical device restriction by @jinleili in #2502 - Refactor
PrivateCapabilitiescreation by @jinleili in #2509 - Refactor texture_format_capabilities function by @jinleili in #2522
- Improve
push | pop_debug_markerby @jinleili in #2537 - Fix some supported limits by @jinleili in #2608
- Don't skip incomplete binding resources. by @dragostis in #2622
- Fix
Rgb9e5Ufloatcapabilities andsampler_lod_averagesupport by @jinleili in #2656 - Fix Depth24Plus | Depth24PlusStencil8 capabilities by @jinleili in #2686
- Get_supported_formats: sort like the old get_preferred_format and simplify return type by @victorvde in #2786
- Restrict hal::TextureUses::COLOR_TARGET condition within create_texture by @jinleili in #2818
DX12
- Fix UMA check by @kvark in #2305
- Fix partial texture barrier not affecting stencil aspect by @Wumpf in #2308
- Improve RowPitch computation by @kvark in #2409
Vulkan
- Explicitly set Vulkan debug message types instead of !empty() by @victorvde in #2321
- Use stencil read/write masks by @kvark in #2382
- Vulkan: correctly set INDEPENDENT_BLEND,make runnable on Android 8.x by @jinleili in #2498
- Fix ASTC format mapping by @kvark in #2476
- Support flipped Y on VK 1.1 devices by @cwfitzgerald in #2512
- Fixed builtin(primitive_index) for vulkan backend by @kwillemsen in #2716
- Fix PIPELINE_STATISTICS_QUERY feature support by @jinleili in #2750
- Add a vulkan workaround for large buffers. by @nical in #2796
GLES
- Fix index buffer state not being reset in reset_state by @rparrett in #2391
- Allow push constants trough emulation by @JCapucho in #2400
- Hal/gles: fix dirty vertex buffers that are unused by @kvark in #2427
- Fix texture description for bgra formats by @JCapucho in #2520
- Remove a
log::error!debugging statement from the gles queue by @expenses in #2630 - Fix clearing depth and stencil at the same time by @expenses in #2675
- Handle cubemap copies by @expenses in #2725
- Allow clearing index buffers by @grovesNL in #2740
- Fix buffer-texture copy for 2d arrays by @tuchs in #2809
Wayland
- Search for different versions of libwayland by @sh7dm in #2336
WebGPU
- Fix compilation on wasm32-unknown-unknown without
webglfeature by @jakobhellermann in #2355 - Solve crash on WebGPU by @cwfitzgerald in #2807
Emscripten
- Fix emscripten by @cwfitzgerald in #2494
Performance
- Do texture init via clear passes when possible by @Wumpf in #2307
- Bind group deduplication by @cwfitzgerald in #2623
- Tracking Optimization and Rewrite by @cwfitzgerald in #2662
Documentation
- Add defaults to new limits and correct older ones by @MultisampledNight in #/2303
- Improve shader source documentation by @grovesNL in #2315
- Fix typo by @rustui in #2393
- Add a ⭐ to the feature matrix of examples README by @yutannihilation in #2457
- Fix get_timestamp_period type in docs by @superdump in #2478
- Fix mistake in Access doc comment by @nical in #2479
- Improve shader support documentation by @cwfitzgerald in #2501
- Document the gfx_select! macro. by @jimblandy in #2555
- Add Windows 11 to section about DX12 by @HeavyRain266 in #2552
- Document some aspects of resource tracking. by @jimblandy in #2558
- Documentation for various things. by @jimblandy in #2566
- Fix doc links. by @jimblandy in #2579
- Fixed misspelling in documentation by @zenitopires in #2634
- Update push constant docs to reflect the API by @Noxime in #2637
- Exclude dependencies from documentation by @yutannihilation in #2642
- Document
GpuFuture. by @jimblandy in #2644 - Document random bits and pieces. by @jimblandy in #2651
- Add cross-references to each wgpu type's documentation. by @kpreid in #2653
- RenderPassDescriptor: make label lifetime match doc, and make names descriptive. by @kpreid in #2654
- Document
VertexStepMode. by @jimblandy in #2685 - Add links for SpirV documents. by @huandzh in #2697
- Add symlink LICENSE files into crates. by @dskkato in #2604
- Fix documentation links. by @jimblandy in #2756
- Improve push constant documentation, including internal docs. by @jimblandy in #2764
- Clarify docs for
wgpu_core'sIdandgfx_select!. by @jimblandy in #2766 - Update the Supported Platforms table in README by @jinleili in #2770
- Remove depth image from readme - we don't dictate direction of depth by @cwfitzgerald in #2812
Dependency Updates
- Update
ashto0.37by @a1phyr in #2557 - Update parking_lot to 0.12. by @emilio in #2639
- Accept both parking-lot 0.11 and 0.12, to avoid windows-rs. by @jimblandy in #2660
- Update web-sys to 0.3.58, sparse attachments support by @jinleili in #2813
- Remove use of inplace_it by @mockersf in #2889
deno-webgpu
- Clean up features in deno by @crowlKats in #2445
- Dont panic when submitting same commandbuffer multiple times by @crowlKats in #2449
- Handle error sources to display full errors by @crowlKats in #2454
- Pull changes from deno repo by @crowlKats in #2455
- Fix cts_runner by @crowlKats in #2456
- Update deno_webgpu by @crowlKats in #2539
- Custom op arity by @crowlKats in #2542
Examples
- Fix conserative-raster low res target getting zero sized on resize by @Wumpf in #2318
- Replace run-wasm-example.sh with aliased rust crate (xtask) by @rukai in #2346
- Get cargo-run-wasm from crates.io by @rukai in #2415
- Fix msaa-line example's unnecessary MSAA data store by @jinleili in #2421
- Make shadow example runnable on iOS Android devices by @jinleili in #2433
- Blit should only draw one triangle by @CurryPseudo in #2474
- Fix wasm examples failing to compile by @Liamolucko in #2524
- Fix incorrect filtering used in mipmap generation by @LaylBongers in #2525
- Correct program output ("Steps", not "Times") by @skierpage in #2535
- Fix resizing behaviour of hello-triangle example by @FrankenApps in #2543
- Switch from
cgmathtoglamin examples by @a1phyr in #2544 - Generate 1x1 mip level by @davidar in #2551
- Wgpu/examples/shadow: Don't run on llvmpipe. by @jimblandy in #2595
- Avoid new WGSL reserved words in wgpu examples. by @jimblandy in #2606
- Move texture-array example over to wgsl by @cwfitzgerald in #2618
- Remove the default features from wgpu-info by @jinleili in #2753
- Fix bunnymark test screenshot and replace rand with nanorand by @stevenhuyn in #2746
- Use FIFO swapchain in examples by @cwfitzgerald in #2790
Testing/Internal
- Test WebGPU backend with extra features by @kvark in #2362
- Lint deno_webgpu & wgpu-core by @AaronO in #2403
- IdentityManager:
from_indexmethod is unneeded. by @jimblandy in #2424 - Added id32 feature by @caiiiycuk in #2464
- Update dev deps by @rukai in #2493
- Use cargo nextest for running our tests by @cwfitzgerald in #2495
- Many Steps Towards GL Testing Working by @cwfitzgerald in #2504
- Rename ci.txt to ci.yml by @simon446 in #2510
- Re-enable GL testing in CI by @cwfitzgerald in #2508
- Expect shadow example to pass on GL by @kvark in #2541
- Simplify implementation of RefCount and MultiRefCount. by @jimblandy in #2548
- Provide a proper
newmethod forRefCount. by @jimblandy in #2570 - Add logging to LifetimeTracker::triage_suspected. by @jimblandy in #2569
- wgpu-hal: Work around cbindgen bug: ignore
gles::eglmodule. by @jimblandy in #2576 - Specify an exact wasm-bindgen-cli version in publish.yml. by @jimblandy in #2624
- Rename
timeout_ustotimeout_ns, to match actual units. by @jimblandy in #2645 - Move set_index_buffer FFI functions back into wgpu. by @jimblandy in #2661
- New function:
Global::create_buffer_error. by @jimblandy in #2673 - Actually use RenderBundleEncoder::set_bind_group in tests. by @jimblandy in #2678
- Eliminate wgpu_core::commands::bundle::State::raw_dynamic_offsets. by @jimblandy in #2684
- Move RenderBundleEncoder::finish's pipeline layout id into the state. by @jimblandy in #2755
- Expect shader_primitive_index tests to fail on AMD RADV POLARIS12. by @jimblandy in #2754
- Introduce
VertexStep: a stride and a step mode. by @jimblandy in #2768 - Increase max_outliers on wgpu water example reftest. by @jimblandy in #2767
- wgpu_core::command::bundle: Consolidate pipeline and vertex state. by @jimblandy in #2769
- Add type annotation to render pass code, for rust-analyzer. by @jimblandy in #2773
- Expose naga span location helpers by @nical in #2752
- Add create_texture_error by @nical in #2800
wgpu-hal 0.12.5 (2022-04-19)
- fix crashes when logging in debug message callbacks
- fix program termination when dx12 or gles error messages happen.
- implement validation canary
- DX12:
- Ignore erroneous validation error from DXGI debug layer.
wgpu-hal-0.12.4 (2022-01-24)
- Metal:
- check for MSL-2.3
wgpu-hal-0.12.3, deno-webgpu-? (2022-01-20)
- Metal:
- preserve vertex invariance
- Vulkan
- fix stencil read/write masks
- Gles:
- reset index binding properly
- DX12:
- fix copies into 1D textures
wgpu-core-0.12.2, wgpu-hal-0.12.2 (2022-01-10)
- fix tracy compile error
- fix buffer binding limits beyond 2Gb
- fix zero initialization of 3D textures
- Metal:
- fix surface texture views
- Gles:
- extend
libwaylandsearch paths
- extend
wgpu-core-0.12.1, wgpu-hal-0.12.1 (2021-12-29)
- zero initialization uses now render target clears when possible (faster and doesn't enforce COPY_DST internally if not necessary)
- fix use of MSAA targets in WebGL
- fix not providing
COPY_DSTflag for textures causing assertions in some cases - fix surface textures not getting zero initialized
- clear_texture supports now depth/stencil targets
- error message on creating depth/stencil volume texture
- Vulkan:
- fix validation error on debug message types
- DX12:
- fix check for integrated GPUs
- fix stencil subresource transitions
- Metal:
- implement push constants
wgpu-0.12 (2021-12-18)
- API:
MULTIVIEWfeatureDEPTH_CLIP_CONTROLfeature to replace the oldDEPTH_CLAMPTEXTURE_FORMAT_16BIT_NORMfeature- push/pop error scopes on the device
- more limits for compute shaders
SamplerBindingTypeinstead of booleans- sampler arrays are supported by
TEXTURE_BINDING_ARRAYfeature - "glsl" cargo feature for accepting GLSL shader code
- enforced MSRV-1.53
- correctness:
- textures are zero-initialized
- lots and lots of fixes
- validation:
- match texture-sampler pairs
- check
min_binding_sizelate at draw - check formats to match in
copy_texture_to_texture - allow
strip_index_formatto be none if unused - check workgroup sizes and counts
- shaders:
- please refer to naga-0.8 changelog
- nice error messages
wgpu-core-0.11.3, wgpu-hal-0.11.5, wgpu-0.11.1 (2021-12-01)
- Core:
- validate device descriptor before actually creating it
- fix validation of texture-sampler pairs
- Vulkan:
- fix running on Vulkan-1.1 instance
- improve detection of workaround for Intel+Nvidia on Linux
- fix resource limits on Vulkan-1.2
- fix the check for storage buffer requirement
- change internal semaphore logic to work around Linux+Intel bugs
- fix enabling extension-provided features
- GLES:
- fix running on old and bogus drivers
- fix stale samplers on bindings change
- fix integer textures
- fix querying work group parameters
- fix stale PBO bindings caused by resource copies
- fix rendering to cubemap faces
- fix
Rgba16Floatformat - fix stale vertex attributes when changing the pipeline
- Metal:
- fix window resizing for running in multiple processes
- Web:
- fix
set_index_bufferandset_vertex_bufferto have optional sizes
- fix
wgpu-core-0.11.2, wgpu-hal-0.11.4 (2021-10-22)
- fix buffer transition barriers
- Metal:
- disable RW buffers on macOS 10.11
- fix memory leaks in render pass descriptor
- WebGL:
- fix surface reconfiguration
- GLES:
- fix mapping when persistent mapping isn't supported
- allow presentation in Android emulator
- fix sRGB attributes on EGL-1.4 contexts
wgpu-hal-0.11.3 (2021-10-16)
- GL:
- fix mapping flags and buffer initialization
- fix context creation when sRGB is available
wgpu-core-0.11.1 (2021-10-15)
- fix bind group layout lifetime with regard to bind groups
wgpu-hal-0.11.2 (2021-10-12)
- GL/WebGL: fix vertex buffer bindings with non-zero first instance
- DX12: fix cube array view construction
wgpu-hal-0.11.1 (2021-10-09)
- Vulkan: fix NV optimus detection on Linux
- GL:
- fix indirect dispatch buffers
- WebGL:
- fix querying storage-related limits
- work around a browser bug in the clear shader
wgpu-0.11 (2021-10-07)
- Infrastructure:
- Deno WebGPU plugin is a part of the repository
- WebGPU CTS is ran on CI via Deno
- API:
- initial WebGL support
SwapchainFrameis removed.SurfaceTexture::present()needs to be called instead of dropping.- better SPIR-V control flow processing
- ability to request a software (fallback) adapter
- new limits for
min_uniform_buffer_offset_alignmentandmin_storage_buffer_offset_alignment - features:
- new
PARTIALLY_BOUND_BINDING_ARRAY NON_FILL_POLYGON_MODEis split intoPOLYGON_MODE_LINEandPOLYGON_MODE_POINT
- new
- fixes:
- many shader-related fixes in naga-0.7
- fix a panic in resource cleanup happening when they are dropped on another thread
- Vulkan:
- create SPIR-V per entry point to work around driver bugs
- expose higher descriptor limits based on descriptor indexing capabilities
- GL and Vulkan:
- Fix renderdoc device pointers
- optimization:
- on Vulkan, bounds checks are omitted if the platform can do them natively
wgpu-core-0.10.4, wgpu-0.10.2 (2021-09-23)
- fix
write_texturefor array textures - fix closing an encoder on validation error
- expose Metal surface creation
- panic with an actual error message in the default handler
wgpu-hal-0.10.7 (2021-09-14)
- Metal:
- fix stencil back-face state
- fix the limit on command buffer count
wgpu-hal-0.10.6 (2021-09-12)
- Metal:
- fix stencil operations
- fix memory leak on M1 when out of focus
- fix depth clamping checks
- fix unsized storage buffers beyond the first
wgpu-core-0.10.3, wgpu-hal-0.10.4 (2021-09-08)
- Vulkan:
- fix read access barriers for writable storage buffers
- fix shaders using cube array textures
- work around Linux Intel+Nvidia driver conflicts
- work around Adreno bug with
OpName
- DX12:
- fix storage binding offsets
- Metal:
- fix compressed texture copies
wgpu-core-0.10.2, wgpu-hal-0.10.3 (2021-09-01)
- All:
- fix querying the size of storage textures
- Vulkan:
- use render pass labels
- Metal:
- fix moving the surface between displays
- DX12:
- enable BC compressed textures
- GL:
- fix vertex-buffer and storage related limits
wgpu-core-0.10.1, wgpu-hal-0.10.2 (2021-08-24)
- All:
- expose more formats via adapter-specific feature
- fix creation of depth+stencil views
- validate cube textures to not be used as storage
- fix mip level count check for storage textures
- Metal:
- fix usage of work group memory
- DX12:
- critical fix of pipeline layout
v0.10 (2021-08-18)
- Infrastructure:
gfx-halis replaced by the in-house graphics abstractionwgpu-hal. Backends: Vulkan, Metal, D3D-12, and OpenGL ES-3.- examples are tested automatically for image snapshots.
- API:
crossfeature is removed entirely. Only Rust code from now on.- processing SPIR-V inputs for later translation now requires
spirvcompile feature enabled - new
Features::SPIRV_SHADER_PASSTHROUGHrun-time feature allows providing pass-through SPIR-V (orthogonal to the compile feature) - several bitflag names are renamed to plural:
TextureUsage,BufferUsage,ColorWrite. - the
SwapChainis merged intoSurface. Returned frames areTextureinstead ofTextureView. - renamed
TextureUsagebits:SAMPLED->TEXTURE_BINDING,STORAGE->STORAGE_BINDING. - renamed
InputStepModetoVertexStepMode. - readable storage textures are no longer a part of the base API. Only exposed via format-specific features, non-portably.
- implemented
Rgb9e5Ufloatformat. - added limits for binding sizes, vertex data, per-stage bindings, and others.
- reworked downlevel flags, added downlevel limits.
resolver = "2"is now required in top-level cargo manifests
- Fixed:
Device::create_query_setwould return an error when creating exactlyQUERY_SET_MAX_QUERIES(8192) queries. Now it only returns an error when trying to create more thanQUERY_SET_MAX_QUERIESqueries.
wgpu-core-0.9.2
- fix
Features::TEXTURE_SPECIFIC_FORMAT_FEATURESnot being supported for rendertargets
wgpu-core-0.9.1 (2021-07-13)
- fix buffer inits delayed by a frame
- fix query resolves to initialize buffers
- fix pipeline statistics stride
- fix the check for maximum query count
v0.9 (2021-06-18)
- Updated:
- naga to
v0.5.
- naga to
- Added:
Features::VERTEX_WRITABLE_STORAGE.Features::CLEAR_COMMANDSwhich allows you to usecmd_buf.clear_textureandcmd_buf.clear_buffer.
- Changed:
- Updated default storage buffer/image limit to
8from4.
- Updated default storage buffer/image limit to
- Fixed:
Buffer::get_mapped_rangecan now have a range of zero.- Fixed output spirv requiring the "kernel" capability.
- Fixed segfault due to improper drop order.
- Fixed incorrect dynamic stencil reference for Replace ops.
- Fixed tracking of temporary resources.
- Stopped unconditionally adding cubemap flags when the backend doesn't support cubemaps.
- Validation:
- Ensure that if resources are viewed from the vertex stage, they are read only unless
Features::VERTEX_WRITABLE_STORAGEis true. - Ensure storage class (i.e. storage vs uniform) is consistent between the shader and the pipeline layout.
- Error when a color texture is used as a depth/stencil texture.
- Check that pipeline output formats are logical
- Added shader label to log messages if validation fails.
- Ensure that if resources are viewed from the vertex stage, they are read only unless
- Tracing:
- Make renderpasses show up in the trace before they are run.
- Docs:
- Fix typo in
PowerPreference::LowPowerdescription.
- Fix typo in
- Player:
- Automatically start and stop RenderDoc captures.
- Examples:
- Handle winit's unconditional exception.
- Internal:
- Merged wgpu-rs and wgpu back into a single repository.
- The tracker was split into two different stateful/stateless trackers to reduce overhead.
- Added code coverage testing
- CI can now test on lavapipe
- Add missing extern "C" in wgpu-core on
wgpu_render_pass_execute_bundles - Fix incorrect function name
wgpu_render_pass_bundle_indexed_indirecttowgpu_render_bundle_draw_indexed_indirect.
wgpu-types-0.8.1 (2021-06-08)
- fix dynamic stencil reference for Replace ops
v0.8.1 (2021-05-06)
- fix SPIR-V generation from WGSL, which was broken due to "Kernel" capability
- validate buffer storage classes
- Added support for storage texture arrays for Vulkan and Metal.
v0.8 (2021-04-29)
- naga is used by default to translate shaders, SPIRV-Cross is optional behind
crossfeature - Features:
- buffers are zero-initialized
- downlevel limits for DX11/OpenGL support
- conservative rasterization (native-only)
- buffer resource indexing (native-only)
- API adjustments to the spec:
- Renamed
RenderPassColorAttachmentDescriptortoRenderPassColorAttachment:- Renamed the
attachmentmember toview
- Renamed the
- Renamed
RenderPassDepthStencilAttachmentDescriptortoRenderPassDepthStencilAttachment:- Renamed the
attachmentmember toview
- Renamed the
- Renamed
VertexFormatvalues- Examples:
Float3->Float32x3,Ushort2->Uint16x2
- Examples:
- Renamed the
depthvalue ofExtent3dtodepth_or_array_layers - Updated blending options in
ColorTargetState:- Renamed
BlendStatetoBlendComponent - Added
BlendStatestruct to hold color and alpha blend state - Moved
color_blendandalpha_blendmembers intoblendmember
- Renamed
- Moved
clamp_depthfromRastizerStatetoPrimitiveState - Updated
PrimitiveState:- Added
conservativemember for enabling conservative rasterization
- Added
- Updated copy view structs:
- Renamed
TextureCopyViewtoImageCopyTexture - Renamed
TextureDataLayouttoImageDataLayout - Changed
bytes_per_rowandrows_per_imagemembers ofImageDataLayoutfromu32toOption<NonZeroU32>
- Renamed
- Changed
BindingResource::Bindingfrom containing fields directly to containing aBufferBinding - Added
BindingResource::BufferArray
- Renamed
- Infrastructure:
- switch from
tracingtoprofiling - more concrete and detailed errors
- API traces include the command that crashed/panicked
- Vulkan Portability support is removed from Apple platforms
- switch from
- Validation:
- texture bindings
- filtering of textures by samplers
- interpolation qualifiers
- allow vertex components to be underspecified
wgpu-core-0.7.1 (2021-02-25)
- expose
wgc::device::queuesub-module in public - fix the indexed buffer check
- fix command allocator race condition
v0.7 (2021-01-31)
- Major API changes:
RenderPipelineDescriptorBindingType- new
ShaderModuleDescriptor - new
RenderEncoder
- Features:
- (beta) WGSL support, including the ability to bypass SPIR-V entirely
- (beta) implicit bind group layout support
- better error messages
- timestamp and pipeline statistics queries
- ETC2 and ASTC compressed textures
- (beta) targeting Wasm with WebGL backend
- reduced dependencies
- Native-only:
- clamp-to-border addressing
- polygon fill modes
- query a format for extra capabilities
f64support in shaders
- Validation:
- shader interface
- render pipeline descriptor
- vertex buffers
wgpu-0.6.2 (2020-11-24)
- don't panic in the staging belt if the channel is dropped
v0.6 (2020-08-17)
- Crates:
- C API is moved to another repository
player: standalone API replayer and tester
- Features:
- Proper error handling with all functions returning
Result - Graceful handling of "error" objects
- API tracing infrastructure
- uploading data with
write_buffer/write_texturequeue operations - reusable render bundles
- read-only depth/stencil attachments
- bind group layout deduplication
- Cows, cows everywhere
- Web+Native features:
- Depth clamping (feature)
- BC texture compression
- Native-only features:
- mappable primary buffers
- texture array bindings
- push constants
- multi-draw indirect
- Proper error handling with all functions returning
- Validation:
- all transfer operations
- all resource creation
- bind group matching to the layout
- experimental shader interface matching with naga
wgpu-core-0.5.6 (2020-07-09)
- add debug markers support
wgpu-core-0.5.5 (2020-05-20)
- fix destruction of adapters, swap chains, and bind group layouts
- fix command pool leak with temporary threads
- improve assertion messages
- implement
From<TextureFormat>forTextureComponentType
wgpu-core-0.5.4 (2020-04-24)
- fix memory management of staging buffers
wgpu-core-0.5.3 (2020-04-18)
- fix reading access to storage textures
- another fix to layout transitions for swapchain images
wgpu-core-0.5.2 (2020-04-15)
- fix read-only storage flags
- fix pipeline layout life time
- improve various assert messages
wgpu-core-0.5.1 (2020-04-10)
- fix tracking of swapchain images that are used multiple times in a command buffer
- fix tracking of initial usage of a resource across a command buffer
v0.5 (2020-04-06)
- Crates:
wgpu-types: common types between native and web targetswgpu-core: internal API for the native and remote wrappers
- Features:
- based on gfx-hal-0.5
- moved from Rendy to the new
gfx-memoryandgfx-descriptorcrates - passes are now recorded on the client side. The user is also responsible to keep all resources referenced in the pass up until it ends recording.
- coordinate system is changed to have Y up in the rendering space
- revised GPU lifetime tracking of all resources
- revised usage tracking logic
- all IDs are now non-zero
- Mailbox present mode
- Validation:
- active pipeline
- Fixes:
- lots of small API changes to closely match upstream WebGPU
- true read-only storage bindings
- unmapping dropped buffers
- better error messages on misused swapchain frames
wgpu-core-0.4.3 (2020-01-20)
- improved swap chain error handling
wgpu-core-0.4.2 (2019-12-15)
- fixed render pass transitions
wgpu-core-0.4.1 (2019-11-28)
- fixed depth/stencil transitions
- fixed dynamic offset iteration
v0.4 (2019-11-03)
- Platforms: removed OpenGL/WebGL support temporarily
- Features:
- based on gfx-hal-0.4 with the new swapchain model
- exposing adapters from all available backends on a system
- tracking of samplers
- cube map support with an example
- Validation:
- buffer and texture usage
wgpu-core-0.3.3 (2019-08-22)
- fixed instance creation on Windows
wgpu-core-0.3.1 (2019-08-21)
- fixed pipeline barriers that aren't transitions
v0.3 (2019-08-21)
- Platforms: experimental OpenGL/WebGL
- Crates:
- Rust API is moved out to another repository
- Features:
- based on gfx-hal-0.3 with help of
rendy-memoryandrendy-descriptor - type-system-assisted deadlock prevention (for locking internal structures)
- texture sub-resource tracking
raw-window-handleintegration instead ofwinit- multisampling with an example
- indirect draws and dispatches
- stencil masks and reference values
- native "compute" example
- everything implements
Debug
- based on gfx-hal-0.3 with help of
- Validation
- vertex/index/instance ranges at draw calls
- bing groups vs their expected layouts
- bind group buffer ranges
- required stencil reference, blend color
wgpu-core-0.2.6 (2019-04-04)
- fixed frame acquisition GPU waits
wgpu-core-0.2.5 (2019-03-31)
- fixed submission tracking
- added support for blend colors
- fixed bind group compatibility at the gfx-hal level
- validating the bind groups and blend colors
wgpu-core-0.2.3 (2019-03-20)
- fixed vertex format mapping
- fixed building with "empty" backend on Windows
- bumped the default descriptor pool size
- fixed host mapping alignments
- validating the uniform buffer offset
v0.2 (2019-03-06)
- Platforms: iOS/Metal, D3D11
- Crates:
wgpu-remote: remoting layer for the cross-process boundarygfx-examples: selected gfx pre-ll examples ported over
- Features:
- native example for compute
- "gfx-cube" and "gfx-shadow" examples
- copies between buffers and textures
- separate object identity for the remote client
- texture view tracking
- native swapchain resize support
- buffer mapping
- object index epochs
- comprehensive list of vertex and texture formats
- validation of pipeline compatibility with the pass
- Fixes
- fixed resource destruction
v0.1 (2019-01-24)
- Platforms: Linux/Vulkan, Windows/Vulkan, D3D12, macOS/Metal
- Crates:
wgpu-native: C API implementation of WebGPU, based on gfx-halwgpu-bindings: auto-generated C headerswgpu: idiomatic Rust wrapperexamples: native C examples
- Features:
- native examples for triangle rendering
- basic native swapchain integration
- concept of the storage hub
- basic recording of passes and command buffers
- submission-based lifetime tracking and command buffer recycling
- automatic resource transitions