oneAPI Deep Neural Network Library (oneDNN)
This is a patch release containing the following changes to v3.4:
This is a patch release containing the following changes to v3.3.5:
Intel Architecture Processors:
1
and 14
.matmul
and add
operations and mixed int8 and bfloat16 data types with Graph API.reduction
, softmax
and layernorm
operations with experimental Graph Compiler backend.Intel Graphics Products:
AArch64-based Processors:
-mcpu=generic
to improve portability.--num-streams
knob in benchdnn to support benchmarking in multi-stream scenarios.This release contains contributions from the project core team as well as Alexander Grund @Flamefire, David Svantesson @davsva01, Fadi Arafeh @fadara01, Hugh Delaney @hdelan, Ilya Lavrenov @ilya-lavrenov, Jacob Kahn @jacobkahn, Nathan John Sircombe @nSircombe, Renato Barros Arantes @renato-arantes, Sergey Shalnov @shssf, Sunita Nadampalli @snadampal, and Svetlozar Georgiev @sgeor255. We would also like to thank everyone who asked questions and reported issues.
This is a patch release containing the following changes to v3.3.4:
SEGFAULT
in int8 convolution on processors with Intel AMX support (2a8e122b63b55f897c470d23f21003bb70f0e839)Intel Architecture Processors:
1
and 14
.matmul
and add
operations and mixed int8 and bfloat16 data types with Graph API.reduction
, softmax
and layernorm
operations with experimental Graph Compiler backend.Intel Graphics Products:
AArch64-based Processors:
-mcpu=generic
to improve portability.--num-streams
knob in benchdnn to support benchmarking in multi-stream scenarios.This release contains contributions from the project core team as well as Alexander Grund @Flamefire, David Svantesson @davsva01, Fadi Arafeh @fadara01, Hugh Delaney @hdelan, Ilya Lavrenov @ilya-lavrenov, Jacob Kahn @jacobkahn, Nathan John Sircombe @nSircombe, Renato Barros Arantes @renato-arantes, Sergey Shalnov @shssf, Sunita Nadampalli @snadampal, and Svetlozar Georgiev @sgeor255. We would also like to thank everyone who asked questions and reported issues.
This is a patch release containing the following changes to v3.3.3:
SEGFAULT
in 3D convolutions with different h
and w
parameters on Intel CPUs (b5f916ec068f783dbba2cd4f04a673e996f9efba)This is a patch release containing the following changes to v3.3.2:
This is a patch release containing the following changes to v3.3.1:
This is a patch release containing the following changes to v3.3:
avgpool_bwd
operation in Graph API (d025ef6620b131f3487bb748866ddd9d7225c09f, 9e0602ad37afa18d46f407cb52577f1afead238b, e0dc1b3d070313052f5fd6ac739778d45b57859c)SEGFAULT
in experimental Graph Compiler for fp32 MLP subgraph (42071057abb2fcbbca6ed67117bdb1a5ee3dc0cd)unimplemented
on Intel GPUs (bf12207b0312c0174f0c47ae0d3abd70edc31957, 800b5e9613bd0994af82706ef024ad2b453be2b6, ec7054a2c79ae33d3db4ff04ce11360c2c896d56)any
memory format tag.dnnl::graph::set_constant_tensor_cache()
call.This release contains contributions from the project core team as well as Amy Wignall @AmyWignall-arm, @baibeta, Benjamin Taylor @bentaylorhk-arm, Ilya Lavrenov @ilya-lavrenov, Kentaro Kawakami @kawakami-k, Milos Puzovic @milpuz01, Renato Barros Arantes @renato-arantes, @snadampal, @sparkyrider, and Thomas Köppe @tkoeppe. We would also like to thank everyone who asked questions and reported issues.