Simd Library Release Notes (2020).

Home | Release Notes | Download | Documentation | Issues | GitHub

2020 | 2019 | 2018 | 2017 | 2016 | 2015 | 2014 | 2013

January 3, 2020 (version 4.5.85)

Algorithms

New features
  • Base implementation, SSE2, AVX2, AVX-512F and NEON optimizations of function SynetUnaryOperation32fLayerForward.
  • Base implementation, SSE2, AVX2, AVX-512F and NEON optimizations of function SynetSoftplus32f.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel2x2Block2x2SetFilter.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel2x2Block2x2SetInput.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel2x2Block2x2SetOutput.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel2x2Block4x4SetFilter.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel2x2Block4x4SetInput.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel2x2Block4x4SetOutput.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel1x3Block1x4SetFilter.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel1x3Block1x4SetInput.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel1x3Block1x4SetOutput.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel1x5Block1x4SetFilter.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel1x5Block1x4SetInput.
  • Base implementation, SSE, AVX, AVX-512F and NEON optimizations of function WinogradKernel1x5Block1x4SetOutput.
Improving
  • Performance of Convolution32f framework (NHWC format, kernel=1x1x1).
  • Performance of Convolution32f framework (NHWC format, kernel=2x2).
  • Performance of Convolution32f framework (NHWC format, kernel=1x3).
  • Performance of Convolution32f framework (NHWC format, kernel=1x5).
Renaming
  • NeuralSigmoid to SynetSigmoid32f.
  • NeuralTanh to SynetTanh32f.
  • NeuralRelu to SynetRelu32f.
  • Winograd2x3SetFilter to WinogradKernel3x3Block2x2SetFilter.
  • Winograd2x3SetInput to WinogradKernel3x3Block2x2SetInput.
  • Winograd2x3SetOutput to WinogradKernel3x3Block2x2SetOutput.
  • Winograd3x3SetFilter to WinogradKernel3x3Block3x3SetFilter.
  • Winograd3x3SetInput to WinogradKernel3x3Block3x3SetInput.
  • Winograd3x3SetOutput to WinogradKernel3x3Block3x3SetOutput.
  • Winograd4x4SetFilter to WinogradKernel3x3Block4x4SetFilter.
  • Winograd4x4SetInput to WinogradKernel3x3Block4x4SetInput.
  • Winograd4x4SetOutput to WinogradKernel3x3Block4x4SetOutput.
Bug fixing
  • Error in Convolution32f framework (kernel greater than input size, NHWC format).
  • Potential crash in ContourDetector.

Test framework

New features
  • Tests for verifying functionality of function SynetUnaryOperation32fLayerForward.
  • Tests for verifying functionality of function SynetSoftplus32f.
  • Tests for verifying functionality of function WinogradKernel2x2Block2x2SetFilter.
  • Tests for verifying functionality of function WinogradKernel2x2Block2x2SetInput.
  • Tests for verifying functionality of function WinogradKernel2x2Block2x2SetOutput.
  • Tests for verifying functionality of function WinogradKernel2x2Block4x4SetFilter.
  • Tests for verifying functionality of function WinogradKernel2x2Block4x4SetInput.
  • Tests for verifying functionality of function WinogradKernel2x2Block4x4SetOutput.
  • Tests for verifying functionality of function WinogradKernel1x3Block1x4SetFilter.
  • Tests for verifying functionality of function WinogradKernel1x3Block1x4SetInput.
  • Tests for verifying functionality of function WinogradKernel1x3Block1x4SetOutput.
  • Tests for verifying functionality of function WinogradKernel1x5Block1x4SetFilter.
  • Tests for verifying functionality of function WinogradKernel1x5Block1x4SetInput.
  • Tests for verifying functionality of function WinogradKernel1x5Block1x4SetOutput.
Home
2020 | 2019 | 2018 | 2017 | 2016 | 2015 | 2014 | 2013