aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2022-02-03Enable QU8 AAarch microkernels based on uarchFrank Barchard
2022-02-03Graph rewriting for FP16 inferenceMarat Dukhan
2022-02-03QS8 AArch32 GEMM benchmark build fixFrank Barchard
2022-02-02Add AArch32 GEMM benchmarks for Cortex A53 and Cortex A7Frank Barchard
2022-02-02QS8 GEMM benchmark for JIT add ISA checkFrank Barchard
2022-02-02Enable QS8/QC8 4x8 lane GEMM AArch32 microkernel for Cortex A7Frank Barchard
2022-02-02Include JIT_SRCS in XNNPACK buildMarat Dukhan
2022-02-02Support vld1r_32 with 1 or 2 register(s) in listZhi An Ng
2022-02-02Fix incorrect k argument to QC8/QS8 GEMM microkernel testZhi An Ng
2022-02-02Fix passing of kc JIT generator in F32 GEMM benchmarksZhi An Ng
2022-02-02Include missing <limits> header in 4x8 F32 GEMM codegen for A53Marat Dukhan
2022-02-02Make void* params argument of JIT generators constZhi An Ng
2022-02-02QS8 4x8 lane GEMM AArch32 microkernel for Cortex A7Frank Barchard
2022-02-02Enable QS8 4x8 lane GEMM AArch32 microkernel for Cortex A5r0 and A7Frank Barchard
2022-02-02Fix tfjs build by adding dependency on jitZhi An Ng
2022-02-02Specialize 6x8-aarch64-neonfma-cortex-a75 on min/max paramsZhi An Ng
2022-02-02QC8 4x8 lane GEMM AArch32 microkernel for Cortex A7Frank Barchard
2022-02-02Enable QU8 4x8 lane GEMM AArch32 microkernel for Cortex A53Frank Barchard
2022-02-02Enable QC8 4x8 lane GEMM AArch32 microkernel for Cortex A53Frank Barchard
2022-02-02Make SSE2 microkernels consistent with neon zip microkernels.Alan Kelly
2022-02-02Enable QS8 4x8 lane GEMM AArch32 microkernel for Cortex A53Frank Barchard
2022-02-02QS8 4x8 lane GEMM AArch32 microkernel for Cortex A53Frank Barchard
2022-02-02Add neon zip microkernel generatorAlan Kelly
2022-02-01Set F32 GEMM generator function for A75 if XNN_ENABLE_JIT is set (defaults to...Zhi An Ng
2022-02-01Store rows in direct order in F16 GEMM microkernelsMarat Dukhan
2022-02-01Explicitly disable -ffast-math for scalar & WAsm microkernelsMarat Dukhan
2022-02-01Guard JIT-related structs and functionality behind XNN_PLATFORM_JITZhi An Ng
2022-02-01Integrate JIT generated GEMM microkernels into create_convolution2d_nhwcZhi An Ng
2022-02-01Reoptimize QC8/QS8/QU8 GEMM/IGEMM WAsm SIMD microkernel selectionMarat Dukhan
2022-02-01QU8 GEMM/IGEMM WAsm SIMD microkernels with SR=4Marat Dukhan
2022-01-31Reoptimize NEON QC8/QS8 GEMM/IGEMM microkernels with SR > 1Marat Dukhan
2022-01-31Re-generate amalgamated FMA3 microkernelsMarat Dukhan
2022-01-31Reoptimize QS8/QC8 GEMM/IGEMM WAsm SIMD microkernels with swizzleMarat Dukhan
2022-01-31Pad K to a multiple of SR in GEMM/IGEMM microkernelsMarat Dukhan
2022-01-31Fix excessive memory allocation for packed weights in DeconvolutionMarat Dukhan
2022-01-31Improve test coverage for quantized Depthwise Convolutions in TFLite weight l...Marat Dukhan
2022-01-31Link LibM to indirection target in CMake buildMarat Dukhan
2022-01-31Make SSE2 microkernels consistent with neon zip microkernels.Alan Kelly
2022-01-31Make SSE2 microkernels consistent with neon zip microkernels.Alan Kelly
2022-01-31Make SSE2 microkernels consistent with neon zip microkernels.Alan Kelly
2022-01-31Integrate JIT generated GEMM microkernels into create_convolution2d_nhwcXNNPACK Team
2022-01-31Guard JIT-related structs and functionality behind XNN_PLATFORM_JITXNNPACK Team
2022-01-28Guard JIT-related structs and functionality behind XNN_PLATFORM_JITZhi An Ng
2022-01-28Integrate JIT generated GEMM microkernels into create_convolution2d_nhwcZhi An Ng
2022-01-28Check code_buffer capacity before attempting to release itZhi An Ng
2022-01-27Remove wb from JIT aarch32 instructions, use mem operand and ++ insteadZhi An Ng
2022-01-27Add F32 GEMM 6x8 aarch64 neonfma cortex a75 JIT microkernel to benchmarkZhi An Ng
2022-01-27Fix encoding of prfmZhi An Ng
2022-01-26QS8/QC8 4x8 dot product IGEMM AArch32 microkernel for Cortex A55Frank Barchard
2022-01-26Add default cases for switch, GCC warns that control reaches the end of non-v...Zhi An Ng