aboutsummaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2022-08-22bfly4m1 use a single pointer for dataFrank Barchard
2022-08-22Fix naming of the U64->U32 VSQRTSHIFT microkernelMarat Dukhan
2022-08-22U64->U32 VSQRTSHIFT microkernelMarat Dukhan
2022-08-22Unify WINDOW inferface with other microkernelsMarat Dukhan
2022-08-21Unify VLSHIFT interface with other VUNARY microkernelsMarat Dukhan
2022-08-21Refactor S16 RMAXABS microkernelMarat Dukhan
2022-08-21U64 SQRT evaluation stubsMarat Dukhan
2022-08-21Evaluation stubs for U32 SQRT using F32 SQRTMarat Dukhan
2022-08-20Specialized M1 variant of bfly4 scalarFrank Barchard
2022-08-19Rename xnn_qs8_minmax_params to xnn_qc8_conv_minmax_paramsMarat Dukhan
2022-08-19Enable Relaxed SIMD microkernels for QS8/QU8 VCVT & VLRELUMarat Dukhan
2022-08-18Specialized WINDOW microkernels for shift by constantsFrank Barchard
2022-08-18Refactor declarations of microkernel function pointersMarat Dukhan
2022-08-17u32 filterbank-subtract scalar microkernelFrank Barchard
2022-08-17Specialize binary elementwise operation task for 1D-4D casesMarat Dukhan
2022-08-17Use mulext for filterback-accumulate to multiply 2 uint32 values to produce a...Frank Barchard
2022-08-17Refactor U32 VLOG microkernels to use math_extmul_u32Marat Dukhan
2022-08-17BF16 GEMM microkernels for NEON & NEON-BF16Marat Dukhan
2022-08-17Depth to space nhwc uses transpose.Alan Kelly
2022-08-17Depth to space nchw2nhwc uses transposeAlan Kelly
2022-08-17Handle input and output strides in transpose normalizationAlan Kelly
2022-08-16Enable ARM SIMD32 microkernels for pre-NEON AArch32 processorsMarat Dukhan
2022-08-16Rename ARMV6SIMD to ARMSIMD32Marat Dukhan
2022-08-16WAsm Relaxed SIMD QS8/QU8 VCVT & VLRELU microkernelsMarat Dukhan
2022-08-16Move post-operation structs into separate file and libZhi An Ng
2022-08-16Add eager API for transpose.Alan Kelly
2022-08-15filterbank accumulate removed unused math headerFrank Barchard
2022-08-15Add fused operators support to convolution operatorsZhi An Ng
2022-08-13u32 filterbank-accumulate NEON and scalar microkernelsFrank Barchard
2022-08-12Change microkernel initialization functions to return the size of the initial...Zhi An Ng
2022-08-09CS16 fftr scalar microkernelFrank Barchard
2022-08-08bfly4 reorder math to do r and i in batches.Frank Barchard
2022-08-08Remove unused FFT tablesFrank Barchard
2022-08-08Remove duplicate XNN_INTERNAL attribute from microparams initializationMarat Dukhan
2022-08-07Extract xnnpack/microkernels.h into a separate Bazel targetMarat Dukhan
2022-08-06bfly4 template generate SAMPLE_TILE up to x4Frank Barchard
2022-08-06Fix indent for vsquareabs templateFrank Barchard
2022-08-05CS16 bfly4 microkernel scalar implementationFrank Barchard
2022-08-03Rename table loglut to vlogFrank Barchard
2022-08-03Return xnn_status instead of hard coded integers in JIT generatorsZhi An Ng
2022-08-03Add vld2_r to AArch32 assembler, this will be used later for fused operations.Zhi An Ng
2022-08-02apply sort to gemm headersFrank Barchard
2022-08-02U32 VLOG microkernel to compute natural logFrank Barchard
2022-08-02Change JIT generator to return uint8_t instead of xnn_status to remove depend...Zhi An Ng
2022-08-02Add some AArch32 assembler instructionsZhi An Ng
2022-08-02Add extended multiplication math functionsMarat Dukhan
2022-08-01Convert AArch32 and AArch64 F32 GEMM and IGEMM microkernels used for default ...Zhi An Ng
2022-08-01Do not redefine GNU_SOURCE if it's already defined by something already in th...XNNPACK Team
2022-07-29S16 Front End Microkernel improve naming consistencyFrank Barchard
2022-07-28CS16 squareabs microkernel for NEONFrank Barchard