aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2 daysSoftmax kernels for AVX/AVX512 generate smaller unrolled variantsupstream-masterFrank Barchard
4 daysAVX512skx RSUM F16F32ACC microkernels accumulate into outputFrank Barchard
4 daysDon't use mmap/munmap/mprotect for XNN_PLATFORM_QURT: the functions aren't av...XNNPACK Team
7 daysFix GEMM config for xnn_qd8_f16_qc8w_igemm_minmax_ukernel_4x16c4__neondotfp16...Frank Barchard
7 daysRdsum microkernels are accumlatingAlan Kelly
7 daysAdd SSE rdsum microkernelsAlan Kelly
7 daysFix GEMM config for xnn_qd8_f16_qc8w_gemm_minmax_ukernel_2x8c2s4__neonfp16arithFrank Barchard
7 daysFix GEMM config for xnn_qd8_f16_qc8w_igemm_minmax_ukernel_4x16c8__neoni8mmFrank Barchard
7 daysFix GEMM cnofig for xnn_qd8_f16_qc8w_igemm_minmax_ukernel_4x16c4__neondotfp16...Frank Barchard
8 daysEnable F16-F32ACC-RSUM AVX512SKX microkernel for sum of FP16 valuesFrank Barchard
8 daysFix GEMM cnofig for xnn_qd8_f32_qc8w_gemm_minmax_ukernel_2x8c2s4__neon_mlalFrank Barchard
8 daysIntroduce TransposeConv with dynamic range quantization Subgraph APIArtsiom Ablavatski
8 daysIntroduce QD8 TranposeConv operatorArtsiom Ablavatski
8 daysRDSum microkernels are no longer minmax and have their own tester.Alan Kelly
8 daysClean-up rdsum benchesAlan Kelly
8 daysInternal config changeAlan Kelly
9 daysMerge pull request #6326 from ejparkqc:masterXNNPACK Team
9 daysMerge pull request #6267 from mcr229:fc_qcint32_biasXNNPACK Team
9 daysF16-F32ACC-RSUM AVX512SKX microkernel for sum of FP16 valuesFrank Barchard
9 daysImprove GEMM unittest performanceFrank Barchard
9 daysChange one of the flags for hexagon-sim to use v68EJ Park
9 daysF16-RMAX scalar optimized keep max sign complementFrank Barchard
10 daysRemoving build script using Hexagon v66EJ Park
10 daysChange Hexagon minimum supported version to v68 (from v66)EJ Park
10 daysAdd f32 rsum discontig benchmarksAlan Kelly
10 daysAdd f32 rsum discontig neon microkernelsAlan Kelly
10 daysAdd rsum discontiguous ukernels.Alan Kelly
10 daysF16-RMAX - enable rmax F16C microkernel for F16C instead of AVX2Frank Barchard
10 daysFix math.h spelling - compliment changed to complementFrank Barchard
10 daysF16-RMAX - enable rmax scalar for all platformsFrank Barchard
10 daysF16-RMINMAX - move math_min_f16 and math_max_f16 to math.hFrank Barchard
10 daysF16-RMAX benchmark include f16c_u32 for AVXFrank Barchard
11 daysMean op can handle arbitrary reduction axis in the contiguous axes.Alan Kelly
11 daysFix missing `#include`s in the `XNNPACK/src` subdirectory.Pedro Gonnet
11 daysRemove printf that were used during debugging reduce and resizeFrank Barchard
12 daysGEMM unittest step thru NC by NextPrimeFrank Barchard
13 daysStep thru k-block test range using prime numbersFrank Barchard
14 daysAdd iterative `vsqrt` microkernels for `x86_64`, which computes `x*rsqrt(x)`,...Pedro Gonnet
14 daysPass VNNI and AMX flags to hardware-config.cFrank Barchard
14 daysAdd support for broadcasting of scalar weights to PreluAlan Kelly
2024-04-19Automated Code ChangeXNNPACK Team
2024-04-18Rsum ukernels accumulate into output.Alan Kelly
2024-04-17X8-PACKW use unaligned_store_s32Frank Barchard
2024-04-17Use the FARF() macros for debug logging on Hexagon, rather than qurt_printf()...XNNPACK Team
2024-04-17Fix order of external values in slinkyDillon Sharlet
2024-04-17Re-generate tests after template updateAlan Kelly
2024-04-17Use the `ReplicableRandomDevice` instead of `std::random_device`/`std::mt1993...Pedro Gonnet
2024-04-17Enable AVX2 F16-F32ACC GEMM for improved performanceFrank Barchard
2024-04-17Fix missing `#include`s in `XNNPACK/test` subdirectory.Pedro Gonnet
2024-04-16Rollback of new `f32-vsqrt` microkernels.Pedro Gonnet