Commit 82f51e9
[0.6] Add support arm opt fp32 intrinsics [MOD-9011] (#636)
* Add support arm opt fp32 intrinsics [MOD-9011] (#617)
* Add arm support
* Changed the arm cpu info
* Add ip test
* Add to tests
* Added tests andbm
* fix tests
* Add github benchmakrs
* Check 1
* only arm
* change ami
* Try ireland
* Try different image
* try image
* back to old image
* larger image
* Add option to change env
* back to default region
* Created new image
* Try to add the x86 to check
* Try different machine
* added include
* Try without opti on arm
* Change to c6g
* added matrix region
* change to west
* try the i8
* Try oregon
* Change subnet id
* Now subnet
* Change subnet
* add subnet
* Try group id
* Change to vpc id
* change subnet
* Change ami
* Try without subnet
* add security group again
* Change the subnets
* Change to ids
* Change sg
* psubnet
* Try different
* different
* to a file
* print
* p
* leave empty
* empty
* Try different account
* Run 2 arm machines
* Move both to us-west-2
* Try workflow
* Change name
* Changes
* Change the secrets
* Add supprted arch
* Add defaults
* Support all
* Change the jq
* Change machine to t4g
* Change the name
* Change the machine
* fix the stop
* only benchamrk
* add the secrets
* region secret
* benchmark region
* Change timeout
* Added support for arch name in benchamrks
* change th json
* changed to v9.0
* Change the check
* add v9
* Check alt version of armv9
* added check
* add arc_arch
* changed to CONCAT_WITH_UNDERSCORE_ARCH
* change the check
* Add full check
* fix the instruct
* Added the cmake
* fix the support
* put it back to cmake
* back
* change the condition
* No armpl for now
* cland format
* remove the opt
* Changed to one machine
* Added BENCHMARK_ARCH
* fix endif
* Remove secrets call
* pr changes
* Changes
* change to compile
* add sve
* add #endif
* add armpl
* add to cmake
* remove armpl
* add install
* Add ARCH=$(uname -m)
* change the path to armpl
* suuport check for armv7
* change the armpl
* Change or OR
* add neon supported for spaces
* add sve
* add support
* align
* format
* change error
* change
* Removed the ifdef
* Add comments
* clang
* Change names
* format
* Try fp32 neon simd
* add l2
* add cmake
* add SVE
* fix sve l2
* PR changes
* Change to 1
* fix the l2
* fix format
* add desciriopn for chunk == 1
* Change functions
* Add include
* Change the cast
* add resudual
* formatting
* Move th consexpt
* remove template armpl
* Back to armpl
* back to armpl_neon
* include
* armnpl
* add choose
* fix the residual div
* raise the residuals values
* back to char
* Remove prefetch
* Revert implemetion chooser
* Remove armpl
* Revert remove error
* Remove comment
* Remove empty line
* format
* Add support macos
* add sudo
* Add absolute path
* find all libs
* Change folder
* Now set for real
* Remove armpl from pull
* change the templates
* change chunk size to 1
* Back to 4
* Removed the for
* Change to 2 sums
* Changed
* Add get opt func
* Change the var name
* format
* Pr fixes
* PR
* pr
* pr fix
* PR
* added conversion
* small dim for intel only
* Test smallDimChooser only for intel
(cherry picked from commit b996755)
* changes
* Revert "changes"
This reverts commit a0a00c5.
* Revert "Add support arm opt fp32 intrinsics [MOD-9011] (#617)"
This reverts commit c97d347.
* Changes
* Format
* Change tests
* fix tests
* format
* PR changes
* new line
* tests
* format
* Add support arm opt fp32 intrinsics [MOD-9011] (#617)
* Add arm support
* Changed the arm cpu info
* Add ip test
* Add to tests
* Added tests andbm
* fix tests
* Add github benchmakrs
* Check 1
* only arm
* change ami
* Try ireland
* Try different image
* try image
* back to old image
* larger image
* Add option to change env
* back to default region
* Created new image
* Try to add the x86 to check
* Try different machine
* added include
* Try without opti on arm
* Change to c6g
* added matrix region
* change to west
* try the i8
* Try oregon
* Change subnet id
* Now subnet
* Change subnet
* add subnet
* Try group id
* Change to vpc id
* change subnet
* Change ami
* Try without subnet
* add security group again
* Change the subnets
* Change to ids
* Change sg
* psubnet
* Try different
* different
* to a file
* print
* p
* leave empty
* empty
* Try different account
* Run 2 arm machines
* Move both to us-west-2
* Try workflow
* Change name
* Changes
* Change the secrets
* Add supprted arch
* Add defaults
* Support all
* Change the jq
* Change machine to t4g
* Change the name
* Change the machine
* fix the stop
* only benchamrk
* add the secrets
* region secret
* benchmark region
* Change timeout
* Added support for arch name in benchamrks
* change th json
* changed to v9.0
* Change the check
* add v9
* Check alt version of armv9
* added check
* add arc_arch
* changed to CONCAT_WITH_UNDERSCORE_ARCH
* change the check
* Add full check
* fix the instruct
* Added the cmake
* fix the support
* put it back to cmake
* back
* change the condition
* No armpl for now
* cland format
* remove the opt
* Changed to one machine
* Added BENCHMARK_ARCH
* fix endif
* Remove secrets call
* pr changes
* Changes
* change to compile
* add sve
* add #endif
* add armpl
* add to cmake
* remove armpl
* add install
* Add ARCH=$(uname -m)
* change the path to armpl
* suuport check for armv7
* change the armpl
* Change or OR
* add neon supported for spaces
* add sve
* add support
* align
* format
* change error
* change
* Removed the ifdef
* Add comments
* clang
* Change names
* format
* Try fp32 neon simd
* add l2
* add cmake
* add SVE
* fix sve l2
* PR changes
* Change to 1
* fix the l2
* fix format
* add desciriopn for chunk == 1
* Change functions
* Add include
* Change the cast
* add resudual
* formatting
* Move th consexpt
* remove template armpl
* Back to armpl
* back to armpl_neon
* include
* armnpl
* add choose
* fix the residual div
* raise the residuals values
* back to char
* Remove prefetch
* Revert implemetion chooser
* Remove armpl
* Revert remove error
* Remove comment
* Remove empty line
* format
* Add support macos
* add sudo
* Add absolute path
* find all libs
* Change folder
* Now set for real
* Remove armpl from pull
* change the templates
* change chunk size to 1
* Back to 4
* Removed the for
* Change to 2 sums
* Changed
* Add get opt func
* Change the var name
* format
* Pr fixes
* PR
* pr
* pr fix
* PR
* added conversion
* small dim for intel only
* Test smallDimChooser only for intel
(cherry picked from commit b996755)
* changes
* Revert "changes"
This reverts commit a0a00c5.
* Revert "Add support arm opt fp32 intrinsics [MOD-9011] (#617)"
This reverts commit c97d347.
* Changes
* Format
* Change tests
* fix tests
* format
* PR changes
* new line
* tests
* format
* Add support arm opt fp32 intrinsics [MOD-9011] (#617)
* Add arm support
* Changed the arm cpu info
* Add ip test
* Add to tests
* Added tests andbm
* fix tests
* Add github benchmakrs
* Check 1
* only arm
* change ami
* Try ireland
* Try different image
* try image
* back to old image
* larger image
* Add option to change env
* back to default region
* Created new image
* Try to add the x86 to check
* Try different machine
* added include
* Try without opti on arm
* Change to c6g
* added matrix region
* change to west
* try the i8
* Try oregon
* Change subnet id
* Now subnet
* Change subnet
* add subnet
* Try group id
* Change to vpc id
* change subnet
* Change ami
* Try without subnet
* add security group again
* Change the subnets
* Change to ids
* Change sg
* psubnet
* Try different
* different
* to a file
* print
* p
* leave empty
* empty
* Try different account
* Run 2 arm machines
* Move both to us-west-2
* Try workflow
* Change name
* Changes
* Change the secrets
* Add supprted arch
* Add defaults
* Support all
* Change the jq
* Change machine to t4g
* Change the name
* Change the machine
* fix the stop
* only benchamrk
* add the secrets
* region secret
* benchmark region
* Change timeout
* Added support for arch name in benchamrks
* change th json
* changed to v9.0
* Change the check
* add v9
* Check alt version of armv9
* added check
* add arc_arch
* changed to CONCAT_WITH_UNDERSCORE_ARCH
* change the check
* Add full check
* fix the instruct
* Added the cmake
* fix the support
* put it back to cmake
* back
* change the condition
* No armpl for now
* cland format
* remove the opt
* Changed to one machine
* Added BENCHMARK_ARCH
* fix endif
* Remove secrets call
* pr changes
* Changes
* change to compile
* add sve
* add #endif
* add armpl
* add to cmake
* remove armpl
* add install
* Add ARCH=$(uname -m)
* change the path to armpl
* suuport check for armv7
* change the armpl
* Change or OR
* add neon supported for spaces
* add sve
* add support
* align
* format
* change error
* change
* Removed the ifdef
* Add comments
* clang
* Change names
* format
* Try fp32 neon simd
* add l2
* add cmake
* add SVE
* fix sve l2
* PR changes
* Change to 1
* fix the l2
* fix format
* add desciriopn for chunk == 1
* Change functions
* Add include
* Change the cast
* add resudual
* formatting
* Move th consexpt
* remove template armpl
* Back to armpl
* back to armpl_neon
* include
* armnpl
* add choose
* fix the residual div
* raise the residuals values
* back to char
* Remove prefetch
* Revert implemetion chooser
* Remove armpl
* Revert remove error
* Remove comment
* Remove empty line
* format
* Add support macos
* add sudo
* Add absolute path
* find all libs
* Change folder
* Now set for real
* Remove armpl from pull
* change the templates
* change chunk size to 1
* Back to 4
* Removed the for
* Change to 2 sums
* Changed
* Add get opt func
* Change the var name
* format
* Pr fixes
* PR
* pr
* pr fix
* PR
* added conversion
* small dim for intel only
* Test smallDimChooser only for intel
(cherry picked from commit b996755)
* changes
* Revert "changes"
This reverts commit a0a00c5.
* Revert "Add support arm opt fp32 intrinsics [MOD-9011] (#617)"
This reverts commit c97d347.
* Changes
* Format
* Change tests
* fix tests
* format
* PR changes
* new line
* unndeed
* remove
* changes
* format
* [0.6] [MOD-9303] Update GoogleTest tag to support CMake 4.0 compatibility (#637)
[MOD-9303] Update GoogleTest tag to support CMake 4.0 compatibility (#630)
* use googletest 1.16.0
remove pin cmake version (not needed as we meet cmake requirments of cmake 4.0)
* disbale isntall cmake in codeql
* disable temp flow
remove install cmake from codeql
(cherry picked from commit b983c5b)
* Build the test at runtime
* Fix length
---------
Co-authored-by: Omer <lerman25@gmail.com>
Co-authored-by: meiravgri <109056284+meiravgri@users.noreply.github.com>1 parent 1476b87 commit 82f51e9
File tree
25 files changed
+893
-21
lines changed- cmake
- src/VecSim/spaces
- IP
- L2
- functions
- tests
- benchmark
- spaces_benchmarks
- unit
25 files changed
+893
-21
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
93 | 93 | | |
94 | 94 | | |
95 | 95 | | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
96 | 122 | | |
97 | 123 | | |
98 | 124 | | |
99 | 125 | | |
100 | 126 | | |
101 | 127 | | |
102 | 128 | | |
| 129 | + | |
103 | 130 | | |
104 | 131 | | |
105 | 132 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
13 | 16 | | |
14 | 17 | | |
15 | 18 | | |
16 | 19 | | |
17 | 20 | | |
18 | | - | |
19 | 21 | | |
| 22 | + | |
20 | 23 | | |
21 | | - | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
22 | 29 | | |
| 30 | + | |
23 | 31 | | |
24 | 32 | | |
| 33 | + | |
25 | 34 | | |
26 | 35 | | |
27 | 36 | | |
| |||
52 | 61 | | |
53 | 62 | | |
54 | 63 | | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
55 | 82 | | |
56 | 83 | | |
57 | 84 | | |
58 | | - | |
59 | 85 | | |
60 | 86 | | |
61 | 87 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
0 commit comments