AArch64 Instruction support & minor bug fixes #273

FinnWilkinson · 2022-11-16T20:39:53Z

As well as some AArch64 instruction support, the following fixes are implemented :

Added example SME core (A64FX with in-core SME support)
Fixes MIPS calculation in CoreWrapper.cc
Fixed store instruction group allocation logic for non-SVE instructions
Corrected SME instruction group allocation in
Reassign LdAddr iterator after erasing element in LSQ load conflict logic (originally in PR Reassign iterator after erasing element in std::vector #272)
Moved optinisations of PR Minor optimizations #274 into this PR :
- Refactored the ReorderBuffer::commitMicroOps method by using binary search to find the list of uops with the same instruction id. This improves the complexity from O(n) to O(logn)
- Changed the DispatchIssueUnit::tick method to use a dynamic array instead of initializing a vector for each method call. This eliminates ~ 14m memory allocations for stream-triad.
- Replaced std::vector to std::list in LoadStoreQueue::startLoad as it provides O(1) deletion compared to O(n) (worst-case).
Removed the explicit copy constructor from the AArch64 Instruction class
Changed MatrixRow-Count config option to Matrix-Count
- Makes it easier for the user to understand how many physcial ZA registers they are defining
- Hides the internal ZA implementation, again making it easier for users to understand and use

…the mbind syscall.

…with tests.

… index) NEON instructions with tests.

…ion with tests.

…er for the user.

FinnWilkinson · 2023-01-27T15:16:10Z

#rerun tests

…f SME src/dest operands.

…class.

jj16791

One small comment about a misspelling but looks good otherwise

src/lib/arch/aarch64/Instruction_decode.cc

The instructions in ROB are sorted by their insnID, hence we can use binary search to efficiently find them.

This avoids having to make a new vector for every tick of DispatchIssueUnit

Replace std::vector with std::list as it supports O(1) deletion.

dANW34V3R

One comment about the TODO. The rest looks good

src/lib/arch/aarch64/Instruction_execute.cc

src/lib/pipeline/ReorderBuffer.cc

jj16791

A few comments for points of discussion

src/include/simeng/pipeline/DispatchIssueUnit.hh

src/lib/pipeline/LoadStoreQueue.cc

src/include/simeng/pipeline/ReorderBuffer.hh

src/lib/pipeline/ReorderBuffer.cc

As well as some AArch64 instruction support, the following fixes and optimisations are implemented : - Added example SME core (A64FX with in-core SME support) - Fixes MIPS calculation in CoreWrapper.cc - Fixed store instruction group allocation logic for non-SVE instructions - Corrected SME instruction group allocation in - Reassign LdAddr iterator after erasing element in LSQ load conflict logic (originally in PR Reassign iterator after erasing element in std::vector #272) - Moved optinisations of PR Minor optimizations #274 into this PR : - Refactored the ReorderBuffer::commitMicroOps method by using binary search to find the list of uops with the same instruction id. This improves the complexity from O(n) to O(logn) - Changed the DispatchIssueUnit::tick method to use a dynamic array instead of initializing a vector for each method call. This eliminates ~ 14m memory allocations for stream-triad. - Replaced std::vector to std::list in LoadStoreQueue::startLoad as it provides O(1) deletion compared to O(n) (worst-case). - Removed the explicit copy constructor from the AArch64 Instruction class - Changed MatrixRow-Count config option to Matrix-Count - Makes it easier for the user to understand how many physcial ZA registers they are defining - Hides the internal ZA implementation, again making it easier for users to understand and use

FinnWilkinson added bug Something isn't working enhancement New feature or request labels Nov 16, 2022

FinnWilkinson requested review from dANW34V3R, jj16791 and rahahahat November 16, 2022 20:39

FinnWilkinson self-assigned this Nov 16, 2022

FinnWilkinson force-pushed the dev branch from 0e8fd49 to 26efa1c Compare November 29, 2022 15:16

FinnWilkinson added 11 commits December 7, 2022 12:35

Added OPENBLAS_NUM_THREADS to environment variables, and implemented …

4b9c5cc

…the mbind syscall.

Implemented 4s version of the ST1 (multi structure) NEON instruction …

423ee8a

…with tests.

Implemented the 2s and 4s variants of the ST1 (multi structures, post…

9a25067

… index) NEON instructions with tests.

Implemented scalar double-precision FCVTZU (vector, integer) instruct…

b5ec08a

…ion with tests.

Removed OpenBLAS environment variable from Linux Process.

79c05c5

Implemented 32-bit (fixed-point) scvtf instruction with tests.

2081324

Added A64fx with SME model config as an example implementation of SME.

efa9b64

Fixed instruction grouping allocation in Instruction Decode for AArch64.

a71dbd8

Fixed instruction group allocation in Instruction_decode.

89f5382

Fixed coreWrapper MIPS calculation.

7bb8ea5

Updated how physical matrix register count is defined to make it easi…

67f2781

…er for the user.

FinnWilkinson force-pushed the RIKEN-code-support branch from 08b5165 to 67f2781 Compare December 7, 2022 12:43

FinnWilkinson and others added 6 commits January 27, 2023 15:19

Updated AArch64 Instruction methods to accomodate for larger number o…

b660e31

…f SME src/dest operands.

Undo previous commit

4bd94ac

Removed version control conflict marker from ModelConfig.hh.

33000fc

Fixed formatting issues.

dbd8816

Removed un-needed explicit copy constructor from AArch64 Instruction …

34d83af

…class.

Reassign LdAddr iterator after erasing element in LSQ.

b262fcc

FinnWilkinson mentioned this pull request Jan 30, 2023

Reassign iterator after erasing element in std::vector #272

Closed

jj16791 requested changes Jan 30, 2023

View reviewed changes

src/lib/arch/aarch64/Instruction_decode.cc Outdated Show resolved Hide resolved

FinnWilkinson added 2 commits January 30, 2023 11:31

Use binary search to find insn in ROB->commitMicroOps

27ea2de

The instructions in ROB are sorted by their insnID, hence we can use binary search to efficiently find them.

Use dynamic array to track dispatches

ca1c615

This avoids having to make a new vector for every tick of DispatchIssueUnit

Minor improvements in LSQ::startLoad

85a23d9

Replace std::vector with std::list as it supports O(1) deletion.

dANW34V3R reviewed Jan 30, 2023

View reviewed changes

src/lib/arch/aarch64/Instruction_execute.cc Show resolved Hide resolved

Fixed spelling mistake.

f55fe1f

FinnWilkinson mentioned this pull request Jan 30, 2023

Minor optimizations #274

Closed

dANW34V3R reviewed Jan 30, 2023

View reviewed changes

src/lib/pipeline/ReorderBuffer.cc Outdated Show resolved Hide resolved

jj16791 requested changes Jan 30, 2023

View reviewed changes

src/include/simeng/pipeline/DispatchIssueUnit.hh Outdated Show resolved Hide resolved

src/lib/pipeline/LoadStoreQueue.cc Outdated Show resolved Hide resolved

src/include/simeng/pipeline/ReorderBuffer.hh Show resolved Hide resolved

Attended to PR comments.

c880c4d

dANW34V3R reviewed Jan 30, 2023

View reviewed changes

src/lib/pipeline/ReorderBuffer.cc Show resolved Hide resolved

jj16791 approved these changes Jan 30, 2023

View reviewed changes

Updated comments in ROB->commitMicroOps.

13bbd33

dANW34V3R approved these changes Jan 30, 2023

View reviewed changes

jj16791 approved these changes Jan 30, 2023

View reviewed changes

FinnWilkinson merged commit bfac331 into dev Jan 30, 2023

FinnWilkinson deleted the RIKEN-code-support branch June 8, 2023 10:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AArch64 Instruction support & minor bug fixes #273

AArch64 Instruction support & minor bug fixes #273

FinnWilkinson commented Nov 16, 2022 •

edited

Loading

FinnWilkinson commented Jan 27, 2023

jj16791 left a comment

dANW34V3R left a comment

jj16791 left a comment

AArch64 Instruction support & minor bug fixes #273

AArch64 Instruction support & minor bug fixes #273

Conversation

FinnWilkinson commented Nov 16, 2022 • edited Loading

FinnWilkinson commented Jan 27, 2023

jj16791 left a comment

Choose a reason for hiding this comment

dANW34V3R left a comment

Choose a reason for hiding this comment

jj16791 left a comment

Choose a reason for hiding this comment

FinnWilkinson commented Nov 16, 2022 •

edited

Loading