feat: flash attention support for hexagon-npu #16

chraac · 2025-06-14T15:02:51Z

Related to chraac/llama.cpp#45

… method; enhance tests for FLASH_ATTN_EXT

…th new properties and methods

…parsing and improve OpTensor handling

…tions

Co-authored-by: Copilot <[email protected]>

Copilot

Pull Request Overview

Adds Flash Attention support for the Hexagon NPU backend by enhancing log parsing, test coverage, CLI scripts, and documentation.

Extend log_parser and its tests to recognize FLASH_ATTN_EXT operations
Introduce --flash-attn flags in device‐run and benchmark scripts (bash and PowerShell)
Update build docs with Hexagon SDK requirements and add a Python test workflow

Reviewed Changes

Copilot reviewed 12 out of 12 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
scripts/tests/test_log_parser.py	Expanded unit tests to cover `FLASH_ATTN_EXT` parsing
scripts/run_device_model.sh	Added `-f/--flash-attn` argument and forwarding in bash script
scripts/run_device_model.ps1	Added `-f/--FlashAttention` switch and arg assembly in PS script
scripts/log_parser.py	Implemented parsing and item subclassing for `FLASH_ATTN_EXT`
scripts/batch_run_benchmarks_and_save_log.sh	Added `--flash-attn` support for bash benchmark runner
scripts/batch_run_benchmarks_and_save_log.ps1	Added `-f/--FlashAttention` support for PowerShell benchmark
llama.cpp	Updated submodule commit reference
docs/how-to-build.md	Added instructions to install Hexagon SDK
.github/workflows/python_tests.yml	New workflow to run Python unit tests for log parser

Comments suppressed due to low confidence (3)

scripts/tests/test_log_parser.py:8

[nitpick] The test name suggests checking a method called get_name, but it actually verifies dtype, shape, and permute behavior. Consider renaming to test_op_tensor_initialization_sets_dtype_shape_and_permute for clarity.

    def test_given_op_tensor_when_get_name_then_return_correct_name(self):

.github/workflows/python_tests.yml:41

This step is labeled Test with pytest but invokes the unittest module incorrectly. Either switch to pytest invocation or use python -m unittest test_log_parser -v to run the tests as written.

        python3 -m test_log_parser -v

scripts/log_parser.py:131

[nitpick] The return type hint dict[str:(...)] is invalid Python syntax. Consider using dict[str, Union[OpTensor.DataType, int, float, bool, list]] or the -> Dict[str, Any] pattern to satisfy type checkers.

    def __parse_prop(prop: str) -> dict[str:(OpTensor.DataType | int | float | bool | list)]:

scripts/run_device_model.sh

Copilot · 2025-06-18T02:58:21Z

scripts/batch_run_benchmarks_and_save_log.ps1

@@ -37,6 +40,10 @@ if ($Verbose) {
    $extraArgs = "-v"
 }

+if ($FlashAttention) {


The variable $extraArgs is only set inside the if ($Verbose) block and may be undefined here. Initialize $extraArgs = "" before any conditionals to prevent null or concatenation errors.

This reverts commit 63c8829.

chraac added 13 commits May 28, 2025 21:47

add params

ce39ca2

add flash attention param

c33103e

wip

bf16adb

fix bench param

97d099c

update how to build

a2cc441

add log parser

ecfe2da

update log parser regex and add tests for FLASH_ATTN_EXT properties

ee6731e

refactor log parser to improve readability and add list_from_iterable…

ed61259

… method; enhance tests for FLASH_ATTN_EXT

add support for parsing and saving log data to CSV; enhance OpItem wi…

a3a91c9

…th new properties and methods

enhance log parser to support file encoding detection; refactor item …

d095a1e

…parsing and improve OpTensor handling

enhance log parser with detailed logging for parsing and saving opera…

bbd4b4f

…tions

wip

06fef0f

bump llama.cpp version

1439de8

chraac requested a review from Copilot June 14, 2025 15:02

chraac self-assigned this Jun 14, 2025

chraac added the enhancement New feature or request label Jun 14, 2025

This comment was marked as outdated.

Sign in to view

chraac and others added 5 commits June 15, 2025 22:08

fix comments

3e54892

Co-authored-by: Copilot <[email protected]>

wip

53377ec

Co-authored-by: Copilot <[email protected]>

bump llama.cpp version

07d855a

wip

28fd500

bump llama.cpp version

119bfd5

chraac requested a review from Copilot June 18, 2025 02:43

This comment was marked as outdated.

Sign in to view

chraac force-pushed the dev-flash-attn branch from 9963edd to b601d7d Compare June 18, 2025 02:51

add Python CI workflow and requirements for testing

09a9f1e

chraac force-pushed the dev-flash-attn branch from b601d7d to 09a9f1e Compare June 18, 2025 02:52

chraac requested a review from Copilot June 18, 2025 02:56

Copilot AI reviewed Jun 18, 2025

View reviewed changes

chraac force-pushed the dev-flash-attn branch from 9657a79 to 0583edd Compare June 18, 2025 03:04

chraac force-pushed the dev-flash-attn branch from 0583edd to f4e295b Compare June 18, 2025 03:05

wip

63c8829

chraac force-pushed the dev-flash-attn branch from f4e295b to 63c8829 Compare June 18, 2025 03:07

chraac added 2 commits June 18, 2025 11:08

Revert "wip"

a850ed4

This reverts commit 63c8829.

update build script references in documentation

48832ed

chraac merged commit e675c6f into main Jun 22, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: flash attention support for hexagon-npu #16

feat: flash attention support for hexagon-npu #16

Uh oh!

chraac commented Jun 14, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

feat: flash attention support for hexagon-npu #16

feat: flash attention support for hexagon-npu #16

Uh oh!

Conversation

chraac commented Jun 14, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Copilot AI Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!