Remove deprecated `accessed_flags` flags field from `ZydisDecodedInstruction` #262

flobernd · 2021-11-03T07:10:16Z

Removes a bunch of deprecated code related to accessed cpu flags.

In the current state we lost fine granular information about FPU flag actions:

TESTED, TESTED_MODIFIED -> READ
MODIFIED, TESTED_MODIFIED SET_0, SET_1, UNDEFINED -> WRITTEN

I'm planning to bring back this functionality in a follow up commit.

athre0z · 2021-11-06T16:48:54Z

Hmm. I seem to remember that we decided that there is precisely zero value in being able to distinguish set 1, set 0 and undefined from write for any use-case that we could think of, unless we also provide full semantic information. What changed?

flobernd · 2021-11-06T16:55:55Z

The only drawback including the extra info would be a slightly larger binary (maybe 1KiB).

I personally would be fine keeping only the cpu_flags_read/written, but at the same time I'm wondering if any user possibly uses the advanced info for anything.

ZehMatt · 2021-11-06T17:14:14Z

I personally think having more granular detail would be desired, I was a bit surprised that it got reduced to the two fields only.

flobernd · 2021-11-06T17:29:34Z

I think there were some instructions for which the affected flags change depending on semantics. But these were only like 1-2 exceptions for which we defined the "worst case" set.

athre0z · 2021-11-06T17:30:22Z

I personally think having more granular detail would be desired, I was a bit surprised that it got reduced to the two fields only.

Interesting. Could you perhaps provide some insights in how you'd profit from that in any real world project? Note that there are only 81 instructions that use SET_0 and 3 that use SET_1, which are the two cases where I could imagine that someone might build assumptions on it for e.g. an optimizer. For UNDEFINED, any optimizer would always have to assume that the flag might be tainted, which is essentially exactly the same as WRITE.

On the other hand, with the new design, users will now have to inspect (or OR together) the 4 different write flags in order to determine if any sort of write occurred.

Corresponding instruction DB queries

$ jq '.[] | select(.affected_flags and any(.affected_flags[] | values; . == "0")) | .mnemonic' instructions.json | sort | uniq | wc -l
81

$ jq '.[] | select(.affected_flags and any(.affected_flags[] | values; . == "1")) | .mnemonic' instructions.json | sort | uniq | wc -l
1

flobernd · 2021-11-06T18:18:19Z

On the other hand, with the new design, users will now have to inspect (or OR together) the 4 different write flags in order to determine if any sort of write occurred

With the old design before the flags_read/written masks, it was even worse tho 😃

Maybe we can go for a compromise and keep the flags_read/written while adding the other mask as additional info.

ZehMatt · 2021-11-07T00:38:29Z

I personally think having more granular detail would be desired, I was a bit surprised that it got reduced to the two fields only.

Interesting. Could you perhaps provide some insights in how you'd profit from that in any real world project? Note that there are only 81 instructions that use SET_0 and 3 that use SET_1, which are the two cases where I could imagine that someone might build assumptions on it for e.g. an optimizer. For UNDEFINED, any optimizer would always have to assume that the flag might be tainted, which is essentially exactly the same as WRITE.

Well I would go with modified, consumed, undefined. As for the undefined flags there are quite interesting uses in obfuscations, so allowing the analysis to spot use of undefined flags without specifically checking for the instruction makes it a bit easier. The interesting thing about those undefined flags is that they are usually concrete per CPU and just undefined in a general sense.

athre0z · 2021-11-07T16:19:36Z

I started a poll about this on Twitter yesterday and also asked around in some Discord communities. While nobody I talked to seemed to have a particularly strong opinion about it, it seems like quite a few people have previously profited from having this distinction in one way or another. Looking at this data, I'm going to pull an 180 on my previous position and suggest that we just keep things as implemented in this PR. :)

…ruction`

flobernd · 2021-11-09T08:09:44Z

I started a poll about this on Twitter yesterday and also asked around in some Discord communities.

Thanks for taking care 👍

athre0z

LGTM! Sorry for taking so long to review.

include/Zydis/DecoderTypes.h

flobernd · 2021-11-10T05:42:57Z

LGTM! Sorry for taking so long to review.

No worries at all! Thanks.

flobernd requested a review from athre0z November 3, 2021 07:10

flobernd force-pushed the remove-deprecated-code branch 3 times, most recently from ea27991 to 5580ced Compare November 5, 2021 09:34

athre0z added A-decoder Area: Decoder C-cleanup Category: Cleanup of code and refactoring work P-medium Priority: Medium labels Nov 6, 2021

flobernd force-pushed the remove-deprecated-code branch from 5580ced to b9717e4 Compare November 7, 2021 09:29

athre0z added this to the v4.0.0 milestone Nov 7, 2021

Remove deprecated accessed_flags flags field from `ZydisDecodedInst…

ad037b5

…ruction`

flobernd force-pushed the remove-deprecated-code branch from b9717e4 to ad037b5 Compare November 9, 2021 08:08

flobernd added 3 commits November 9, 2021 09:18

Fix msvc pedantic warnings

3338da6

Restore advanced FPU flag information

74d0a1c

Remove duplicate code in ZydisInfo

29fce53

flobernd force-pushed the remove-deprecated-code branch from 5226272 to 29fce53 Compare November 9, 2021 10:36

athre0z previously approved these changes Nov 9, 2021

View reviewed changes

include/Zydis/DecoderTypes.h Outdated Show resolved Hide resolved

include/Zydis/DecoderTypes.h Show resolved Hide resolved

Remove legacy code

8e26ad4

flobernd dismissed athre0z’s stale review via 8e26ad4 November 10, 2021 08:39

flobernd merged commit 7d6ee06 into master Nov 10, 2021

athre0z deleted the remove-deprecated-code branch November 10, 2021 09:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove deprecated `accessed_flags` flags field from `ZydisDecodedInstruction` #262

Remove deprecated `accessed_flags` flags field from `ZydisDecodedInstruction` #262

flobernd commented Nov 3, 2021 •

edited

Loading

athre0z commented Nov 6, 2021

flobernd commented Nov 6, 2021

ZehMatt commented Nov 6, 2021

flobernd commented Nov 6, 2021

athre0z commented Nov 6, 2021 •

edited

Loading

flobernd commented Nov 6, 2021

ZehMatt commented Nov 7, 2021

athre0z commented Nov 7, 2021

flobernd commented Nov 9, 2021

athre0z left a comment

flobernd commented Nov 10, 2021

Remove deprecated accessed_flags flags field from ZydisDecodedInstruction #262

Remove deprecated accessed_flags flags field from ZydisDecodedInstruction #262

Conversation

flobernd commented Nov 3, 2021 • edited Loading

athre0z commented Nov 6, 2021

flobernd commented Nov 6, 2021

ZehMatt commented Nov 6, 2021

flobernd commented Nov 6, 2021

athre0z commented Nov 6, 2021 • edited Loading

Corresponding instruction DB queries

flobernd commented Nov 6, 2021

ZehMatt commented Nov 7, 2021

athre0z commented Nov 7, 2021

flobernd commented Nov 9, 2021

athre0z left a comment

Choose a reason for hiding this comment

flobernd commented Nov 10, 2021

Remove deprecated `accessed_flags` flags field from `ZydisDecodedInstruction` #262

Remove deprecated `accessed_flags` flags field from `ZydisDecodedInstruction` #262

flobernd commented Nov 3, 2021 •

edited

Loading

athre0z commented Nov 6, 2021 •

edited

Loading