[Perf] Linux/x64: 19 Regressions on 3/20/2025 1:07:06 AM +00:00 #113912

performanceautofiler · 2025-03-25T18:31:31Z

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	522637540793a45275ac6a3136a0cd245264564c
Compare	f28528958c15a3b6a6d75970e09ec8d15322586a
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Buffers.Text.Tests.Utf8ParserTests

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
TryParseUInt32Hex - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	7.42 ns	9.64 ns	1.30	0.03	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Buffers.Text.Tests.Utf8ParserTests*'

System.Buffers.Text.Tests.Utf8ParserTests.TryParseUInt32Hex(value: FFFFFFFFFFFFFFFF)

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	522637540793a45275ac6a3136a0cd245264564c
Compare	f28528958c15a3b6a6d75970e09ec8d15322586a
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in PerfLabTests.LowLevelPerf

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector
GenericGenericMethod - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	112.84 μs	133.82 μs	1.19	0.01	False
GenericClassGenericStaticMethod - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	111.48 μs	133.83 μs	1.20	0.03	False
StaticDelegate - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	200.77 μs	222.73 μs	1.11	0.02	False
ClassVirtualMethod - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	111.70 μs	133.90 μs	1.20	0.02	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'PerfLabTests.LowLevelPerf*'

PerfLabTests.LowLevelPerf.GenericGenericMethod

ETL Files

Histogram

JIT Disasms

PerfLabTests.LowLevelPerf.GenericClassGenericStaticMethod

ETL Files

Histogram

JIT Disasms

PerfLabTests.LowLevelPerf.StaticDelegate

ETL Files

Histogram

JIT Disasms

PerfLabTests.LowLevelPerf.ClassVirtualMethod

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	522637540793a45275ac6a3136a0cd245264564c
Compare	f28528958c15a3b6a6d75970e09ec8d15322586a
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Memory.Span<Int32>

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
IndexOfAnyTwoValues - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	9.60 ns	11.54 ns	1.20	0.04	False
IndexOfAnyTwoValues - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	149.36 ns	171.73 ns	1.15	0.03	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Memory.Span&lt;Int32&gt;*'

System.Memory.Span<Int32>.IndexOfAnyTwoValues(Size: 33)

ETL Files

Histogram

JIT Disasms

System.Memory.Span<Int32>.IndexOfAnyTwoValues(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	522637540793a45275ac6a3136a0cd245264564c
Compare	f28528958c15a3b6a6d75970e09ec8d15322586a
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Collections.IndexerSetReverse<Int32>

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
List - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	524.73 ns	574.25 ns	1.09	0.03	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Collections.IndexerSetReverse&lt;Int32&gt;*'

System.Collections.IndexerSetReverse<Int32>.List(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	522637540793a45275ac6a3136a0cd245264564c
Compare	f28528958c15a3b6a6d75970e09ec8d15322586a
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Collections.ContainsKeyFalse<Int32, Int32>

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
ConcurrentDictionary - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	1.58 μs	1.85 μs	1.17	0.11	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Collections.ContainsKeyFalse&lt;Int32, Int32&gt;*'

System.Collections.ContainsKeyFalse<Int32, Int32>.ConcurrentDictionary(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	522637540793a45275ac6a3136a0cd245264564c
Compare	f28528958c15a3b6a6d75970e09ec8d15322586a
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Collections.Tests.Perf_BitArray

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector
BitArrayByteArrayCtor - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	155.19 ns	181.80 ns	1.17	0.01	False
BitArraySetLengthGrow - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	215.32 ns	242.45 ns	1.13	0.04	False
BitArraySetLengthShrink - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	155.01 ns	182.69 ns	1.18	0.02	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Collections.Tests.Perf_BitArray*'

System.Collections.Tests.Perf_BitArray.BitArrayByteArrayCtor(Size: 512)

ETL Files

Histogram

JIT Disasms

System.Collections.Tests.Perf_BitArray.BitArraySetLengthGrow(Size: 512)

ETL Files

Histogram

JIT Disasms

System.Collections.Tests.Perf_BitArray.BitArraySetLengthShrink(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	522637540793a45275ac6a3136a0cd245264564c
Compare	f28528958c15a3b6a6d75970e09ec8d15322586a
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in Span.IndexerBench

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
CoveredIndex2 - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	638.74 ns	696.39 ns	1.09	0.01	False
CoveredIndex1 - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	662.00 ns	1.10 μs	1.67	0.00	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'Span.IndexerBench*'

Span.IndexerBench.CoveredIndex2(length: 1024)

ETL Files

Histogram

JIT Disasms

Span.IndexerBench.CoveredIndex1(length: 1024)

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	522637540793a45275ac6a3136a0cd245264564c
Compare	f28528958c15a3b6a6d75970e09ec8d15322586a
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Collections.AddGivenSize<Int32>

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
Stack - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	679.93 ns	810.37 ns	1.19	0.08	False
Queue - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	1.02 μs	1.13 μs	1.11	0.04	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Collections.AddGivenSize&lt;Int32&gt;*'

System.Collections.AddGivenSize<Int32>.Stack(Size: 512)

ETL Files

Histogram

JIT Disasms

System.Collections.AddGivenSize<Int32>.Queue(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	522637540793a45275ac6a3136a0cd245264564c
Compare	f28528958c15a3b6a6d75970e09ec8d15322586a
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Text.RegularExpressions.Tests.Perf_Regex_Industry_Leipzig

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
Count - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	327.33 ms	354.25 ms	1.08	0.01	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Text.RegularExpressions.Tests.Perf_Regex_Industry_Leipzig*'

System.Text.RegularExpressions.Tests.Perf_Regex_Industry_Leipzig.Count(Pattern: ".{2,4}(Tom|Sawyer|Huckleberry|Finn)", Options: Compiled)

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	522637540793a45275ac6a3136a0cd245264564c
Compare	f28528958c15a3b6a6d75970e09ec8d15322586a
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Tests.Perf_UInt32

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
ParseSpan - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	7.94 ns	9.77 ns	1.23	0.11	False
Parse - Duration of single invocation 📝 - Benchmark Source ADX - Test Multi Config Graph	9.62 ns	10.98 ns	1.14	0.07	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Tests.Perf_UInt32*'

System.Tests.Perf_UInt32.ParseSpan(value: "0")

ETL Files

Histogram

JIT Disasms

System.Tests.Perf_UInt32.Parse(value: "12345")

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

LoopedBard3 · 2025-03-26T06:53:31Z

A mix of noise/bimodal, the rest are likely due to #113575 based on linked improvements. FYI @EgorBo. Commit range: 230a4ce...254b55a

Related regressions:

[Perf] Linux/x64: 3 Regressions on 3/20/2025 1:07:06 AM +00:00 perf-autofiling-issues#52351
[Perf] Windows/x64: 2 Regressions on 3/20/2025 1:07:06 AM +00:00 perf-autofiling-issues#52337
[Perf] Windows/x64: 13 Regressions on 3/20/2025 1:07:06 AM +00:00 perf-autofiling-issues#52324
[Perf] Linux/x64: 3 Regressions on 3/16/2025 10:54:17 PM +00:00 perf-autofiling-issues#52301 (ModPow)
[Perf] Linux/arm64: 1 Regression on 3/20/2025 12:28:49 AM +00:00 perf-autofiling-issues#52489
[Perf] Linux/arm64: 14 Regressions on 3/20/2025 4:50:58 AM +00:00 perf-autofiling-issues#52490
[Perf] Linux/x64: 2 Regressions on 3/29/2025 2:11:00 AM +00:00 perf-autofiling-issues#52681
[Perf] Windows/arm64: 1 Regression on 3/20/2025 4:50:58 AM +00:00 perf-autofiling-issues#52920
[Perf] Windows/arm64: 1 Regression on 3/20/2025 2:16:23 PM +00:00 perf-autofiling-issues#53303

EgorBo · 2025-03-26T13:31:55Z

@EgorBot -intel -amd -commit 254b55a vs previous --filter System.Collections.Tests.Perf_BitArray.BitArrayByteArrayCtor

EgorBo · 2025-03-26T13:55:43Z

Does not look like my change

EgorBo · 2025-03-26T13:56:00Z

@EgorBot -intel -amd -commit 0666ad4 vs previous --filter System.Collections.Tests.Perf_BitArray.BitArrayByteArrayCtor

EgorBo · 2025-03-26T13:56:19Z

@EgorBot -intel -amd -commit 48ace18 vs previous --filter System.Collections.Tests.Perf_BitArray.BitArrayByteArrayCtor

EgorBo · 2025-03-26T14:11:05Z

@EgorBot -intel -amd -nonativepgo -commit 0666ad4 vs previous --filter System.Collections.Tests.Perf_BitArray.BitArrayByteArrayCtor

EgorBo · 2025-03-26T14:22:44Z

Confirmed that it is same issue as #113108 (regressed by #113491, see EgorBot/runtime-utils#332 (comment)) cc @amanasifkhalid

EgorBo · 2025-03-26T14:23:36Z

@amanasifkhalid do we need to keep this one open or #113108 is enough?

EgorBo · 2025-03-26T14:24:22Z

NOTE: seems like it's Intel specific, so likely the JCC erratum

amanasifkhalid · 2025-03-26T14:31:51Z

@EgorBo I'd like to keep this open for now, just so I can include it in my tables of regressions/improvements. I'll close this if I don't find anything actionable.

dotnet-policy-service · 2025-03-26T14:35:56Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

amanasifkhalid · 2025-04-11T23:19:18Z

I'm not sure if it'll make it into .NET 10, but I think these regressions are a good motivator for improving loop inversion so we don't have to rely on ad-hoc fgOptimizeBranch passes, which (as seen above) sometimes make loop structures amenable to block layout, and sometimes don't. So I'm going to keep this issue around for now.

amanasifkhalid · 2025-04-29T16:11:24Z

I'm not sure if it'll make it into .NET 10, but I think these regressions are a good motivator for improving loop inversion

I don't know if we'll get to graph-based loop inversion in .NET 10. Moving to future.

performanceautofiler bot added arch-x64 os-linux Linux OS (any supported distro) runtime-coreclr specific to the CoreCLR runtime untriaged New issue has not been triaged by the area owner labels Mar 25, 2025

performanceautofiler bot mentioned this issue Mar 25, 2025

[SENTINEL] Autofile run complete at 3/25/2025 6:45:15 PM +00:00. 12 issues filed. dotnet/perf-autofiling-issues#52321

Closed

LoopedBard3 transferred this issue from dotnet/perf-autofiling-issues Mar 26, 2025

dotnet-issue-labeler bot added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Mar 26, 2025

LoopedBard3 added tenet-performance Performance related issue tenet-performance-benchmarks Issue from performance benchmark labels Mar 26, 2025

EgorBot mentioned this issue Mar 26, 2025

Benchmarks for #113912 (EgorBo) EgorBot/runtime-utils#330

Open

EgorBot mentioned this issue Mar 26, 2025

Benchmarks for #113912 (EgorBo) EgorBot/runtime-utils#331

Closed

EgorBot mentioned this issue Mar 26, 2025

Benchmarks for #113912 (EgorBo) EgorBot/runtime-utils#332

Open

EgorBot mentioned this issue Mar 26, 2025

Benchmarks for #113912 (EgorBo) EgorBot/runtime-utils#333

Open

EgorBo assigned amanasifkhalid Mar 26, 2025

EgorBo added area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI and removed needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners labels Mar 26, 2025

JulieLeeMSFT removed the untriaged New issue has not been triaged by the area owner label Mar 26, 2025

JulieLeeMSFT added this to the 10.0.0 milestone Mar 26, 2025

amanasifkhalid mentioned this issue Apr 11, 2025

Widespread perf regressions due to RPO layout #102763

Closed

amanasifkhalid modified the milestones: 10.0.0, Future Apr 29, 2025

[Perf] Linux/x64: 19 Regressions on 3/20/2025 1:07:06 AM +00:00 #113912

[Perf] Linux/x64: 19 Regressions on 3/20/2025 1:07:06 AM +00:00 #113912

Comments

performanceautofiler bot commented Mar 25, 2025

Run Information

Regressions in System.Buffers.Text.Tests.Utf8ParserTests

Repro

System.Buffers.Text.Tests.Utf8ParserTests.TryParseUInt32Hex(value: FFFFFFFFFFFFFFFF)

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in PerfLabTests.LowLevelPerf

Repro

PerfLabTests.LowLevelPerf.GenericGenericMethod

ETL Files

Histogram

JIT Disasms

PerfLabTests.LowLevelPerf.GenericClassGenericStaticMethod

ETL Files

Histogram

JIT Disasms

PerfLabTests.LowLevelPerf.StaticDelegate

ETL Files

Histogram

JIT Disasms

PerfLabTests.LowLevelPerf.ClassVirtualMethod

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in System.Memory.Span<Int32>

Repro

System.Memory.Span<Int32>.IndexOfAnyTwoValues(Size: 33)

ETL Files

Histogram

JIT Disasms

System.Memory.Span<Int32>.IndexOfAnyTwoValues(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in System.Collections.IndexerSetReverse<Int32>

Repro

System.Collections.IndexerSetReverse<Int32>.List(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in System.Collections.ContainsKeyFalse<Int32, Int32>

Repro

System.Collections.ContainsKeyFalse<Int32, Int32>.ConcurrentDictionary(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in System.Collections.Tests.Perf_BitArray

Repro

System.Collections.Tests.Perf_BitArray.BitArrayByteArrayCtor(Size: 512)

ETL Files

Histogram

JIT Disasms

System.Collections.Tests.Perf_BitArray.BitArraySetLengthGrow(Size: 512)

ETL Files

Histogram

JIT Disasms

System.Collections.Tests.Perf_BitArray.BitArraySetLengthShrink(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in Span.IndexerBench

Repro

Span.IndexerBench.CoveredIndex2(length: 1024)

LoopedBard3 commented Mar 26, 2025 •

edited

Loading

EgorBo commented Mar 26, 2025 •

edited

Loading