Metrics for categorical forecasting #2767

jonasblanc · 2025-04-10T09:06:59Z

Checklist before merging this PR:

Mentioned all issues that this PR fixes or addresses.
Summarized the updates of this PR under Summary.
Added an entry under Unreleased in the Changelog.

Fixes #.

Summary

Implement metrics for categorical forecasting.

TODO:

extend list of supported categorical metrics
categorical specific tests
adapt metrics decorator to support per class score and cross-class averaging

Other Information

codecov · 2025-04-11T10:23:42Z

Codecov Report

Attention: Patch coverage is 74.44444% with 23 lines in your changes missing coverage. Please review.

Project coverage is 95.03%. Comparing base (8e60174) to head (96492e7).

Files with missing lines	Patch %	Lines
darts/metrics/categorical_metrics.py	74.44%	23 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2767      +/-   ##
==========================================
- Coverage   95.23%   95.03%   -0.21%     
==========================================
  Files         145      146       +1     
  Lines       15092    15182      +90     
==========================================
+ Hits        14373    14428      +55     
- Misses        719      754      +35

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

dennisbader

Thanks @jonasblanc, this is a nice start 🚀

Some thoughts:

remove per time step metric support for all metrics, except maybe the accuracy itself
add support for probabilistic series:
- if preds are sampled, take the label with highest count
- if preds are likelihood parameters, take the label with highest probability
for a first collection, we should have accuracy, precision, recall, f1
if we don't add timeseries support for confusion matrix, etc. then we should make these functions private

darts/metrics/categorical_metrics.py

dennisbader · 2025-04-21T08:25:43Z

darts/metrics/categorical_metrics.py

We could call it classification.py then it's a bit more nice to import:

from darts.metrics.classification import acc

Maybe forecasting_classification in order not to be confused with up-coming TS classification ?

dennisbader · 2025-04-21T08:30:30Z

darts/metrics/categorical_metrics.py

+        intersect,
+        remove_nan_union=False,
+    )
+    return np.mean(


Here you give already the aggregated value over time. But your metric is defined as a per time step metric since it accepts the time_reduction.

Either you return here the time dependent accuracy (e.g. boolean 1 / 0 whether it's a hit per time step) and add a mean accuracy metric that aggregates (see metrics.ae and metrics.mae as an example), or you call it macc directly and remove the time_reduction.

dennisbader · 2025-04-21T08:53:26Z

darts/metrics/categorical_metrics.py

+
+@multi_ts_support
+@multivariate_support
+def bacc(


these metrics below should not allow the time_reduction but should always return the aggregated metric

jonasblanc added 2 commits April 9, 2025 17:04

Implement accuracy metric

8b9c6b7

Implement balanced acc and precision score

1b9d145

jonasblanc moved this to In progress in darts Apr 10, 2025

jonasblanc added this to darts Apr 10, 2025

Merge branch 'master' into feat/categorical_metrics

3f55bd4

jonasblanc self-assigned this Apr 17, 2025

jonasblanc added feature request Use this label to request a new feature improvement New feature or improvement labels Apr 17, 2025

dennisbader requested changes Apr 21, 2025

View reviewed changes

jonasblanc and others added 2 commits May 28, 2025 17:03

Merge branch 'master' into feat/categorical_metrics

96492e7

Adapt decorators for classification metrics, wip

3f4c181

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Metrics for categorical forecasting #2767

Metrics for categorical forecasting #2767

Uh oh!

jonasblanc commented Apr 10, 2025

Uh oh!

codecov bot commented Apr 11, 2025 •

edited

Loading

Uh oh!

dennisbader left a comment

Uh oh!

Uh oh!

dennisbader Apr 21, 2025

Uh oh!

jonasblanc May 28, 2025

Uh oh!

dennisbader Apr 21, 2025

Uh oh!

dennisbader Apr 21, 2025

Uh oh!

Uh oh!

Metrics for categorical forecasting #2767

Are you sure you want to change the base?

Metrics for categorical forecasting #2767

Uh oh!

Conversation

jonasblanc commented Apr 10, 2025

Summary

Other Information

Uh oh!

codecov bot commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dennisbader left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dennisbader Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

jonasblanc May 28, 2025

Choose a reason for hiding this comment

Uh oh!

dennisbader Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

dennisbader Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov bot commented Apr 11, 2025 •

edited

Loading