Specify PK for TF-PSA-Crypto 1.0 #203

gilles-peskine-arm · 2025-03-14T17:51:39Z

Design for pk.h in TF-PSA-Crypto 1.0.

The design attempts to minimize the amount of work we do before 1.0. A lot of things are left under the carpet. In particular, Mbed TLS library code and tests, as well as sample programs, are allowed to call functions that are now declared as private.

Resolves Mbed-TLS/mbedtls#8452

PR checklist

changelog provided | not required because: doc only
framework PR not required
mbedtls development PR not required because: new stuff, doc only
mbedtls 3.6 PR not required because: new stuff
mbedtls 2.28 PR not required because: new stuff
tests not required because: doc only

Signed-off-by: Gilles Peskine <[email protected]>

mpg

Looks pretty good, thanks! I only have a couple of question. The rest is very minor (and some comments are just notes that don't require any kind of action).

docs/architecture/pk-4.md

mpg · 2025-03-17T09:08:43Z

docs/architecture/pk-4.md

+* The poorly defined type `mbedtls_pk_type_t` and the associated function `mbedtls_pk_can_do()`. Use PSA metadata instead.
+* Mechanism names: `mbedtls_pk_get_name()`.
+* The RSA-oriented length function: `mbedtls_pk_get_len()`. Use `mbedtls_pk_get_bitlen()`.
+* PSS-extended functions: `mbedtls_pk_sign_ext()`, `mbedtls_pk_verify_ext()`. PSA has less flexibility than the PK API. Use PSA APIs to get all the flexibility that PSA can have.


Consider the following scenario: a TLS 1.2/1.3 server has an RSA key that it wants to use for both protocols, which means making either v1.5 signatures (for 1.2) or PSS signatures (for 1.3). I think the way we support it now is with sign_ext() - nor for the extra fancy PSS settings, just for the ability to say we want PSS.

How is this kind of use case handled in the new PK? Do people need to create two contexts, one for v1.5 one for PSS?

(I don't think mbedtls_pk_set_algorithm() helps here, because mutating the context every time you want to do a signature would cause all kinds of threading issues.)

You need two contexts. The idea is that under the hood, this can be a PSA key, and PSA only allows one algorithm family. (Our PSA implementation can support two algorithms, but in the medium term we don't want to assume that this extension is present — even TF-M doesn't expose it.)

Ok, but then we need to document this as a breaking change between 3.x and 4.x/1.x. In 3.x I can parse an RSA key with pk_parse(), pass it to ssl_conf_own_cert() and things will just work with TLS 1.2 and 1.3. In 4.x I'll need to create a copy of the context with different permissions and then call ssl_conf_own_cert() twice. I think that's perfectly OK but ought to be mentioned in the ChangeLog and documented in the migration guide.

Good point, I hadn't really thought this through. I'm at least going to add this to a documentation task.

But now I wonder if we should go further and actually stop using mbedtls_pk_sign_ext in TLS. (Verify is fine, you can always export public keys). Because otherwise existing users are going to have applications that keep working just fine with 4.0, either because they didn't notice this entry in the migration guide or they didn't think it applied to them since their application kept working. And then it'll break in some 1.x PK refactoring,

docs/architecture/pk-4.md

Signed-off-by: Gilles Peskine <[email protected]>

mpg

LGTM, could not find anything worse than a harmless typo.

mpg · 2025-03-19T08:50:56Z

docs/architecture/pk-4.md

-* Create an object that wraps the source object. The wrapper object is only valid as long as the source object is valid, and destroying the wrapper object does not affect the source object. Resource management is tricky, but this has a low overhead and works for keys whose material cannot be copied.
+* Create an object that aliases the source object: wrap a PSA key in a PK context, or peek at the underlying PSA key of a PK context. The wrapper/underlying object is only valid as long as the source object is valid. A PK context created by wrapping an existing PSA key does not destroy the PSA key. Resource management is tricky, but this has a low overhead and works for keys whose material cannot be copied.
+
+There is currently no way to access the underlying PSA key of a PK context. A nw function to [access the underlying PSA key of a PK context](#access-the-underlying-psa-key-of-a-pk-context) is not planned for TF-PSA-Crypto 1.0.


docs/architecture/pk-4.md

A TLS 1.2+1.3 server with an RSA key needs the key to allow both PKCS#1v1.5 and PSS. The current method is to parse a key and use both algorithms as needed. This won't work in an all-PSA world. Signed-off-by: Gilles Peskine <[email protected]>

Signed-off-by: Gilles Peskine <[email protected]>

davidhorstmann-arm

A few questions / clarifications and a smattering of typos, but no big problems with the design as a whole.

docs/architecture/pk-4.md

Signed-off-by: Gilles Peskine <[email protected]>

davidhorstmann-arm

LGTM, thanks!

mpg

Looks generally good to me. However, I'm feeling somewhat uncomfortable about this whole set_algorithm() thing. I'll keep thinking about it.

mpg · 2025-03-20T09:52:37Z

docs/architecture/pk-4.md

+
+An alternative approach is to require copying the key after parsing. This is what we're effectively doing when the application wants to use the key through PSA: it calls `mbedtls_pk_parse_xxx()`, then `mbedtls_pk_import_into_psa()` (after which it can free the intermediate PK object). But what if the application wants to use the key through PK? The current workflow can be:
+
+* `mbedtls_pk_set_algorithm()` — but as noted above this is not future-proof.


I think for keys that were just parsed (or more generally, are exportable - or even just copyable) we can make it do an export-import under the hood, so it keeps working when all (private) keys are backed by PSA.

For keys that can't be copied it would fail though. And things that will fail in 1.x should already fail in 1.0. We can achieve that by adding an extra check, or already using the future-proof implementation in 1.0.

gilles-peskine-arm · 2025-03-20T10:07:55Z

I'm feeling somewhat uncomfortable about this whole set_algorithm() thing.

Same, the more I sleep on it the more wrong it feels.

gilles-peskine-arm · 2025-03-23T23:26:26Z

docs/architecture/pk-4.md

+
+ACTION (https://github.com/Mbed-TLS/TF-PSA-Crypto/pull/204): populate the field `pk->psa_algorithm` when populating `pk`. Use the same approach as the existing function `mbedtls_pk_get_psa_attributes()` for a signature usage.
+
+ACTION (https://github.com/Mbed-TLS/TF-PSA-Crypto/pull/204): implement a new function to change the algorithm associated with a PK context:


Having thought about it further, this is definitely wrong. It can't be implemented in a reasonable way on top of a PSA key, since you can't change the PSA key's policy, and having a PK algorithm that doesn't match the policy of the underlying PSA key is pointless.

We need a different solution for the following scenario:

I parse a key.

Based on some of the key's metadata, I decide which signature algorithm to use it with.

I sign with pk.

There is already such a workflow. It's somewhat inefficient, but I think we can live with it.

Parse the key.

mbedtls_pk_get_psa_attributes() and mbedtls_pk_import_into_psa(). This workflow was created precisely to solve this problem.

mbedtls_pk_copy_from_psa() or, to save some resources, mbedtls_pk_wrap_psa().

Signed-off-by: Gilles Peskine <[email protected]>

mpg · 2025-04-24T10:26:28Z

Trying to summarize my thoughts. This is mostly redundant with what you've written already, but (re)writing it myself helps me organize my thoughts.

I think the public key case is the easiest, so I'll focus on the private key case for now.

It seems to me there's two ways a user might want a private RSA key to be used:

with a fixed algorithm decided in advance (for example, only PSS);
with either algorithm depending on what's negotiated in TLS.

Also, there are two ways a user might get their RSA key:

by parsing a file / string of bytes;
as an existing PSA key.

So that's a matrix of 4 cases to consider:

Parse -> fixed alg.
Parse -> any alg.
PSA key -> fixed alg.
PSA key -> any alg.

The 4th case is not guaranteed to always work (ie if the key is not exportable), and that's a feature not a bug.

Finally, users might either have access to the key_enrolment_alg extension and want to take advantage of it, or not. It's not clear to me if allowing users to take advantage of it when it's available should be a design goal here.

gilles-peskine-arm · 2025-04-24T10:29:57Z

Finally, users might either have access to the key_enrolment_alg extension and want to take advantage of it, or not. It's not clear to me if allowing users to take advantage of it when it's available should be a design goal here.

I consider it an optional goal: nice to have, but not a blocker.

mpg · 2025-04-24T10:36:42Z

So far the general idea is:

Parse -> fixed alg
- if the desired alg is the one that parsing sets by default (v1.5 currently), nothing to do;
- if it's the other one, want to change the alg on an existing PK context.
Parse -> any alg: want to create a copy with the other alg.
PSA -> fixed alg: nothing to do, PSA already has a set alg.
PSA -> any alg: want to create a copy with the other alg (might fail though).

So, it seems to me we have two basic wants:
a. change the alg on an existing PK context
b. create a copy of an existing PK context with another alg

It's easy to express (a) in terms of (b): just copy then destroy the original (though maybe not optimal on resources). It's also easy to express (b) in terms of (a) plus a generic "copy as is" function, but that "copy as is" function is unlikely to be useful for anything else.

mpg · 2025-04-24T10:55:26Z

Consider the following scenario: I have a PSA key that allows both v1.5 (primary alg) and PSS (enrolment alg). I create a PK context from it using pk_wrap_psa(). I then call set_algorithm(PSS) on it.

Should I expect to still be able to do v1.5 signatures with the resulting PK context? It's not clear to me what would be the least surprising here.

More generally, I think the fundamental discomfort I have with set_algorithm is that it becomes unclear what part of the code is responsible for expressing and enforcing policies. I think I'd really like a model where policies are clearly expressed and enforced by PSA only (and PK really is just a wrapper).

The only problem with that is that it kind of exposes implementation details: right now, PK is not just a wrapper around PSA, and the plan is that making it a wrapper will be a 4.x thing. I'm almost tempted to suggest making it a wrapper right now (ie, make RSA key storage work the same as ECC key storage currently works (with MBEDTLS_PK_USE_PSA_EC_DATA): private keys are stored as PSA keys, and public keys as the data needed by psa_import(). But that doesn't really change the issue that it makes implementation details almost part of the API.

mpg · 2025-04-24T10:57:57Z

docs/architecture/pk-4.md

+In both TLS and X.509, we want the following control flow:
+
+1. Parse a certificate, creating a PK context containing a public key.
+2. Determine which signature verification algorithm to use. This information does not come from the certiciate.


Typo: certiciate

gilles-peskine-arm · 2025-04-24T11:07:39Z

In all the variants that I've considered, PK is explicitly not responsible for enforcing policies. However, since PK is the interface, it unavoidably has some control over expressing policies. For example, given that we don't want to change the parsing interface (because we don't have a good alternative), parsing needs to pick some default policy. And that, in turn, means that there has to be some way of using a different policy. There is already a way through import_into_psa. I do wonder if we should just get rid of pk_sign and pk_verify... But it would mean significantly more pre-4.0 work in the code that's using it, so I don't think it would really be a win.

mpg · 2025-04-25T10:52:06Z

What we want to do now (4.0) depends on lot of what we plan to do later (4.x). We don't want to paint ourselves into a corner where we no longer can make the changes we want to do in 4.x without breaking users code. (And as was mentioned in a few other contexts, I think we shouldn't rely just on documentation: things that will break in 4.x should break in 4.0 already.)

So, I'd like to discuss what we want to do next, in 4.x, which is move RSA keys from being stored as rsa_contexts to being held as PSA keys for all PK contexts (for private keys at least), even those created by PKparse. This will associate a policy with each key, which is a significant change from 3.6 where PK essentially didn't have policies.

If at this point, if relying on the "enrolment algorithm" extension is an option, we could make PKparse create RSA keys that are usable with both v1.5 and PSS. This would be the least disruptive compared to current (3.6) behaviour. (PK contexts created by wrapping a PSA key would inherit its policy, as they already to do in 3.6 with setup_opaque().

Technically I think we could do this, as making TLS/X.509 work with other implementations of PSA crypto is not an official goal until 5.0 (where we're likely to remove PK) - but unofficially perhaps there are people who still want to use PK in a context where the "enrolment alg" extension can't be relied on?

If we can't rely on this extension, that means in 4.x (when all PK (private-key) contexts are backed by PSA) then RSA keys suddenly become single-alg. Which means this change should happen in 4.0. Which means either we move that change forward to 4.0, or in the meantime PK has to enforce policies for RSA keys so that code that will break in 4.x breaks in 4.0 already.

Wdyt?

gilles-peskine-arm · 2025-04-25T10:57:32Z

unofficially perhaps there are people who still want to use PK in a context where the "enrolment alg" extension can't be relied on?

TF-M, last I heard.

things that will break in 4.x should break in 4.0 already

That's a good point. So far in the design I've tried to ensure that the new way of doing things will be ok for 4.x, but I haven't added enforcement that “cheating” ways no longer work. I'll think about it.

RSA keys suddenly become single-alg. Which means this change should happen in 4.0. Which means either we move that change forward to 4.0, or in the meantime PK has to enforce policies for RSA keys so that code that will break in 4.x breaks in 4.0 already.

That is mostly the case with mbedtls_pk_set_algorithm: although PK allows you to cheat, you have to do it explicitly by calling a function on the context. You can no longer ignore the policy by calling mbedtls_pk_sign_ext().

mpg · 2025-04-28T10:26:58Z

You can no longer ignore the policy by calling mbedtls_pk_sign_ext().

Because we're removing this function anyway, right? So there's just pk_sign() and when people call it, that will use the algorithm associated with the key.

So we'll need some changes in library/ssl_tls13_generic.c as it's currently calling sign_ext().

You said earlier prototyping wasn't really relevant here because the issue is not to have things work in 4.0 but that they keep working in 4.x. If we add the requirement that things that will fail in 4.x should already fail in 4.0, does this point still stand in your opinion?

At this point, I'm under the impression that we're going in the following direction:

For public keys, we're keeping verify_ext() (presumably with the type argument changed from mbedtls_pk_type_t to psa_algorithm_t) which supports the use cases in X.509 and TLS and works regardless of the underlying implementation (since public keys are always exportable).
For private keys, we're not keeping sign_ext() as it's not compatible PSA-based key storage (and quite questionable from a "crypto best practices" perspective).
For TLS, if a user wants their RSA key to be usable with both v1.5 and an PSS, they will have to explicitly opt in by somehow (see below) creating two PK contexts, one for each alg, and call set_own_cert() twice, one with each key.
We need to adapt the TLS code always call sign() not sign_ext(), and select the correct key for the desired algorithm (or error out if we can't find a key that matches one of the negotiated algs).
For private keys, can_do_ext() should probably only return true for the algorithm that would be used if pk_sign was called on that key. (Currently can_do_ext() returns true if the argument matches the key's enrolment alg, but presumably pk_sign() would always use the key's "primary" alg.)

I feel like the undecided point so far was mostly the "somehow" above for creating two PK contexts. It seems to me the main contenders are:

Don't add new functions (but perhaps documentation and examples) and just let people use mbedtls_pk_import_into_psa() + copy_from_psa()/wrap_psa() (currently called setup_opaque()).
Add pk_set_algorithm() (will fail if the key is PSA-based and not exportable, but will always work on a key that was just parsed). That's cool if the user just wants to use the other alg, but if they want to use both, they need to manage creating a 2nd context with the same key material on their own. (Well, if the key has just been parsed, they could just parse it again to another context and call set_alg on the 2nd context.)
Add pk_copy() that allows changing the alg on the copy. (Again, will fail if the key is PSA-based and not exportable, but will always work on freshly parsed keys.) That's cool if the users wants two context, but slightly less cool if they just want one context with the other alg.
Add both pk_set_alg() and pk_copy().

It seems to me that this part is really just about user convenience. So I think there are two relatively independent parts to this:

A. Whether we want to keep verify_ext() and/or sign_ext(), and if we remove sign_ext(), clearly specify sign() and can_do_ext() behaviour, and: (1) how we'll implement this when RSA keys are stored as rsa_context and when they're held in PSA, and (2) how to adapt our TLS code.
B. What convenience function(s) (if any) we want to provide for users to create a PK context with the desired alg, or a two contexts with different algs from the same key material.

Thanks to Manuel for raising this concern. Signed-off-by: Gilles Peskine <[email protected]>

mpg · 2025-05-05T13:57:37Z

docs/architecture/pk-4.md

+
+If the effective capabilities of `mbedtls_ssl_conf_own_cert()` change, we need to be careful not to end up in a situation where:
+
+1. An application works fine with Mbed TLS 3.6, relying only on documented behavior.


Fun (not) game: dive into 3.6 documentation and see if anything says that when you pass the result of pk_parse_key() (on an RSA) key to mbedtls_ssl_conf_own_cert() then you'll be able to do handshake signatures with any of v1.5 or PSS as a result. I'm honestly not sure the documentation says anything to that effect (or the opposite).

I'm still strongly inclined to consider that something that shouldn't be broken in 3.6.x, and that should be advertised as a breaking change in 4.0 if we decide to change it, of course. It's just that seeing the wording here made me think, part of our problems every time we touch PK is it's underspecified (and lacks a clear design).

We don't say it explicitly, but since PK doesn't claim to set a policy on RSA keys, passing a PK object containing a freshly parsed RSA key to a consumer (here SSL) allows that consumer to use both v1.5 and PSS.

Specify PK for TF-PSA-Crypto 1.0

bf3ddf7

Signed-off-by: Gilles Peskine <[email protected]>

gilles-peskine-arm added size-s Estimated task size: small (~2d) needs-review Every commit must be reviewed by at least two team members needs-reviewer This PR needs someone to pick it up for review priority-high High priority - will be reviewed soon labels Mar 14, 2025

gilles-peskine-arm added this to Roadmap pull requests (new board) Mar 14, 2025

github-project-automation bot moved this to In Development in Roadmap pull requests (new board) Mar 14, 2025

gilles-peskine-arm mentioned this pull request Mar 14, 2025

Make PK partially private #204

Open

5 tasks

Fix internal links

5f4c22c

Signed-off-by: Gilles Peskine <[email protected]>

This was referenced Mar 14, 2025

Update unit tests for the new PK specification #207

Open

Update the PSA migration guide for the new PK #209

Open

Actions created

d23f982

Signed-off-by: Gilles Peskine <[email protected]>

gilles-peskine-arm mentioned this pull request Mar 14, 2025

Remove mbedtls_pk_type_t from the API #210

Open

mpg reviewed Mar 18, 2025

View reviewed changes

gilles-peskine-arm added 3 commits March 18, 2025 23:15

Typos

f2294a9

Signed-off-by: Gilles Peskine <[email protected]>

Clarifications

57ff454

Signed-off-by: Gilles Peskine <[email protected]>

Reasoning for keeping restartable PK unchanged

cab288b

Signed-off-by: Gilles Peskine <[email protected]>

gilles-peskine-arm requested a review from mpg March 18, 2025 22:21

mpg previously approved these changes Mar 19, 2025

View reviewed changes

davidhorstmann-arm self-requested a review March 19, 2025 10:27

gilles-peskine-arm removed the needs-reviewer This PR needs someone to pick it up for review label Mar 19, 2025

gilles-peskine-arm added 2 commits March 19, 2025 15:27

mbedtls_pk_set_algorithm() is not future-proof

ffde533

Signed-off-by: Gilles Peskine <[email protected]>

gilles-peskine-arm dismissed mpg’s stale review via ffde533 March 19, 2025 14:38

davidhorstmann-arm requested changes Mar 19, 2025

View reviewed changes

gilles-peskine-arm added 3 commits March 19, 2025 18:27

typos

62ace9d

Signed-off-by: Gilles Peskine <[email protected]>

Clarifications

e77bff7

Signed-off-by: Gilles Peskine <[email protected]>

Note the danger of indirect references from PK to PSA that can be broken

39a1d8c

Signed-off-by: Gilles Peskine <[email protected]>

gilles-peskine-arm requested review from davidhorstmann-arm and mpg March 19, 2025 17:31

davidhorstmann-arm previously approved these changes Mar 19, 2025

View reviewed changes

mpg reviewed Mar 20, 2025

View reviewed changes

gilles-peskine-arm added the needs-work label Mar 23, 2025

gilles-peskine-arm commented Mar 23, 2025

View reviewed changes

Analyze the use of RSA-PSS in TLS

1f1e4b1

Signed-off-by: Gilles Peskine <[email protected]>

gilles-peskine-arm dismissed davidhorstmann-arm’s stale review via 1f1e4b1 April 11, 2025 07:28

A few alternatives to mbedtls_pk_set_algorithm() that involve copying

beede2d

Signed-off-by: Gilles Peskine <[email protected]>

gabor-mezei-arm self-requested a review April 15, 2025 11:08

mpg reviewed Apr 24, 2025

View reviewed changes

gilles-peskine-arm mentioned this pull request Apr 30, 2025

Unit tests for mbedtls_ssl_conf_own_cert with respect to RSA policies Mbed-TLS/mbedtls#10160

Open

How to ensure 1.x won't break unwitting code

4771c6b

Thanks to Manuel for raising this concern. Signed-off-by: Gilles Peskine <[email protected]>

mpg reviewed May 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specify PK for TF-PSA-Crypto 1.0 #203

Specify PK for TF-PSA-Crypto 1.0 #203

gilles-peskine-arm commented Mar 14, 2025 •

edited by mpg

Loading

mpg left a comment

mpg Mar 17, 2025

gilles-peskine-arm Mar 18, 2025

mpg Mar 19, 2025

gilles-peskine-arm Mar 19, 2025

mpg left a comment

mpg Mar 19, 2025

davidhorstmann-arm left a comment

davidhorstmann-arm left a comment

mpg left a comment

mpg Mar 20, 2025

gilles-peskine-arm commented Mar 20, 2025

gilles-peskine-arm Mar 23, 2025

mpg commented Apr 24, 2025

gilles-peskine-arm commented Apr 24, 2025

mpg commented Apr 24, 2025

mpg commented Apr 24, 2025

mpg Apr 24, 2025

gilles-peskine-arm commented Apr 24, 2025

mpg commented Apr 25, 2025

gilles-peskine-arm commented Apr 25, 2025

mpg commented Apr 28, 2025

mpg May 5, 2025

gilles-peskine-arm May 5, 2025


		An alternative approach is to require copying the key after parsing. This is what we're effectively doing when the application wants to use the key through PSA: it calls `mbedtls_pk_parse_xxx()`, then `mbedtls_pk_import_into_psa()` (after which it can free the intermediate PK object). But what if the application wants to use the key through PK? The current workflow can be:

		* `mbedtls_pk_set_algorithm()` — but as noted above this is not future-proof.


		ACTION (https://github.com/Mbed-TLS/TF-PSA-Crypto/pull/204): populate the field `pk->psa_algorithm` when populating `pk`. Use the same approach as the existing function `mbedtls_pk_get_psa_attributes()` for a signature usage.

		ACTION (https://github.com/Mbed-TLS/TF-PSA-Crypto/pull/204): implement a new function to change the algorithm associated with a PK context:


		If the effective capabilities of `mbedtls_ssl_conf_own_cert()` change, we need to be careful not to end up in a situation where:

		1. An application works fine with Mbed TLS 3.6, relying only on documented behavior.

Specify PK for TF-PSA-Crypto 1.0 #203

Are you sure you want to change the base?

Specify PK for TF-PSA-Crypto 1.0 #203

Conversation

gilles-peskine-arm commented Mar 14, 2025 • edited by mpg Loading

PR checklist

mpg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mpg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

davidhorstmann-arm left a comment

Choose a reason for hiding this comment

davidhorstmann-arm left a comment

Choose a reason for hiding this comment

mpg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gilles-peskine-arm commented Mar 20, 2025

Choose a reason for hiding this comment

mpg commented Apr 24, 2025

gilles-peskine-arm commented Apr 24, 2025

mpg commented Apr 24, 2025

mpg commented Apr 24, 2025

Choose a reason for hiding this comment

gilles-peskine-arm commented Apr 24, 2025

mpg commented Apr 25, 2025

gilles-peskine-arm commented Apr 25, 2025

mpg commented Apr 28, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gilles-peskine-arm commented Mar 14, 2025 •

edited by mpg

Loading