Support FIPS enabled machines with MD5 hashing #15299

MattTheCuber · 2025-03-21T17:53:27Z

FIPS enabled machines prohibit MD5 hashing for security reasons. Using hashlib.md5 in Python will throw the error ValueError: [digital envelope routines] unsupported. There are typically 2 resolutions to this problem: use the usedforsecrutiy=False flag or use SHA1 hashing. The usedforsecurity flag is easier to implement and was introduced in Python 3.9, which is the minimum Python version for this library, so that is the method I chose.

github-actions · 2025-03-21T17:53:38Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Matthew Vine <[email protected]>

mgoin

Looks reasonable to me (cc security expert @russellb :)). You could consider making an "insecure_md5" function that provides a common wrapper for hashlib.md5(..., usedforsecurity=False)

MattTheCuber · 2025-03-26T21:04:18Z

You could consider making an "insecure_md5" function that provides a common wrapper for hashlib.md5(..., usedforsecurity=False)

I can do that. Let me know if you want this.

jaredmcqueen · 2025-03-26T21:09:14Z

excited to see traction on this. Thanks @mgoin and @MattTheCuber

russellb · 2025-03-26T21:56:52Z

Thank you for the report and PR! I definitely want to get this fixed.

I'm a little torn on the change. Hashing here does have security implications. See this advisory where I changed this code recently:

GHSA-rm76-4mrf-v9r8

It was previously easy to predict KV cache conflicts when using vLLM with Python 3.12. It's much more difficult now after the change since we now always start with a random base number that's different each time your run vLLM. It's a fair argument that the use of MD5 makes it more feasible to calculate and abuse hash conflicts, though it doesn't seem very feasible unless you could find out the random number used as the base of our hash calculations.

I think I'm on the side of just accepting the proposed change since I think it's "secure enough" that exploiting md5 hash collisions seems not practical with our current code. I didn't want to approve the change without giving that context first, though.

If you all are still comfortable with the change with the understanding that there IS a slight security implication here, then I'm OK with it.

Switching to SHA1 isn't trivial since it takes up a bit more memory than MD5 hashes and is slightly more expensive to calculate. We'd really need to measure the impact of the change carefully.

MattTheCuber · 2025-03-26T22:04:30Z

Changing to SHA1 is definitely the right solution under this context. SHA1 is FIPS certified and is the best hashing standard when needing to balance security and speed. I can send you performance evaluations later (if you want) since I don't have them handy, but SHA1 is significantly faster than MD5 hashing. I can make that swap tomorrow if you would like? I just don't know if anything externally uses the hashes and would break. If you can confirm nothing does, I don't see why to not swap. Unless you are making hundreds of thousands of hashes, I don't think memory will be a problem.

russellb · 2025-03-27T00:00:45Z

Thanks for the response. I had a feeling consensus wouldn't stay on MD5 with that context. I'll share this PR with some others that maintain this part of the code.

russellb · 2025-03-27T00:08:34Z

It looks like SHA-1 is getting retired from FIPS? Maybe we should look at skipping SHA-1 for something else?

https://csrc.nist.gov/news/2022/nist-transitioning-away-from-sha-1-for-all-apps

russellb · 2025-03-27T00:17:07Z

OK I need to back up. I greatly apologize, but I didn't actually read the patch and just assumed what code this was talking about and I was completely wrong. I thought this was about how we do hashing for prefix caching, but that's using Python's built in hash() function, not MD5.

Please forget everything I said in the last few messages, it is not relevant here.

I'm fine with this change, though switching to another algorithm should also be fine, I think. I'll go ahead and approve as-is though.

MattTheCuber · 2025-03-27T00:18:21Z

Oh man, I didn't realize that. I guess switching to SHA2 would be the best choice? I will test performance tomorrow on SHA2 vs MD5. I know SHA1 is much faster but I don't know about SHA2. Although, unless you are doing hundreds of thousands of hashes in a short time, the slowdown will likely be insignificant.

russellb · 2025-03-27T00:19:41Z

Thanks again for raising this. Let me know if you want to talk about any other issues that affect security compliance!

Signed-off-by: Matthew Vine <[email protected]>

Signed-off-by: Matthew Vine <[email protected]> Signed-off-by: Kyle Sayers <[email protected]>

Signed-off-by: Matthew Vine <[email protected]> Signed-off-by: xinyuxiao <[email protected]>

Signed-off-by: Matthew Vine <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

Signed-off-by: Matthew Vine <[email protected]>

MattTheCuber added 2 commits March 21, 2025 18:17

fix: support fips enabled machines with md5 hashing

fe26018

Signed-off-by: Matthew Vine <[email protected]>

chore: fix formatting

3305fad

Signed-off-by: Matthew Vine <[email protected]>

MattTheCuber force-pushed the fix-fips branch from 6fa2844 to 3305fad Compare March 21, 2025 18:18

mgoin requested a review from russellb March 26, 2025 20:25

mgoin reviewed Mar 26, 2025

View reviewed changes

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 26, 2025

russellb approved these changes Mar 27, 2025

View reviewed changes

russellb merged commit 7a6d45b into vllm-project:main Mar 27, 2025
42 checks passed

lengrongfu pushed a commit to lengrongfu/vllm that referenced this pull request Apr 2, 2025

Support FIPS enabled machines with MD5 hashing (vllm-project#15299)

67caa94

Signed-off-by: Matthew Vine <[email protected]>

kylesayrs pushed a commit to neuralmagic/vllm that referenced this pull request Apr 2, 2025

Support FIPS enabled machines with MD5 hashing (vllm-project#15299)

e6d94a0

Signed-off-by: Matthew Vine <[email protected]> Signed-off-by: Kyle Sayers <[email protected]>

Alex4210987 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Apr 5, 2025

Support FIPS enabled machines with MD5 hashing (vllm-project#15299)

8d564ab

Signed-off-by: Matthew Vine <[email protected]> Signed-off-by: xinyuxiao <[email protected]>

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

Support FIPS enabled machines with MD5 hashing (vllm-project#15299)

cc52a9d

Signed-off-by: Matthew Vine <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

nishith-fujitsu pushed a commit to nishith-fujitsu/vllm that referenced this pull request Apr 9, 2025

Support FIPS enabled machines with MD5 hashing (vllm-project#15299)

be102c8

Signed-off-by: Matthew Vine <[email protected]>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

sydarb mentioned this pull request Apr 23, 2025

Addendum Fix to support FIPS enabled machines with MD5 hashing #17043

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support FIPS enabled machines with MD5 hashing #15299

Support FIPS enabled machines with MD5 hashing #15299

MattTheCuber commented Mar 21, 2025

github-actions bot commented Mar 21, 2025

mgoin left a comment

MattTheCuber commented Mar 26, 2025

jaredmcqueen commented Mar 26, 2025

russellb commented Mar 26, 2025

MattTheCuber commented Mar 26, 2025

russellb commented Mar 27, 2025

russellb commented Mar 27, 2025

russellb commented Mar 27, 2025

MattTheCuber commented Mar 27, 2025

russellb commented Mar 27, 2025

Support FIPS enabled machines with MD5 hashing #15299

Support FIPS enabled machines with MD5 hashing #15299

Conversation

MattTheCuber commented Mar 21, 2025

github-actions bot commented Mar 21, 2025

mgoin left a comment

Choose a reason for hiding this comment

MattTheCuber commented Mar 26, 2025

jaredmcqueen commented Mar 26, 2025

russellb commented Mar 26, 2025

MattTheCuber commented Mar 26, 2025

russellb commented Mar 27, 2025

russellb commented Mar 27, 2025

russellb commented Mar 27, 2025

MattTheCuber commented Mar 27, 2025

russellb commented Mar 27, 2025