Nullpointer when using Azure Open AI and streaming #2691

berjanjonker · 2025-04-10T16:54:00Z

Bug description
While using the AzureOpenAI chat client streaming won't work. It gets into this nullpointer. When I subscribe and print out the .content() Flux; I noticed that the last received token is null.

2025-04-10 17:38:22.611 [http-nio-8080-exec-7] ERROR o.a.c.c.C.[.[.[.[dispatcherServlet].log - Servlet.service() for servlet [dispatcherServlet] threw exception
java.lang.NullPointerException: Cannot invoke "com.azure.ai.openai.models.ChatResponseMessage.getToolCalls()" because "responseMessage" is null
at org.springframework.ai.azure.openai.AzureOpenAiChatModel.buildGeneration(AzureOpenAiChatModel.java:498)
Suppressed: reactor.core.publisher.FluxOnAssembly$OnAssemblyException:
Assembly trace from producer [reactor.core.publisher.FluxMapFuseable] :
reactor.core.publisher.Flux.map(Flux.java:6588)
org.springframework.ai.azure.openai.AzureOpenAiChatModel.lambda$internalStream$13(AzureOpenAiChatModel.java:381)

Environment
Spring-AI 1.0.0-M6
Chat Model: Azure OpenAI

Steps to reproduce
Set a custom content-filer on your model: Azure AI Foundry->Safety+Security->Create content filter->Output Filter->Streaming mode (Preview)->Asynchronous Filter
chatClient.prompt().user("How are you?").stream().content().doOnEach(data -> System.out.println(data.get()));

//output
How
can
I
assist
you
today
?
null

Expected behavior
AzureOpenAiChatModel should be null safe or null values should be filtered out.

Minimal Complete Reproducible example
See above. When I switch to another vendor like Anthrophic the result is as expected (without a null at the end of the stream)

The text was updated successfully, but these errors were encountered:

dev-jonghoonpark · 2025-04-16T09:02:57Z

I got the same result with the OpenAI module.

dev-jonghoonpark · 2025-04-17T02:44:48Z

I have found that this issue is not related to Spring AI.

doOnEach handles multiple events.
In the provided code, it calls the onComplete event as the final step after all tasks are completed.
Since there is no data in the onComplete event, it results in null.

Using doOnNext instead of doOnEach will resolve the issue.

ReloadingPeace · 2025-04-17T05:52:38Z

I have found that this issue is not related to Spring AI.

doOnEach handles multiple events. In the provided code, it calls the onComplete event as the final step after all tasks are completed. Since there is no data in the onComplete event, it results in null.

Using doOnNext instead of doOnEach will resolve the issue.

I am a beginner, please forgive me if there are any mistakes in what I said. Here is my opinion:
I think the null pointer caused by the Azure OpenAiChatModel not processing ChatResponse properly cannot be avoided when obtaining results, whether it is doOnNext or doOnEach

berjanjonker · 2025-04-17T20:57:18Z

I have found that this issue is not related to Spring AI.
doOnEach handles multiple events. In the provided code, it calls the onComplete event as the final step after all tasks are completed. Since there is no data in the onComplete event, it results in null.
Using doOnNext instead of doOnEach will resolve the issue.

I am a beginner, please forgive me if there are any mistakes in what I said. Here is my opinion: I think the null pointer caused by the Azure OpenAiChatModel not processing ChatResponse properly cannot be avoided when obtaining results, whether it is doOnNext or doOnEach

I agree. Created a PR to make the processing of chatReponses more robust

markpollack · 2025-04-18T15:25:36Z

thanks so much! will review.

markpollack · 2025-04-21T19:05:18Z

I am testing with M7. The snippit to reproduce this error

chatClient.prompt().user("How are you?").stream().content().doOnEach(data -> System.out.println(data.get()));

Passes with M7.

Also the test in the PR - #2789 passes without the fix in the code.

I'm not sure what is going on here, though I suppose the extra checks in the PR don't hurt.

Thoughts?

berjanjonker · 2025-04-22T19:50:01Z

Thanks for testing @markpollack!
I did some testing with different setups in Azure and I found the cause. It is because of the Content Filtering option: asynchronous-filter
As you can see in the example below. WIth the asynchronous-filter enabled the last data message is null/empty and the content-filter result is send at a later moment.

data: {"id":"","object":"","created":0,"model":"","prompt_annotations":[{"prompt_index":0,"content_filter_results":{"hate":{"filtered":false,"severity":"safe"},"self_harm":{"filtered":false,"severity":"safe"},"sexual":{"filtered":false,"severity":"safe"},"violence":{"filtered":false,"severity":"safe"}}}],"choices":[],"usage":null} 

data: {"id":"chatcmpl-7rCNsVeZy0PGnX3H6jK8STps5nZUY","object":"chat.completion.chunk","created":1692913344,"model":"gpt-35-turbo","choices":[{"index":0,"finish_reason":null,"delta":{"role":"assistant"}}],"usage":null} 

data: {"id":"chatcmpl-7rCNsVeZy0PGnX3H6jK8STps5nZUY","object":"chat.completion.chunk","created":1692913344,"model":"gpt-35-turbo","choices":[{"index":0,"finish_reason":null,"delta":{"content":"Color"}}],"usage":null} 

data: {"id":"chatcmpl-7rCNsVeZy0PGnX3H6jK8STps5nZUY","object":"chat.completion.chunk","created":1692913344,"model":"gpt-35-turbo","choices":[{"index":0,"finish_reason":null,"delta":{"content":" is"}}],"usage":null} 

data: {"id":"chatcmpl-7rCNsVeZy0PGnX3H6jK8STps5nZUY","object":"chat.completion.chunk","created":1692913344,"model":"gpt-35-turbo","choices":[{"index":0,"finish_reason":null,"delta":{"content":" a"}}],"usage":null} 

data: {"id":"chatcmpl-7rCNsVeZy0PGnX3H6jK8STps5nZUY","object":"chat.completion.chunk","created":1692913344,"model":"gpt-35-turbo","choices":[{"index":0,"finish_reason":"stop","delta":{}}],"usage":null} 

data: {"id":"","object":"","created":0,"model":"","choices":[{"index":0,"finish_reason":null,"content_filter_results":{"hate":{"filtered":false,"severity":"safe"},"self_harm":{"filtered":false,"severity":"safe"},"sexual":{"filtered":false,"severity":"safe"},"violence":{"filtered":false,"severity":"safe"}},"content_filter_offsets":{"check_offset":506,"start_offset":44,"end_offset":571}}],"usage":null} 

data: [DONE]

You can resproduce this if you set a custom content-filer on your model: Azure AI Foundry->Safety+Security->Create content filter->Output Filter->Streaming mode (Preview)

berjanjonker linked a pull request Apr 17, 2025 that will close this issue

make responseMessage in AzureOpenAiChatModel.buildGeneration null-safe #2789

Open

markpollack added this to the 1.0.0-RC1 milestone Apr 18, 2025

markpollack added the azure label Apr 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nullpointer when using Azure Open AI and streaming #2691

Nullpointer when using Azure Open AI and streaming #2691

berjanjonker commented Apr 10, 2025 •

edited

Loading

dev-jonghoonpark commented Apr 16, 2025

dev-jonghoonpark commented Apr 17, 2025

ReloadingPeace commented Apr 17, 2025

berjanjonker commented Apr 17, 2025 •

edited

Loading

markpollack commented Apr 18, 2025

markpollack commented Apr 21, 2025

berjanjonker commented Apr 22, 2025 •

edited

Loading

Nullpointer when using Azure Open AI and streaming #2691

Nullpointer when using Azure Open AI and streaming #2691

Comments

berjanjonker commented Apr 10, 2025 • edited Loading

dev-jonghoonpark commented Apr 16, 2025

dev-jonghoonpark commented Apr 17, 2025

ReloadingPeace commented Apr 17, 2025

berjanjonker commented Apr 17, 2025 • edited Loading

markpollack commented Apr 18, 2025

markpollack commented Apr 21, 2025

berjanjonker commented Apr 22, 2025 • edited Loading

berjanjonker commented Apr 10, 2025 •

edited

Loading

berjanjonker commented Apr 17, 2025 •

edited

Loading

berjanjonker commented Apr 22, 2025 •

edited

Loading