Enabling HTTP caching of inference results · pytorch/serve#1527

(2 comments) (4 reactions) (1 assignee)Java (790 forks)batch import

enhancementhelp wanted

Repository metrics

This prevents some reverse-proxies like nginx from caching the results

It would be great to have a way to override this behavior, either from the model handler, or through some configuration key.

N/A

Research direction: Investigate the file frontend/server/src/main/java/org/pytorch/serve/util/NettyUtils.java, specifically lines 187-190 where caching prevention headers are set. Consider adding a configuration option or model handler method to allow overriding these headers. Review existing configuration mechanisms in TorchServe to understand how to implement a toggle for caching. Look at discussions in the issue comments for any additional context or proposed solutions.
Tech stack: javapython
Domain: backendperformance
Issue type: Feature
Difficulty: 3
Estimated time: Half day
Activity status: Stale
Clarity: Clear
Prerequisites: JavaNettyTorchServe internalsHTTP caching concepts
Newbie friendliness: 40