-
-
Notifications
You must be signed in to change notification settings - Fork 436
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Last-Modified or ETag headers on critical REST API endpoints #3329
Comments
A few screenshots to illustrate: Refreshing the main UI home page when every static resource has already been cached: Here the loading of the UI is delayed by:
Here each item picker is forced to independently make calls to An item cache will a reliable way to determine when it's out-of-date would be a huge improvement in this case. Also important to know, there are "dynamic" fields returned with these entities, independent of their "configuration", for example the state of items or the status of things/rules, these ideally would not be taken into account for the "last modified" date too avoid too many cache invalidations - this would be by convention: if you send a |
I would be very hesitant to ignore state for caching purposes just because a caching request header is present. Many many tools will silently and automatically add caching headers, and they expect that it will not change the semantics of the request. If we could add a param to the items endpoint that simply excludes state from the response, it would make this far more tractable. I like the idea of just having registry listeners that update an ETag on every change, regardless of the change. Honestly I don't think we should even bother trying to persist the ETag across restarts (and re-loading of registries!). |
You're absolutely right.
Could work indeed, or a generic parameter to exclude fields which aren't to be cached (or the opposite, one to specify that the intent is to cache so the |
Why does each item picker need to make independent calls? And wouldn‘t compressing the response also improve the situation? I feel it‘s a bit anti-pattern to have a registry and then cache the registry in the REST resource. |
I don't think he's saying to have the REST resource cache it, but to have the browser cache it. The REST resource would keep track of an ETag, and on any update to the registry invalidate it. |
They're independent components, they don't talk to each other, so either they're retrieving the data ad-hoc (which ensures it's up-to-date; that's what they do now) or there's client-side caching involved, but then there's the question of when to invalidate that cache, which this proposal would solve. |
From @spacemanspiff2007 :
That still leaves tracking metadata changes to which do not generate an event. |
This implements a new optional `cacheable` parameter for these REST endpoints: - `/rest/items` - `/rest/things` - `/rest/rules` When this parameter is set, a flat list of all elements excluding non-cacheable fields (e.g. "state", "transformedState", "stateDescription", "commandDescription" for items, "statusInfo", "firmwareStatus", "properties" for things, "status" for rules) will be retrieved along with a `Last-Modified` HTTP response header. When unknown, the Last-Modified header will be set to the date of the request. Also only when this parameter is set, and a `If-Modified-Since` header is found in the request, that header will be compared to the last known modified date for the corresponding cacheable list. The last modified date will be reset when any change is made on the elements of the underlying registry. If the `If-Modified-Since` date is equal or more recent than the last modified date, then a 304 Not Modified response with no content will be served instead of the usual 200 OK, informing the client that its cache is still valid at the provided date. All other request parameters will be ignored except for "metadata" in the `/rest/items` endpoint. When a metadata selector is set, the resulting item list will be considered like a completely different resource, i.e. it will have its own last modified date. Regarding metadata, the approach to invalidating last modified dates is very conservative: when any metadata is changed, all cacheable lists of items will have their last modified date reset even if the change was in a metadata namespace that wasn't requested. This also implements the abovedescribed behavior for the `/rest/ui/components/{namespace}` endpoint, but no `cacheable` parameter is necessary. The last modified date is tracked by namespace. Signed-off-by: Yannick Schaus <[email protected]>
With #3335 combined with openhab/openhab-webui#1661 the total transferred size when loading the UI's home page can be as low as 7 kB - regardless of how many items, pages or widgets you have (if they have not changed). This could be a huge improvement for those accessing their instance on myopenhab.org with poor upload bandwidth for example. The item pickers example is also far better - the requests are made normally to the server to revalidate the freshness but the response is empty and the 304 response tells the browser to reuse the version from its cache. Note that this is all made by the browser as discussed before, I didn't have to implement any caching, only add the parameters to the URLs. It was therefore important to have a proper |
Again from @spacemanspiff2007 since he still apparently cannot comment here himself. That still leaves tracking metadata changes to which do not generate an event. Correct - but that could be changed. I'm still not convinced if the additional complexity is worth a theoretical bandwidth saving. So this is more a "it's possible that this might save some bandwidth but only under very specific circumstances and we're not even sure if and how often the use is hitting those circumstances". |
TBF, what I identified is that something that's currently possible with JSR223 scripting, but I consider a violation of the abstraction (calling GenericItem.setLabel() on an item provided from the GenericItemProvider) would be even less likely to continue to work, since there's no GenericItemProvider.update() method in order to notify the registry and subsequently clear any downstream cache. I would not consider that a blocker for implementing proper HTTP caching. And anyone that's truly wanting to continue with such (ab)uses, could (ab)use it even more by calling AbstractRegistry.updated directly, masquerading as the GenericItemProvider. Yes, gross, which is why I don't endorse this usage in the first place, but don't take excessive steps to prevent it either. |
* Closes #3329. This implements a new optional `cacheable` parameter for these REST endpoints: - `/rest/items` - `/rest/things` - `/rest/rules` When this parameter is set, a flat list of all elements excluding non-cacheable fields (e.g. "state", "transformedState", "stateDescription", "commandDescription" for items, "statusInfo", "firmwareStatus", "properties" for things, "status" for rules) will be retrieved along with a `Last-Modified` HTTP response header. When unknown, the Last-Modified header will be set to the date of the request. Also only when this parameter is set, and a `If-Modified-Since` header is found in the request, that header will be compared to the last known modified date for the corresponding cacheable list. The last modified date will be reset when any change is made on the elements of the underlying registry. If the `If-Modified-Since` date is equal or more recent than the last modified date, then a 304 Not Modified response with no content will be served instead of the usual 200 OK, informing the client that its cache is still valid at the provided date. All other request parameters will be ignored except for "metadata" in the `/rest/items` endpoint. When a metadata selector is set, the resulting item list will be considered like a completely different resource, i.e. it will have its own last modified date. Regarding metadata, the approach to invalidating last modified dates is very conservative: when any metadata is changed, all cacheable lists of items will have their last modified date reset even if the change was in a metadata namespace that wasn't requested. This also implements the abovedescribed behavior for the `/rest/ui/components/{namespace}` endpoint, but no `cacheable` parameter is necessary. The last modified date is tracked by namespace. Signed-off-by: Yannick Schaus <[email protected]>
* Closes openhab#3329. This implements a new optional `cacheable` parameter for these REST endpoints: - `/rest/items` - `/rest/things` - `/rest/rules` When this parameter is set, a flat list of all elements excluding non-cacheable fields (e.g. "state", "transformedState", "stateDescription", "commandDescription" for items, "statusInfo", "firmwareStatus", "properties" for things, "status" for rules) will be retrieved along with a `Last-Modified` HTTP response header. When unknown, the Last-Modified header will be set to the date of the request. Also only when this parameter is set, and a `If-Modified-Since` header is found in the request, that header will be compared to the last known modified date for the corresponding cacheable list. The last modified date will be reset when any change is made on the elements of the underlying registry. If the `If-Modified-Since` date is equal or more recent than the last modified date, then a 304 Not Modified response with no content will be served instead of the usual 200 OK, informing the client that its cache is still valid at the provided date. All other request parameters will be ignored except for "metadata" in the `/rest/items` endpoint. When a metadata selector is set, the resulting item list will be considered like a completely different resource, i.e. it will have its own last modified date. Regarding metadata, the approach to invalidating last modified dates is very conservative: when any metadata is changed, all cacheable lists of items will have their last modified date reset even if the change was in a metadata namespace that wasn't requested. This also implements the abovedescribed behavior for the `/rest/ui/components/{namespace}` endpoint, but no `cacheable` parameter is necessary. The last modified date is tracked by namespace. Signed-off-by: Yannick Schaus <[email protected]> GitOrigin-RevId: 6e83d3f
Idea from openhab/openhab-webui#1650 (comment):
The Main UI (and possibly others) makes relatively frequent REST API calls to get all things, or item, or rules, or UI components to make sure the data is up-to-date. For instance every time an item or thing picker is rendered, a call to
/rest/items
resp./rest/things
is performed./rest/items
is also called every time the home page is shown, to compute the model cards (locations, equipment, points) with the latest data.It could be a huge performance improvement if these critical and costly endpoints supported either the
ETag
orLast-Modified
HTTP response headers, and their equivalent request headers (If-None-Match
andIf-Modified-Since
).If the preconditions don't match (see JAX-RS - Handling 'Last-Modified' and 'ETag' headers with Request Injection) then a 304 Not Modified response would be sent instead.
The UI and other clients would store the last known list of entities in a client-side cache (possibly even surviving page refreshes with IndexedDB), as well as the last sent EntityTag or modified date retrieved from the response headers, which would then be sent along with future requests. It would react to a 200 OK response by updating the cache before returning the data to the caller, and to a 304 Not Modified response by simply returning the cache.
To determine if the data has changed, resources backed by a Registry could keep the last sent EntityTag or last modified date as well, and add a change listener to reset this last ETag or date when a change was notified.
For items it would surely be a little more complicated since changes to metadata would have to be handled as well.
The text was updated successfully, but these errors were encountered: