I\'m trying to configure Infinispan as a hibernate 2nd level cache. Everything is fine, but I want to tweak the default configuration, i.e. the values that all caches shared
The default cache configuration for entities is named entity
:
Cache configuration can differ for each type of data stored in the cache. In order to override the cache configuration template, use property
hibernate.cache.infinispan.data-type.cfg
wheredata-type
can be one of:
entity
Entities indexed by@Id
or@EmbeddedId
attribute.
immutable-entity
Entities tagged with@Immutable
annotation or set asmutable=false
in mapping file.
naturalid
Entities indexed by their@NaturalId
attribute.
collection
All collections.
timestamps
Mapping entity type → last modification timestamp. Used for query caching.
query
Mapping query → query result.
pending-puts
Auxiliary caches for regions using invalidation mode caches.
The default for collection
, immutable-entity
and naturalid
is also the configuration specified for entity
, so you don't have to configure them separately (if you don't want separate configurations of course), as can be seen in the documentation and the source code.
Note
In general, making a Hibernate L2 cache distributed may not be a good idea, because entity instances are stored in disassembled hydrated state in the L2 cache, meaning that only ids of the associated entities are stored together with the parent entity state.
Suppose you have the following entities (A
, B
, C
are all cachable):
@Entity
public class A {
@ManyToOne
private B b;
@OneToMany
private Collection cs;
}
Even if the cs
collection were cachable also, to fully assemble an entity A
instance from the cache you would have the following network round trips to the other nodes of the cluster:
A
state.B
state based on id stored in the b
association.cs
ids.cs
collection, fetch the C
entity state, one by one.Obviously, if you are assembling of collection of A
instances (from a result of a query for example), all of the above is executed for each instance of A
.
This all implies that reading data from the database directly (with properly configured lazy loading, using batch size for example), could be much more efficient than all of the network round trips in a distributed cache.
Also, that's one of the reasons why the entity/collection cache should run in the invalidation cluster mode (data are cached only on the node that reads/writes it, but invalidated on other nodes when changed).