DDD: choose relationship or only id reference with JPA/Hibernate

Question

Here is a situation makes me quite confusing. I have two tables: users and articles. One user can write multiple articles and one article can only have one author. From this business. I have two entity: If I follow the JPA style, the Article should be like this: This will make the query service quite easy. For example, I may

Accepted Answer

The problemThe write/command & read/query needs are orthogonal and those are pulling the model in opposite directions which creates tension in a unified model and can (and often does) lead to a huge mess.CommandsOn the one hand, you want aggregate roots (ARs) to be very behavior-focused and only own the minimal amount of data necessary to enforce invariants in a strongly-consistent way. That makes for a model that is easy to test, that&#8217;s scalable, concurrency-friendly and lets you immediately identify which data is part of the transactional boundary. When ARs are properly modeled the commands will generally involve a single AR per transaction.QueriesOn the other hand, queries tends to need to pull data across multiple ARs, which encourages defining each & every relationship as object references in the domain model. That totally works against our command side goal. We are then left with a model where we need to optimize with lazy loading, a model that&#8217;s quite opaque regarding which data is getting persisted when saving an object (have to check cascade configs), that&#8217;s harder to setup for tests, that introduces direct coupling between ARs, etc.The solution? CQRS!The solution is actually very simple and akin to why we have bounded contexts. Rather than attempting to make a single model fulfill different goals we can have two models: the command model & the query model.That&#8217;s generally referred to as Command Query Responsibility Segregation (CQRS). In it&#8217;s most complex andoptimized form, CQRS could mean having an entirely different database (even in kind) to process reads, allowing to optimize indexes for reads rather than writes, de-normalize data to avoid joins, etc.Fortunately, for most systems you actually do not need such scalability (and complexity) and can implement CQRS using a much more simplistic approach by having a logical read/write segregation. In practical terms that generally just means having two sets of services or handlers. Command services/handlers and Query services/handlers.e.g. you may have a CommandOrderService and a QueryOrderService to process commands & queries respectively.While the command services would usually load ARs from repositories, execute commands on those and save them back, the query services would be free to use any practical means to gather the data. Sometimes that means leveraging repositories and aggregating data at the application level, sometimes it means executing raw SQL, leveraging database-specific features and by-passing the domain model entirely.The point is, by having that very simple command/query service split then you  can focus on optimizing the domain model for writes/commands and then resort on any data strategy you would like to fulfill query needs without polluting your command processing flows. Query services tends to require a range of different dependencies and will often be much more coupled to the infrastructure, which isn&#8217;t something you&#8217;d want for commands, but is a perfectly fine trade-off for queries.There&#8217;s many examples of such lightweight CQRS implementation in practice, but you can have a look at the application layer of the Implementing Domain-Driven Design (IDDD) Collaboration&#8217;s BC code on GitHub.ChallengesEven though I made it sound so simple, you are still most likely to face challenges. For instance, a different model for commands & queries means you can&#8217;t easily re-use query object specifications for both, commands & queries. If you used to model authorization rules as AR specifications, you now might have to duplicate those rules on the query side or write custom translators (e.g. spec to SQL).Another common challenge to face is to map complex specialized hierarchies. For instance, you might have a case management system where there&#8217;s hundreds of different case specializations with their own schema. Manually crafting queries to load data and them map those graphs effectively could be tedious. For that reason sometimes I use dedicated query entities (not the domain model) where I map object relationships and let the ORM do the work.Sometimes, you may even store JSON in the DB and leverage JSON-indexing features of your DB to process queries, etc.In the context of Spring specifically, you could need additional boilerplate to integrate Pageable with hand-made queries or even JPAQuery written with Query DSL.As you can see there&#8217;s not a one size fits all strategy to process queries and that&#8217;s fine because that&#8217;s carefully abstracted away in a different logical model where you can do whatever works.ConclusionYou can&#8217;t imagine how often I could have written a query in 2 minutes (and have) and map it manually in a DTO an instead got drawn deep into making it work forcefully through Spring Data with awful annotations and end up with a sub-optimal and overly complex solution.Queries are also so much easier to process looking at an homogeneous data model. Ever tried to query specialized types where the root of the hierarchy didn&#8217;t own the data you need? It&#8217;s very impractical with ORMs.Anyway, in my experience lightweight CQRS always was better than the running queries through the domain model despite the new challenges that could comes with it.

Advertisement

Answer

The problem

Commands

Queries

The solution? CQRS!

Challenges

Conclusion