Most message queues, and some data stores, do not support distributed transactions. This may cause problems when message handlers modify business data. Business data and messages may become inconsistent in the form of zombie records or ghost messages (see below).
The NServiceBus outbox feature ensures consistency between business data and messages. It simulates an atomic transaction, distributed across both the data store used for business data and the message queue used for messaging.
IMessageSessionwon't be handled by the outbox. The outbox requires an incoming message context. Use the
IMessageHandlerContextinstance to dispatch messages handled by the outbox.
Consider a message handler that creates a
User in the business database, and also publishes a
UserCreated event. If a failure occurs during the execution of the message handler, two scenarios may occur, depending on the order of operations.
- Zombie record: The message handler creates the
Userin the database first, then publishes the
UserCreatedevent. If a failure occurs between these two operations:
Useris created in the database, but the
UserCreatedevent is not published.
- The message handler does not complete, so the message is retried, and both operations are repeated. This results in a duplicate
Userin the database, known as a zombie record, which is never announced to the rest of the system.
- Ghost message: The message handler publishes the
UserCreatedevent first, then creates the
Userin the database. If a failure occurs between these two operations:
UserCreatedevent is published, but the
Useris not created in the database.
- The rest of the system is notified about the creation of the
User, but the
Userdoes not exist in the database. This causes further errors in other message handlers which expect the
Userto exist in the database.
To avoid these problems, developers of distributed systems have two options:
- Ensure all message handlers are idempotent. This means each message handler can handle the same message multiple times without adverse side effects. This is often difficult to achieve.
- Implement infrastructure which guarantees consistency between business data and messages. This avoids the need for all messages handlers to be idempotent.
The outbox feature is the infrastructure described in the second option.
The outbox feature guarantees that each message is processed once and only once, using the database transaction used to store business data.
Returning to the earlier example of a message handler which creates a
User and then publishes a
UserCreated event, the following process occurs. Details are described following the diagram.
There is no single transaction that spans all the operations. The operations occur in two separate phases:
- Receive the incoming message from the queue.
- Do not acknowledge receipt of the message yet, so that if processing fails, the message will be delivered again.
- Begin a transaction in the database.
- Check outbox storage in the database to see if the incoming message has already been processed. This is called deduplication.
- If the message has already been processed, skip to step 6.
- If the message has not yet been processed, continue to step 4.
- Execute the message handler for the incoming message
- Any outgoing messages are not immediately sent.
- Store any outgoing messages in outbox storage in the database.
- Commit the transaction in the database.
- This is the operation that ensures consistency between messaging and database operations.
- Check if the outgoing messages have already been sent.
- If the outgoing messages have already been sent, the incoming message is a duplicate, so skip to step 4.
- If the outgoing messages have not yet been sent, continue to Step 2.
- Send the outgoing messages to the queue.
- If processing fails at this point, duplicate messages may be sent to the queue. Any duplicates will have the same
MessageId, so they will be deduplicated by the outbox feature (in phase 1, step 3) in the endpoint that receives them.
- If processing fails at this point, duplicate messages may be sent to the queue. Any duplicates will have the same
- Update outbox storage to show that the outgoing messages have been sent.
- Acknowledge (ACK) receipt of the incoming message so that it is removed from the queue and will not be delivered again.
In phase 1, outgoing messages are not immediately sent. They are serialized and persisted in outbox storage. This occurs in a single transaction (steps 2 to 6) which also includes changes to business data made by message handlers. This guarantees that changes to business data are not made without capturing outgoing messages, and vice-versa.
In phase 2, outgoing messages are sent to the messaging infrastructure and outbox storage is updated to indicate that the outgoing messages have been sent (steps 1 to 3). Due to possible failures, a given message may be sent multiple times. For example, if an exception is thrown in step 3 (failure to update outbox storage) the message will be read from outbox storage again and will be sent again. As long as the receiving endpoints use the outbox feature, these duplicates will be handled by the deduplication in phase 1, step 3.
- For best performance, outbox data should be stored in the same database as business data. For more information, see Transaction scope below.
- The outbox works in an NServiceBus message handler without any additional dependencies. For use outside the context of a message handler, use the TransactionalSession package.
- Because deduplication is done using
MessageId, messages sent outside of an NServiceBus message handler (i.e. from a Web API) cannot be deduplicated unless they are sent with the same
- The outbox is expected to generate duplicate messages from time to time, especially if there is unreliable communication between the endpoint and the message broker.
- Endpoints using the outbox feature should not send messages to endpoints using DTC (see below) as the DTC-enabled endpoints treat duplicates coming from outbox-enabled endpoints as multiple messages.
The performance of the outbox feature depends on the scope of the transactions used to update business data and outbox data. Transactions may be scoped to a single database, multiple databases on a single server, or multiple databases on multiple servers.
- Transactions scoped to a single database are supported by all persisters and usually have the best performance, so it is usually best to store outbox data in the same database as business data.
- Transactions scoped to multiple databases on a single server, also known as cross-database transactions, are supported by some persisters, such as those which use SQL Server. These transactions may have reasonably good performance but they may introduce other concerns. For example, cross-database transactions are not supported by all types of tables in SQL Server.
- Transactions scoped to multiple databases on multiple servers, also known as distributed transactions, are supported by persisters which use SQL Server, but they require the use of MSDTC and should generally be avoided for the same reasons as those listed in the comparison with MSDTC below.
The Microsoft Distributed Transaction Coordinator (DTC) is a Windows technology that enlists multiple local transactions (called resource managers) within a single distributed
TransactionScope. All enlisted transactions either complete successfully as a set or are all rolled back.
MSDTC uses a chatty protocol due to the need for multiple resource managers to continually check on each other to make sure they are prepared to commit their results. An example of where this works well is a distributed transaction including consuming messages from an MSMQ message queue and storing business data in SQL Server. The communication with MSMQ is local and has very low latency, and there are only two resource managers to coordinate, with only one communication path between them. Early versions of NServiceBus primarily used MSMQ and SQL Server and systems built with those versions tended to use MSDTC.
The more resource managers involved and/or the higher the latency between them, the worse MSDTC performs. Cloud environments, in particular, have much higher latency and the message queue and the data store are often not even located in the same server rack.
The rise of cloud infrastructure and decline of MSMQ have contributed to the overall decline in use of MSDTC in the software development industry.
The outbox feature is designed to provide the same level of consistency between business data and messages provided by MSDTC, without the need to coordinate multiple resource managers.
In order to enable the Outbox, use the following code API:
Each NServiceBus persistence package may contain specific configuration options, such as a time to keep deduplication data and a deduplication data cleanup interval. For details, refer to the specific page for each persistence package in the persistence section below.
When converting a system from using the DTC to the outbox, care must be taken to ensure the system does not process duplicate messages incorrectly.
Because the outbox feature uses an "at least once" consistency guarantee at the transport level, endpoints that enable the outbox may occasionally send duplicate messages. These duplicate messages will be properly handled by deduplication in other outbox-enabled endpoints, but will be processed more than once by endpoints which still use the DTC.
In order to gradually convert an entire system from the DTC to the outbox:
- Enable the outbox on any endpoints that receive messages but do not send or publish any messages.
- Enable the outbox on any endpoints that only send or publish messages to already-converted endpoints, where the outbox will be able to properly handle any duplicate messages.
- Progress outward until all endpoints are converted.
The outbox only uses the incoming message identifier as a unique key for deduplicating messages. If the sender does not use the outbox when sending messages, it is responsible for ensuring that the message identifier value is consistent when that message is sent multiple times.
To determine if a message has been processed before, the identification data for each outbox record is retained. The duration that this data is retained for varies depending on the persistence chosen for the outbox. The default duration, as well as the frequency of data removal, can be overridden for all outbox persisters.
After the outbox data retention period has lapsed, a retried message will be seen as the first of its kind and will be reprocessed. It is important to ensure that the retention period of outbox data is longer than the maximum time the message can be retried, including delayed retries and manual retries via ServicePulse.
Depending on the throughput of the system's endpoints, the outbox cleanup interval may need to be run more frequently. Increased frequency will allow each cleanup operation to purge the fewest records possible each time it runs. Purging fewer records will make the purge operation run faster which will ensure that it completes before the next purge operation is due to start.
The amount of storage space required for the outbox can be calculated as follows:
Total outbox records = Message throughput per second * Deduplication period in seconds
A single outbox record, after all transport operations have been dispatched, usually requires less than 50 bytes, most of which are taken for storing the original message ID as this is a string value.
The outbox feature requires persistence in order to perform deduplication and to store outgoing downstream messages.
For more information on the outbox persistence options available refer to the dedicated persistence pages: