The distributed nature of parallel, message-driven systems makes them more difficult to debug than their sequential, synchronous, and centralized counterparts. For these reasons, NServiceBus provides built-in message auditing for every endpoint
By default, auditing is not enabled and must be configured for each endpoint where auditing is required.
It is recommended to specify a central auditing queue for all related endpoints (i.e., endpoints that belong to the same system). This gives an overview of the entire system in one place and can help see how messages correlate. The audit queue can be considered a central record of everything that happened in the system.
When auditing NServiceBus messages, it is important to have the capability to process messages sent to the audit queue: The Particular Service Platform, specifically ServiceControl, processes messages from these auditing queues to provide diagnostic and visualization features. For more information, see the ServiceInsight and ServicePulse documentation.
How it works
Auditing shows the end state of the flow of messages. When configured to audit, NServiceBus captures a copy of every successfully processed message and forwards it to a specified audit queue. When a message fails to be processed it is forwarded to the configured error queue and not be copied to the audit queue.
This means that a message is eventually forwarded to either the audit queue or the error queue, but while it's in flight it will be in the endpoint queue until all immediate and delayed processing attempts have exceeded.
Outgoing messages will only end in the audit queue when successfully processed.
Events
Because auditing only shows processed messages, published messages only appear if there are subscribers for that message. If there are multiple subscribers there will be an audit message for each subscriber in the audit queue after each subscriber successfully processes their copy of the event message. If there is no subscriber or the routing in the messaging infrastructure is misconfigured no event message will be copied to the audit queue.
Performance impact
Enabling auditing has an impact on storage and network resources.
Message throughput
Enabling auditing on all endpoints doubles the global message throughput, which can sometimes be troublesome in high message volume environments.
In such environments, it might make sense only to enable auditing on relevant endpoints that need auditing.
Queue storage capacity
Auditing successfully processed messages can result in storing huge amounts of message data. First in the audit queue and second in any application that will store these messages. If the audit storage logic stops processing audit messages, the audit queue size can grow very fast. Messaging infrastructure has a size limit on the amount of data that can be stored in a queue. If this storage limit is reached, messages can no longer be processed in all the endpoints that have auditing enabled.
Make sure the size limit is increased to allow for scheduled and unscheduled downtime.
Audit store capacity
Perform capacity planning on the store where messages will be written.
When using ServiceControl, read the ServiceControl capacity planning documentation.
Filtering audit messages
In some cases, it might be useful to exclude certain message types from being forwarded to the audit queue. This can be accomplished with a custom behavior in the pipeline.
Message headers
The audited message is enriched with additional headers, which contain information related to processing the message:
- Processing start and end times.
- Processing host ID and name.
- Processing machine address.
- Processing endpoint.
Handling Audit messages
Audit messages can be handled in various ways: saved in a database, custom logged, etc. However, it is important not to leave the messages in the audit queue, as most queuing technologies have upper-bound limits on their queue sizes and depth. By not processing these messages, the limits of the underlying queue technology may be reached.
Configuring auditing
Configure the target audit queue using the configuration API.
endpointConfiguration.AuditProcessedMessagesTo("targetAuditQueue");
Audit configuration options
Two settings control auditing:
Queue Name
The queue name to forward audit messages.
OverrideTimeToBeReceived
To force a TimeToBeReceived on audit messages by setting OverrideTimeToBeReceived
use the configuration syntax below.
Note that while the phrasing is "forwarding a message" in the implementation, it is actually "cloning and sending a new message". This is important when considering TimeToBeReceived since the time taken to receive and process the original message is not part of the TimeToBeReceived of the new audit message. In effect, the audit message receives the full-time allotment of whatever TimeToBeReceived is used.
MSMQ
MSMQ forces the same TimeToBeReceived on all messages in a transaction. Therefore, OverrideTimeToBeReceived is unsupported when using the MSMQ Transport. If OverrideTimeToBeReceived is detected when using MSMQ, an exception will be thrown with the following text:
Setting a custom OverrideTimeToBeReceived for audits is not supported on transactional MSMQ
Default Value
If no OverrideTimeToBeReceived is defined then:
Versions 5 and below: TimeToBeReceived of the original message will be used.
Versions 6 and above: No TimeToBeReceived will be set.
Filtering audit messages
When auditing is enabled, all messages are audited by default. To control which message types are audited, see the audit filter sample.
Filtering individual properties
Auditing works by sending an exact copy of the received message to the audit queue, so filtering out individual properties is not supported.
For sensitive properties, e.g. credit card numbers or passwords, use message property encryption. For large properties, consider the data bus feature to avoid including the actual payload in the audited message.
Adding additional audit information
Additional information can be added to audit messages using a custom behavior, as shown in the following snippet. The additional data will be contained in the audit message headers.
public class CustomAuditDataBehavior : Behavior<IAuditContext>
{
public override Task Invoke(IAuditContext context, Func<Task> next)
{
context.AuditMetadata["myKey"] = "MyValue";
return next();
}
}
Message headers count towards the message size and the additional audit information has to honor the transport's message header limitations.
Custom audit action
The example below shows how to extend the pipeline with a behavior that:
- Stores the message body in an external storage
- Excludes the body from the message sent to the audit queue
- Adds a metadata entry that links to the stored body
public class EnableExternalBodyStorageBehavior : Behavior<IAuditContext>
{
private readonly IExternalBodyStorage storage;
public EnableExternalBodyStorageBehavior(IExternalBodyStorage storage)
{
this.storage = storage;
}
public async override Task Invoke(IAuditContext context, Func<Task> next)
{
var message = context.Message;
var bodyUrl = await storage.StoreBody(message.MessageId, message.Body);
context.AuditMetadata["body-url"] = bodyUrl;
context.AuditAction = new SkipAuditMessageBody();
await next();
}
class SkipAuditMessageBody : RouteToAudit
{
public override IReadOnlyCollection<IRoutingContext> GetRoutingContexts(IAuditActionContext context)
{
var routingContexts = base.GetRoutingContexts(context);
foreach (var routingContext in routingContexts)
{
// clear out the message body
routingContext.Message.UpdateBody(ReadOnlyMemory<byte>.Empty);
}
return routingContexts;
}
}
}
In addition, the behavior needs to be registered in the pipeline.